Transcription summaries and actions

swtb · March 4, 2025, 10:25pm

What is currently the best small model for summarising transcripts and extracting actions?

I’m looking at the <5B parameter or maybe <10B parameter classes

Transcripts will be produced by whisper + pyanote/diarization.

Audio clips will be at least 1hours long possibly as long as 6 hours in rare cases. So we can expect large transcripts.

John6666 · March 5, 2025, 11:15am

For smaller models, I think the Llama 3.2 or Qwen 2.5 series are safe, but there may be specific benchmarks on the leaderboard. The URL below is for the long-context-support version of Qwen.

swtb · March 5, 2025, 3:19pm

Thanks for this, I wasnt aware of Qwens long context model

Any thoughts on wether it will be better to use long context and try to summarise in one go compared to chunking the input into intermediate summaries?

John6666 · March 5, 2025, 3:29pm

It would probably be more accurate to have the model directly summarize long contexts, but it would probably require a huge amount of VRAM and latency to process long contexts at once, so it would probably be smarter to process them in chunks. I think it would be easier to summarize short texts in chunks even with a small model.

Topic		Replies	Views
Teams Transcript Summarisation Intermediate	0	258	April 17, 2023
Summarization on long documents 🤗Transformers	63	59976	August 16, 2024
Summarization pipeline on long text Beginners	6	4841	December 14, 2022
Help to choose a model for compact summarization 🤗Hub	1	308	November 8, 2024
Pegasus Summarization API_Inference Beginners	4	393	May 28, 2021

Transcription summaries and actions

Related topics