Open-Source Fine-tuned LLM Models for Data Extraction Tasks

bubbleMilkTea · September 24, 2024, 9:43am

I’ve used OpenAI GPT-4 for data extraction, but since it’s a general-purpose commercial model, it’s not specifically fine-tuned for data extraction tasks. I believe GPT-4 may not perform as well as models fine-tuned exclusively for this purpose. Therefore, I’m looking for open-source LLMs that are specifically trained for data extraction and offer high accuracy and efficiency. Could you recommend any models that fit these criteria?

John6666 · September 24, 2024, 12:25pm

I’m totally ignorant about LLMs for specific applications, but why don’t you actually try and get a feel for which language model to use as a base?
In general at HF, it would be quickest if you could find a space that uses LLM for a similar use case and see the source code there or ask the author a question.

Topic		Replies	Views
Open-Source LLM Models for Data Extraction Tasks Research	0	373	September 24, 2024
A model to extract email text body from html code 🤗Transformers	4	798	May 2, 2024
What is the best fine tune LLM model I can use to extract data lineage from PySpark scripts? Models	0	919	June 26, 2023
Open Source LLMs: Your experience and recommendation Beginners	0	1055	November 17, 2023
What model will fit better for Email Parsing and Data Extraction Beginners	1	795	May 16, 2024

Open-Source Fine-tuned LLM Models for Data Extraction Tasks

Related topics