OCR dataset results #46

cagey-squirrel · 2025-01-07T22:59:16Z

Could you please share results from using OCR text extracting approaches on the datasets shared on HF? I am having trouble with PPOCR and would like to replicate results with text for RAG.

What would work best for me is the output of calling eval on ChartQA, InfoVQA, MP-DocVQA and SlideVQA datasets but with their text content being used for RAG instead of images (model used and OCR method does not matter too much, would prefer the one with best results).

tcy6 · 2025-01-11T15:12:42Z

@cagey-squirrel

Could you please share results from using OCR text extracting approaches on the datasets shared on HF? I am having trouble with PPOCR and would like to replicate results with text for RAG.

I’m very sorry, but we couldn’t find the OCR results after conducting the experiments.

What would work best for me is the output of calling eval on ChartQA, InfoVQA, MP-DocVQA and SlideVQA datasets but with their text content being used for RAG instead of images (model used and OCR method does not matter too much, would prefer the one with best results).

Could you please explain this in more detail? I’m having trouble understanding it.

cagey-squirrel · 2025-01-11T15:27:29Z

Hi, thanks for the response.

When eval.sh script is called it produces embeddings.corpus, embeddings.query, test_result.log and test..rec files
It would be great if you had the test..trec files which are generated after running eval.sh on OCR versions of datasets ChartQA, InfoVQA, MP-DocVQA and SlideVQA.

tcy6 · 2025-01-11T16:21:03Z

Hi, thanks for the response.

When eval.sh script is called it produces embeddings.corpus, embeddings.query, test_result.log and test..rec files It would be great if you had the test..trec files which are generated after running eval.sh on OCR versions of datasets ChartQA, InfoVQA, MP-DocVQA and SlideVQA.

Let me try to find it~
BTW, which OCR are you referring to?

cagey-squirrel · 2025-01-11T18:46:14Z

Not too important, if there are multiple then one with the best results.
Also I wanted to know how did you partition the text into chunks for RAG with OCR? Did you extract text page by page or did you group the chunks differently?

tcy6 · 2025-01-12T06:45:14Z

Not too important, if there are multiple then one with the best results. Also I wanted to know how did you partition the text into chunks for RAG with OCR? Did you extract text page by page or did you group the chunks differently?

We extract text page by page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OCR dataset results #46

OCR dataset results #46

cagey-squirrel commented Jan 7, 2025 •

edited

Loading

tcy6 commented Jan 11, 2025

cagey-squirrel commented Jan 11, 2025

tcy6 commented Jan 11, 2025 •

edited

Loading

cagey-squirrel commented Jan 11, 2025

tcy6 commented Jan 12, 2025

OCR dataset results #46

OCR dataset results #46

Comments

cagey-squirrel commented Jan 7, 2025 • edited Loading

tcy6 commented Jan 11, 2025

cagey-squirrel commented Jan 11, 2025

tcy6 commented Jan 11, 2025 • edited Loading

cagey-squirrel commented Jan 11, 2025

tcy6 commented Jan 12, 2025

cagey-squirrel commented Jan 7, 2025 •

edited

Loading

tcy6 commented Jan 11, 2025 •

edited

Loading