can I use this model to extract text from an entire document?

by sergenti - opened Feb 25, 2023

Feb 25, 2023

Hey there, I am working on a PDF parsing project.

Is there a way to use this model to extract an entire page?

OR - are there any other models capable of extracting text from images like these? (don't mind the red rectangle)
I tried other python libraries and the results are bad

P.S. yes, I am using another model to detect tables and remove them in order to improve the parsing

P.P.S. yes, the image above is taken from "attention is all you need" lol

wanbiguizhao

Apr 26, 2023

Hey there, I am working on a PDF parsing project.

Is there a way to use this model to extract an entire page?

OR - are there any other models capable of extracting text from images like these? (don't mind the red rectangle)
I tried other python libraries and the results are bad

P.S. yes, I am using another model to detect tables and remove them in order to improve the parsing

P.P.S. yes, the image above is taken from "attention is all you need" lol
maybe you can try layoutlmv3 ,which can analysis document layout,help detect table ,title,text,etc

wanbiguizhao

Apr 26, 2023

https://arxiv.org/pdf/2204.08387.pdf paper

ldemiguel

May 30, 2023

At the end did you find an answer for extract an entire page?

wanbiguizhao

Jun 2, 2023

At the end did you find an answer for extract an entire page?

i'm trying

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment