Vision Import for PDFs

1 pointsposted a month ago
by artursapek

3 Comments

damnitbuilds

a month ago

Blurb: "Unlike traditional PDF text extraction, this approach actually "reads" your PDF like a human would, preserving formatting, tables, and document structure with high accuracy.

Input text in their example:

QUENE ELI-

sabet, Quene of England

Their output text from their example:

QUEENE ELIZABETH

Elizabeth, Queene of England

Try harder.

artursapek

a month ago

Classic HN snark. It’s an example that is supposed to show the edge of its capabilities. You won’t find another word processor that can even come close.

damnitbuilds

a month ago

No this is clearly fair criticism that shows them failing at what they say they do well.

"Come close" ? Nonsense - a free online OCR got me a much better result:

QVENE ELI-

fabet, Quene of England,