What does HackerNews think of tabula-java?

Extract tables from PDF files

Language: Java

There's probably newer AI powered tools but Tabula is the main library I know of https://github.com/tabulapdf/tabula-java
tabula-java [0], a library for extracting tables from PDF files. It started as a monolithic webapp written in JRuby, and we later extracted the table detection and segmentation logic into a Java library.

[0] https://github.com/tabulapdf/tabula-java