There's probably newer AI powered tools but Tabula is the main library I know of https://github.com/tabulapdf/tabula-java
tabula-java [0], a library for extracting tables from PDF files. It started as a monolithic webapp written in JRuby, and we later extracted the table detection and segmentation logic into a Java library.