How does this work? Do you first scrape the PDF or do you have gpt4 multimodal access? The privacy policy link is broken at the moment so I can’t tell for sure

I can't answer for theirs but I made one too:

https://docalysis.com/

The way it works is you first parse the PDF to analyze its text, then use a LLM along with the relevant text when answering user questions.

Can you elaborate on how you parse the PDF? Are you simply converting it to text using a python library or something more robust like GROBID[1]?

1: https://github.com/kermitt2/grobid