That is funny. For audio books I'm currently working on an `epub` command for `tone` which will be able to extract text from `epub` files, e.g.:

  tone epub --format="markdown" --extract-sentences --one-file-per-chapter output-path/

As a result, you can use https://github.com/readbeyond/aeneas with the generated text / markdown files to create a json mapping file looking like this:

  {
   "fragments": [
    {
     "begin": "0.000",
     "children": [], 
     "end": "7.920",
     "id": "f000001",
     "language": "eng",
     "lines": [
      "This is the first sentence of the audio book."
     ]   
    }
  }

Since aeneas is a bit inaccurate, I'm also working on an improvement with silence detection for these mapping files.

If you are looking for something that is "ready to use", you could check out https://github.com/r4victor/syncabook or the according library https://github.com/r4victor/afaligner

If you have audio files, that are NOT audio books, the epub approach will not help you and the other comments are more helpful.