What does HackerNews think of scancode-toolkit?
:mag_right: ScanCode detects licenses, copyrights, dependencies by "scanning code" ... to discover and inventory open source and third-party packages used in your code. Sponsored by NLnet project https://nlnet.nl/project/vulnerabilitydatabase, the Google Summer of Code, Azure credits, nexB and others generous sponsors!
Language:
Python
We run a script that scans our source code and generates a manifest of all included libraries including license/copyright. Here's one of them:
The most accurate license detector is ScanCode. I think it uses some sort of rolling checksum thing for detection.