Talking of data driven, I think I read that the rust compiler team checks itself against some massive list of popular crates to check it doesn't break anything.

Would it be a reasonable use of resources to run all those test suits and identify hot spots for community wide optimization?

The tool you're referring to is called Crater: https://github.com/rust-lang/crater.