This is very interesting.
I’m still noodling on how to send a full page screenshot to a model and get it to return the individual images (or the bounds of them) in the page.
Have you looked at https://github.com/facebookresearch/segment-anything ?
This is very interesting.
I’m still noodling on how to send a full page screenshot to a model and get it to return the individual images (or the bounds of them) in the page.