AI Friends

If you want to read user elements on a screen and get a structured response, is a...

If you want to read user elements on a screen and get a structured response, https://huggingface.co/microsoft/OmniParser is a sick tool. I recently https://github.com/addy999/omniparser-api it in an API to self host
huggingface.co github.com
CodeCode Repositorygithub.com