- @simon.fedi.simonwillison.net.ap.brid.gy Have you tried Gemini Computer Use? It connects the LLM with your Browser including launching browser, inputting text, clicking buttons, scrolling, taking PNGs for multi modal I/O Strongly recommendOct 11, 2025 19:47