Automation MCP
Give models access to your mouse, keyboard and screen
5 followers
Control your Mac with detailed mouse, keyboard, screen, and window management capabilities. Works really well with Gemini 2.5 pro + good prompting :D Let me know if any feedback or bugs :)
Swiddle
This looks super cool!
Do you have some example use cases that you’ve tested? For example, can it launch a browser and perform a web search?
Swiddle
@tleyden Yes it can technically do anything you can but, performance varies on the model.
I've tested browser searches and WhatsApp + Discord messages.
Gemini 2.5 pro works well. But isn't there 100%.
I suggest adding this at the end of the prompt:
screenInfo and screenshot are required for you to move the mouse to the correct position. Take a screenshot after each action to be sure you execute correctly.