aiComputerUse | prompt | var1

The aiComputerUse command integrates mouse and keyboard control of Anthropic Claude Computer Use into our Open-Source RPA software. This makes the Ui.Vision browser extension an easy way to demo and use this technology on our Mac, Linux or Windows machine.

How to use Claude computer use

To get started, install the Ui.Vision Chrome extension and the XModules. Then run one of the demo macros that ship with the software. For example the "CU_PlayTicTacToe" macro is fun: Sit back and watch your Ui.Vision browser extension to play a game of Tic Tac Toe all by itself. Or run CU_FillForm, and see how you can now automate form filling without specificing each field.


Ui.Vision plays TicTacToe with the help of Anthropic Computer Use.

Syntax

The syntax is easy: aiComputerUse | prompt | var1. The last response of Claude is stored in the variable. See also the other AI commands.


Computer Use Demo 2: The aiComputerUse prompt is "Fill web form with artificial data".

Computer Use Insights from our Experience

Here are some insights we gained from using Computer Use (CU):

- CU was trained to move the mouse before clicking. This is often unncessary. Tell it "No mouse movements, only clicks". This saves time.

- Keep the screen (for desktop automation) or browser viewport (for browser automation) as small as possible. The smaller the screenshot area, the faster and cheaper the API response. Smaller screen sizes also improve the accuracy of the returned x,y values.

- Shorter prompter work often better than longer prompts. If you ask Claude for prompting help, it will write you long and often much too detailed prompts.

Prompts can be as easy as "Fill out this form with random data" or "Search for a flight". To get started, see the new Computer Use demo macros for example prompts that work. Or search our user forum for the tag "prompting".

- In general, we find that - at least for now - Computer Use works better for browser automation than desktop automation. Google's Project Jarvis team seems to have realized this as well.

What is the difference between aiScreenXY and aiComputerUse

The aiScreenXY command is optimized for finding items in screenshots. Compared to aiComputerUse, this is a very narrow task. But unlike aiComputerUse the output of aiScreenXY is very predictable. Plus, aiScreenXY autotomatically stores x,y values in the ${!ai1} and ${!ai2} variables for further use with XClick and XMove.

Anthropic Claude and Computer Use Demo macros
Test Computer Use and other AI commands with the demo macros.

Anything wrong or missing on this page? Suggestions?

...then please contact us.

Fresh from the Ui.Vision Forum: The Latest 3 RPA Software Discussions

← Meet the Ui.Vision team and users on our RPA software forum.