API Reference

This section documents all public classes, methods, and functions provided by the CUA SDK.

ComputerAgent

Class: ComputerAgent

The main interface for creating and controlling agents.

Constructor

ComputerAgent(name: str = "Merlin", mode: str = "interactive", config: dict = None)

Methods

run_task(task: str) -> Any

Runs a task using the agent.

agent = ComputerAgent()
result = agent.run_task("Open Notepad and type 'Hello'")

stop() -> None

Stops the agent and cleans up resources.

agent.stop()

Docker Utilities

If using Docker, the following scripts/utilities are available:

Configuration Utilities

Additional Functions

Document any other public functions or utilities here as your SDK evolves.


For more usage examples, see Usage Examples.