BrowserAutomation
Natural language browser automation via Playwright. Navigate, click, type, screenshot — describe what you want, no CSS selectors needed.
Log in once, sessions persist in ~/.co/browser_profile/. Uses a vision LLM to find elements by description.
Installation
code
Quick Start
With an agent
main.py
Direct usage
main.py
API Reference
Navigation
- go_to(url)
- get_current_url()
- get_text()
- get_links_from_page(filter?)
Interaction
- click(description)
- keyboard_type(text)
- keyboard_press(key)
- scroll(times?, description?)
Screenshot
- take_screenshot(path?, full_page?)
- set_viewport(width, height)
Waiting
- wait(seconds)
- wait_for_element(description)
- wait_for_text(text)
- wait_for_manual_login(site)
Persistent Sessions
Log in once — cookies and sessions persist to ~/.co/browser_profile/ automatically:
main.py
Screenshots
main.py
