Powered by NVIDIA NIM
AI that browses
the web for you
Give the agent a task in plain English. It plans it with a 120B model, executes it in a real browser, and streams every action live to your screen.
Six NVIDIA Models. One Agent.
Planner
nemotron-super-120b
Executor
llama-3.1-nemotron-70b
Fallback
llama-nemotron-8b
Embeddings
nv-embedqa-1b
Reranking
nv-rerankqa-4b
Vision
llama-3.2-vision
How It Works
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β USER TASK (natural language) β
ββββββββββββββββββββββββββββ¬ββββββββββββββββββββββββββββββββββββββββ
β
ββββββββββΌβββββββββ
β PLANNER AGENT β β nemotron-super-120b
β (120B model) β Breaks task into steps
ββββββββββ¬βββββββββ
β JSON plan
ββββββββββΌβββββββββ
β EXECUTOR AGENT β β llama-3.1-nemotron-70b
β (70B model) β Converts steps β actions
ββββββββββ¬βββββββββ
β browser commands
ββββββββββββββββΌβββββββββββββββ
β PLAYWRIGHT BROWSER β
β navigate Β· click Β· type β
β scroll Β· screenshot Β· wait β
ββββββββ¬βββββββββββββββ¬βββββββββ
β β
ββββββββββββΌβββ ββββββββΌβββββββββ
β VISION AGENTβ β MEMORY AGENT β
β (vision mdl)β β (embeddings) β
β screenshot β β store+recall β
β analysis β β past steps β
βββββββββββββββ βββββββββββββββββ
β
ββββββββββΌβββββββββ
β WEBSOCKET β β Live stream to browser
β STREAMER β
βββββββββββββββββββ