IBM CUGA

IBM's Computer using generalist agent ( CUGA ) is capable of achieving a wide range of tasks across different domains. By leveraging advanced capabilities, it can seamlessly operate on the web, interact with desktop applications, and integrate with APIs. Much like a human adapting to various activities, this agent does not require specialized programming for each task, making it highly versatile and efficient.

Benchmark Results

Name Accuracy Trajectories Link
Webarena 61.7% View Trajectories

Video Demos

Find a subreddit focused on topics related to NYC, and post my question, "is car necessary" there

What is the duration required to first walk from Univ of Pittsburgh to starbucks on Craig Street, and then drive to Pittsburgh International Airport?

Delete all reviews from scammer carlo

Invite Jakub K, Alex Dills, Alex Hutnik and BenoƮt Blanchon as collaborator to my time tracking tool project repo

Create a repo named nolan_old_fans with movies directed by Christopher Nolan before 2010 in a README file