When you purchase through links on our site, we may earn an affiliate commission.Heres how it works.

The Hugging Face team to date has totalled a very credible 55.15% average.

These scores reflect an agents ability to autonomously cope with 450 non-trivial questions and solve them.

ChatGPT search interface

Weve quickly gone up from the previous SoTA…to ourcurrent performance of 55.15% on the validation set.

The Hugging Face initiative is a work in progress.

The developers are inviting the public to contribute to help build out the final product.

A graphic showing the Hugging Face smiley face next to text saying “Open Deep-Research”

Those who are interested in seeing what progress has been made so far, should visit thedemo sitedirectly.

This is a Hugging Face Space, but it is acting like extremely overloaded at the moment.

When I tried to run an agent I received an warning pop-up, which Im assuming reflects teething problems.

Arrow

Still this is definitely a project worth watching.

More from Tom’s Guide

Arrow

Apple 13" MacBook Air (M3,…

Lenovo Chromebook Duet 3…

ASUS Zenbook S 13 OLED Laptop…

ASUS

Asus ROG Zephyrus G14 2023

Best Buy

Lenovo IdeaPad Duet 3

Macbook pro

Apple MacBook Pro (2024) 14.2…

P.C. Richard & Son

Apple 2024 MacBook Pro Laptop…

Apple 13" MacBook Air (M3,…

ASUS Zenbook S 13 OLED…

DeepSeek R1 illustrations

alibaba image on mobile

Perplexity deep research screenshot

iPhone 16 Pro Max, Galaxy S25 Ultra and Pixel 9 Pro

ChatGPT generated image

Nintendo Switch 2

Motorola Razr 2024 Review.

Close-up of the Samsung Galaxy Watch 7 on a user�s wrist next to the Garmin Vivoactive 6

A person holding a Nintendo Switch 2 playing Mario Kart World

Carhartt apparel for men and women