When you purchase through links on our site, we may earn an affiliate commission.Heres how it works.
The Hugging Face team to date has totalled a very credible 55.15% average.
These scores reflect an agents ability to autonomously cope with 450 non-trivial questions and solve them.
Weve quickly gone up from the previous SoTA…to ourcurrent performance of 55.15% on the validation set.
The Hugging Face initiative is a work in progress.
The developers are inviting the public to contribute to help build out the final product.
Those who are interested in seeing what progress has been made so far, should visit thedemo sitedirectly.
This is a Hugging Face Space, but it is acting like extremely overloaded at the moment.
When I tried to run an agent I received an warning pop-up, which Im assuming reflects teething problems.
Still this is definitely a project worth watching.