Proposed New Test of AI Capabilities:)

When Microsoft can turn a fleet of LLMs loose on the Azure UX, and Google can do the same for the Google Adwords UX, and reduce their level of dreadfulness substantially, it will go a long way to showing that frontier models are as good as claimed.

1 points | by VikRubenfeld 2 hours ago

0 comments