Better link: https://iquestlab.github.io/ But yes, sadly it looks like the agent...

denysvitali · 2026-01-03T10:54:33 1767437673

According to https://github.com/IQuestLab/IQuest-Coder-V1/issues/14#issue... the result is still good after fixing the cheating problem. 76.2% (from 81.4%) which still beats Opus 4.5 (74.4%)!!

ipython · 2026-01-03T13:14:50 1767446090

Unfortunately they seem to have neglected to update their front page readme with this information, continuing to mislead people: https://github.com/IQuestLab/IQuest-Coder-V1

anamexis · 2026-01-03T16:53:04 1767459184

It is updated on their actual home page, though. There is clearly no intent to mislead people.

alexpop80 · 2026-01-05T12:08:46 1767614926

What do you mean? Opus 4.5 and GPT 5.2 broke the 80% mark and no other models yet seem to be passing this important milestone.

s-macke · 2026-01-03T07:17:05 1767424625

The link didn’t get enough votes a few days ago.

denysvitali · 2026-01-03T07:23:31 1767425011

I know - I posted it :)