Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I recently thought of a related question. Actually, I'm almost certain that foundation model trainers have thought of this. The question is to what extent are popular modern benchmarks (or any reference to them, or description of them, etc.) bring scrubbed from the training data? Or are popular benchmarks designed in such a way that they can be re-parametrized for each run? In any case, it seems like a surprisingly hard problem to deal with.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: