Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
I'm sorry, but those are vanity evals (hex.tech)
9 points by izzymiller on April 14, 2025 | hide | past | favorite | 1 comment


Stoked to get to publish some of our private eval results and a bit of the behind the scenes of our framework! We've been using this approach for almost a year and found it extremely high leverage for making meaningful improvements to the AI parts of our product




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: