Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Teams I work with use the abstain rate to flag what goes to a human. Disagreement between models is the same idea. Your 67% is what makes "two cheap models, escalate when they fight" actually work. Without abstain it mostly looks like noise.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: