Given that HellaSwag performance seems to correlate with reasoning ability more ... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		cosmojg on June 18, 2023 \| parent \| context \| favorite \| on: Falcon LLM – A 40B Model Given that HellaSwag performance seems to correlate with reasoning ability more than other benchmarks, Falcon certainly look promising! Hopefully this is a clean result and not the product of dataset contamination.

avereveard on June 18, 2023 [–]

I've given it a try, to having a chat is good, to follow langchain prompts it's not.

I guess it depends on the type of work you want to extract from it.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact