Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> Try coming up with a dataset that doesn’t have any copyrighted material in them.

Isn't this what Mistral AI did?



Did they? That'd be interesting to take a look at. Do they publish contents of their dataset?


The RAW Weights here: https://docs.mistral.ai/models/




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: