Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
oofbey
3 months ago
|
parent
|
context
|
favorite
| on:
The Dragon Hatchling: The missing link between the...
Attention mechanisms are wonderfully interpretable as is. You can literally see which tokens each token is attending to. People don’t bother much these days. But that’s not a strong selling point.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: