I mean, that's really the mystery of it. One of the most notable advances of rec...

punnerud · on Jan 23, 2023

“seem to only manifest in models with extremely large numbers of parameters”

You can do the same with models trained on your laptop in a few seconds. The trick here is to let the model have attention to what is learned, and can also be used on images and other type of data.

The benefit of a lot of parameters is more related to training in parallel, faster and “remember” more from the data.

urbandw311er · on Jan 22, 2023

Thanks for the pointer.

Any idea of how clever the wrapper around these things is? For example, would OP’s use case simply get forwarded to the neural network as one single input (string of words) or is there some clever preprocessing going on.

tarvaina · on Jan 22, 2023

The only significant bit of preprocessing by the language model is tokenization. You can see how it works here: https://beta.openai.com/tokenizer