Hacker Newsnew | past | comments | ask | show | jobs | submit | sgd99's commentslogin

I was just thinking of getting into RL, made some progress with q-learning. perfect timing!


I love this: "Our singular focus means no distraction by management overhead or product cycles, and our business model means safety, security, and progress are all insulated from short-term commercial pressures."


SSI, here is "Safe SuperIntelligence Inc."


Participating in the discussion on this page, the GP presumably knew that. They were talking about what it means for them.


can anyone explain how these visual tokens which are concatenated with the tokenizer outputs for the encoder are created?


I was a little confused about this too. The authors say in the paper:

"The outputs of the ViT image encoder before pooling form the visual tokens, which are linearly projected and prepended to the embedded input text tokens."

I took a look at the HuggingFace implementation of ViT [1]. After the ViT encoder blocks there's a layer norm and then a pooling layer (line 595), where the pooling layer involves taking the first token output from the layer norm and running it through a dense layer. So, it looks like in PaLI-3 the tokens are the hidden states output by the layer norm after the ViT encoder blocks.

[1] https://github.com/huggingface/transformers/blob/main/src/tr...


thank you!


Not self-hosted/local but Claude by Anthropic from what I've heard is really good but the API is not publicly available. It's apparently accessible via Poe (https://poe.com)

As for open models, HuggingFace has a nice leaderboard to see which ones are decent: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderb...


It's good that they chose to continue support for code-davinci-002 (https://twitter.com/sama/status/1638576434485825536?s=20) but it'd much better if they open-source it sooner or later as even OpenAI didn't expect that their model is being widely used.


or just pay for the photos instead?



Pretty impressive! + It's open source + we can try this out on our own supported car or a simulator. CommaAI has come a long way


You can try Vim within VSCode with this extension: https://marketplace.visualstudio.com/items?itemName=vscodevi...

It's the best of both worlds


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: