I wanted to build this for myself. Could never figure out how to get audio output from Mac. Tried almost all audio loopback driver (Blackhole, Soundflower ...). There was problem everywhere wrt security.
Even tried making a teams meeting bot. But Teams doesn't give live audio to developer unless you are a special partner.
One of the things I would want to do is - As the meeting is going on - I would like to ask a LLM what questions I could ask at that point in time. Especially if it's a subject I am not expert in.
Would I be able to create an extension that could do this?
you can definitely do that in the future. but we had that on our mind as well from multiple requests - planning to add "eli5 - explain like i'm five" and "mmss - make me sound smart" ;)
(edit: grammar fix)
i think there are tools like cluely - where they propose to "cheat" on everything in real-time. or just wearables like waves that shows ar displays with real-time assist. (i've never used both of them before, but i understood their products like this) so proactive ai agents are somewhat becoming a thing I guess. but it all boils down to privacy for us.
mmss was something that a lot of users suggested - they wanted to be saved from public humiliation
System audio (i.e. Zoom calls) can be captured only on Chromium browsers on Windows and ChromeOS when sharing the entire screen; tab audio has wider OS support; Safari and Firefox do not support system or tab audio capture https://addpipe.com/docs/recording-client/screen-recording/#...
The MDN support table does not differentiate in this regard. (le: it actually does if you click to see the implementation notes)
Image in the readme would really be helpful.
In fact anytime there is a visual output it makes sense to put an image.
Thanks for creating this though - Will give it a try for an upcoming project.
Even tried making a teams meeting bot. But Teams doesn't give live audio to developer unless you are a special partner.
Glad you made this. Will play around