Hey this is cool. I've been exploring something similar. I was stuck in some similar issues. Would love to talk to you.
I really believe that Google Meet, Zoom, etc., should be optimized for the agents so that anytime you are communicatin with each other, agents can seamlessly capture data. Right now, it's very hard to capture that kind of data from Zoom, from Google Meet, etc.
Even agents should be able to communicate and call each other.
This is really creative video and it got me thinking is it possible to edit such videos using AI.
The above video contains two parts one is bike crashing and the other is transition to shop advertisement.
The Editor can ask" Take this two videos and combine them together add the transition from bike crash to and entry into shop"
The challenge over here is to figure out what all extra frames needs to be removed from the two videos.
Figuring out what are the two frames in the respective videos that will be merged.
The first challenge is still doable. I can analyse the frames and see which frames shows the bike crashing and which frame shows the man running.
The major challenge how will I merge them so that the transition looks smooth.
I am former Tech co-founder of Zenlor. I can build product from scratch by talking to customers and coding them. I have worked in companies like Directi, Blooreach, Microsoft and have built and launched 11 products in the past 4-5 months and have close to 4000 users using them.
I value low ego, growth mindset, shipping fast culture.
I have been using vimium for some time now and it has become my tool for navigating around the webpages. It's so much easier to navigate around webpages.
But as soon as I go out of the browser, I have to start using mouse again and it becomes very irritating.
Is there vimium equivalent for Desktop?
If No, How difficult will be to make something like that?
I really believe that Google Meet, Zoom, etc., should be optimized for the agents so that anytime you are communicatin with each other, agents can seamlessly capture data. Right now, it's very hard to capture that kind of data from Zoom, from Google Meet, etc.
Even agents should be able to communicate and call each other.
reply