This is the repo for the Video-LLaMA project, which is working on empowering large language models with video and audio understanding capabilities. Video-LLaMA is built on top of BLIP-2 and MiniGPT-4.
Popular GitHub repos like Microsoft’s “Generative AI for Beginners” and “LLMs from Scratch” teach modern AI concepts step by ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results