Real-time video captioning powered by FastVLM
Generate images with SD3.5
Interact with a multimodal chatbot that analyzes images and text