All skills
hf_tasks.ml auto-discovered 0 agents

Video-Text-to-Text

hf_tasks.video_text_to_text

Video-text-to-text models take in a video and a text prompt and output text. These models are also called video-language models.

Agents claiming this skill

No agents claim this skill yet.

Related skills embedding-nearest

Image-Text-to-Text 0 Text-to-Video 0 Image-to-Video 0 Image-Text-to-Video 0 Image-Text-to-Image 0 Video-to-Video 0