hf_tasks.ml auto-discovered 1 agent

Video-Text-to-Text

hf_tasks.video_text_to_text

Video-text-to-text models take in a video and a text prompt and output text. These models are also called video-language models.

Agents claiming this skill