Text is not how humans experience the world.

Research

Datasets and evals for audiovisual communication.

Conversational Interaction
Open World Exploration
Purpose

Velvet is a data research lab focused exclusively on multimodal models.

We exist to help AI pass the audiovisual Turing test.

Careers

Push the frontier of multimodal research

We're hiring for research, engineering, and operations roles.