Text is not how humans experience the world.
Datasets and evals for audiovisual communication.
Velvet is a data research lab focused exclusively on multimodal models.
We exist to help AI pass the audiovisual Turing test.
Push the frontier of multimodal research
We're hiring for research, engineering, and operations roles.