Who We Are
At Twelve Labs, we are pioneering the development of frontier multimodal foundation models that can see, hear and understand the world as humans do. Our models have redefined the standards in video-language modeling, allowing developers to build programs with state-of-the-art semantic search, summarization and analysis capabilities.
Twelve Labs has raised $107 million in Seed + Series A funding from world-class VC & corporate partners: NVIDIA, NEA, Radical Ventures, Index Ventures, Snowflake and Databricks. Our advisory team features AI visionaries and founders such as Fei-Fei Li, Silvio Savarese, Alexandr Wang and more. Headquartered in San Francisco, with an influential APAC presence in Seoul, our global footprint underscores our commitment to driving worldwide innovation.
About The Role
As a Software Engineer, Data at Twelve Labs, you will build core data infrastructure for acquiring, preprocessing, cleaning, filtering, and labeling multimodal text-vision datasets for model training. In this role, you will have a larger impact on the quality of our models than perhaps any other engineering role at the entire company: well filtered & labeled data is core to everything we do. This role is a perfect fit for distributed systems engineers who want to advance video understanding by delivering world class systems for unstructured multimodal corpora.
In This Role, You Will:
Acquire, filter, label (leveraging techniques like RLAIF), and sanitize large-scale vision-language datasets for LLM/VLM pretraining
Scale our data systems to enable our evolution from double-digit to triple-digit billion parameter models (and beyond!)
Establish strong relationships with 3rd party data vendors and human-in-the-loop data labeling services
Build the highest impact, not the flashiest, libraries and services
Work across teams to understand and manage project priorities and product deliverables, evaluate trade-offs, and drive technical initiatives from ideation to execution to shipment
You May Be A Good Fit If You Have:
6+ years of industry experience
Strong experience as a backend and/or data engineer, with an interest in ML/AI systems
Managed data acquisition for large generative or contrastive models
Strong Python expertise and considerable prior work history with at least one statically typed language (we use Golang)
Strong Candidates May Also Have Experience:
Working as a technical lead
Building model-bootstrapped language or vision-language datasets (RLAIF, etc.)
Working with FFmpeg or other high performance image/video processing libraries
A PhD, or a Master's degree, in machine learning or a closely related discipline
Interview Process
Recruiter Phone Screen
Initial Technical Assessment
Technical Interview 1: Coding
Technical Interview 2: System Design & Project Deep Dive
Final Interview: Culture
Even if there are a few checkboxes that aren’t ticked through your prior experience, we still encourage you to apply! If you are a 0-1 achiever, a ferocious learner, and a kind and fun team player who motivates others, you will find a home at Twelve Labs.
We are a global company that values the uniqueness of each person’s journey. It is the differences in our cultural, educational, and life experiences that allow us to constantly challenge the status quo. We are looking for individuals who are motivated by our mission and eager to make an impact as we push the bounds of technology to transform the world. Join us as we revolutionize video understanding and multimodal AI.
Benefits and Perks
🤝 An open and inclusive culture and work environment.
🧑💻 Work closely with a collaborative, mission-driven team on cutting-edge AI technology.
🦷 Full health, dental, and vision benefits.
✈️ Flexible PTO and parental leave policy. Office closed the week of Christmas and New Years.
🛂 VISA support (such as H1B and OPT transfer for US employees).
...A not-for-profit integrated system in North Carolina is adding a full-time BC/BE Psychiatrist to their practice in Thomasville, NC .... ...to working with geriatric population (memory specific group) Epic EHR utilized system-wide Recruitment Package Employed position...
Seeking a BE/BC Gastroenterologist to join a well-established multi-specialty group. Work in a state-of-the-art clinic, endo suites, & OR's. Due to the ongoing demands & excellent referral system, new physician will be busy from day one. Enjoy an experienced staff & supportive...
...Hardware Support Security Android Bash Linux Network OpenAPI Perl Python Ruby Unix More: At Leidos in San Diego, CA, I am passionate about making a difference. I am looking for a Senior Systems Engineer to join my team focused on...
We're looking for a virtual assistant who can work remotely with our team on varied administrative tasks. The ideal candidate is someone who is very organised, pays attention to detail, and can work under tight deadlines. Similar job experience is not required, but we do...
...outreach through participation in system, division, and department programs and events. Requirements Education:Bachelors Degree in Exercise Physiology required; Masters degree in Exercise Physiology preferred; Concentration in Cardiac Rehab / Clinical Exercise...