How Twelve Labs Teaches A.I. to ‘See’ and Transform Video Understanding: Interview

Date:

Revolutionizing Video Understanding: How Twelve Labs is Teaching A.I. to “See”

Sure, the score of a football game is important. But sporting events can also foster cultural moments that slip under the radar—such as Travis Kelce signing a heart to Taylor Swift in the stands. While such footage could be social-media gold, it’s easily missed by traditional content tagging systems. That’s where Twelve Labs comes in.

Every sports team or sports league has decades of footage that they’ve captured in-game, around the stadium, about players. However, these archives are often underutilized due to inconsistent and outdated content management. “To date, most of the processes for tagging content have been manual,” Soyoung Lee, co-founder and head of GTM at Twelve Labs, told Observer.

Photo by Vaughn Ridley/Web Summit via Sportsfile via Getty Images

Soyoung Lee, co-founder and head of GTM at Twelve Labs, pictured at Web Summit Vancouver 2025.

Unlocking the Value of Video Content

Twelve Labs, a San Francisco-based startup specializing in video-understanding A.I., wants to unlock the value of video content by offering models that can search vast archives, generate text summaries and create short-form clips from long-form footage. Its work extends far beyond sports, touching industries from entertainment and advertising to security.

“Large language models can read and write really well,” said Lee. “But we want to move on to create a world in which A.I. can also see.” Twelve Labs has raised more than $100 million from investors such as Nvidia, Intel, and Firstman Studio, the studio of Squid Game creator Hwang Dong-hyuk.

Expertise and Applications

Twelve Labs counts thousands of developers and hundreds of enterprise customers. Demand is highest in entertainment and media, spanning Hollywood studios, sports leagues, social media influencers and advertising firms that rely on Twelve Labs tools to automate clip generation, assist with scene selection or enable contextual ad placements.

Government agencies also use the startup’s technology for video search and event retrieval. Beyond its work with the U.S. and other nations, Lee said that Twelve Labs has a deployment in South Korea’s Sejong City to help CCTV operators monitor thousands of camera feeds and locate specific incidents. To reduce security risks, the company has removed capabilities for facial and biometric recognition, she added.

Addressing Job Concerns and Democratizing Creative Work

Many of the industries Twelve Labs serves are already debating whether A.I. threatens human jobs—a concern Lee argues is only partly warranted. “I don’t know if jobs will be lost, per se, but jobs will have to transition,” she said, comparing the shift to how tools like Photoshop reshaped creative roles.

If anything, Lee believes systems like Twelve Labs’ will democratize creative work traditionally limited to companies with big budgets. “You are now able to do things with less, which means you have more stories that can be created from independent creatives who do not have that same capital,” she said. “It actually allows for the scaling of content creation and personalizing distribution.”

Image Source: observer.com

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Share post:

Subscribe

Subscribe to get our latest news delivered straight to your inbox.

We don’t spam! Read our privacy policy for more info.

Popular

More like this
Related

Chad Baker-Mazara, USC’s main scorer, dismissed from males’s basketball group

USC Basketball Star Chad Baker-Mazara Dismissed from Program Amidst...

Jim Carrey interview at French movie awards shocks followers: ‘Impersonator’

Jim Carrey's Rare Red Carpet Appearance Sparks Speculation Comedian Jim...