Google’s new machine learning API recognizes objects in videos

At its Cloud Next conference in San Francisco, Google today announced the launch of a new machine learning API for automatically recognizing objects in videos and making them searchable.

The new Video Intelligence API will allow developers to build applications that can automatically extract entities from a video. Until now, most similar image recognition APIs available in the cloud only focused on doing this for still images, but with the help of this new API, developers will be able to build applications that let users search and discover information in videos. That means you can search for “dog” or “flower,” for example.

Besides extracting metadata, the API allows you to tag scene changes in a video.

Those videos have to be stored in Google’s cloud storage service. You can see a demo of how this works here. If you are developer, you can sign up for the private beta here.

As Google’s Fei-Fei Li, its chief scientist of AI and Machine Learning at Google Cloud, noted in today’s keynote, the world of pixels goes beyond images. Videos have long been a challenge for machine learning researchers. This new service, though, now makes extracting information from these videos as easy as doing the same for images.

In addition, the Cloud Machine Learning Engine, the company’s tool for building custom machine learning models using its TensorFlow framework, is now generally available.

Techcrunch event

Disrupt 2026: The tech ecosystem, all in one room

Your next round. Your next hire. Your next breakout opportunity. Find it at TechCrunch Disrupt 2026, where 10,000+ founders, investors, and tech leaders gather for three days of 250+ tactical sessions, powerful introductions, and market-defining innovation. Register now to save up to $400.

Save up to $300 or 30% to TechCrunch Founder Summit

1,000+ founders and investors come together at TechCrunch Founder Summit 2026 for a full day focused on growth, execution, and real-world scaling. Learn from founders and investors who have shaped the industry. Connect with peers navigating similar growth stages. Walk away with tactics you can apply immediately

Offer ends March 13.

San Francisco, CA | October 13-15, 2026

As Li noted in today’s keynote, the company wants to democratize the machine learning technologies it has developed in-house. The Vision API is another example of this.

Topics

, , , , ,
Loading the next article
Error loading the next article