Microsoft's AI-based video metadata extraction service now generally available

Home » Microsoft

1 min. read

Published on September 17, 2018

by Pradeep Viswav

published on September 17, 2018

Share this article

Improve this guide

Readers help support MSpoweruser. We may get a commission if you buy through our links.

Microsoft Video Indexer is a cloud service that enables you to extract visual and speech metadata from your videos, which can be used to build enhanced search experiences in your existing apps. At Build developer conference last year, Microsoft first announced the public preview of the Video Indexer service. At IBC 2018 last week, Microsoft announced the general availability of Video Indexer service. Along with the information about GA, Microsoft announced the following new capabilities.

The Emotion recognition model which detects emotional moments in video and audio assets based on speech content and voice tonality.
A Topic inferencing model built to understand the high-level topics of the video or audio files based on spoken words and visual cues. Topics in this model are sourced from IPTC taxonomy among others to align to industry standards.
Enhanced celebrity recognition model which now covers one million faces based on commonly requested data sources such as IMDB, Wikipedia, and top LinkedIn influencers.

Learn more about this announcement here.

Pradeep Viswav

Software and Services Expert

Pradeep is a Computer Science and Engineering Graduate. He was also a Microsoft Student Partner. He is currently working in a leading IT company.

User forum

0 messages

Sort by:

Leave a Reply Cancel reply