VideoSpace's Video Search Engine is arguably one of world's most advanced video search engine because it is the first to combine the following search elements into a single video search platform:

  • Speech Recognition (up to 12 languages)
  • Words or Text (more than 25 languages)
  • Objects (detect objects)
  • Motion (detect specific zones in a video)
  • Faces (detect up to 64 faces in a single frame)
  • Emotion (detect 8 major emotions) 
  • Offensive Content (detect pornography, nudity, profanity, violence)
  • Custom Search (Logos, Landmarks, Objects, etc)

This combination makes our Video search Engine one most powerful in the world!

Current search engines can only search for "Title" and "Metadata" of your video. What if you want to search the content INSIDE the videos? Now you can with VideoSpace Search Engine! 

Currently, for speech recognition, we are able to index and search words spoken:

  • English
  • English (British)
  • Chinese
  • Spanish
  • Spanish (Mexican
  • French
  • German
  • Italian
  • Portuguese (Brazilian)
  • Arabic (Egyptian)
  • Japanese
  • Russian

Our Video Search Engine also has the option to utilizes Video OCR (Optical Character Recognition) to detect text content in video files so that we can index and search your media by text. Video OCR will enhance the discoverability of your video content. This is extremely useful in highly textual video, like a screen-capture of a video slideshow presentation.

Our Video OCR detects up to 26 languages, they are: Arabic, Chinese Simplified, Chinese Traditional, Czech, Danish, Dutch, English, Finnish, French, German, Greek, Hungarian, Italian, Japanese, Korean, Norwegian, Polish, Portuguese, Romanian, Russian, Serbian Cyrillic, Serbian Latin, Slovak, Spanish, Swedish, Turkish.


Our Search Engine automatically makes your media deeply searchable without the need for manually applied metadata. Using deep neural net (DNN)-based speech recognition technology from Microsoft Research, our engine converts digital audio into natural language and automatically extracts meaningful metadata from your media.


Innovative custom vocabulary adaptation

With its custom vocabulary adaptation, our engine consistently outperforms industry-standard speech transcription technology. Indexing medical lecture content?

Submit custom words like “aneurysm” alongside your job, and watch as we scours the Internet to include related words such as “hemorrhage” or “embolism” to its internal dictionary, dramatically increasing accuracy.


Auto-generated closed captions

Reduce the effort required to make your multimedia accessible by passing your content through our engine. Use the output caption file (in your preferred format) to provide closed captions for your users.



Our indexing engine can generate keywords from speech content in your multimedia and produce an XML file containing the frequency and time offset of each spoken keyword and other valuable data. Use the file to perform speech analytics, tag your content, or power a recommendation engine.