VideoSpace's Video Search Engine is arguably one of world's most advanced video search engine because it is the first to combine:

  • Speech Recognition (up to 12 major languages)
  • Video OCR (Optical Character Recognition - more than 25 major languages)
  • Translated Search (in more than 450 language pairs)

into a single video search platform. This combination makes it one of the world's most powerful video search engine.

Current search engines can only search for "Title" and "Metadata" of your video. What if you want to search the content INSIDE the videos? Now you can with VideoSpace Search Engine! 

Search the Spoken Word INSIDE your Video 

Currently, for speech recognition, we are able to index and search words spoken:

  • English
  • English (British)
  • Chinese
  • Spanish
  • Spanish (Mexican
  • French
  • German
  • Italian
  • Portuguese (Brazilian)
  • Arabic (Egyptian)
  • Japanese
  • Russian

Our Video Search Engine also has the option to utilizes Video OCR (Optical Character Recognition) to detect text content in video files so that we can index and search your media by text. Video OCR will enhance the discoverability of your video content. This is extremely useful in highly textual video, like a screen-capture of a video slideshow presentation.

Our Video OCR detects up to 26 languages, they are: Arabic, Chinese Simplified, Chinese Traditional, Czech, Danish, Dutch, English, Finnish, French, German, Greek, Hungarian, Italian, Japanese, Korean, Norwegian, Polish, Portuguese, Romanian, Russian, Serbian Cyrillic, Serbian Latin, Slovak, Spanish, Swedish, Turkish.


Our Search Engine automatically makes your media deeply searchable without the need for manually applied metadata. Using deep neural net (DNN)-based speech recognition technology from Microsoft Research, our engine converts digital audio into natural language and automatically extracts meaningful metadata from your media.


Innovative custom vocabulary adaptation

With its custom vocabulary adaptation, our engine consistently outperforms industry-standard speech transcription technology. Indexing medical lecture content?

Submit custom words like “aneurysm” alongside your job, and watch as we scours the Internet to include related words such as “hemorrhage” or “embolism” to its internal dictionary, dramatically increasing accuracy.


Auto-generated closed captions

Reduce the effort required to make your multimedia accessible by passing your content through our engine. Use the output caption file (in your preferred format) to provide closed captions for your users.



Our indexing engine can generate keywords from speech content in your multimedia and produce an XML file containing the frequency and time offset of each spoken keyword and other valuable data. Use the file to perform speech analytics, tag your content, or power a recommendation engine.