VideoSpace Search Engine is arguably the World's Best Video Search Engine as it is the first to combine 8 search elements into a single video search platform:

  • Speech Recognition (up to 12 languages)
  • Words or Text (more than 25 languages)
  • Objects (detects over 10,000 objects)
  • Motion (detects motion in specific zones)
  • Faces (detects up to 64 faces in a single frame)
  • Emotion (detects up to 8 major emotions) 
  • Offensive Content (detects pornography, nudity, profanity, violence)
  • Custom Search (e.g. logos, landmarks, objects, etc)
videospace video search engine

This combination makes Videospace search Engine one of the most powerful in the world!

Current search engines can only search for "Title" and "Metadata" of your video. What if you want to search the content INSIDE the videos? Now you can with VideoSpace Search Engine! 

For speech recognition, we are able to index and search words spoken in 12 languages. They are English, English (British), Chinese, Spanish, Spanish (Mexican), French, German, Italian, Portuguese (Brazilian), Arabic (Egyptian), Japanese, Russian. We are currently working on other languages and will release updates whenever available. 

Our Video Search Engine also has the option to utilizes Video OCR (Optical Character Recognition) to detect text content in video files so that we can index and search your media by text. Video OCR will enhance the discoverability of your video content. This is extremely useful in highly textual video, like a screen-capture of a video slideshow presentation.

Our Video OCR detects up to 26 languages, they are: Arabic, Chinese Simplified, Chinese Traditional, Czech, Danish, Dutch, English, Finnish, French, German, Greek, Hungarian, Italian, Japanese, Korean, Norwegian, Polish, Portuguese, Romanian, Russian, Serbian Cyrillic, Serbian Latin, Slovak, Spanish, Swedish, Turkish.


Our Search Engine automatically makes your media deeply searchable without the need for manually applied metadata. Using deep neural net (DNN)-based speech recognition technology from Microsoft Research, our engine converts digital audio into natural language and automatically extracts meaningful metadata from your media.


Innovative custom vocabulary adaptation

With its custom vocabulary adaptation, our engine consistently outperforms industry-standard speech transcription technology. Indexing medical lecture content?

Submit custom words like “aneurysm” alongside your job, and watch as we scours the Internet to include related words such as “hemorrhage” or “embolism” to its internal dictionary, dramatically increasing accuracy.


Auto-generated closed captions

Reduce the effort required to make your multimedia accessible by passing your content through our engine. Use the output caption file (in your preferred format) to provide closed captions for your users.



Our indexing engine can generate keywords from speech content in your multimedia and produce an XML file containing the frequency and time offset of each spoken keyword and other valuable data. Use the file to perform speech analytics, tag your content, or power a recommendation engine.