Micorsoft project oxford OCR
Introdution
Recently, Micorsoft has released a set of artificial intelligence based vision, speech and language APIs. In the set of vision, there are computer vision, face, emotion, and vedio API. In the set of speech, there are speech, speaker recognition, and custom recognotion intelligence service API. In the set of language, there are spell check, language understanding intelligence service, and web language model API. Today, I'm going to test the functionality of object character recognition in the computer vision API.
Prerequistes
- An account of Microsoft
- Subscribe to the Microsoft Project Oxford
Demo
Here is the demo site without a proper subscription key. You will need to fill in your subscription key to make it working properly.
After entering the subscription key and the image url, the result would be returned beneath.
After entering the subscription key and the image url, the result would be returned beneath.
Code
This simple page of application would let the user enter the subscription key and the image url and send a request to the Microsoft Project Oxford APIs using the ajax function of JQuery. There are request headers needed to be set before sending out the AJAX (asynchronous HTTP), including Content-Type and Ocp-Apim-Subscription-Key. Upon receiving the results, the string would be appended below.
Conclusion
There are a growing number of such service provided by vendors, such as Microsoft, Google, IBM etc. We would embrace these tools to strength and enrich the functionality of our application. An idiom comes to me:
Though reinvent the wheel is another great way to go through the background knoledge of a domain, it's move energy saving and efficient to use these APIs.