Google's Cloud Vision API Will Allow for Cloud Based Image, Face and Emotion Detection

Saturday, December 5, 2015

Google's Cloud Vision API Will Allow for Cloud Based Image, Face and Emotion Detection


Image Recognition

With the new release of a cloud-based image recognition system API from Google, developers will be empowered to build new applications that can see, and more importantly understand, the content of images. The company showed off the software with a simple robot that can recognize objects like a banana, and a user's smiling face.


Google recently announced the launch of Cloud Vision, one of the company’s image recognition technologies. They have made it available to developer as an API with a limited preview available using the Google Cloud Platform.

Related articles
"The uses of Cloud Vision API are game changing to developers of all types of applications and we are very excited to see what happens next," writes Ram Ramanathan, Product Manager for the Google Cloud Platform.

Google’s image recognition technology is one of the strongest around, applicable to many domains that include optical character recognition (OCR), face detection, and object recognition.

The Cloud Vision API quickly classifies images into thousands of categories, detects faces with associated emotions, and recognizes printed words in many languages. Developers using the Cloud Vision API, will be able to build metadata into an image catalog, to moderate offensive content, or enable new marketing scenarios through image sentiment analysis.

Google Cloud Vision


The following set of Google Cloud Vision API features can be applied in any combination on an image:

  • Label/Entity Detection picks out the dominant entity (e.g., a car, a cat) within an image, from a broad set of object categories. You can use the API to easily build metadata on your image catalog, enabling new scenarios like image based searches or recommendations.
  • Optical Character Recognition to retrieve text from an image. Cloud Vision API provides automatic language identification, and supports a wide variety of languages.
  • Safe Search Detection to detect inappropriate content within your image. Powered by Google SafeSearch, the feature enables you to easily moderate crowd-sourced content.
  • Facial Detection can detect when a face appears in photos, along with associated facial features such as eye, nose and mouth placement, and likelihood of over 8 attributes like joy and sorrow. We don't support facial recognition and we don’t store facial detection information on any Google server.
  • Landmark Detection to identify popular natural and manmade structures, along with the associated latitude and longitude of the landmark.
  • Logo Detection to identify product logos within an image. Cloud Vision API returns the identified product brand logo, with the associated bounding polybox.
To demonstrate a simple example of the Vision API, Google developers have built a working Raspberry Pi based platform with just a few hundreds of lines of Python code, calling the Vision API. 

As the video below shows, the demo robot can roam and identify objects, including smiling faces.

Cloud Vision is partially powered by Google's TensorFlow machine learning platform that was recently open-sourced.


SOURCE  Google


By 33rd SquareEmbed


0 comments:

Post a Comment