What Is The Computer Vision API? Reference Guide

User Manual:

Open the PDF directly: View PDF .
Page Count: 270

Download
Open PDF In Browser	View PDF

Contents
Computer Vision API Documentation
Overview
What is Computer Vision?
Quickstarts
Using the REST API
Analyze a remote image
cURL
Go
Java
JavaScript
Node.js
PHP
Python
Ruby
Analyze a local image
C#
Python
Generate a thumbnail
C#
cURL
Go
Java
JavaScript
Node.js
PHP
Python
Ruby
Extract printed text
C#

cURL
Go
Java
JavaScript
Node.js
PHP
Python
Ruby
Extract handwritten text
C#
Java
JavaScript
Python
Use a domain model
PHP
Python
Using the .NET SDK
Analyze an image
Generate a thumbnail
Extract text
Using the Python SDK
Tutorials
Generate metadata for images
Concepts
Tagging images
Detecting objects
Detecting brands
Categorizing images
Describing images
Detecting faces
Detecting image types
Detecting domain-specific content

Detecting color schemes
Generating thumbnails
Recognize printed and handwritten text
Detecting adult and racy content
How-to guides
Use Computer Vision
Java
JavaScript
Python
Call the Computer Vision API
Use containers
Install and run containers
Configure containers
Use the Computer Vision Connected Service
Analyze videos in real time
Reference
Azure CLI
Azure PowerShell
Computer Vision API v2.0
Computer Vision API v1.0
SDKs
.NET
Node.js
Python
Go
Android (Java)
Swift
Resources
Samples
Explore an image processing app
Other Computer Vision samples
FAQ

Category taxonomy
Language support
Pricing and limits
UserVoice
Stack Overflow
Azure roadmap
Regional availability
Compliance

What is Computer Vision?
5/29/2019 • 5 minutes to read • Edit Online

Azure's Computer Vision service provides developers with access to advanced algorithms that process images and
return information. To analyze an image, you can either upload an image or specify an image URL. The images
processing algorithms can analyze content in several different ways, depending on the visual features you're
interested in. For example, Computer Vision can determine if an image contains adult or racy content, or it can find
all of the human faces in an image.
You can use Computer Vision in your application by using either a native SDK or invoking the REST API directly.
This page broadly covers what you can do with Computer Vision.

Analyze images for insight
You can analyze images to detect and provide insights about their visual features and characteristics. All of the
features in the table below are provided by the Analyze Image API.
ACTION

DESCRIPTION

Tag visual features

Identify and tag visual features in an image, from a set of
thousands of recognizable objects, living things, scenery, and
actions. When the tags are ambiguous or not common
knowledge, the API response provides 'hints' to clarify the
meaning of the tag in the context of a known setting. Tagging
isn't limited to the main subject, such as a person in the
foreground, but also includes the setting (indoor or outdoor),
furniture, tools, plants, animals, accessories, gadgets, and so
on.

Detect objects

Object detection is similar to tagging, but the API returns the
bounding box coordinates for each tag applied. For example, if
an image contains a dog, cat and person, the Detect operation
will list those objects together with their coordinates in the
image. You can use this functionality to process further
relationships between the objects in an image. It also lets you
know when there are multiple instances of the same tag in an
image.

Detect brands

Identify commercial brands in images or videos from a
database of thousands of global logos. You can use this
feature, for example, to discover which brands are most
popular on social media or most prevalent in media product
placement.

Categorize an image

Identify and categorize an entire image, using a category
taxonomy with parent/child hereditary hierarchies. Categories
can be used alone, or with our new tagging models.
Currently, English is the only supported language for tagging
and categorizing images.

ACTION

DESCRIPTION

Describe an image

Generate a description of an entire image in human-readable
language, using complete sentences. Computer Vision's
algorithms generate various descriptions based on the objects
identified in the image. The descriptions are each evaluated
and a confidence score generated. A list is then returned
ordered from highest confidence score to lowest.

Detect faces

Detect faces in an image and provide information about each
detected face. Computer Vision returns the coordinates,
rectangle, gender, and age for each detected face.
Computer Vision provides a subset of the functionality that
can be found in Face, and you can use the Face service for
more detailed analysis, such as facial identification and pose
detection.

Detect image types

Detect characteristics about an image, such as whether an
image is a line drawing or the likelihood of whether an image
is clip art.

Detect domain-specific content

Use domain models to detect and identify domain-specific
content in an image, such as celebrities and landmarks. For
example, if an image contains people, Computer Vision can use
a domain model for celebrities included with the service to
determine if the people detected in the image match known
celebrities.

Detect the color scheme

Analyze color usage within an image. Computer Vision can
determine whether an image is black & white or color and, for
color images, identify the dominant and accent colors.

Generate a thumbnail

Analyze the contents of an image to generate an appropriate
thumbnail for that image. Computer Vision first generates a
high-quality thumbnail and then analyzes the objects within
the image to determine the area of interest. Computer Vision
then crops the image to fit the requirements of the area of
interest. The generated thumbnail can be presented using an
aspect ratio that is different from the aspect ratio of the
original image, depending on your needs.

Get the area of interest

Analyze the contents of an image to return the coordinates of
the area of interest. This is the same function that is used to
generate a thumbnail, but instead of cropping the image,
Computer Vision returns the bounding box coordinates of the
region, so the calling application can modify the original image
as desired.

Extract text from images
You can use Computer Vision to extract text from an image into a machine-readable character stream using optical
character recognition (OCR ). If needed, OCR corrects the rotation of the recognized text and provides the frame
coordinates of each word. OCR supports 25 languages and automatically detects the language of the recognized
text.
You can also use the Read API to extract both printed and handwritten text from images and text-heavy documents.
The Read API uses updated models and works for a variety objects with different surfaces and backgrounds, such
as receipts, posters, business cards, letters, and whiteboards. Currently, English is the only supported language.

Moderate content in images
You can use Computer Vision to detect adult and racy content in an image and return a confidence score for both.
The filter for adult and racy content detection can be set on a sliding scale to accommodate your preferences.

Use containers
Use Computer Vision containers to recognize printed and handwritten text locally by installing a standardized
Docker container closer to your data.

Image requirements
Computer Vision can analyze images that meet the following requirements:
The image must be presented in JPEG, PNG, GIF, or BMP format
The file size of the image must be less than 4 megabytes (MB )
The dimensions of the image must be greater than 50 x 50 pixels
For OCR, the dimensions of the image must be between 50 x 50 and 4200 x 4200 pixels

Data privacy and security
As with all of the Cognitive Services, developers using the Computer Vision service should be aware of Microsoft's
policies on customer data. See the Cognitive Services page on the Microsoft Trust Center to learn more.

Next steps
Get started with Computer Vision by following a quickstart guide:
Quickstart: Analyze an image
Quickstart: Extract handwritten text
Quickstart: Generate a thumbnail

Quickstart: Analyze a remote image using the REST
API and cURL in Computer Vision
4/18/2019 • 2 minutes to read • Edit Online

In this quickstart, you analyze a remotely stored image to extract visual features using Computer Vision's REST
API. With the Analyze Image method, you can extract visual features based on image content.
If you don't have an Azure subscription, create a free account before you begin.

Prerequisites
You must have cURL.
You must have a subscription key for Computer Vision. You can get a free trial key from Try Cognitive Services.
Or, follow the instructions in Create a Cognitive Services account to subscribe to Computer Vision and get your
key.

Create and run the sample command
To create and run the sample, do the following steps:
1. Copy the following command into a text editor.
2. Make the following changes in the command where needed:
a. Replace the value of  with your subscription key.
b. Replace the request URL ( https://westcentralus.api.cognitive.microsoft.com/vision/v2.0/analyze ) with
the endpoint URL for the Analyze Image method from the Azure region where you obtained your
subscription keys, if necessary.
c. Optionally, change the language parameter of the request URL ( language=en ) to use a different
supported language.
d. Optionally, change the image URL in the request body (
http://upload.wikimedia.org/wikipedia/commons/3/3c/Shaki_waterfall.jpg\ ) to the URL of a different
image to be analyzed.
3. Open a command prompt window.
4. Paste the command from the text editor into the command prompt window, and then run the command.
curl -H "Ocp-Apim-Subscription-Key: " -H "Content-Type: application/json"
"https://westcentralus.api.cognitive.microsoft.com/vision/v2.0/analyze?
visualFeatures=Categories,Description&details=Landmarks&language=en" -d "
{\"url\":\"http://upload.wikimedia.org/wikipedia/commons/3/3c/Shaki_waterfall.jpg\"}"

Examine the response
A successful response is returned in JSON. The sample application parses and displays a successful response in the
command prompt window, similar to the following example:

{
"categories": [
{
"name": "outdoor_water",
"score": 0.9921875,
"detail": {
"landmarks": []
}
}
],
"description": {
"tags": [
"nature",
"water",
"waterfall",
"outdoor",
"rock",
"mountain",
"rocky",
"grass",
"hill",
"covered",
"hillside",
"standing",
"side",
"group",
"walking",
"white",
"man",
"large",
"snow",
"grazing",
"forest",
"slope",
"herd",
"river",
"giraffe",
"field"
],
"captions": [
{
"text": "a large waterfall over a rocky cliff",
"confidence": 0.916458423253597
}
]
},
"requestId": "b6e33879-abb2-43a0-a96e-02cb5ae0b795",
"metadata": {
"height": 959,
"width": 1280,
"format": "Jpeg"
}
}

Next steps
Explore the Computer Vision API used to analyze an image, detect celebrities and landmarks, create a thumbnail,
and extract printed and handwritten text. To rapidly experiment with the Computer Vision API, try the Open API
testing console.
Explore the Computer Vision API

Quickstart: Analyze a remote image using the REST
API and Go in Computer Vision
4/18/2019 • 3 minutes to read • Edit Online

In this quickstart, you analyze a remotely stored image to extract visual features by using Computer Vision's REST
API. With the Analyze Image method, you can extract visual features based on image content.
If you don't have an Azure subscription, create a free account before you begin.

Prerequisites
You must have Go installed.
You must have a subscription key for Computer Vision. You can get a free trial key from Try Cognitive Services.
Or, follow the instructions in Create a Cognitive Services account to subscribe to Computer Vision and get your
key.

Create and run the sample
To create and run the sample, do the following steps:
1. Copy the below code into a text editor.
2. Make the following changes in code where needed:
a. Replace the value of subscriptionKey with your subscription key.
b. Replace the value of uriBase with the endpoint URL for the Analyze Image method from the Azure
region where you obtained your subscription keys, if necessary.
c. Optionally, replace the value of imageUrl with the URL of a different image that you want to analyze.
3. Save the code as a file with a .go extension. For example, analyze-image.go .
4. Open a command prompt window.
5. At the prompt, run the go build command to compile the package from the file. For example,
go build analyze-image.go .
6. At the prompt, run the compiled package. For example, analyze-image .
package main
import (
"encoding/json"
"fmt"
"io/ioutil"
"net/http"
"strings"
"time"
)
func main() {
// Replace  with your valid subscription key.
const subscriptionKey = ""
//
//
//
//
//
//

You must use the same Azure region in your REST API method as you used to
get your subscription keys. For example, if you got your subscription keys
from the West US region, replace "westcentralus" in the URL
below with "westus".
Free trial subscription keys are generated in the "westus" region.

// Free trial subscription keys are generated in the "westus" region.
// If you use a free trial subscription key, you shouldn't need to change
// this region.
const uriBase =
"https://westcentralus.api.cognitive.microsoft.com/vision/v2.0/analyze"
const imageUrl =
"https://upload.wikimedia.org/wikipedia/commons/3/3c/Shaki_waterfall.jpg"
const params = "?visualFeatures=Description&details=Landmarks&language=en"
const uri = uriBase + params
const imageUrlEnc = "{\"url\":\"" + imageUrl + "\"}"
reader := strings.NewReader(imageUrlEnc)
// Create the HTTP client
client := &http.Client{
Timeout: time.Second * 2,
}
// Create the POST request, passing the image URL in the request body
req, err := http.NewRequest("POST", uri, reader)
if err != nil {
panic(err)
}
// Add request headers
req.Header.Add("Content-Type", "application/json")
req.Header.Add("Ocp-Apim-Subscription-Key", subscriptionKey)
// Send the request and retrieve the response
resp, err := client.Do(req)
if err != nil {
panic(err)
}
defer resp.Body.Close()
// Read the response body
// Note, data is a byte array
data, err := ioutil.ReadAll(resp.Body)
if err != nil {
panic(err)
}
// Parse the JSON data from the byte array
var f interface{}
json.Unmarshal(data, &f)
// Format and display the JSON result
jsonFormatted, _ := json.MarshalIndent(f, "", " ")
fmt.Println(string(jsonFormatted))
}

Examine the response
A successful response is returned in JSON. The sample application parses and displays a successful response in the
command prompt window, similar to the following example:

{
"categories": [
{
"detail": {
"landmarks": []
},
"name": "outdoor_water",
"score": 0.9921875
}
],
"description": {
"captions": [
{
"confidence": 0.916458423253597,
"text": "a large waterfall over a rocky cliff"
}
],
"tags": [
"nature",
"water",
"waterfall",
"outdoor",
"rock",
"mountain",
"rocky",
"grass",
"hill",
"covered",
"hillside",
"standing",
"side",
"group",
"walking",
"white",
"man",
"large",
"snow",
"grazing",
"forest",
"slope",
"herd",
"river",
"giraffe",
"field"
]
},
"metadata": {
"format": "Jpeg",
"height": 959,
"width": 1280
},
"requestId": "a92f89ab-51f8-4735-a58d-507da2213fc2"
}

Next steps
Explore the Computer Vision API used to analyze an image, detect celebrities and landmarks, create a thumbnail,
and extract printed and handwritten text. To rapidly experiment with the Computer Vision API, try the Open API
testing console.
Explore the Computer Vision API

Quickstart: Analyze a remote image using the
Computer Vision REST API and Java
4/18/2019 • 4 minutes to read • Edit Online

In this quickstart, you analyze a remotely stored image to extract visual features by using Computer Vision's REST
API. With the Analyze Image method, you can extract visual features based on image content.
If you don't have an Azure subscription, create a free account before you begin.

Prerequisites
You must have Java™ Platform, Standard Edition Development Kit 7 or 8 (JDK 7 or 8) installed.
You must have a subscription key for Computer Vision. You can get a free trial key from Try Cognitive Services.
Or, follow the instructions in Create a Cognitive Services account to subscribe to Computer Vision and get your
key.

Create and run the sample application
To create and run the sample, do the following steps:
1. Create a new Java project in your favorite IDE or editor. If the option is available, create the Java project from
a command line application template.
2. Import the following libraries into your Java project. If you're using Maven, the Maven coordinates are
provided for each library.
Apache HTTP client (org.apache.httpcomponents:httpclient:4.5.5)
Apache HTTP core (org.apache.httpcomponents:httpcore:4.4.9)
JSON library (org.json:json:20180130)
3. Add the following import statements to the file that contains the Main public class for your project.
import
import
import
import
import
import
import
import
import
import

java.net.URI;
org.apache.http.HttpEntity;
org.apache.http.HttpResponse;
org.apache.http.client.methods.HttpPost;
org.apache.http.entity.StringEntity;
org.apache.http.client.utils.URIBuilder;
org.apache.http.impl.client.CloseableHttpClient;
org.apache.http.impl.client.HttpClientBuilder;
org.apache.http.util.EntityUtils;
org.json.JSONObject;

4. Replace the
needed:

Main

public class with the following code, then make the following changes in code where

a. Replace the value of subscriptionKey with your subscription key.
b. Replace the value of uriBase with the endpoint URL for the Analyze Image method from the Azure
region where you obtained your subscription keys, if necessary.
c. Optionally, replace the value of imageToAnalyze with the URL of a different image that you want to
analyze.

public
//
//
//

class Main {
**********************************************
*** Update or verify the following values. ***
**********************************************

// Replace  with your valid subscription key.
private static final String subscriptionKey = "";
// You must use the same Azure region in your REST API method as you used to
// get your subscription keys. For example, if you got your subscription keys
// from the West US region, replace "westcentralus" in the URL
// below with "westus".
//
// Free trial subscription keys are generated in the "westus" region.
// If you use a free trial subscription key, you shouldn't need to change
// this region.
private static final String uriBase =
"https://westcentralus.api.cognitive.microsoft.com/vision/v2.0/analyze";
private static final String imageToAnalyze =
"https://upload.wikimedia.org/wikipedia/commons/" +
"1/12/Broadway_and_Times_Square_by_night.jpg";
public static void main(String[] args) {
CloseableHttpClient httpClient = HttpClientBuilder.create().build();
try {
URIBuilder builder = new URIBuilder(uriBase);
// Request parameters. All of them are optional.
builder.setParameter("visualFeatures", "Categories,Description,Color");
builder.setParameter("language", "en");
// Prepare the URI for the REST API method.
URI uri = builder.build();
HttpPost request = new HttpPost(uri);
// Request headers.
request.setHeader("Content-Type", "application/json");
request.setHeader("Ocp-Apim-Subscription-Key", subscriptionKey);
// Request body.
StringEntity requestEntity =
new StringEntity("{\"url\":\"" + imageToAnalyze + "\"}");
request.setEntity(requestEntity);
// Call the REST API method and get the response entity.
HttpResponse response = httpClient.execute(request);
HttpEntity entity = response.getEntity();
if (entity != null) {
// Format and display the JSON response.
String jsonString = EntityUtils.toString(entity);
JSONObject json = new JSONObject(jsonString);
System.out.println("REST Response:\n");
System.out.println(json.toString(2));
}
} catch (Exception e) {
// Display error message.
System.out.println(e.getMessage());
}
}
}

Compile and run the program

1. Save, then build the Java project.
2. If you're using an IDE, run Main .
Alternately, if you're running the program from a command line window, run the following commands. These
commands presume your libraries are in a folder named libs that is in the same folder as Main.java ; if not, you
will need to replace libs with the path to your libraries.
1. Compile the file

Main.java

.

javac -cp ".;libs/*" Main.java

2. Run the program. It will send the request to the QnA Maker API to create the KB, then it will poll for the
results every 30 seconds. Each response is printed to the command line window.
java -cp ".;libs/*" Main

Examine the response
A successful response is returned in JSON. The sample application parses and displays a successful response in the
console window, similar to the following example:

REST Response:
{
"metadata": {
"width": 1826,
"format": "Jpeg",
"height": 2436
},
"color": {
"dominantColorForeground": "Brown",
"isBWImg": false,
"accentColor": "B74314",
"dominantColorBackground": "Brown",
"dominantColors": ["Brown"]
},
"requestId": "bbffe1a1-4fa3-4a6b-a4d5-a4964c58a811",
"description": {
"captions": [{
"confidence": 0.8241405091548035,
"text": "a group of people on a city street filled with traffic at night"
}],
"tags": [
"outdoor",
"building",
"street",
"city",
"busy",
"people",
"filled",
"traffic",
"many",
"table",
"car",
"group",
"walking",
"bunch",
"crowded",
"large",
"night",
"light",
"standing",
"man",
"tall",
"umbrella",
"riding",
"sign",
"crowd"
]
},
"categories": [{
"score": 0.625,
"name": "outdoor_street"
}]
}

Clean up resources
When no longer needed, delete the Java project, including the compiled class and imported libraries.

Next steps
Explore a Java Swing application that uses Computer Vision to perform optical character recognition (OCR ); create
smart-cropped thumbnails; plus detect, categorize, tag, and describe visual features, including faces, in an image. To
rapidly experiment with the Computer Vision API, try the Open API testing console.

Computer Vision API Java Tutorial

Quickstart: Analyze a remote image using the REST
API and JavaScript in Computer Vision
4/19/2019 • 3 minutes to read • Edit Online

In this quickstart, you analyze a remotely stored image to extract visual features by using Computer Vision's REST
API. With the Analyze Image method, you can extract visual features based on image content.
If you don't have an Azure subscription, create a free account before you begin.

Prerequisites
You must have a subscription key for Computer Vision. You can get a free trial key from Try Cognitive Services. Or,
follow the instructions in Create a Cognitive Services account to subscribe to Computer Vision and get your key.

Create and run the sample
To create and run the sample, do the following steps:
1. Copy the following code into a text editor.
2. Make the following changes in code where needed:
a. Replace the value of subscriptionKey with your subscription key.
b. Replace the value of uriBase with the endpoint URL for the Analyze Image method from the Azure
region where you obtained your subscription keys, if necessary.
c. Optionally, replace the value of the value attribute for the inputImage control with the URL of a
different image that you want to analyze.
3. Save the code as a file with an .html extension. For example, analyze-image.html .
4. Open a browser window.
5. In the browser, drag and drop the file into the browser window.
6. When the webpage is displayed in the browser, choose the Analyze Image button.



Analyze Sample




Analyze image:
Enter the URL to an image, then click the Analyze image button.



Image to analyze:







Response:






Source image:

Examine the response A successful response is returned in JSON. The sample webpage parses and displays a successful response in the browser window, similar to the following example: { "categories": [ { "name": "outdoor_water", "score": 0.9921875, "detail": { "landmarks": [] } } ], "description": { "tags": [ "nature", "water", "waterfall", "outdoor", "rock", "mountain", "rocky", "grass", "hill", "covered", "hillside", "standing", "side", "group", "walking", "white", "man", "large", "snow", "grazing", "forest", "slope", "herd", "river", "giraffe", "field" ], "captions": [ { "text": "a large waterfall over a rocky cliff", "confidence": 0.916458423253597 } ] }, "color": { "dominantColorForeground": "Grey", "dominantColorBackground": "Green", "dominantColors": [ "Grey", "Green" ], "accentColor": "4D5E2F", "isBwImg": false }, "requestId": "73ef10ce-a4ea-43c6-aee7-70325777e4b3", "metadata": { "height": 959, "width": 1280, "format": "Jpeg" } } Next steps Explore a JavaScript application that uses Computer Vision to perform optical character recognition (OCR ); create smart-cropped thumbnails; plus detect, categorize, tag, and describe visual features, including faces, in an image. To rapidly experiment with the Computer Vision API, try the Open API testing console. Computer Vision API JavaScript Tutorial Quickstart: Analyze a remote image using the REST API with Node.js in Computer Vision 4/18/2019 • 2 minutes to read • Edit Online In this quickstart, you analyze a remotely stored image to extract visual features by using Computer Vision's REST API. With the Analyze Image method, you can extract visual features based on image content. If you don't have an Azure subscription, create a free account before you begin. Prerequisites You must have Node.js 4.x or later installed. You must have npm installed. You must have a subscription key for Computer Vision. You can get a free trial key from Try Cognitive Services. Or, follow the instructions in Create a Cognitive Services account to subscribe to Computer Vision and get your key. Create and run the sample To create and run the sample, do the following steps: 1. Install the npm request package. a. Open a command prompt window as an administrator. b. Run the following command: npm install request c. After the package is successfully installed, close the command prompt window. 2. Copy the following code into a text editor. 3. Make the following changes in code where needed: a. Replace the value of subscriptionKey with your subscription key. b. Replace the value of uriBase with the endpoint URL for the Analyze Image method from the Azure region where you obtained your subscription keys, if necessary. c. Optionally, replace the value of imageUrl with the URL of a different image that you want to analyze. d. Optionally, replace the value of the language request parameter with a different language. 4. Save the code as a file with a .js extension. For example, analyze-image.js . 5. Open a command prompt window. 6. At the prompt, use the node command to run the file. For example, node analyze-image.js . 'use strict'; const request = require('request'); // Replace with your valid subscription key. const subscriptionKey = ''; // You must use the same location in your REST call as you used to get your // subscription keys. For example, if you got your subscription keys from // westus, replace "westcentralus" in the URL below with "westus". const uriBase = 'https://westcentralus.api.cognitive.microsoft.com/vision/v2.0/analyze'; const imageUrl = 'https://upload.wikimedia.org/wikipedia/commons/3/3c/Shaki_waterfall.jpg'; // Request parameters. const params = { 'visualFeatures': 'Categories,Description,Color', 'details': '', 'language': 'en' }; const options = { uri: uriBase, qs: params, body: '{"url": ' + '"' + imageUrl + '"}', headers: { 'Content-Type': 'application/json', 'Ocp-Apim-Subscription-Key' : subscriptionKey } }; request.post(options, (error, response, body) => { if (error) { console.log('Error: ', error); return; } let jsonResponse = JSON.stringify(JSON.parse(body), null, ' '); console.log('JSON Response\n'); console.log(jsonResponse); }); Examine the response A successful response is returned in JSON. The sample parses and displays a successful response in the command prompt window, similar to the following example: { "categories": [ { "name": "outdoor_water", "score": 0.9921875, "detail": { "landmarks": [] } } ], "description": { "tags": [ "nature", "water", "waterfall", "outdoor", "rock", "mountain", "rocky", "grass", "hill", "covered", "hillside", "standing", "side", "group", "walking", "white", "man", "large", "snow", "grazing", "forest", "slope", "herd", "river", "giraffe", "field" ], "captions": [ { "text": "a large waterfall over a rocky cliff", "confidence": 0.916458423253597 } ] }, "color": { "dominantColorForeground": "Grey", "dominantColorBackground": "Green", "dominantColors": [ "Grey", "Green" ], "accentColor": "4D5E2F", "isBwImg": false }, "requestId": "81b4e400-e3c1-41f1-9020-e6871ad9f0ed", "metadata": { "height": 959, "width": 1280, "format": "Jpeg" } } Clean up resources When no longer needed, delete the file, and then uninstall the npm the following steps: request package. To uninstall the package, do 1. Open a command prompt window as an administrator. 2. Run the following command: npm uninstall request 3. After the package is successfully uninstalled, close the command prompt window. Next steps Explore the Computer Vision APIs used to analyze an image, detect celebrities and landmarks, create a thumbnail, and extract printed and handwritten text. To rapidly experiment with the Computer Vision API, try the Open API testing console. Explore the Computer Vision API Quickstart: Analyze a remote image using the REST API and PHP in Computer Vision 4/18/2019 • 2 minutes to read • Edit Online In this quickstart, you analyze a remotely stored image to extract visual features by using Computer Vision's REST API. With the Analyze Image method, you can extract visual features based on image content. If you don't have an Azure subscription, create a free account before you begin. Prerequisites You must have PHP installed. You must have Pear installed. You must have a subscription key for Computer Vision. You can get a free trial key from Try Cognitive Services. Or, follow the instructions in Create a Cognitive Services account to subscribe to Computer Vision and get your key. Create and run the sample To create and run the sample, do the following steps: 1. Install the PHP5 HTTP_Request2 package. a. Open a command prompt window as an administrator. b. Run the following command: pear install HTTP_Request2 c. After the package is successfully installed, close the command prompt window. 2. Copy the following code into a text editor. 3. Make the following changes in code where needed: a. Replace the value of subscriptionKey with your subscription key. b. Replace the value of uriBase with the endpoint URL for the Analyze Image method from the Azure region where you obtained your subscription keys, if necessary. c. Optionally, replace the value of imageUrl with the URL of a different image that you want to analyze. d. Optionally, replace the value of the language request parameter with a different language. 4. Save the code as a file with a .php extension. For example, analyze-image.php . 5. Open a browser window with PHP support. 6. Drag and drop the file into the browser window. Analyze Image Sample with a valid subscription key. $ocpApimSubscriptionKey = ''; // You must use the same location in your REST call as you used to obtain // your subscription keys. For example, if you obtained your subscription keys // from westus, replace "westcentralus" in the URL below with "westus". $uriBase = 'https://westcentralus.api.cognitive.microsoft.com/vision/v2.0/'; $imageUrl = 'https://upload.wikimedia.org/wikipedia/commons/3/3c/Shaki_waterfall.jpg'; require_once 'HTTP/Request2.php'; $request = new Http_Request2($uriBase . '/analyze'); $url = $request->getUrl(); $headers = array( // Request headers 'Content-Type' => 'application/json', 'Ocp-Apim-Subscription-Key' => $ocpApimSubscriptionKey ); $request->setHeader($headers); $parameters = array( // Request parameters 'visualFeatures' => 'Categories,Description', 'details' => '', 'language' => 'en' ); $url->setQueryVariables($parameters); $request->setMethod(HTTP_Request2::METHOD_POST); // Request body parameters $body = json_encode(array('url' => $imageUrl)); // Request body $request->setBody($body); try { $response = $request->send(); echo "

" .
json_encode(json_decode($response->getBody()), JSON_PRETTY_PRINT) . "

"; } catch (HttpException $ex) { echo "

" . $ex . "

"; } ?> Examine the response A successful response is returned in JSON. The sample website parses and displays a successful response in the browser window, similar to the following example: { "categories": [ { "name": "outdoor_water", "score": 0.9921875, "detail": { "landmarks": [] } } ], "description": { "tags": [ "nature", "water", "waterfall", "outdoor", "rock", "mountain", "rocky", "grass", "hill", "covered", "hillside", "standing", "side", "group", "walking", "white", "man", "large", "snow", "grazing", "forest", "slope", "herd", "river", "giraffe", "field" ], "captions": [ { "text": "a large waterfall over a rocky cliff", "confidence": 0.916458423253597 } ] }, "requestId": "ebf5a1bc-3ba2-4c56-99b4-bbd20ba28705", "metadata": { "height": 959, "width": 1280, "format": "Jpeg" } } Clean up resources When no longer needed, delete the file, and then uninstall the PHP5 package, do the following steps: 1. Open a command prompt window as an administrator. 2. Run the following command: HTTP_Request2 package. To uninstall the pear uninstall HTTP_Request2 3. After the package is successfully uninstalled, close the command prompt window. Next steps Explore the Computer Vision API used to analyze an image, detect celebrities and landmarks, create a thumbnail, and extract printed and handwritten text. To rapidly experiment with the Computer Vision API, try the Open API testing console. Explore the Computer Vision API Quickstart: Analyze a remote image using the REST API and Python in Computer Vision 4/19/2019 • 3 minutes to read • Edit Online In this quickstart, you analyze a remotely stored image to extract visual features by using Computer Vision's REST API. With the Analyze Image method, you can extract visual features based on image content. You can run this quickstart in a step-by step fashion using a Jupyter notebook on MyBinder. To launch Binder, select the following button: launch binder If you don't have an Azure subscription, create a free account before you begin. Prerequisites You must have Python installed if you want to run the sample locally. You must have a subscription key for Computer Vision. You can get a free trial key from Try Cognitive Services. Or, follow the instructions in Create a Cognitive Services account to subscribe to Computer Vision and get your key. You must have the following Python packages installed. You can use pip to install Python packages. requests matplotlib pillow Create and run the sample To create and run the sample, do the following steps: 1. Copy the following code into a text editor. 2. Make the following changes in code where needed: a. Replace the value of subscription_key with your subscription key. b. Replace the value of vision_base_url with the endpoint URL for the Computer Vision resource in the Azure region where you obtained your subscription keys, if necessary. c. Optionally, replace the value of image_url with the URL of a different image that you want to analyze. 3. Save the code as a file with an .py extension. For example, analyze-image.py . 4. Open a command prompt window. 5. At the prompt, use the python command to run the sample. For example, python analyze-image.py . import requests # If you are using a Jupyter notebook, uncomment the following line. #%matplotlib inline import matplotlib.pyplot as plt import json from PIL import Image from io import BytesIO # Replace with your valid subscription key. subscription_key = "" assert subscription_key # You must use the same region in your REST call as you used to get your # subscription keys. For example, if you got your subscription keys from # westus, replace "westcentralus" in the URI below with "westus". # # Free trial subscription keys are generated in the "westus" region. # If you use a free trial subscription key, you shouldn't need to change # this region. vision_base_url = "https://westcentralus.api.cognitive.microsoft.com/vision/v2.0/" analyze_url = vision_base_url + "analyze" # Set image_url to the URL of an image that you want to analyze. image_url = "https://upload.wikimedia.org/wikipedia/commons/thumb/1/12/" + \ "Broadway_and_Times_Square_by_night.jpg/450px-Broadway_and_Times_Square_by_night.jpg" headers = {'Ocp-Apim-Subscription-Key': subscription_key } params = {'visualFeatures': 'Categories,Description,Color'} data = {'url': image_url} response = requests.post(analyze_url, headers=headers, params=params, json=data) response.raise_for_status() # The 'analysis' object contains various fields that describe the image. The most # relevant caption for the image is obtained from the 'description' property. analysis = response.json() print(json.dumps(response.json())) image_caption = analysis["description"]["captions"][0]["text"].capitalize() # Display the image and overlay it with the caption. image = Image.open(BytesIO(requests.get(image_url).content)) plt.imshow(image) plt.axis("off") _ = plt.title(image_caption, size="x-large", y=-0.1) plt.show() Examine the response A successful response is returned in JSON. The sample webpage parses and displays a successful response in the command prompt window, similar to the following example: { "categories": [ { "name": "outdoor_", "score": 0.00390625, "detail": { "landmarks": [] } }, { "name": "outdoor_street", "score": 0.33984375, "detail": { "landmarks": [] "landmarks": [] } } ], "description": { "tags": [ "building", "outdoor", "street", "city", "people", "busy", "table", "walking", "traffic", "filled", "large", "many", "group", "night", "light", "crowded", "bunch", "standing", "man", "sign", "crowd", "umbrella", "riding", "tall", "woman", "bus" ], "captions": [ { "text": "a group of people on a city street at night", "confidence": 0.9122243847383961 } ] }, "color": { "dominantColorForeground": "Brown", "dominantColorBackground": "Brown", "dominantColors": [ "Brown" ], "accentColor": "B54316", "isBwImg": false }, "requestId": "c11894eb-de3e-451b-9257-7c8b168073d1", "metadata": { "height": 600, "width": 450, "format": "Jpeg" } } Next steps Explore a Python application that uses Computer Vision to perform optical character recognition (OCR ); create smart-cropped thumbnails; plus detect, categorize, tag, and describe visual features, including faces, in an image. To rapidly experiment with the Computer Vision API, try the Open API testing console. Computer Vision API Python Tutorial Quickstart: Analyze a remote image using the REST API and Ruby in Computer Vision 4/18/2019 • 2 minutes to read • Edit Online In this quickstart, you analyze a remotely stored image to extract visual features by using Computer Vision's REST API. With the Analyze Image method, you can extract visual features based on image content. If you don't have an Azure subscription, create a free account before you begin. Prerequisites You must have Ruby 2.4.x or later installed. You must have a subscription key for Computer Vision. You can get a free trial key from Try Cognitive Services. Or, follow the instructions in Create a Cognitive Services account to subscribe to Computer Vision and get your key. Create and run the sample To create and run the sample, do the following steps: 1. Copy the following code into a text editor. 2. Make the following changes in code where needed: a. Replace with your subscription key. b. Replace https://westcentralus.api.cognitive.microsoft.com/vision/v2.0/analyze with the endpoint URL for the Analyze Image method in the Azure region where you obtained your subscription keys, if necessary. c. Optionally, replace the value of the language request parameter with a different language. d. Optionally, replace http://upload.wikimedia.org/wikipedia/commons/3/3c/Shaki_waterfall.jpg\ with the URL of a different image that you want to analyze. 3. Save the code as a file with an .rb extension. For example, analyze-image.rb . 4. Open a command prompt window. 5. At the prompt, use the ruby command to run the sample. For example, ruby analyze-image.rb . require 'net/http' # You must use the same location in your REST call as you used to get your # subscription keys. For example, if you got your subscription keys from westus, # replace "westcentralus" in the URL below with "westus". uri = URI('https://westcentralus.api.cognitive.microsoft.com/vision/v2.0/analyze') uri.query = URI.encode_www_form({ # Request parameters 'visualFeatures' => 'Categories, Description', 'details' => 'Landmarks', 'language' => 'en' }) request = Net::HTTP::Post.new(uri.request_uri) # Request headers # Replace with your valid subscription key. request['Ocp-Apim-Subscription-Key'] = '' request['Content-Type'] = 'application/json' request.body = "{\"url\": \"http://upload.wikimedia.org/wikipedia/commons/3/3c/Shaki_waterfall.jpg\"}" response = Net::HTTP.start(uri.host, uri.port, :use_ssl => uri.scheme == 'https') do |http| http.request(request) end puts response.body Examine the response A successful response is returned in JSON. The sample parses and displays a successful response in the command prompt window, similar to the following example: { "categories": [ { "name": "abstract_", "score": 0.00390625 }, { "name": "people_", "score": 0.83984375, "detail": { "celebrities": [ { "name": "Satya Nadella", "faceRectangle": { "left": 597, "top": 162, "width": 248, "height": 248 }, "confidence": 0.999028444 } ] } } ], "adult": { "isAdultContent": false, "isRacyContent": false, "adultScore": 0.0934349000453949, "racyScore": 0.068613491952419281 }, }, "tags": [ { "name": "person", "confidence": 0.98979085683822632 }, { "name": "man", "confidence": 0.94493889808654785 }, { "name": "outdoor", "confidence": 0.938492476940155 }, { "name": "window", "confidence": 0.89513939619064331 } ], "description": { "tags": [ "person", "man", "outdoor", "window", "glasses" ], "captions": [ { "text": "Satya Nadella sitting on a bench", "confidence": 0.48293603002174407 } ] }, "requestId": "0dbec5ad-a3d3-4f7e-96b4-dfd57efe967d", "metadata": { "width": 1500, "height": 1000, "format": "Jpeg" }, "faces": [ { "age": 44, "gender": "Male", "faceRectangle": { "left": 593, "top": 160, "width": 250, "height": 250 } } ], "color": { "dominantColorForeground": "Brown", "dominantColorBackground": "Brown", "dominantColors": [ "Brown", "Black" ], "accentColor": "873B59", "isBWImg": false }, "imageType": { "clipArtType": 0, "lineDrawingType": 0 } } Next steps Explore the Computer Vision API used to analyze an image, detect celebrities and landmarks, create a thumbnail, and extract printed and handwritten text. To rapidly experiment with the Computer Vision API, try the Open API testing console. Explore the Computer Vision API Quickstart: Analyze a local image using the REST API and C# in Computer Vision 4/18/2019 • 4 minutes to read • Edit Online In this quickstart, you will analyze a locally stored image to extract visual features by using Computer Vision's REST API. With the Analyze Image method, you can extract visual feature information based on image content. If you don't have an Azure subscription, create a free account before you begin. Prerequisites You must have Visual Studio 2015 or later. You must have a subscription key for Computer Vision. You can get a free trial key from Try Cognitive Services. Or, follow the instructions in Create a Cognitive Services account to subscribe to Computer Vision and get your key. Create and run the sample application To create the sample in Visual Studio, do the following steps: 1. Create a new Visual Studio solution in Visual Studio, using the Visual C# Console App (.NET Framework) template. 2. Install the Newtonsoft.Json NuGet package. a. On the menu, click Tools, select NuGet Package Manager, then Manage NuGet Packages for Solution. b. Click the Browse tab, and in the Search box type "Newtonsoft.Json". c. Select Newtonsoft.Json when it displays, then click the checkbox next to your project name, and Install. 3. Replace the code in Program.cs with the following code, and then make the following changes in code where needed: a. Replace the value of subscriptionKey with your subscription key. b. Replace the value of uriBase with the endpoint URL for the Analyze Image method from the Azure region where you obtained your subscription keys, if necessary. 4. Run the program. 5. At the prompt, enter the path to a local image. using using using using using using Newtonsoft.Json.Linq; System; System.IO; System.Net.Http; System.Net.Http.Headers; System.Threading.Tasks; namespace CSHttpClientSample { static class Program { // Replace with your valid subscription key. const string subscriptionKey = ""; // You must use the same Azure region in your REST API method as you used to // get your subscription keys. For example, if you got your subscription keys // from the West US region, replace "westcentralus" in the URL // from the West US region, replace "westcentralus" in the URL // below with "westus". // // Free trial subscription keys are generated in the "westus" region. // If you use a free trial subscription key, you shouldn't need to change // this region. const string uriBase = "https://westcentralus.api.cognitive.microsoft.com/vision/v2.0/analyze"; static void Main() { // Get the path and filename to process from the user. Console.WriteLine("Analyze an image:"); Console.Write( "Enter the path to the image you wish to analyze: "); string imageFilePath = Console.ReadLine(); if (File.Exists(imageFilePath)) { // Call the REST API method. Console.WriteLine("\nWait a moment for the results to appear.\n"); MakeAnalysisRequest(imageFilePath).Wait(); } else { Console.WriteLine("\nInvalid file path"); } Console.WriteLine("\nPress Enter to exit..."); Console.ReadLine(); } ///

/// Gets the analysis of the specified image file by using /// the Computer Vision REST API. ///

/// The image file to analyze. static async Task MakeAnalysisRequest(string imageFilePath) { try { HttpClient client = new HttpClient(); // Request headers. client.DefaultRequestHeaders.Add( "Ocp-Apim-Subscription-Key", subscriptionKey); // Request parameters. A third optional parameter is "details". // The Analyze Image method returns information about the following // visual features: // Categories: categorizes image content according to a // taxonomy defined in documentation. // Description: describes the image content with a complete // sentence in supported languages. // Color: determines the accent color, dominant color, // and whether an image is black & white. string requestParameters = "visualFeatures=Categories,Description,Color"; // Assemble the URI for the REST API method. string uri = uriBase + "?" + requestParameters; HttpResponseMessage response; // Read the contents of the specified local image // into a byte array. byte[] byteData = GetImageAsByteArray(imageFilePath); // Add the byte array as an octet stream to the request body. using (ByteArrayContent content = new ByteArrayContent(byteData)) { { // This example uses the "application/octet-stream" content type. // The other content types you can use are "application/json" // and "multipart/form-data". content.Headers.ContentType = new MediaTypeHeaderValue("application/octet-stream"); // Asynchronously call the REST API method. response = await client.PostAsync(uri, content); } // Asynchronously get the JSON response. string contentString = await response.Content.ReadAsStringAsync(); // Display the JSON response. Console.WriteLine("\nResponse:\n\n{0}\n", JToken.Parse(contentString).ToString()); } catch (Exception e) { Console.WriteLine("\n" + e.Message); } } ///

/// Returns the contents of the specified file as a byte array. ///

/// Gets a thumbnail image from the specified image file by using /// the Computer Vision REST API. ///

/// The image file to use to create the thumbnail image. static async Task MakeThumbNailRequest(string imageFilePath) { try { HttpClient client = new HttpClient(); // Request headers. client.DefaultRequestHeaders.Add( "Ocp-Apim-Subscription-Key", subscriptionKey); // Request parameters. // The width and height parameters specify a thumbnail that's // 200 pixels wide and 150 pixels high. // The smartCropping parameter is set to true, to enable smart cropping. string requestParameters = "width=200&height=150&smartCropping=true"; // Assemble the URI for the REST API method. string uri = uriBase + "?" + requestParameters; HttpResponseMessage response; // Read the contents of the specified local image // into a byte array. byte[] byteData = GetImageAsByteArray(imageFilePath); // Add the byte array as an octet stream to the request body. using (ByteArrayContent content = new ByteArrayContent(byteData)) { // This example uses the "application/octet-stream" content type. // The other content types you can use are "application/json" // and "multipart/form-data". content.Headers.ContentType = new MediaTypeHeaderValue("application/octet-stream"); // Asynchronously call the REST API method. response = await client.PostAsync(uri, content); } // Check the HTTP status code of the response. If successful, display // display the response and save the thumbnail. if (response.IsSuccessStatusCode) { // Display the response data. Console.WriteLine("\nResponse:\n{0}", response); // Get the image data for the thumbnail from the response. byte[] thumbnailImageData = await response.Content.ReadAsByteArrayAsync(); // Save the thumbnail to the same folder as the original image, // using the original name with the suffix "_thumb". // Note: This will overwrite an existing file of the same name. string thumbnailFilePath = imageFilePath.Insert(imageFilePath.Length - 4, "_thumb"); File.WriteAllBytes(thumbnailFilePath, thumbnailImageData); Console.WriteLine("\nThumbnail written to: {0}", thumbnailFilePath); } else { // Display the JSON error data. string errorString = await response.Content.ReadAsStringAsync(); Console.WriteLine("\n\nResponse:\n{0}\n", JToken.Parse(errorString).ToString()); } } catch (Exception e) { Console.WriteLine("\n" + e.Message); } } ///

/// Returns the contents of the specified file as a byte array. ///

/// The image file to read. /// The byte array of the image data. static byte[] GetImageAsByteArray(string imageFilePath) { // Open a read-only file stream for the specified file. using (FileStream fileStream = new FileStream(imageFilePath, FileMode.Open, FileAccess.Read)) { // Read the file's contents into a byte array. BinaryReader binaryReader = new BinaryReader(fileStream); return binaryReader.ReadBytes((int)fileStream.Length); } } } } Examine the response A successful response is returned as binary data, which represents the image data for the thumbnail. If the request succeeds, the thumbnail is saved to the same folder as the local image, using the original name with the suffix "_thumb". If the request fails, the response contains an error code and a message to help determine what went wrong. The sample application displays a successful response in the console window, similar to the following example: Response: StatusCode: 200, ReasonPhrase: 'OK', Version: 1.1, Content: System.Net.Http.StreamContent, Headers: { Pragma: no-cache apim-request-id: 131eb5b4-5807-466d-9656-4c1ef0a64c9b Strict-Transport-Security: max-age=31536000; includeSubDomains; preload x-content-type-options: nosniff Cache-Control: no-cache Date: Tue, 06 Jun 2017 20:54:07 GMT X-AspNet-Version: 4.0.30319 X-Powered-By: ASP.NET Content-Length: 5800 Content-Type: image/jpeg Expires: -1 } Clean up resources When no longer needed, delete the Visual Studio solution. To do so, open File Explorer, navigate to the folder in which you created the Visual Studio solution, and delete the folder. Next steps Explore a basic Windows application that uses Computer Vision to perform optical character recognition (OCR ); create smart-cropped thumbnails; plus detect, categorize, tag, and describe visual features, including faces, in an image. To rapidly experiment with the Computer Vision APIs, try the Open API testing console. Computer Vision API C# Tutorial Quickstart: Generate a thumbnail using the REST API and cURL in Computer Vision 4/18/2019 • 2 minutes to read • Edit Online In this quickstart, you generate a thumbnail from an image using Computer Vision's REST API. You specify the desired height and width, which can differ in aspect ration from the input image. Computer Vision uses smart cropping to intelligently identify the area of interest and generate cropping coordinates around that region. If you don't have an Azure subscription, create a free account before you begin. Prerequisites You must have cURL. You must have a subscription key for Computer Vision. You can get a free trial key from Try Cognitive Services. Or, follow the instructions in Create a Cognitive Services account to subscribe to Computer Vision and get your key. Get Thumbnail request With the Get Thumbnail method, you can generate a thumbnail of an image. To run the sample, do the following steps: 1. 2. 3. 4. Copy the following code into an editor. Replace with your valid subscription key. Replace with the path and filename to save the thumbnail. Change the Request URL ( https://westcentralus.api.cognitive.microsoft.com/vision/v2.0 ) to use the location where you obtained your subscription keys, if necessary. 5. Optionally, change the image ( {\"url\":\"... ) to analyze. 6. Open a command window on a computer with cURL installed. 7. Paste the code in the window and run the command. NOTE You must use the same location in your REST call as you used to obtain your subscription keys. For example, if you obtained your subscription keys from westus, replace "westcentralus" in the URL below with "westus". Create and run the sample command To create and run the sample, do the following steps: 1. Copy the following command into a text editor. 2. Make the following changes in the command where needed: a. Replace the value of with your subscription key. b. Replace the value of with the path and name of the file in which to save the thumbnail. c. Replace the request URL ( https://westcentralus.api.cognitive.microsoft.com/vision/v2.0/generateThumbnail ) with the endpoint URL for the Get Thumbnail method from the Azure region where you obtained your subscription keys, if necessary. d. Optionally, change the image URL in the request body ( https://upload.wikimedia.org/wikipedia/commons/thumb/5/56/Shorkie_Poo_Puppy.jpg/1280pxShorkie_Poo_Puppy.jpg\ ) to the URL of a different image from which to generate a thumbnail. 3. Open a command prompt window. 4. Paste the command from the text editor into the command prompt window, and then run the command. curl -H "Ocp-Apim-Subscription-Key: " -o -H "Content-Type: application/json" "https://westcentralus.api.cognitive.microsoft.com/vision/v2.0/generateThumbnail? width=100&height=100&smartCropping=true" -d " {\"url\":\"https://upload.wikimedia.org/wikipedia/commons/thumb/5/56/Shorkie_Poo_Puppy.jpg/1280pxShorkie_Poo_Puppy.jpg\"}" Examine the response A successful response writes the thumbnail image to the file specified in . If the request fails, the response contains an error code and a message to help determine what went wrong. If the request seems to succeed but the created thumbnail is not a valid image file, it might be that your subscription key is not valid. Next steps Explore the Computer Vision API to how to analyze an image, detect celebrities and landmarks, create a thumbnail, and extract printed and handwritten text. To rapidly experiment with the Computer Vision API, try the Open API testing console. Explore the Computer Vision API Quickstart: Generate a thumbnail using the REST API and Go in Computer Vision 4/18/2019 • 2 minutes to read • Edit Online In this quickstart, you generate a thumbnail from an image using Computer Vision's REST API. You specify the height and width, which can differ in aspect ratio from the input image. Computer Vision uses smart cropping to intelligently identify the area of interest and generate cropping coordinates based on that region. If you don't have an Azure subscription, create a free account before you begin. Prerequisites You must have Go installed. You must have a subscription key for Computer Vision. You can get a free trial key from Try Cognitive Services. Or, follow the instructions in Create a Cognitive Services account to subscribe to Computer Vision and get your key. Create and run the sample To create and run the sample, do the following steps: 1. Copy the following code into a text editor. 2. Make the following changes in code where needed: a. Replace the value of subscriptionKey with your subscription key. b. Replace the value of uriBase with the endpoint URL for the Get Thumbnail method from the Azure region where you obtained your subscription keys, if necessary. c. Optionally, replace the value of imageUrl with the URL of a different image from which you want to generate a thumbnail. 3. Save the code as a file with a .go extension. For example, get-thumbnail.go . 4. Open a command prompt window. 5. At the prompt, run the go build command to compile the package from the file. For example, go build get-thumbnail.go . 6. At the prompt, run the compiled package. For example, get-thumbnail . package main import ( "encoding/json" "fmt" "io/ioutil" "net/http" "strings" "time" ) func main() { // Replace with your valid subscription key. const subscriptionKey = "" // You must use the same Azure region in your REST API method as you used to // get your subscription keys. For example, if you got your subscription keys // from the West US region, replace "westcentralus" in the URL // from the West US region, replace "westcentralus" in the URL // below with "westus". // // Free trial subscription keys are generated in the "westus" region. // If you use a free trial subscription key, you shouldn't need to change // this region. const uriBase = "https://westcentralus.api.cognitive.microsoft.com/vision/v2.0/generateThumbnail" const imageUrl = "https://upload.wikimedia.org/wikipedia/commons/9/94/Bloodhound_Puppy.jpg" const params = "?width=100&height=100&smartCropping=true" const uri = uriBase + params const imageUrlEnc = "{\"url\":\"" + imageUrl + "\"}" reader := strings.NewReader(imageUrlEnc) // Create the HTTP client client := &http.Client{ Timeout: time.Second * 2, } // Create the POST request, passing the image URL in the request body req, err := http.NewRequest("POST", uri, reader) if err != nil { panic(err) } // Add headers req.Header.Add("Content-Type", "application/json") req.Header.Add("Ocp-Apim-Subscription-Key", subscriptionKey) // Send the request and retrieve the response resp, err := client.Do(req) if err != nil { panic(err) } defer resp.Body.Close() // Read the response body. // Note, data is a byte array data, err := ioutil.ReadAll(resp.Body) if err != nil { panic(err) } // Parse the JSON data var f interface{} json.Unmarshal(data, &f) // Format and display the JSON result jsonFormatted, _ := json.MarshalIndent(f, "", " ") fmt.Println(string(jsonFormatted)) } Examine the response A successful response contains the thumbnail image binary data. If the request fails, the response contains an error code and a message to help determine what went wrong. Next steps Explore the Computer Vision API to analyze an image, detect celebrities and landmarks, create a thumbnail, and extract printed and handwritten text. To rapidly experiment with the Computer Vision API, try the Open API testing console. Explore the Computer Vision API Quickstart: Generate a thumbnail using the REST API and Java in Computer Vision 4/18/2019 • 4 minutes to read • Edit Online In this quickstart, you generate a thumbnail from an image by using Computer Vision's REST API. You specify the height and width, which can differ from the aspect ratio of the input image. Computer Vision uses smart cropping to intelligently identify the area of interest and generate cropping coordinates based on that region. If you don't have an Azure subscription, create a free account before you begin. Prerequisites You must have Java™ Platform, Standard Edition Development Kit 7 or 8 (JDK 7 or 8) installed. You must have a subscription key for Computer Vision. You can get a free trial key from Try Cognitive Services. Or, follow the instructions in Create a Cognitive Services account to subscribe to Computer Vision and get your key. Create and run the sample application To create and run the sample, do the following steps: 1. Create a new Java project in your favorite IDE or editor. If the option is available, create the Java project from a command line application template. 2. Import the following libraries into your Java project. If you're using Maven, the Maven coordinates are provided for each library. Apache HTTP client (org.apache.httpcomponents:httpclient:4.5.5) Apache HTTP core (org.apache.httpcomponents:httpcore:4.4.9) JSON library (org.json:json:20180130) 3. Add the following import statements to the file that contains the Main public class for your project. import import import import import import import import import import import import import import import java.awt.*; javax.swing.*; java.net.URI; java.io.InputStream; javax.imageio.ImageIO; java.awt.image.BufferedImage; org.apache.http.HttpEntity; org.apache.http.HttpResponse; org.apache.http.client.methods.HttpPost; org.apache.http.entity.StringEntity; org.apache.http.client.utils.URIBuilder; org.apache.http.impl.client.CloseableHttpClient; org.apache.http.impl.client.HttpClientBuilder; org.apache.http.util.EntityUtils; org.json.JSONObject; 4. Replace the needed: Main public class with the following code, then make the following changes in code where a. Replace the value of subscriptionKey with your subscription key. b. Replace the value of uriBase with the endpoint URL for the Get Thumbnail method from the Azure region where you obtained your subscription keys, if necessary. c. Optionally, replace the value of imageToAnalyze with the URL of a different image for which you want to generate a thumbnail. 5. Save, then build the Java project. 6. If you're using an IDE, run Main . Otherwise, open a command prompt window and then use the command to run the compiled class. For example, java Main . // // // // This sample uses the following libraries: - Apache HTTP client (org.apache.httpcomponents:httpclient:4.5.5) - Apache HTTP core (org.apache.httpcomponents:httpccore:4.4.9) - JSON library (org.json:json:20180130). public // // // class Main { ********************************************** *** Update or verify the following values. *** ********************************************** // Replace with your valid subscription key. private static final String subscriptionKey = ""; // You must use the same Azure region in your REST API method as you used to // get your subscription keys. For example, if you got your subscription keys // from the West US region, replace "westcentralus" in the URL // below with "westus". // // Free trial subscription keys are generated in the "westus" region. // If you use a free trial subscription key, you shouldn't need to change // this region. private static final String uriBase = "https://westcentralus.api.cognitive.microsoft.com/vision/v2.0/generateThumbnail"; private static final String imageToAnalyze = "https://upload.wikimedia.org/wikipedia/commons/9/94/Bloodhound_Puppy.jpg"; public static void main(String[] args) { CloseableHttpClient httpClient = HttpClientBuilder.create().build(); try { URIBuilder uriBuilder = new URIBuilder(uriBase); // Request parameters. uriBuilder.setParameter("width", "100"); uriBuilder.setParameter("height", "150"); uriBuilder.setParameter("smartCropping", "true"); // Prepare the URI for the REST API method. URI uri = uriBuilder.build(); HttpPost request = new HttpPost(uri); // Request headers. request.setHeader("Content-Type", "application/json"); request.setHeader("Ocp-Apim-Subscription-Key", subscriptionKey); // Request body. StringEntity requestEntity = new StringEntity("{\"url\":\"" + imageToAnalyze + "\"}"); request.setEntity(requestEntity); // Call the REST API method and get the response entity. HttpResponse response = httpClient.execute(request); HttpEntity entity = response.getEntity(); // Check for success. java // Check for success. if (response.getStatusLine().getStatusCode() == 200) { // Display the thumbnail. System.out.println("\nDisplaying thumbnail.\n"); displayImage(entity.getContent()); } else { // Format and display the JSON error message. String jsonString = EntityUtils.toString(entity); JSONObject json = new JSONObject(jsonString); System.out.println("Error:\n"); System.out.println(json.toString(2)); } } catch (Exception e) { System.out.println(e.getMessage()); } } // Displays the given input stream as an image. private static void displayImage(InputStream inputStream) { try { BufferedImage bufferedImage = ImageIO.read(inputStream); ImageIcon imageIcon = new ImageIcon(bufferedImage); JLabel jLabel = new JLabel(); jLabel.setIcon(imageIcon); JFrame jFrame = new JFrame(); jFrame.setLayout(new FlowLayout()); jFrame.setSize(100, 150); jFrame.add(jLabel); jFrame.setVisible(true); jFrame.setDefaultCloseOperation(JFrame.EXIT_ON_CLOSE); } catch (Exception e) { System.out.println(e.getMessage()); } } } Examine the response A successful response is returned as binary data, which represents the image data for the thumbnail. If the request succeeds, the thumbnail is generated from the binary data in the response and displayed in a separate window created by the sample application. If the request fails, the response is displayed in the console window. The response for the failed request contains an error code and a message to help determine what went wrong. Next steps Explore a Java Swing application that uses Computer Vision to perform optical character recognition (OCR ); create smart-cropped thumbnails; plus detect, categorize, tag, and describe visual features, including faces, in an image. To rapidly experiment with the Computer Vision API, try the Open API testing console. Computer Vision API Java Tutorial Quickstart: Generate a thumbnail using the REST API and JavaScript in Computer Vision 4/18/2019 • 3 minutes to read • Edit Online In this quickstart, you generate a thumbnail from an image by using Computer Vision's REST API. You specify the height and width, which can differ in aspect ratio from the input image. Computer Vision uses smart cropping to intelligently identify the area of interest and generate cropping coordinates based on that region. If you don't have an Azure subscription, create a free account before you begin. Prerequisites You must have a subscription key for Computer Vision. You can get a free trial key from Try Cognitive Services. Or, follow the instructions in Create a Cognitive Services account to subscribe to Computer Vision and get your key. Create and run the sample To create and run the sample, do the following steps: 1. Copy the following code into a text editor. 2. Make the following changes in code where needed: a. Replace the value of subscriptionKey with your subscription key. b. Replace the value of uriBase with the endpoint URL for the Get Thumbnail method from the Azure region where you obtained your subscription keys, if necessary. c. Optionally, replace the value of the value attribute for the inputImage control with the URL of a different image that you want to analyze. 3. Save the code as a file with an .html extension. For example, get-thumbnail.html . 4. Open a browser window. 5. In the browser, drag and drop the file into the browser window. 6. When the webpage is displayed in the browser, choose the Generate thumbnail button. Thumbnail Sample

Generate thumbnail image:

Enter the URL to an image to use in creating a thumbnail image, then click the Generate thumbnail button.

Image for thumbnail:

Response:

Source image:

Thumbnail:

Examine the response A successful response is returned as binary data, which represents the image data for the thumbnail. If the request succeeds, the thumbnail is generated from the binary data in the response and displayed in the browser window. If the request fails, the response is displayed in the console window. The response for the failed request contains an error code and a message to help determine what went wrong. Next steps Explore a JavaScript application that uses Computer Vision to perform optical character recognition (OCR ); create smart-cropped thumbnails; plus detect, categorize, tag, and describe visual features, including faces, in an image. To rapidly experiment with the Computer Vision API, try the Open API testing console. Computer Vision API JavaScript Tutorial Quickstart: Generate a thumbnail using the REST API and Node.js in Computer Vision 4/18/2019 • 2 minutes to read • Edit Online In this quickstart, you generate a thumbnail from an image by using Computer Vision's REST API. With the Get Thumbnail method, you can generate a thumbnail of an image. You specify the height and width, which can differ from the aspect ratio of the input image. Computer Vision uses smart cropping to intelligently identify the area of interest and generate cropping coordinates based on that region. If you don't have an Azure subscription, create a free account before you begin. Prerequisites You must have Node.js 4.x or later installed. You must have npm installed. You must have a subscription key for Computer Vision. You can get a free trial key from Try Cognitive Services. Or, follow the instructions in Create a Cognitive Services account to subscribe to Computer Vision and get your key. Create and run the sample To create and run the sample, do the following steps: 1. Install the npm request package. a. Open a command prompt window as an administrator. b. Run the following command: npm install request c. After the package is successfully installed, close the command prompt window. 2. Copy the following code into a text editor. 3. Make the following changes in code where needed: a. Replace the value of subscriptionKey with your subscription key. b. Replace the value of uriBase with the endpoint URL for the Get Thumbnail method from the Azure region where you obtained your subscription keys, if necessary. c. Optionally, replace the value of imageUrl with the URL of a different image that you want to analyze. 4. Save the code as a file with a .js extension. For example, get-thumbnail.js . 5. Open a command prompt window. 6. At the prompt, use the node command to run the file. For example, node get-thumbnail.js . 'use strict'; const request = require('request'); // Replace with your valid subscription key. const subscriptionKey = ''; // You must use the same location in your REST call as you used to get your // subscription keys. For example, if you got your subscription keys from // westus, replace "westcentralus" in the URL below with "westus". const uriBase = 'https://westcentralus.api.cognitive.microsoft.com/vision/v2.0/generateThumbnail'; const imageUrl = 'https://upload.wikimedia.org/wikipedia/commons/9/94/Bloodhound_Puppy.jpg'; // Request parameters. const params = { 'width': '100', 'height': '100', 'smartCropping': 'true' }; const options = { uri: uriBase, qs: params, body: '{"url": ' + '"' + imageUrl + '"}', headers: { 'Content-Type': 'application/json', 'Ocp-Apim-Subscription-Key' : subscriptionKey } }; request.post(options, (error, response, body) => { if (error) { console.log('Error: ', error); return; } }); Examine the response A successful response is returned as binary data, which represents the image data for the thumbnail. If the request fails, the response is displayed in the console window. The response for the failed request contains an error code and a message to help determine what went wrong. Next steps Explore the Computer Vision API used to analyze an image, detect celebrities and landmarks, create a thumbnail, and extract printed and handwritten text. To rapidly experiment with the Computer Vision API, try the Open API testing console. Explore the Computer Vision API Quickstart: Generate a thumbnail using the REST API and PHP in Computer Vision 4/18/2019 • 2 minutes to read • Edit Online In this quickstart, you generate a thumbnail from an image by using Computer Vision's REST API. With the Get Thumbnail method, you can generate a thumbnail of an image. You specify the height and width, which can differ from the aspect ratio of the input image. Computer Vision uses smart cropping to intelligently identify the area of interest and generate cropping coordinates based on that region. If you don't have an Azure subscription, create a free account before you begin. Prerequisites You must have PHP installed. You must have Pear installed. You must have a subscription key for Computer Vision. You can get a free trial key from Try Cognitive Services. Or, follow the instructions in Create a Cognitive Services account to subscribe to Computer Vision and get your key. Create and run the sample To create and run the sample, do the following steps: 1. Install the PHP5 HTTP_Request2 package. a. Open a command prompt window as an administrator. b. Run the following command: pear install HTTP_Request2 c. After the package is successfully installed, close the command prompt window. 2. Copy the following code into a text editor. 3. Make the following changes in code where needed: a. Replace the value of subscriptionKey with your subscription key. b. Replace the value of uriBase with the endpoint URL for the Get Thumbnail method from the Azure region where you obtained your subscription keys, if necessary. c. Optionally, replace the value of imageUrl with the URL of a different image for which you want to generate a thumbnail. 4. Save the code as a file with a .php extension. For example, get-thumbnail.php . 5. Open a browser window with PHP support. 6. Drag and drop the file into the browser window. Get Thumbnail Sample with a valid subscription key. $ocpApimSubscriptionKey = ''; // You must use the same location in your REST call as you used to obtain // your subscription keys. For example, if you obtained your subscription keys // from westus, replace "westcentralus" in the URL below with "westus". $uriBase = 'https://westcentralus.api.cognitive.microsoft.com/vision/v2.0/'; $imageUrl = 'https://upload.wikimedia.org/wikipedia/commons/9/94/Bloodhound_Puppy.jpg'; require_once 'HTTP/Request2.php'; $request = new Http_Request2($uriBase . 'generateThumbnail'); $url = $request->getUrl(); $headers = array( // Request headers 'Content-Type' => 'application/json', 'Ocp-Apim-Subscription-Key' => $ocpApimSubscriptionKey ); $request->setHeader($headers); $parameters = array( // Request parameters 'width' => '100', // Width of the thumbnail. 'height' => '100', // Height of the thumbnail. 'smartCropping' => 'true', ); $url->setQueryVariables($parameters); $request->setMethod(HTTP_Request2::METHOD_POST); // Request body parameters $body = json_encode(array('url' => $imageUrl)); // Request body $request->setBody($body); try { $response = $request->send(); echo "

" .
json_encode(json_decode($response->getBody()), JSON_PRETTY_PRINT) . "

"; } catch (HttpException $ex) { echo "

" . $ex . "

"; } ?> Examine the response A successful response is returned as binary data, which represents the image data for the thumbnail. If the request fails, the response is displayed in the browser window. The response for the failed request contains an error code and a message to help determine what went wrong. Clean up resources When no longer needed, delete the file, and then uninstall the PHP5 package, do the following steps: HTTP_Request2 package. To uninstall the 1. Open a command prompt window as an administrator. 2. Run the following command: pear uninstall HTTP_Request2 3. After the package is successfully uninstalled, close the command prompt window. Next steps Explore the Computer Vision API to analyze an image, detect celebrities and landmarks, create a thumbnail, and extract printed and handwritten text. To rapidly experiment with the Computer Vision API, try the Open API testing console. Explore the Computer Vision API Quickstart: Generate a thumbnail using the REST API and Python in Computer Vision 4/18/2019 • 2 minutes to read • Edit Online In this quickstart, you will generate a thumbnail from an image using Computer Vision's REST API. With the Get Thumbnail method, you can specify the desired height and width, and Computer Vision uses smart cropping to intelligently identify the area of interest and generate cropping coordinates based on that region. If you don't have an Azure subscription, create a free account before you begin. Prerequisites You must have a subscription key for Computer Vision. You can get a free trial key from Try Cognitive Services. Or, follow the instructions in Create a Cognitive Services account to subscribe to Computer Vision and get your key. A code editor such as Visual Studio Code Create and run the sample To create and run the sample, copy the following code into the code editor. import requests # If you are using a Jupyter notebook, uncomment the following line. #%matplotlib inline import matplotlib.pyplot as plt from PIL import Image from io import BytesIO # Replace with your valid subscription key. subscription_key = "" assert subscription_key # You must use the same region in your REST call as you used to get your # subscription keys. For example, if you got your subscription keys from # westus, replace "westcentralus" in the URI below with "westus". # # Free trial subscription keys are generated in the "westus" region. # If you use a free trial subscription key, you shouldn't need to change # this region. vision_base_url = "https://westcentralus.api.cognitive.microsoft.com/vision/v2.0/" thumbnail_url = vision_base_url + "generateThumbnail" # Set image_url to the URL of an image that you want to analyze. image_url = "https://upload.wikimedia.org/wikipedia/commons/9/94/Bloodhound_Puppy.jpg" headers = {'Ocp-Apim-Subscription-Key': subscription_key} params = {'width': '50', 'height': '50', 'smartCropping': 'true'} data = {'url': image_url} response = requests.post(thumbnail_url, headers=headers, params=params, json=data) response.raise_for_status() thumbnail = Image.open(BytesIO(response.content)) # Display the thumbnail. plt.imshow(thumbnail) plt.axis("off") # Verify the thumbnail size. print("Thumbnail is {0}-by-{1}".format(*thumbnail.size)) Next, do the following: 1. Replace the value of subscription_key with your subscription key. 2. Replace the value of vision_base_url with the endpoint URL for the Computer Vision resource in the Azure region where you obtained your subscription keys, if necessary. 3. Optionally, replace the value of image_url with the URL of a different image for which you want to generate a thumbnail. 4. Save the code as a file with an .py extension. For example, get-thumbnail.py . 5. Open a command prompt window. 6. At the prompt, use the python command to run the sample. For example, python get-thumbnail.py . Examine the response A successful response is returned as binary data which represents the image data for the thumbnail. The sample should display this image. If the request fails, the response is displayed in the command prompt window and should contain an error code. Run in Jupyter (optional) You can optionally run this quickstart in a step-by step fashion using a Jupyter notebook on MyBinder. To launch Binder, select the following button: launch binder Next steps Next, learn more detailed information about the thumbnail generation feature. Generating thumbnails Quickstart: Generate a thumbnail using the REST API and Ruby in Computer Vision 4/18/2019 • 2 minutes to read • Edit Online In this quickstart, you generate a thumbnail from an image by using Computer Vision's REST API. With the Get Thumbnail method, you can generate a thumbnail of an image. You specify the height and width, which can differ from the aspect ratio of the input image. Computer Vision uses smart cropping to intelligently identify the area of interest and generate cropping coordinates based on that region. If you don't have an Azure subscription, create a free account before you begin. Prerequisites You must have Ruby 2.4.x or later installed. You must have a subscription key for Computer Vision. You can get a free trial key from Try Cognitive Services. Or, follow the instructions in Create a Cognitive Services account to subscribe to Computer Vision and get your key. Create and run the sample To create and run the sample, do the following steps: 1. Copy the following code into a text editor. 2. Make the following changes in code where needed: a. Replace with your subscription key. b. Replace https://westcentralus.api.cognitive.microsoft.com/vision/v2.0/analyze with the endpoint URL for the Get Thumbnail method in the Azure region where you obtained your subscription keys, if necessary. c. Optionally, replace https://upload.wikimedia.org/wikipedia/commons/thumb/5/56/Shorkie_Poo_Puppy.jpg/1280pxShorkie_Poo_Puppy.jpg\ with the URL of a different image for which you want to generate a thumbnail. 3. Save the code as a file with an .rb extension. For example, get-thumbnail.rb . 4. Open a command prompt window. 5. At the prompt, use the ruby command to run the sample. For example, ruby get-thumbnail.rb . require 'net/http' # You must use the same location in your REST call as you used to get your # subscription keys. For example, if you got your subscription keys from westus, # replace "westcentralus" in the URL below with "westus". uri = URI('https://westcentralus.api.cognitive.microsoft.com/vision/v2.0/generateThumbnail') uri.query = URI.encode_www_form({ # Request parameters 'width' => '100', 'height' => '100', 'smartCropping' => 'true' }) request = Net::HTTP::Post.new(uri.request_uri) # Request headers # Replace with your valid subscription key. request['Ocp-Apim-Subscription-Key'] = '' request['Content-Type'] = 'application/json' request.body = "{\"url\": \"https://upload.wikimedia.org/wikipedia/commons/thumb/5/56/" + "Shorkie_Poo_Puppy.jpg/1280px-Shorkie_Poo_Puppy.jpg\"}" response = Net::HTTP.start(uri.host, uri.port, :use_ssl => uri.scheme == 'https') do |http| http.request(request) end #puts response.body Examine the response A successful response is returned as binary data, which represents the image data for the thumbnail. If the request fails, the response is displayed in the console window. The response for the failed request contains an error code and a message to help determine what went wrong. Next steps Explore the Computer Vision API used to analyze an image, detect celebrities and landmarks, create a thumbnail, and extract printed and handwritten text. To rapidly experiment with the Computer Vision API, try the Open API testing console. Explore the Computer Vision API Quickstart: Extract printed text (OCR) using the REST API and C# in Computer Vision 4/18/2019 • 4 minutes to read • Edit Online In this quickstart, you will extract printed text with optical character recognition (OCR ) from an image by using Computer Vision's REST API. With the OCR feature, you can detect printed text in an image and extract recognized characters into a machine-usable character stream. If you don't have an Azure subscription, create a free account before you begin. Prerequisites You must have Visual Studio 2015 or later. You must have a subscription key for Computer Vision. You can get a free trial key from Try Cognitive Services. Or, follow the instructions in Create a Cognitive Services account to subscribe to Computer Vision and get your key. Create and run the sample application To create the sample in Visual Studio, do the following steps: 1. Create a new Visual Studio solution in Visual Studio, using the Visual C# Console App template. 2. Install the Newtonsoft.Json NuGet package. a. On the menu, click Tools, select NuGet Package Manager, then Manage NuGet Packages for Solution. b. Click the Browse tab, and in the Search box type "Newtonsoft.Json". c. Select Newtonsoft.Json when it displays, then click the checkbox next to your project name, and Install. 3. Replace the code in Program.cs with the following code, and then make the following changes in code where needed: a. Replace the value of subscriptionKey with your subscription key. b. Replace the value of uriBase with the endpoint URL for the OCR method from the Azure region where you obtained your subscription keys, if necessary. 4. Run the program. 5. At the prompt, enter the path to a local image. using using using using using using Newtonsoft.Json.Linq; System; System.IO; System.Net.Http; System.Net.Http.Headers; System.Threading.Tasks; namespace CSHttpClientSample { static class Program { // Replace with your valid subscription key. const string subscriptionKey = ""; // You must use the same Azure region in your REST API method as you used to // get your subscription keys. For example, if you got your subscription keys // from the West US region, replace "westcentralus" in the URL // from the West US region, replace "westcentralus" in the URL // below with "westus". // // Free trial subscription keys are generated in the "westus" region. // If you use a free trial subscription key, you shouldn't need to change // this region. const string uriBase = "https://westcentralus.api.cognitive.microsoft.com/vision/v2.0/ocr"; static void Main() { // Get the path and filename to process from the user. Console.WriteLine("Optical Character Recognition:"); Console.Write("Enter the path to an image with text you wish to read: "); string imageFilePath = Console.ReadLine(); if (File.Exists(imageFilePath)) { // Call the REST API method. Console.WriteLine("\nWait a moment for the results to appear.\n"); MakeOCRRequest(imageFilePath).Wait(); } else { Console.WriteLine("\nInvalid file path"); } Console.WriteLine("\nPress Enter to exit..."); Console.ReadLine(); } ///

/// Gets the text visible in the specified image file by using /// the Computer Vision REST API. ///

/// The image file with printed text. static async Task MakeOCRRequest(string imageFilePath) { try { HttpClient client = new HttpClient(); // Request headers. client.DefaultRequestHeaders.Add( "Ocp-Apim-Subscription-Key", subscriptionKey); // Request parameters. // The language parameter doesn't specify a language, so the // method detects it automatically. // The detectOrientation parameter is set to true, so the method detects and // and corrects text orientation before detecting text. string requestParameters = "language=unk&detectOrientation=true"; // Assemble the URI for the REST API method. string uri = uriBase + "?" + requestParameters; HttpResponseMessage response; // Read the contents of the specified local image // into a byte array. byte[] byteData = GetImageAsByteArray(imageFilePath); // Add the byte array as an octet stream to the request body. using (ByteArrayContent content = new ByteArrayContent(byteData)) { // This example uses the "application/octet-stream" content type. // The other content types you can use are "application/json" // and "multipart/form-data". content.Headers.ContentType = new MediaTypeHeaderValue("application/octet-stream"); // Asynchronously call the REST API method. response = await client.PostAsync(uri, content); } // Asynchronously get the JSON response. string contentString = await response.Content.ReadAsStringAsync(); // Display the JSON response. Console.WriteLine("\nResponse:\n\n{0}\n", JToken.Parse(contentString).ToString()); } catch (Exception e) { Console.WriteLine("\n" + e.Message); } } ///

/// Returns the contents of the specified file as a byte array. ///

/// The image file to read. /// The byte array of the image data. static byte[] GetImageAsByteArray(string imageFilePath) { // Open a read-only file stream for the specified file. using (FileStream fileStream = new FileStream(imageFilePath, FileMode.Open, FileAccess.Read)) { // Read the file's contents into a byte array. BinaryReader binaryReader = new BinaryReader(fileStream); return binaryReader.ReadBytes((int)fileStream.Length); } } } } Examine the response A successful response is returned in JSON. The sample application parses and displays a successful response in the console window, similar to the following example: { "language": "en", "textAngle": -1.5000000000000335, "orientation": "Up", "regions": [ { "boundingBox": "154,49,351,575", "lines": [ { "boundingBox": "165,49,340,117", "words": [ { "boundingBox": "165,49,63,109", "text": "A" }, { "boundingBox": "261,50,244,116", "text": "GOAL" } ] }, { "boundingBox": "165,169,339,93", "words": [ { "boundingBox": "165,169,339,93", "text": "WITHOUT" } ] }, { "boundingBox": "159,264,342,117", "words": [ { "boundingBox": "159,264,64,110", "text": "A" }, { "boundingBox": "255,266,246,115", "text": "PLAN" } ] }, { "boundingBox": "161,384,338,119", "words": [ { "boundingBox": "161,384,86,113", "text": "IS" }, { "boundingBox": "274,387,225,116", "text": "JUST" } ] }, { "boundingBox": "154,506,341,118", "words": [ { "boundingBox": "154,506,62,111", "text": "A" }, { "boundingBox": "248,508,247,116", "text": "WISH" } ] } ] } ] } Clean up resources When no longer needed, delete the Visual Studio solution. To do so, open File Explorer, navigate to the folder in which you created the Visual Studio solution, and delete the folder. Next steps Explore a basic Windows application that uses Computer Vision to perform optical character recognition (OCR ); create smart-cropped thumbnails; plus detect, categorize, tag, and describe visual features, including faces, in an image. Computer Vision API C# Tutorial Quickstart: Extract printed text (OCR) using the REST API and cURL in Computer Vision 4/18/2019 • 2 minutes to read • Edit Online In this quickstart, you extract printed text with optical character recognition (OCR ) from an image by using Computer Vision's REST API. With the OCR method, you can detect printed text in an image and extract recognized characters into a machine-usable character stream. If you don't have an Azure subscription, create a free account before you begin. Prerequisites You must have cURL. You must have a subscription key for Computer Vision. You can get a free trial key from Try Cognitive Services. Or, follow the instructions in Create a Cognitive Services account to subscribe to Computer Vision and get your key. Create and run the sample command To create and run the sample, do the following steps: 1. Copy the following command into a text editor. 2. Make the following changes in the command where needed: a. Replace the value of with your subscription key. b. Replace the request URL ( https://westcentralus.api.cognitive.microsoft.com/vision/v2.0/ocr ) with the endpoint URL for the OCR method from the Azure region where you obtained your subscription keys, if necessary. c. Optionally, change the image URL in the request body ( https://upload.wikimedia.org/wikipedia/commons/thumb/a/af/Atomist_quote_from_Democritus.png/338pxAtomist_quote_from_Democritus.png\ ) to the URL of a different image to be analyzed. 3. Open a command prompt window. 4. Paste the command from the text editor into the command prompt window, and then run the command. curl -H "Ocp-Apim-Subscription-Key: " -H "Content-Type: application/json" "https://westcentralus.api.cognitive.microsoft.com/vision/v2.0/ocr?language=unk&detectOrientation=true" -d " {\"url\":\"https://upload.wikimedia.org/wikipedia/commons/thumb/a/af/Atomist_quote_from_Democritus.png/338pxAtomist_quote_from_Democritus.png\"}" Examine the response A successful response is returned in JSON. The sample application parses and displays a successful response in the command prompt window, similar to the following example: { "language": "en", "orientation": "Up", "textAngle": 0, "regions": [ { { "boundingBox": "21,16,304,451", "lines": [ { "boundingBox": "28,16,288,41", "words": [ { "boundingBox": "28,16,288,41", "text": "NOTHING" } ] }, { "boundingBox": "27,66,283,52", "words": [ { "boundingBox": "27,66,283,52", "text": "EXISTS" } ] }, { "boundingBox": "27,128,292,49", "words": [ { "boundingBox": "27,128,292,49", "text": "EXCEPT" } ] }, { "boundingBox": "24,188,292,54", "words": [ { "boundingBox": "24,188,292,54", "text": "ATOMS" } ] }, { "boundingBox": "22,253,297,32", "words": [ { "boundingBox": "22,253,105,32", "text": "AND" }, { "boundingBox": "144,253,175,32", "text": "EMPTY" } ] }, { "boundingBox": "21,298,304,60", "words": [ { "boundingBox": "21,298,304,60", "text": "SPACE." } ] }, { "boundingBox": "26,387,294,37", "words": [ { "boundingBox": "26,387,210,37", "text": "Everything" }, { "boundingBox": "249,389,71,27", "boundingBox": "249,389,71,27", "text": "else" } ] }, { "boundingBox": "127,431,198,36", "words": [ { "boundingBox": "127,431,31,29", "text": "is" }, { "boundingBox": "172,431,153,36", "text": "opinion." } ] } ] } ] } Next steps Explore the Computer Vision API used to analyze an image, detect celebrities and landmarks, create a thumbnail, and extract printed and handwritten text. To rapidly experiment with the Computer Vision API, try the Open API testing console. Explore the Computer Vision API Quickstart: Extract printed text (OCR) using the REST API and Go in Computer Vision 4/18/2019 • 3 minutes to read • Edit Online In this quickstart, you extract printed text with optical character recognition (OCR ) from an image by using Computer Vision's REST API. With the OCR method, you can detect printed text in an image and extract recognized characters into a machine-usable character stream. If you don't have an Azure subscription, create a free account before you begin. Prerequisites You must have Go installed. You must have a subscription key for Computer Vision. You can get a free trial key from Try Cognitive Services. Or, follow the instructions in Create a Cognitive Services account to subscribe to Computer Vision and get your key. Create and run the sample To create and run the sample, do the following steps: 1. Copy the following code into a text editor. 2. Make the following changes in code where needed: a. Replace the value of subscriptionKey with your subscription key. b. Replace the value of uriBase with the endpoint URL for the OCR method from the Azure region where you obtained your subscription keys, if necessary. c. Optionally, replace the value of imageUrl with the URL of a different image that you want to analyze. 3. Save the code as a file with a .go extension. For example, get-printed-text.go . 4. Open a command prompt window. 5. At the prompt, run the go build command to compile the package from the file. For example, go build get-printed-text.go . 6. At the prompt, run the compiled package. For example, get-printed-text . package main import ( "encoding/json" "fmt" "io/ioutil" "net/http" "strings" "time" ) func main() { // Replace with your valid subscription key. const subscriptionKey = "" // // // // You must use the same Azure region in your REST API method as you used to get your subscription keys. For example, if you got your subscription keys from the West US region, replace "westcentralus" in the URL below with "westus". // // Free trial subscription keys are generated in the "westus" region. // If you use a free trial subscription key, you shouldn't need to change // this region. const uriBase = "https://westcentralus.api.cognitive.microsoft.com/vision/v2.0/ocr" const imageUrl = "https://upload.wikimedia.org/wikipedia/commons/thumb/a/af/" + "Atomist_quote_from_Democritus.png/338px-Atomist_quote_from_Democritus.png" const params = "?language=unk&detectOrientation=true" const uri = uriBase + params const imageUrlEnc = "{\"url\":\"" + imageUrl + "\"}" reader := strings.NewReader(imageUrlEnc) // Create the Http client client := &http.Client{ Timeout: time.Second * 2, } // Create the Post request, passing the image URL in the request body req, err := http.NewRequest("POST", uri, reader) if err != nil { panic(err) } // Add headers req.Header.Add("Content-Type", "application/json") req.Header.Add("Ocp-Apim-Subscription-Key", subscriptionKey) // Send the request and retrieve the response resp, err := client.Do(req) if err != nil { panic(err) } defer resp.Body.Close() // Read the response body. // Note, data is a byte array data, err := ioutil.ReadAll(resp.Body) if err != nil { panic(err) } // Parse the Json data var f interface{} json.Unmarshal(data, &f) // Format and display the Json result jsonFormatted, _ := json.MarshalIndent(f, "", " ") fmt.Println(string(jsonFormatted)) } Examine the response A successful response is returned in JSON. The sample application parses and displays a successful response in the command prompt window, similar to the following example: { "language": "en", "orientation": "Up", "regions": [ { "boundingBox": "21,16,304,451", "lines": [ "lines": [ { "boundingBox": "28,16,288,41", "words": [ { "boundingBox": "28,16,288,41", "text": "NOTHING" } ] }, { "boundingBox": "27,66,283,52", "words": [ { "boundingBox": "27,66,283,52", "text": "EXISTS" } ] }, { "boundingBox": "27,128,292,49", "words": [ { "boundingBox": "27,128,292,49", "text": "EXCEPT" } ] }, { "boundingBox": "24,188,292,54", "words": [ { "boundingBox": "24,188,292,54", "text": "ATOMS" } ] }, { "boundingBox": "22,253,297,32", "words": [ { "boundingBox": "22,253,105,32", "text": "AND" }, { "boundingBox": "144,253,175,32", "text": "EMPTY" } ] }, { "boundingBox": "21,298,304,60", "words": [ { "boundingBox": "21,298,304,60", "text": "SPACE." } ] }, { "boundingBox": "26,387,294,37", "words": [ { "boundingBox": "26,387,210,37", "text": "Everything" }, { "boundingBox": "249,389,71,27", "text": "else" } } ] }, { "boundingBox": "127,431,198,36", "words": [ { "boundingBox": "127,431,31,29", "text": "is" }, { "boundingBox": "172,431,153,36", "text": "opinion." } ] } ] } ], "textAngle": 0 } Next steps Explore the Computer Vision API used to analyze an image, detect celebrities and landmarks, create a thumbnail, and extract printed and handwritten text. To rapidly experiment with the Computer Vision API, try the Open API testing console. Explore the Computer Vision API Quickstart: Extract printed text (OCR) using the REST API and Java in Computer Vision 4/18/2019 • 3 minutes to read • Edit Online In this quickstart, you extract printed text with optical character recognition (OCR ) from an image by using Computer Vision's REST API. With the OCR method, you can detect printed text in an image and extract recognized characters into a machine-usable character stream. If you don't have an Azure subscription, create a free account before you begin. Prerequisites You must have Java™ Platform, Standard Edition Development Kit 7 or 8 (JDK 7 or 8) installed. You must have a subscription key for Computer Vision. You can get a free trial key from Try Cognitive Services. Or, follow the instructions in Create a Cognitive Services account to subscribe to Computer Vision and get your key. Create and run the sample application To create and run the sample, do the following steps: 1. Create a new Java project in your favorite IDE or editor. If the option is available, create the Java project from a command line application template. 2. Import the following libraries into your Java project. If you're using Maven, the Maven coordinates are provided for each library. Apache HTTP client (org.apache.httpcomponents:httpclient:4.5.5) Apache HTTP core (org.apache.httpcomponents:httpcore:4.4.9) JSON library (org.json:json:20180130) 3. Add the following import statements to the file that contains the Main public class for your project. import import import import import import import import import import java.net.URI; org.apache.http.HttpEntity; org.apache.http.HttpResponse; org.apache.http.client.methods.HttpPost; org.apache.http.entity.StringEntity; org.apache.http.client.utils.URIBuilder; org.apache.http.impl.client.CloseableHttpClient; org.apache.http.impl.client.HttpClientBuilder; org.apache.http.util.EntityUtils; org.json.JSONObject; 4. Replace the needed: Main public class with the following code, then make the following changes in code where a. Replace the value of subscriptionKey with your subscription key. b. Replace the value of uriBase with the endpoint URL for the OCR method from the Azure region where you obtained your subscription keys, if necessary. c. Optionally, replace the value of imageToAnalyze with the URL of a different image from which you want to extract printed text. 5. Save, then build the Java project. 6. If you're using an IDE, run Main . Otherwise, open a command prompt window and then use the command to run the compiled class. For example, java Main . java public // // // class Main { ********************************************** *** Update or verify the following values. *** ********************************************** // Replace with your valid subscription key. private static final String subscriptionKey = ""; // You must use the same Azure region in your REST API method as you used to // get your subscription keys. For example, if you got your subscription keys // from the West US region, replace "westcentralus" in the URL // below with "westus". // // Free trial subscription keys are generated in the "westus" region. // If you use a free trial subscription key, you shouldn't need to change // this region. private static final String uriBase = "https://westcentralus.api.cognitive.microsoft.com/vision/v2.0/ocr"; private static final String imageToAnalyze = "https://upload.wikimedia.org/wikipedia/commons/thumb/a/af/" + "Atomist_quote_from_Democritus.png/338px-Atomist_quote_from_Democritus.png"; public static void main(String[] args) { CloseableHttpClient httpClient = HttpClientBuilder.create().build(); try { URIBuilder uriBuilder = new URIBuilder(uriBase); uriBuilder.setParameter("language", "unk"); uriBuilder.setParameter("detectOrientation", "true"); // Request parameters. URI uri = uriBuilder.build(); HttpPost request = new HttpPost(uri); // Request headers. request.setHeader("Content-Type", "application/json"); request.setHeader("Ocp-Apim-Subscription-Key", subscriptionKey); // Request body. StringEntity requestEntity = new StringEntity("{\"url\":\"" + imageToAnalyze + "\"}"); request.setEntity(requestEntity); // Call the REST API method and get the response entity. HttpResponse response = httpClient.execute(request); HttpEntity entity = response.getEntity(); if (entity != null) { // Format and display the JSON response. String jsonString = EntityUtils.toString(entity); JSONObject json = new JSONObject(jsonString); System.out.println("REST Response:\n"); System.out.println(json.toString(2)); } } catch (Exception e) { // Display error message. System.out.println(e.getMessage()); } } } Examine the response A successful response is returned in JSON. The sample application parses and displays a successful response in the console window, similar to the following example: REST Response: { "orientation": "Up", "regions": [{ "boundingBox": "21,16,304,451", "lines": [ { "boundingBox": "28,16,288,41", "words": [{ "boundingBox": "28,16,288,41", "text": "NOTHING" }] }, { "boundingBox": "27,66,283,52", "words": [{ "boundingBox": "27,66,283,52", "text": "EXISTS" }] }, { "boundingBox": "27,128,292,49", "words": [{ "boundingBox": "27,128,292,49", "text": "EXCEPT" }] }, { "boundingBox": "24,188,292,54", "words": [{ "boundingBox": "24,188,292,54", "text": "ATOMS" }] }, { "boundingBox": "22,253,297,32", "words": [ { "boundingBox": "22,253,105,32", "text": "AND" }, { "boundingBox": "144,253,175,32", "text": "EMPTY" } ] }, { "boundingBox": "21,298,304,60", "words": [{ "boundingBox": "21,298,304,60", "text": "SPACE." }] }, { "boundingBox": "26,387,294,37", "words": [ { "boundingBox": "26,387,210,37", "text": "Everything" }, { "boundingBox": "249,389,71,27", "text": "else" "text": "else" } ] }, { "boundingBox": "127,431,198,36", "words": [ { "boundingBox": "127,431,31,29", "text": "is" }, { "boundingBox": "172,431,153,36", "text": "opinion." } ] } ] }], "textAngle": 0, "language": "en" } Clean up resources When no longer needed, delete the Java project, including the compiled class and imported libraries. Next steps Explore a Java Swing application that uses Computer Vision to perform optical character recognition (OCR ); create smart-cropped thumbnails; plus detect, categorize, tag, and describe visual features, including faces, in an image. To rapidly experiment with the Computer Vision API, try the Open API testing console. Computer Vision API Java Tutorial Quickstart: Extract printed text (OCR) using the REST API and JavaScript in Computer Vision 4/18/2019 • 3 minutes to read • Edit Online In this quickstart, you extract printed text with optical character recognition (OCR ) from an image by using Computer Vision's REST API. With the OCR method, you can detect printed text in an image and extract recognized characters into a machine-usable character stream. If you don't have an Azure subscription, create a free account before you begin. Prerequisites You must have a subscription key for Computer Vision. You can get a free trial key from Try Cognitive Services. Or, follow the instructions in Create a Cognitive Services account to subscribe to Computer Vision and get your key. Create and run the sample To create and run the sample, do the following steps: 1. Copy the following code into a text editor. 2. Make the following changes in code where needed: a. Replace the value of subscriptionKey with your subscription key. b. Replace the value of uriBase with the endpoint URL for the OCR method from the Azure region where you obtained your subscription keys, if necessary. c. Optionally, replace the value of the value attribute for the inputImage control with the URL of a different image that you want to analyze. 3. Save the code as a file with an .html extension. For example, get-printed-text.html . 4. Open a browser window. 5. In the browser, drag and drop the file into the browser window. 6. When the webpage is displayed in the browser, choose the Read image button. OCR Sample

Optical Character Recognition (OCR):

Enter the URL to an image of printed text, then click the Read image button.

Image to read:

Response:

Source image: Source image:

Examine the response A successful response is returned in JSON. The sample webpage parses and displays a successful response in the browser window, similar to the following example: { "language": "en", "orientation": "Up", "textAngle": 0, "regions": [ { "boundingBox": "21,16,304,451", "lines": [ { "boundingBox": "28,16,288,41", "words": [ { "boundingBox": "28,16,288,41", "text": "NOTHING" } ] }, { "boundingBox": "27,66,283,52", "words": [ { "boundingBox": "27,66,283,52", "text": "EXISTS" } ] }, { "boundingBox": "27,128,292,49", "words": [ { "boundingBox": "27,128,292,49", "text": "EXCEPT" } ] }, { "boundingBox": "24,188,292,54", "words": [ { "boundingBox": "24,188,292,54", "text": "ATOMS" } ] }, { "boundingBox": "22,253,297,32", "words": [ { "boundingBox": "22,253,105,32", "text": "AND" }, { "boundingBox": "144,253,175,32", "text": "EMPTY" "text": "EMPTY" } ] }, { "boundingBox": "21,298,304,60", "words": [ { "boundingBox": "21,298,304,60", "text": "SPACE." } ] }, { "boundingBox": "26,387,294,37", "words": [ { "boundingBox": "26,387,210,37", "text": "Everything" }, { "boundingBox": "249,389,71,27", "text": "else" } ] }, { "boundingBox": "127,431,198,36", "words": [ { "boundingBox": "127,431,31,29", "text": "is" }, { "boundingBox": "172,431,153,36", "text": "opinion." } ] } ] } ] } Next steps Explore a JavaScript application that uses Computer Vision to perform optical character recognition (OCR ); create smart-cropped thumbnails; plus detect, categorize, tag, and describe visual features, including faces, in an image. To rapidly experiment with the Computer Vision API, try the Open API testing console. Computer Vision API JavaScript Tutorial Quickstart: Extract printed text (OCR) using the REST API and Node.js in Computer Vision 4/18/2019 • 3 minutes to read • Edit Online In this quickstart, you extract printed text with optical character recognition (OCR ) from an image by using Computer Vision's REST API. With the OCR method, you can detect printed text in an image and extract recognized characters into a machine-usable character stream. If you don't have an Azure subscription, create a free account before you begin. Prerequisites You must have Node.js 4.x or later installed. You must have npm installed. You must have a subscription key for Computer Vision. You can get a free trial key from Try Cognitive Services. Or, follow the instructions in Create a Cognitive Services account to subscribe to Computer Vision and get your key. Create and run the sample To create and run the sample, do the following steps: 1. Install the npm request package. a. Open a command prompt window as an administrator. b. Run the following command: npm install request c. After the package is successfully installed, close the command prompt window. 2. Copy the following code into a text editor. 3. Make the following changes in code where needed: a. Replace the value of subscriptionKey with your subscription key. b. Replace the value of uriBase with the endpoint URL for the OCR method from the Azure region where you obtained your subscription keys, if necessary. c. Optionally, replace the value of imageUrl with the URL of a different image from which you want to extract printed text. 4. Save the code as a file with a .js extension. For example, get-printed-text.js . 5. Open a command prompt window. 6. At the prompt, use the node command to run the file. For example, node get-printed-text.js . 'use strict'; const request = require('request'); // Replace with your valid subscription key. const subscriptionKey = ''; // You must use the same location in your REST call as you used to get your // subscription keys. For example, if you got your subscription keys from // westus, replace "westcentralus" in the URL below with "westus". const uriBase = 'https://westcentralus.api.cognitive.microsoft.com/vision/v2.0/ocr'; const imageUrl = 'https://upload.wikimedia.org/wikipedia/commons/thumb/a/af/' + 'Atomist_quote_from_Democritus.png/338px-Atomist_quote_from_Democritus.png'; // Request parameters. const params = { 'language': 'unk', 'detectOrientation': 'true', }; const options = { uri: uriBase, qs: params, body: '{"url": ' + '"' + imageUrl + '"}', headers: { 'Content-Type': 'application/json', 'Ocp-Apim-Subscription-Key' : subscriptionKey } }; request.post(options, (error, response, body) => { if (error) { console.log('Error: ', error); return; } let jsonResponse = JSON.stringify(JSON.parse(body), null, ' '); console.log('JSON Response\n'); console.log(jsonResponse); }); Examine the response A successful response is returned in JSON. The sample parses and displays a successful response in the command prompt window, similar to the following example: { "language": "en", "orientation": "Up", "textAngle": 0, "regions": [ { "boundingBox": "21,16,304,451", "lines": [ { "boundingBox": "28,16,288,41", "words": [ { "boundingBox": "28,16,288,41", "text": "NOTHING" } ] }, { { "boundingBox": "27,66,283,52", "words": [ { "boundingBox": "27,66,283,52", "text": "EXISTS" } ] }, { "boundingBox": "27,128,292,49", "words": [ { "boundingBox": "27,128,292,49", "text": "EXCEPT" } ] }, { "boundingBox": "24,188,292,54", "words": [ { "boundingBox": "24,188,292,54", "text": "ATOMS" } ] }, { "boundingBox": "22,253,297,32", "words": [ { "boundingBox": "22,253,105,32", "text": "AND" }, { "boundingBox": "144,253,175,32", "text": "EMPTY" } ] }, { "boundingBox": "21,298,304,60", "words": [ { "boundingBox": "21,298,304,60", "text": "SPACE." } ] }, { "boundingBox": "26,387,294,37", "words": [ { "boundingBox": "26,387,210,37", "text": "Everything" }, { "boundingBox": "249,389,71,27", "text": "else" } ] }, { "boundingBox": "127,431,198,36", "words": [ { "boundingBox": "127,431,31,29", "text": "is" }, { { "boundingBox": "172,431,153,36", "text": "opinion." } ] } ] } ] } Clean up resources When no longer needed, delete the file, and then uninstall the npm the following steps: request package. To uninstall the package, do 1. Open a command prompt window as an administrator. 2. Run the following command: npm uninstall request 3. After the package is successfully uninstalled, close the command prompt window. Next steps Explore the Computer Vision API used to analyze an image, detect celebrities and landmarks, create a thumbnail, and extract printed and handwritten text. To rapidly experiment with the Computer Vision API, try the Open API testing console. Explore the Computer Vision API Quickstart: Extract printed text (OCR) using the REST API and PHP in Computer Vision 4/18/2019 • 3 minutes to read • Edit Online In this quickstart, you extract printed text with optical character recognition (OCR ) from an image by using Computer Vision's REST API. With the OCR method, you can detect printed text in an image and extract recognized characters into a machine-usable character stream. If you don't have an Azure subscription, create a free account before you begin. Prerequisites You must have PHP installed. You must have Pear installed. You must have a subscription key for Computer Vision. You can get a free trial key from Try Cognitive Services. Or, follow the instructions in Create a Cognitive Services account to subscribe to Computer Vision and get your key. Create and run the sample To create and run the sample, do the following steps: 1. Install the PHP5 HTTP_Request2 package. a. Open a command prompt window as an administrator. b. Run the following command: pear install HTTP_Request2 c. After the package is successfully installed, close the command prompt window. 2. Copy the following code into a text editor. 3. Make the following changes in code where needed: a. Replace the value of subscriptionKey with your subscription key. b. Replace the value of uriBase with the endpoint URL for the OCR method from the Azure region where you obtained your subscription keys, if necessary. c. Optionally, replace the value of imageUrl with the URL of a different image from which you want to extract printed text. 4. Save the code as a file with a .php extension. For example, get-printed-text.php . 5. Open a browser window with PHP support. 6. Drag and drop the file into the browser window. OCR Sample with a valid subscription key. $ocpApimSubscriptionKey = ''; // You must use the same location in your REST call as you used to obtain // your subscription keys. For example, if you obtained your subscription keys // from westus, replace "westcentralus" in the URL below with "westus". $uriBase = 'https://westcentralus.api.cognitive.microsoft.com/vision/v2.0/'; $imageUrl = 'https://upload.wikimedia.org/wikipedia/commons/thumb/a/af/' . 'Atomist_quote_from_Democritus.png/338px-Atomist_quote_from_Democritus.png'; require_once 'HTTP/Request2.php'; $request = new Http_Request2($uriBase . 'ocr'); $url = $request->getUrl(); $headers = array( // Request headers 'Content-Type' => 'application/json', 'Ocp-Apim-Subscription-Key' => $ocpApimSubscriptionKey ); $request->setHeader($headers); $parameters = array( // Request parameters 'language' => 'unk', 'detectOrientation' => 'true' ); $url->setQueryVariables($parameters); $request->setMethod(HTTP_Request2::METHOD_POST); // Request body parameters $body = json_encode(array('url' => $imageUrl)); // Request body $request->setBody($body); try { $response = $request->send(); echo "

" .
json_encode(json_decode($response->getBody()), JSON_PRETTY_PRINT) . "

"; } catch (HttpException $ex) { echo "

" . $ex . "

"; } ?> Examine the response A successful response is returned in JSON. The sample website parses and displays a successful response in the browser window, similar to the following example: { { "language": "en", "orientation": "Up", "textAngle": 0, "regions": [ { "boundingBox": "21,16,304,451", "lines": [ { "boundingBox": "28,16,288,41", "words": [ { "boundingBox": "28,16,288,41", "text": "NOTHING" } ] }, { "boundingBox": "27,66,283,52", "words": [ { "boundingBox": "27,66,283,52", "text": "EXISTS" } ] }, { "boundingBox": "27,128,292,49", "words": [ { "boundingBox": "27,128,292,49", "text": "EXCEPT" } ] }, { "boundingBox": "24,188,292,54", "words": [ { "boundingBox": "24,188,292,54", "text": "ATOMS" } ] }, { "boundingBox": "22,253,297,32", "words": [ { "boundingBox": "22,253,105,32", "text": "AND" }, { "boundingBox": "144,253,175,32", "text": "EMPTY" } ] }, { "boundingBox": "21,298,304,60", "words": [ { "boundingBox": "21,298,304,60", "text": "SPACE." } ] }, { "boundingBox": "26,387,294,37", "words": [ { { "boundingBox": "26,387,210,37", "text": "Everything" }, { "boundingBox": "249,389,71,27", "text": "else" } ] }, { "boundingBox": "127,431,198,36", "words": [ { "boundingBox": "127,431,31,29", "text": "is" }, { "boundingBox": "172,431,153,36", "text": "opinion." } ] } ] } ] } Clean up resources When no longer needed, delete the file, and then uninstall the PHP5 package, do the following steps: HTTP_Request2 package. To uninstall the 1. Open a command prompt window as an administrator. 2. Run the following command: pear uninstall HTTP_Request2 3. After the package is successfully uninstalled, close the command prompt window. Next steps Explore the Computer Vision API to analyze an image, detect celebrities and landmarks, create a thumbnail, and extract printed and handwritten text. To rapidly experiment with the Computer Vision API, try the Open API testing console. Explore the Computer Vision API Quickstart: Extract printed text (OCR) using the REST API and Python in Computer Vision 4/18/2019 • 3 minutes to read • Edit Online In this quickstart, you extract printed text with optical character recognition (OCR ) from an image by using Computer Vision's REST API. With the OCR method, you can detect printed text in an image and extract recognized characters into a machine-usable character stream. You can run this quickstart in a step-by step fashion using a Jupyter notebook on MyBinder. To launch Binder, select the following button: launch binder If you don't have an Azure subscription, create a free account before you begin. Prerequisites You must have Python installed if you want to run the sample locally. You must have a subscription key for Computer Vision. You can get a free trial key from Try Cognitive Services. Or, follow the instructions in Create a Cognitive Services account to subscribe to Computer Vision and get your key. Create and run the sample To create and run the sample, do the following steps: 1. Copy the following code into a text editor. 2. Make the following changes in code where needed: a. Replace the value of subscription_key with your subscription key. b. Replace the value of vision_base_url with the endpoint URL for the Computer Vision resource in the Azure region where you obtained your subscription keys, if necessary. c. Optionally, replace the value of image_url with the URL of a different image from which you want to extract printed text. 3. Save the code as a file with an .py extension. For example, get-printed-text.py . 4. Open a command prompt window. 5. At the prompt, use the python command to run the sample. For example, python get-printed-text.py . import requests # If you are using a Jupyter notebook, uncomment the following line. #%matplotlib inline import matplotlib.pyplot as plt from matplotlib.patches import Rectangle from PIL import Image from io import BytesIO # Replace with your valid subscription key. subscription_key = "" assert subscription_key # You must use the same region in your REST call as you used to get your # subscription keys. For example, if you got your subscription keys from # westus, replace "westcentralus" in the URI below with "westus". # # Free trial subscription keys are generated in the "westus" region. # If you use a free trial subscription key, you shouldn't need to change # this region. vision_base_url = "https://westcentralus.api.cognitive.microsoft.com/vision/v2.0/" ocr_url = vision_base_url + "ocr" # Set image_url to the URL of an image that you want to analyze. image_url = "https://upload.wikimedia.org/wikipedia/commons/thumb/a/af/" + \ "Atomist_quote_from_Democritus.png/338px-Atomist_quote_from_Democritus.png" headers = {'Ocp-Apim-Subscription-Key': subscription_key} params = {'language': 'unk', 'detectOrientation': 'true'} data = {'url': image_url} response = requests.post(ocr_url, headers=headers, params=params, json=data) response.raise_for_status() analysis = response.json() # Extract the word bounding boxes and text. line_infos = [region["lines"] for region in analysis["regions"]] word_infos = [] for line in line_infos: for word_metadata in line: for word_info in word_metadata["words"]: word_infos.append(word_info) word_infos # Display the image and overlay it with the extracted text. plt.figure(figsize=(5, 5)) image = Image.open(BytesIO(requests.get(image_url).content)) ax = plt.imshow(image, alpha=0.5) for word in word_infos: bbox = [int(num) for num in word["boundingBox"].split(",")] text = word["text"] origin = (bbox[0], bbox[1]) patch = Rectangle(origin, bbox[2], bbox[3], fill=False, linewidth=2, color='y') ax.axes.add_patch(patch) plt.text(origin[0], origin[1], text, fontsize=20, weight="bold", va="top") plt.axis("off") Examine the response A successful response is returned in JSON. The sample webpage parses and displays a successful response in the command prompt window, similar to the following example: { "language": "en", "orientation": "Up", "orientation": "Up", "textAngle": 0, "regions": [ { "boundingBox": "21,16,304,451", "lines": [ { "boundingBox": "28,16,288,41", "words": [ { "boundingBox": "28,16,288,41", "text": "NOTHING" } ] }, { "boundingBox": "27,66,283,52", "words": [ { "boundingBox": "27,66,283,52", "text": "EXISTS" } ] }, { "boundingBox": "27,128,292,49", "words": [ { "boundingBox": "27,128,292,49", "text": "EXCEPT" } ] }, { "boundingBox": "24,188,292,54", "words": [ { "boundingBox": "24,188,292,54", "text": "ATOMS" } ] }, { "boundingBox": "22,253,297,32", "words": [ { "boundingBox": "22,253,105,32", "text": "AND" }, { "boundingBox": "144,253,175,32", "text": "EMPTY" } ] }, { "boundingBox": "21,298,304,60", "words": [ { "boundingBox": "21,298,304,60", "text": "SPACE." } ] }, { "boundingBox": "26,387,294,37", "words": [ { "boundingBox": "26,387,210,37", "text": "Everything" "text": "Everything" }, { "boundingBox": "249,389,71,27", "text": "else" } ] }, { "boundingBox": "127,431,198,36", "words": [ { "boundingBox": "127,431,31,29", "text": "is" }, { "boundingBox": "172,431,153,36", "text": "opinion." } ] } ] } ] } Next steps Explore a Python application that uses Computer Vision to perform optical character recognition (OCR ); create smart-cropped thumbnails; plus detect, categorize, tag, and describe visual features, including faces, in an image. To rapidly experiment with the Computer Vision API, try the Open API testing console. Computer Vision API Python Tutorial Quickstart: Extract printed text (OCR) using the REST API and Ruby in Computer Vision 4/18/2019 • 2 minutes to read • Edit Online In this quickstart, you extract printed text with optical character recognition (OCR ) from an image by using Computer Vision's REST API. With the OCR method, you can detect printed text in an image and extract recognized characters into a machine-usable character stream. If you don't have an Azure subscription, create a free account before you begin. Prerequisites You must have Ruby 2.4.x or later installed. You must have a subscription key for Computer Vision. You can get a free trial key from Try Cognitive Services. Or, follow the instructions in Create a Cognitive Services account to subscribe to Computer Vision and get your key. Create and run the sample To create and run the sample, do the following steps: 1. Copy the following code into a text editor. 2. Make the following changes in code where needed: a. Replace with your subscription key. b. Replace https://westcentralus.api.cognitive.microsoft.com/vision/v2.0/ocr with the endpoint URL for the OCR method in the Azure region where you obtained your subscription keys, if necessary. c. Optionally, replace https://upload.wikimedia.org/wikipedia/commons/thumb/a/af/Atomist_quote_from_Democritus.png/338pxAtomist_quote_from_Democritus.png\ with the URL of a different image from which you want to extract printed text. 3. Save the code as a file with an .rb extension. For example, get-printed-text.rb . 4. Open a command prompt window. 5. At the prompt, use the ruby command to run the sample. For example, ruby get-printed-text.rb . require 'net/http' # You must use the same location in your REST call as you used to get your # subscription keys. For example, if you got your subscription keys from westus, # replace "westcentralus" in the URL below with "westus". uri = URI('https://westcentralus.api.cognitive.microsoft.com/vision/v2.0/ocr') uri.query = URI.encode_www_form({ # Request parameters 'language' => 'unk', 'detectOrientation' => 'true' }) request = Net::HTTP::Post.new(uri.request_uri) # Request headers # Replace with your valid subscription key. request['Ocp-Apim-Subscription-Key'] = '' request['Content-Type'] = 'application/json' request.body = "{\"url\": \"https://upload.wikimedia.org/wikipedia/commons/thumb/a/af/" + "Atomist_quote_from_Democritus.png/338px-Atomist_quote_from_Democritus.png\"}" response = Net::HTTP.start(uri.host, uri.port, :use_ssl => uri.scheme == 'https') do |http| http.request(request) end puts response.body Examine the response A successful response is returned in JSON. The sample parses and displays a successful response in the command prompt window, similar to the following example: { "language": "en", "textAngle": -2.0000000000000338, "orientation": "Up", "regions": [ { "boundingBox": "462,379,497,258", "lines": [ { "boundingBox": "462,379,497,74", "words": [ { "boundingBox": "462,379,41,73", "text": "A" }, { "boundingBox": "523,379,153,73", "text": "GOAL" }, { "boundingBox": "694,379,265,74", "text": "WITHOUT" } ] }, { "boundingBox": "565,471,289,74", "words": [ { "boundingBox": "565,471,41,73", "text": "A" }, { "boundingBox": "626,471,150,73", "text": "PLAN" }, { "boundingBox": "801,472,53,73", "text": "IS" } ] }, { "boundingBox": "519,563,375,74", "words": [ { "boundingBox": "519,563,149,74", "text": "JUST" }, { "boundingBox": "683,564,41,72", "text": "A" }, { "boundingBox": "741,564,153,73", "text": "WISH" } ] } ] } ] } Next steps Explore the Computer Vision API to analyze an image, detect celebrities and landmarks, create a thumbnail, and extract printed and handwritten text. To rapidly experiment with the Computer Vision API, try the Open API testing console. Explore the Computer Vision API Quickstart: Extract handwritten text using the REST API and C# in Computer Vision 5/29/2019 • 6 minutes to read • Edit Online In this quickstart, you will extract handwritten text from an image by using Computer Vision's REST API. With the Batch Read API and the Read Operation Result API, you can detect handwritten text in an image and extract recognized characters into a machine-readable character stream. IMPORTANT Unlike the OCR method, the Batch Read method runs asynchronously. This method does not return any information in the body of a successful response. Instead, the Read method returns a URI in the Operation-Location response header field. You can then call this URI, which represents the Read Operation Result method, in order to check the status and return the results of the Batch Read method call. If you don't have an Azure subscription, create a free account before you begin. Prerequisites You must have Visual Studio 2015 or later. You must have a subscription key for Computer Vision. You can get a free trial key from Try Cognitive Services. Or, follow the instructions in Create a Cognitive Services account to subscribe to Computer Vision and get your key. Create and run the sample application To create the sample in Visual Studio, do the following steps: 1. Create a new Visual Studio solution in Visual Studio, using the Visual C# Console App template. 2. Install the Newtonsoft.Json NuGet package. a. On the menu, click Tools, select NuGet Package Manager, then Manage NuGet Packages for Solution. b. Click the Browse tab, and in the Search box type "Newtonsoft.Json". c. Select Newtonsoft.Json when it displays, then click the checkbox next to your project name, and Install. 3. Replace the code in Program.cs with the following code, and then make the following changes in code where needed: a. Replace the value of subscriptionKey with your subscription key. b. Replace the value of uriBase with the endpoint URL for the Batch Read method from the Azure region where you obtained your subscription keys, if necessary. 4. Run the program. 5. At the prompt, enter the path to a local image. using using using using using using using Newtonsoft.Json.Linq; System; System.IO; System.Linq; System.Net.Http; System.Net.Http.Headers; System.Threading.Tasks; using System.Threading.Tasks; namespace CSHttpClientSample { static class Program { // Replace with your valid subscription key. const string subscriptionKey = ""; // You must use the same Azure region in your REST API method as you used to // get your subscription keys. For example, if you got your subscription keys // from the West US region, replace "westcentralus" in the URL // below with "westus". // // Free trial subscription keys are generated in the "westus" region. // If you use a free trial subscription key, you shouldn't need to change // this region. const string uriBase = "https://westus.api.cognitive.microsoft.com/vision/v2.0/read/core/asyncBatchAnalyze"; static void Main() { // Get the path and filename to process from the user. Console.WriteLine("Handwriting Recognition:"); Console.Write( "Enter the path to an image with handwritten text you wish to read: "); string imageFilePath = Console.ReadLine(); if (File.Exists(imageFilePath)) { // Call the REST API method. Console.WriteLine("\nWait a moment for the results to appear.\n"); ReadHandwrittenText(imageFilePath).Wait(); } else { Console.WriteLine("\nInvalid file path"); } Console.WriteLine("\nPress Enter to exit..."); Console.ReadLine(); } ///

/// Gets the handwritten text from the specified image file by using /// the Computer Vision REST API. ///

/// The image file with handwritten text. static async Task ReadHandwrittenText(string imageFilePath) { try { HttpClient client = new HttpClient(); // Request headers. client.DefaultRequestHeaders.Add( "Ocp-Apim-Subscription-Key", subscriptionKey); // Assemble the URI for the REST API method. string uri = uriBase; HttpResponseMessage response; // Two REST API methods are required to extract handwritten text. // One method to submit the image for processing, the other method // to retrieve the text found in the image. // operationLocation stores the URI of the second REST API method, // returned by the first REST API method. string operationLocation; // Reads the contents of the specified local image // into a byte array. byte[] byteData = GetImageAsByteArray(imageFilePath); // Adds the byte array as an octet stream to the request body. using (ByteArrayContent content = new ByteArrayContent(byteData)) { // This example uses the "application/octet-stream" content type. // The other content types you can use are "application/json" // and "multipart/form-data". content.Headers.ContentType = new MediaTypeHeaderValue("application/octet-stream"); // The first REST API method, Batch Read, starts // the async process to analyze the written text in the image. response = await client.PostAsync(uri, content); } // // // // if The response header for the Batch Read method contains the URI of the second method, Read Operation Result, which returns the results of the process in the response body. The Batch Read operation does not return anything in the response body. (response.IsSuccessStatusCode) operationLocation = response.Headers.GetValues("Operation-Location").FirstOrDefault(); else { // Display the JSON error data. string errorString = await response.Content.ReadAsStringAsync(); Console.WriteLine("\n\nResponse:\n{0}\n", JToken.Parse(errorString).ToString()); return; } // If the first REST API method completes successfully, the second // REST API method retrieves the text written in the image. // // Note: The response may not be immediately available. Handwriting // recognition is an asynchronous operation that can take a variable // amount of time depending on the length of the handwritten text. // You may need to wait or retry this operation. // // This example checks once per second for ten seconds. string contentString; int i = 0; do { System.Threading.Thread.Sleep(1000); response = await client.GetAsync(operationLocation); contentString = await response.Content.ReadAsStringAsync(); ++i; } while (i < 10 && contentString.IndexOf("\"status\":\"Succeeded\"") == -1); if (i == 10 && contentString.IndexOf("\"status\":\"Succeeded\"") == -1) { Console.WriteLine("\nTimeout error.\n"); return; } // Display the JSON response. Console.WriteLine("\nResponse:\n\n{0}\n", JToken.Parse(contentString).ToString()); } catch (Exception e) { Console.WriteLine("\n" + e.Message); } } } ///

/// Returns the contents of the specified file as a byte array. ///

/// The image file to read. /// The byte array of the image data. static byte[] GetImageAsByteArray(string imageFilePath) { // Open a read-only file stream for the specified file. using (FileStream fileStream = new FileStream(imageFilePath, FileMode.Open, FileAccess.Read)) { // Read the file's contents into a byte array. BinaryReader binaryReader = new BinaryReader(fileStream); return binaryReader.ReadBytes((int)fileStream.Length); } } } } Examine the response A successful response is returned in JSON. The sample application parses and displays a successful response in the console window, similar to the following example: { "status": "Succeeded", "recognitionResults": [ { "page": 1, "clockwiseOrientation": 349.59, "width": 3200, "height": 3200, "unit": "pixel", "lines": [ { "boundingBox": [202,618,2047,643,2046,840,200,813], "text": "Our greatest glory is not", "words": [ { "boundingBox": [204,627,481,628,481,830,204,829], "text": "Our" }, { "boundingBox": [519,628,1057,630,1057,832,518,830], "text": "greatest" }, { "boundingBox": [1114,630,1549,631,1548,833,1114,832], "text": "glory" }, { "boundingBox": [1586,631,1785,632,1784,834,1586,833], "text": "is" }, { "boundingBox": [1822,632,2115,633,2115,835,1822,834], "text": "not" } ] }, { "boundingBox": [420,1273,2954,1250,2958,1488,422,1511], "text": "but in rising every time we fall", "words": [ { "boundingBox": [423,1269,634,1268,635,1507,424,1508], "text": "but" }, { "boundingBox": [667,1268,808,1268,809,1506,668,1507], "text": "in" }, { "boundingBox": [874,1267,1289,1265,1290,1504,875,1506], "text": "rising" }, { "boundingBox": [1331,1265,1771,1263,1772,1502,1332,1504], "text": "every" }, { "boundingBox": [1812, 1263, 2178, 1261, 2179, 1500, 1813, 1502], "text": "time" }, { "boundingBox": [2219, 1261, 2510, 1260, 2511, 1498, 2220, 1500], "text": "we" }, { "boundingBox": [2551, 1260, 3016, 1258, 3017, 1496, 2552, 1498], "text": "fall" } ] }, { "boundingBox": [1612, 903, 2744, 935, 2738, 1139, 1607, 1107], "text": "in never failing ,", "words": [ { "boundingBox": [1611, 934, 1707, 933, 1708, 1147, 1613, 1147], "text": "in" }, { "boundingBox": [1753, 933, 2132, 930, 2133, 1144, 1754, 1146], "text": "never" }, { "boundingBox": [2162, 930, 2673, 927, 2674, 1140, 2164, 1144], "text": "failing" }, { "boundingBox": [2703, 926, 2788, 926, 2790, 1139, 2705, 1140], "text": ",", "confidence": "Low" } ] } ] } ] } Clean up resources When no longer needed, delete the Visual Studio solution. To do so, open File Explorer, navigate to the folder in which you created the Visual Studio solution, and delete the folder. Next steps Explore a basic Windows application that uses Computer Vision to perform optical character recognition (OCR ). Create smart-cropped thumbnails; plus detect, categorize, tag, and describe visual features, including faces, in an image. Computer Vision API C# Tutorial Quickstart: Extract handwritten text using the REST API and Java in Computer Vision 5/29/2019 • 6 minutes to read • Edit Online In this quickstart, you extract handwritten text from an image by using Computer Vision's REST API. With the Batch Read API and the Read Operation Result API, you can detect handwritten text in an image, then extract recognized characters into a machine-usable character stream. IMPORTANT Unlike the OCR method, the Batch Read method runs asynchronously. This method does not return any information in the body of a successful response. Instead, the Batch Read method returns a URI in the value of the Operation-Content response header field. You can then call this URI, which represents the Read Operation Result method, in order to check the status and return the results of the Batch Read method call. If you don't have an Azure subscription, create a free account before you begin. Prerequisites You must have Java™ Platform, Standard Edition Development Kit 7 or 8 (JDK 7 or 8) installed. You must have a subscription key for Computer Vision. You can get a free trial key from Try Cognitive Services. Or, follow the instructions in Create a Cognitive Services account to subscribe to Computer Vision and get your key. Create and run the sample application To create and run the sample, do the following steps: 1. Create a new Java project in your favorite IDE or editor. If the option is available, create the Java project from a command line application template. 2. Import the following libraries into your Java project. If you're using Maven, the Maven coordinates are provided for each library. Apache HTTP client (org.apache.httpcomponents:httpclient:4.5.5) Apache HTTP core (org.apache.httpcomponents:httpcore:4.4.9) JSON library (org.json:json:20180130) 3. Add the following import statements to the file that contains the Main public class for your project. import import import import import import import import import import import import java.net.URI; org.apache.http.HttpEntity; org.apache.http.HttpResponse; org.apache.http.client.methods.HttpGet; org.apache.http.client.methods.HttpPost; org.apache.http.client.utils.URIBuilder; org.apache.http.entity.StringEntity; org.apache.http.impl.client.CloseableHttpClient; org.apache.http.impl.client.HttpClientBuilder; org.apache.http.util.EntityUtils; org.apache.http.Header; org.json.JSONObject; 4. Replace the needed: Main public class with the following code, then make the following changes in code where a. Replace the value of subscriptionKey with your subscription key. b. Replace the value of uriBase with the endpoint URL for the Batch Read method from the Azure region where you obtained your subscription keys, if necessary. c. Optionally, replace the value of imageToAnalyze with the URL of a different image from which you want to extract handwritten text. 5. Save, then build the Java project. 6. If you're using an IDE, run Main . Otherwise, open a command prompt window and then use the command to run the compiled class. For example, java Main . public // // // class Main { ********************************************** *** Update or verify the following values. *** ********************************************** // Replace with your valid subscription key. private static final String subscriptionKey = ""; // You must use the same Azure region in your REST API method as you used to // get your subscription keys. For example, if you got your subscription keys // from the West US region, replace "westcentralus" in the URL // below with "westus". // // Free trial subscription keys are generated in the "westus" region. // If you use a free trial subscription key, you shouldn't need to change // this region. private static final String uriBase = "https://westus.api.cognitive.microsoft.com/vision/v2.0/read/core/asyncBatchAnalyze"; private static final String imageToAnalyze = "https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/" + "Cursive_Writing_on_Notebook_paper.jpg/800px-Cursive_Writing_on_Notebook_paper.jpg"; public static void main(String[] args) { CloseableHttpClient httpTextClient = HttpClientBuilder.create().build(); CloseableHttpClient httpResultClient = HttpClientBuilder.create().build();; try { // This operation requires two REST API calls. One to submit the image // for processing, the other to retrieve the text found in the image. URIBuilder builder = new URIBuilder(uriBase); // Prepare the URI for the REST API method. URI uri = builder.build(); HttpPost request = new HttpPost(uri); java // Request headers. request.setHeader("Content-Type", "application/json"); request.setHeader("Ocp-Apim-Subscription-Key", subscriptionKey); // Request body. StringEntity requestEntity = new StringEntity("{\"url\":\"" + imageToAnalyze + "\"}"); request.setEntity(requestEntity); // Two REST API methods are required to extract handwritten text. // One method to submit the image for processing, the other method // to retrieve the text found in the image. // Call the first REST API method to detect the text. HttpResponse response = httpTextClient.execute(request); // Check for success. if (response.getStatusLine().getStatusCode() != 202) { // Format and display the JSON error message. HttpEntity entity = response.getEntity(); String jsonString = EntityUtils.toString(entity); JSONObject json = new JSONObject(jsonString); System.out.println("Error:\n"); System.out.println(json.toString(2)); return; } // Store the URI of the second REST API method. // This URI is where you can get the results of the first REST API method. String operationLocation = null; // The 'Operation-Location' response header value contains the URI for // the second REST API method. Header[] responseHeaders = response.getAllHeaders(); for (Header header : responseHeaders) { if (header.getName().equals("Operation-Location")) { operationLocation = header.getValue(); break; } } if (operationLocation == null) { System.out.println("\nError retrieving Operation-Location.\nExiting."); System.exit(1); } // // // // // // // If the first REST API method completes successfully, the second REST API method retrieves the text written in the image. Note: The response may not be immediately available. Handwriting recognition is an asynchronous operation that can take a variable amount of time depending on the length of the handwritten text. You may need to wait or retry this operation. System.out.println("\nHandwritten text submitted.\n" + "Waiting 10 seconds to retrieve the recognized text.\n"); Thread.sleep(10000); // Call the second REST API method and get the response. HttpGet resultRequest = new HttpGet(operationLocation); resultRequest.setHeader("Ocp-Apim-Subscription-Key", subscriptionKey); HttpResponse resultResponse = httpResultClient.execute(resultRequest); HttpEntity responseEntity = resultResponse.getEntity(); if (responseEntity != null) { // Format and display the JSON response. String jsonString = EntityUtils.toString(responseEntity); JSONObject json = new JSONObject(jsonString); JSONObject json = new JSONObject(jsonString); System.out.println("Text recognition result response: \n"); System.out.println(json.toString(2)); } } catch (Exception e) { System.out.println(e.getMessage()); } } } Examine the response A successful response is returned in JSON. The sample application parses and displays a successful response in the console window, similar to the following example: Handwritten text submitted. Waiting 10 seconds to retrieve the recognized text. Text recognition result response: { "status": "Succeeded", "recognitionResults": [ { "page": 1, "clockwiseOrientation": 349.59, "width": 3200, "height": 3200, "unit": "pixel", "lines": [ { "boundingBox": [202,618,2047,643,2046,840,200,813], "text": "Our greatest glory is not", "words": [ { "boundingBox": [204,627,481,628,481,830,204,829], "text": "Our" }, { "boundingBox": [519,628,1057,630,1057,832,518,830], "text": "greatest" }, { "boundingBox": [1114,630,1549,631,1548,833,1114,832], "text": "glory" }, { "boundingBox": [1586,631,1785,632,1784,834,1586,833], "text": "is" }, { "boundingBox": [1822,632,2115,633,2115,835,1822,834], "text": "not" } ] }, { "boundingBox": [420,1273,2954,1250,2958,1488,422,1511], "text": "but in rising every time we fall", "words": [ { "boundingBox": [423,1269,634,1268,635,1507,424,1508], "text": "but" }, { "boundingBox": [667,1268,808,1268,809,1506,668,1507], "text": "in" }, }, { "boundingBox": [874,1267,1289,1265,1290,1504,875,1506], "text": "rising" }, { "boundingBox": [1331,1265,1771,1263,1772,1502,1332,1504], "text": "every" }, { "boundingBox": [1812, 1263, 2178, 1261, 2179, 1500, 1813, 1502], "text": "time" }, { "boundingBox": [2219, 1261, 2510, 1260, 2511, 1498, 2220, 1500], "text": "we" }, { "boundingBox": [2551, 1260, 3016, 1258, 3017, 1496, 2552, 1498], "text": "fall" } ] }, { "boundingBox": [1612, 903, 2744, 935, 2738, 1139, 1607, 1107], "text": "in never failing ,", "words": [ { "boundingBox": [1611, 934, 1707, 933, 1708, 1147, 1613, 1147], "text": "in" }, { "boundingBox": [1753, 933, 2132, 930, 2133, 1144, 1754, 1146], "text": "never" }, { "boundingBox": [2162, 930, 2673, 927, 2674, 1140, 2164, 1144], "text": "failing" }, { "boundingBox": [2703, 926, 2788, 926, 2790, 1139, 2705, 1140], "text": ",", "confidence": "Low" } ] } ] } ] } Clean up resources When no longer needed, delete the Java project, including the compiled class and imported libraries. Next steps Explore a Java Swing application that uses Computer Vision to perform optical character recognition (OCR ); create smart-cropped thumbnails; plus detect, categorize, tag, and describe visual features, including faces, in an image. To rapidly experiment with the Computer Vision API, try the Open API testing console. Computer Vision API Java Tutorial Quickstart: Extract handwritten text using the REST API and JavaScript in Computer Vision 5/29/2019 • 5 minutes to read • Edit Online In this quickstart, you extract handwritten text from an image by using Computer Vision's REST API. With the Batch Read API and the Read Operation Result API, you can detect handwritten text in an image, then extract recognized characters into a machine-usable character stream. IMPORTANT Unlike the OCR method, the Batch Read method runs asynchronously. This method does not return any information in the body of a successful response. Instead, the Batch Read method returns a URI in the value of the Operation-Content response header field. You can then call this URI, which represents the Read Operation Result method, to both check the status and return the results of the Batch Read method call. If you don't have an Azure subscription, create a free account before you begin. Prerequisites You must have a subscription key for Computer Vision. You can get a free trial key from Try Cognitive Services. Or, follow the instructions in Create a Cognitive Services account to subscribe to Computer Vision and get your key. Create and run the sample To create and run the sample, do the following steps: 1. Copy the following code into a text editor. 2. Make the following changes in code where needed: a. Replace the value of subscriptionKey with your subscription key. b. Replace the value of uriBase with the endpoint URL for the Batch Read method from the Azure region where you obtained your subscription keys, if necessary. c. Optionally, replace the value of the value attribute for the inputImage control with the URL of a different image from which you want to extract handwritten text. 3. Save the code as a file with an .html extension. For example, get-handwriting.html . 4. Open a browser window. 5. In the browser, drag and drop the file into the browser window. 6. When the webpage is displayed in the browser, choose the Read image button. Handwriting Sample

Read handwritten image:

Enter the URL to an image of handwritten text, then click the Read image button.

Image to read:

Response:

Source image:

Examine the response A successful response is returned in JSON. The sample webpage parses and displays a successful response in the browser window, similar to the following example: { { "status": "Succeeded", "recognitionResults": [ { "page": 1, "clockwiseOrientation": 349.59, "width": 3200, "height": 3200, "unit": "pixel", "lines": [ { "boundingBox": [202,618,2047,643,2046,840,200,813], "text": "Our greatest glory is not", "words": [ { "boundingBox": [204,627,481,628,481,830,204,829], "text": "Our" }, { "boundingBox": [519,628,1057,630,1057,832,518,830], "text": "greatest" }, { "boundingBox": [1114,630,1549,631,1548,833,1114,832], "text": "glory" }, { "boundingBox": [1586,631,1785,632,1784,834,1586,833], "text": "is" }, { "boundingBox": [1822,632,2115,633,2115,835,1822,834], "text": "not" } ] }, { "boundingBox": [420,1273,2954,1250,2958,1488,422,1511], "text": "but in rising every time we fall", "words": [ { "boundingBox": [423,1269,634,1268,635,1507,424,1508], "text": "but" }, { "boundingBox": [667,1268,808,1268,809,1506,668,1507], "text": "in" }, { "boundingBox": [874,1267,1289,1265,1290,1504,875,1506], "text": "rising" }, { "boundingBox": [1331,1265,1771,1263,1772,1502,1332,1504], "text": "every" }, { "boundingBox": [1812, 1263, 2178, 1261, 2179, 1500, 1813, 1502], "text": "time" }, { "boundingBox": [2219, 1261, 2510, 1260, 2511, 1498, 2220, 1500], "text": "we" }, { "boundingBox": [2551, 1260, 3016, 1258, 3017, 1496, 2552, 1498], "text": "fall" } ] }, }, { "boundingBox": [1612, 903, 2744, 935, 2738, 1139, 1607, 1107], "text": "in never failing ,", "words": [ { "boundingBox": [1611, 934, 1707, 933, 1708, 1147, 1613, 1147], "text": "in" }, { "boundingBox": [1753, 933, 2132, 930, 2133, 1144, 1754, 1146], "text": "never" }, { "boundingBox": [2162, 930, 2673, 927, 2674, 1140, 2164, 1144], "text": "failing" }, { "boundingBox": [2703, 926, 2788, 926, 2790, 1139, 2705, 1140], "text": ",", "confidence": "Low" } ] } ] } ] } Clean up resources When no longer needed, delete the file. Next steps Explore a JavaScript application that uses Computer Vision to perform optical character recognition (OCR ); create smart-cropped thumbnails; plus detect, categorize, tag, and describe visual features, including faces, in an image. To rapidly experiment with the Computer Vision API, try the Open API testing console. Computer Vision API JavaScript Tutorial Quickstart: Extract handwritten text using the REST API and Python in Computer Vision 5/29/2019 • 5 minutes to read • Edit Online In this quickstart, you extract handwritten text from an image by using Computer Vision's REST API. With the Batch Read API and the Read Operation Result API, you can detect handwritten text in an image, then extract recognized characters into a machine-usable character stream. IMPORTANT Unlike the OCR method, the Batch Read method runs asynchronously. This method does not return any information in the body of a successful response. Instead, the Batch Read method returns a URI in the value of the Operation-Content response header field. You can then call this URI, which represents the Read Operation Result API, to both check the status and return the results of the Batch Read method call. You can run this quickstart in a step-by step fashion using a Jupyter notebook on MyBinder. To launch Binder, select the following button: launch binder If you don't have an Azure subscription, create a free account before you begin. Prerequisites You must have Python installed if you want to run the sample locally. You must have a subscription key for Computer Vision. You can get a free trial key from Try Cognitive Services. Or, follow the instructions in Create a Cognitive Services account to subscribe to Computer Vision and get your key. Create and run the sample To create and run the sample, do the following steps: 1. Copy the following code into a text editor. 2. Make the following changes in code where needed: a. Replace the value of subscription_key with your subscription key. b. Replace the value of vision_base_url with the endpoint URL for the Computer Vision resource in the Azure region where you obtained your subscription keys, if necessary. c. Optionally, replace the value of image_url with the URL of a different image from which you want to extract handwritten text. 3. Save the code as a file with an .py extension. For example, get-handwritten-text.py . 4. Open a command prompt window. 5. At the prompt, use the python command to run the sample. For example, python get-handwritten-text.py . import requests import time # If you are using a Jupyter notebook, uncomment the following line. #%matplotlib inline import matplotlib.pyplot as plt from matplotlib.patches import Polygon from matplotlib.patches import Polygon from PIL import Image from io import BytesIO # Replace with your valid subscription key. subscription_key = "" assert subscription_key # You must use the same region in your REST call as you used to get your # subscription keys. For example, if you got your subscription keys from # westus, replace "westcentralus" in the URI below with "westus". # # Free trial subscription keys are generated in the "westus" region. # If you use a free trial subscription key, you shouldn't need to change # this region. vision_base_url = "https://westcentralus.api.cognitive.microsoft.com/vision/v2.0/" text_recognition_url = vision_base_url + "read/core/asyncBatchAnalyze" # Set image_url to the URL of an image that you want to analyze. image_url = "https://upload.wikimedia.org/wikipedia/commons/d/dd/Cursive_Writing_on_Notebook_paper.jpg" headers = {'Ocp-Apim-Subscription-Key': subscription_key} data = {'url': image_url} response = requests.post( text_recognition_url, headers=headers, json=data) response.raise_for_status() # Extracting handwritten text requires two API calls: One call to submit the # image for processing, the other to retrieve the text found in the image. # Holds the URI used to retrieve the recognized text. operation_url = response.headers["Operation-Location"] # The recognized text isn't immediately available, so poll to wait for completion. analysis = {} poll = True while (poll): response_final = requests.get( response.headers["Operation-Location"], headers=headers) analysis = response_final.json() print(analysis) time.sleep(1) if ("recognitionResults" in analysis): poll= False if ("status" in analysis and analysis['status'] == 'Failed'): poll= False polygons=[] if ("recognitionResults" in analysis): # Extract the recognized text, with bounding boxes. polygons = [(line["boundingBox"], line["text"]) for line in analysis["recognitionResults"][0]["lines"]] # Display the image and overlay it with the extracted text. plt.figure(figsize=(15, 15)) image = Image.open(BytesIO(requests.get(image_url).content)) ax = plt.imshow(image) for polygon in polygons: vertices = [(polygon[0][i], polygon[0][i+1]) for i in range(0, len(polygon[0]), 2)] text = polygon[1] patch = Polygon(vertices, closed=True, fill=False, linewidth=2, color='y') ax.axes.add_patch(patch) plt.text(vertices[0][0], vertices[0][1], text, fontsize=20, va="top") Examine the response A successful response is returned in JSON. The sample webpage parses and displays a successful response in the command prompt window, similar to the following example: { "status": "Succeeded", "recognitionResult": { "lines": [ { "boundingBox": [ 2, 52, 65, 46, 69, 89, 7, 95 ], "text": "dog", "words": [ { "boundingBox": 0, 59, 63, 43, 77, 86, 3, 102 ], "text": "dog" } ] }, { "boundingBox": [ 6, 2, 771, 13, 770, 75, 5, 64 ], "text": "The quick "words": [ { "boundingBox": 0, 4, 92, 5, 77, 71, 0, 71 ], "text": "The" }, { "boundingBox": 74, 4, 189, 5, 174, [ brown fox jumps over the lazy", [ [ 174, 72, 60, 71 ], "text": "quick" }, { "boundingBox": [ 176, 5, 321, 6, 306, 73, 161, 72 ], "text": "brown" }, { "boundingBox": [ 308, 6, 387, 6, 372, 73, 293, 73 ], "text": "fox" }, { "boundingBox": [ 382, 6, 506, 7, 491, 74, 368, 73 ], "text": "jumps" }, { "boundingBox": [ 492, 7, 607, 8, 592, 75, 478, 74 ], "text": "over" }, { "boundingBox": [ 589, 8, 673, 8, 658, 75, 575, 75 ], ], "text": "the" }, { "boundingBox": [ 660, 8, 783, 9, 768, 76, 645, 75 ], "text": "lazy" } ] }, { "boundingBox": [ 2, 84, 783, 96, 782, 154, 1, 148 ], "text": "Pack my box with five dozen liquor jugs", "words": [ { "boundingBox": [ 0, 86, 94, 87, 72, 151, 0, 149 ], "text": "Pack" }, { "boundingBox": [ 76, 87, 164, 88, 142, 152, 54, 150 ], "text": "my" }, { "boundingBox": [ 155, 88, 243, 89, 222, 152, 134, 151 ], "text": "box" }, { "boundingBox": [ 226, 89, 344, 90, 323, 154, 204, 152 ], "text": "with" }, { "boundingBox": [ 336, 90, 432, 91, 411, 154, 314, 154 ], "text": "five" }, { "boundingBox": [ 419, 91, 538, 92, 516, 154, 398, 154 ], "text": "dozen" }, { "boundingBox": [ 547, 92, 701, 94, 679, 154, 525, 154 ], "text": "liquor" }, { "boundingBox": [ 696, 94, 800, 95, 780, 154, 675, 154 ], "text": "jugs" } ] } ] } } Next steps Explore a Python application that uses Computer Vision to perform optical character recognition (OCR ); create smart-cropped thumbnails; plus detect, categorize, tag, and describe visual features, including faces, in an image. To rapidly experiment with the Computer Vision API, try the Open API testing console. Computer Vision API Python Tutorial Quickstart: Recognize domain-specific content using the REST API and PHP with Computer Vision 4/19/2019 • 2 minutes to read • Edit Online In this quickstart, you use a domain model to identify landmarks or, optionally, celebrities in a remotely stored image by using Computer Vision's REST API. With the Recognize Domain Specific Content method, you can apply a domain-specific model to recognize content within an image. If you don't have an Azure subscription, create a free account before you begin. Prerequisites You must have PHP installed. You must have Pear installed. You must have a subscription key for Computer Vision. You can get a free trial key from Try Cognitive Services. Or, follow the instructions in Create a Cognitive Services account to subscribe to Computer Vision and get your key. Create and run the sample To create and run the sample, do the following steps: 1. Install the PHP5 HTTP_Request2 package. a. Open a command prompt window as an administrator. b. Run the following command: pear install HTTP_Request2 c. After the package is successfully installed, close the command prompt window. 2. Copy the following code into a text editor. 3. Make the following changes in code where needed: a. Replace the value of subscriptionKey with your subscription key. b. Replace the value of uriBase with the endpoint URL for the Recognize Domain Specific Content method from the Azure region where you obtained your subscription keys, if necessary. c. Optionally, replace the value of imageUrl with the URL of a different image that you want to analyze. d. Optionally, replace the value of the domain request parameter with celebrites if you want to use the celebrities domain model instead of the landmarks domain model. 4. Save the code as a file with a .php extension. For example, use-domain-model.php . 5. Open a browser window with PHP support. 6. Drag and drop the file into the browser window. Analyze Domain Model Sample with a valid subscription key. $ocpApimSubscriptionKey = ''; // You must use the same location in your REST call as you used to obtain // your subscription keys. For example, if you obtained your subscription keys // from westus, replace "westcentralus" in the URL below with "westus". $uriBase = 'https://westcentralus.api.cognitive.microsoft.com/vision/v2.0/'; // Change 'landmarks' to 'celebrities' to use the Celebrities model. $domain = 'landmarks'; $imageUrl = 'https://upload.wikimedia.org/wikipedia/commons/2/23/Space_Needle_2011-07-04.jpg'; require_once 'HTTP/Request2.php'; $request = new Http_Request2($uriBase . 'models/' . $domain . '/analyze'); $url = $request->getUrl(); $headers = array( // Request headers 'Content-Type' => 'application/json', 'Ocp-Apim-Subscription-Key' => $ocpApimSubscriptionKey ); $request->setHeader($headers); $parameters = array( // Request parameters 'model' => $domain ); $url->setQueryVariables($parameters); $request->setMethod(HTTP_Request2::METHOD_POST); // Request body parameters $body = json_encode(array('url' => $imageUrl)); // Request body $request->setBody($body); try { $response = $request->send(); echo "

" .
json_encode(json_decode($response->getBody()), JSON_PRETTY_PRINT) . "

"; } catch (HttpException $ex) { echo "

" . $ex . "

"; } ?> Examine the response A successful response is returned in JSON. The sample website parses and displays a successful response in the browser window, similar to the following example: { "result": { "landmarks": [ { "name": "Space Needle", "confidence": 0.9998177886009216 } ] }, "requestId": "4d26587b-b2b9-408d-a70c-1f8121d84b0d", "metadata": { "height": 4132, "width": 2096, "format": "Jpeg" } } Clean up resources When no longer needed, delete the file, and then uninstall the PHP5 package, do the following steps: HTTP_Request2 package. To uninstall the 1. Open a command prompt window as an administrator. 2. Run the following command: pear uninstall HTTP_Request2 3. After the package is successfully uninstalled, close the command prompt window. Next steps Explore the Computer Vision API used to analyze an image, detect celebrities and landmarks, create a thumbnail, and extract printed and handwritten text. To rapidly experiment with the Computer Vision API, try the Open API testing console. Explore the Computer Vision API Quickstart: Use a domain model using the REST API and Python in Computer Vision 4/19/2019 • 4 minutes to read • Edit Online In this quickstart, you use a domain model to identify landmarks or, optionally, celebrities in a remotely stored image by using Computer Vision's REST API. With the Recognize Domain Specific Content method, you can apply a domain-specific model to recognize content within an image. You can run this quickstart in a step-by step fashion using a Jupyter notebook on MyBinder. To launch Binder, select the following button: launch binder If you don't have an Azure subscription, create a free account before you begin. Prerequisites You must have Python installed if you want to run the sample locally. You must have a subscription key for Computer Vision. You can get a free trial key from Try Cognitive Services. Or, follow the instructions in Create a Cognitive Services account to subscribe to Computer Vision and get your key. Create and run the landmarks sample To create and run the landmark sample, do the following steps: 1. Copy the following code into a text editor. 2. Make the following changes in code where needed: a. Replace the value of subscription_key with your subscription key. b. Replace the value of vision_base_url with the endpoint URL for the Computer Vision resource in the Azure region where you obtained your subscription keys, if necessary. c. Optionally, replace the value of image_url with the URL of a different image in which you want to detect landmarks. 3. Save the code as a file with an .py extension. For example, get-landmarks.py . 4. Open a command prompt window. 5. At the prompt, use the python command to run the sample. For example, python get-landmarks.py . import requests # If you are using a Jupyter notebook, uncomment the following line. #%matplotlib inline import matplotlib.pyplot as plt from PIL import Image from io import BytesIO # Replace with your valid subscription key. subscription_key = "" assert subscription_key # You must use the same region in your REST call as you used to get your # subscription keys. For example, if you got your subscription keys from # westus, replace "westcentralus" in the URI below with "westus". # # Free trial subscription keys are generated in the "westus" region. # If you use a free trial subscription key, you shouldn't need to change # this region. vision_base_url = "https://westcentralus.api.cognitive.microsoft.com/vision/v2.0/" landmark_analyze_url = vision_base_url + "models/landmarks/analyze" # Set image_url to the URL of an image that you want to analyze. image_url = "https://upload.wikimedia.org/wikipedia/commons/f/f6/" + \ "Bunker_Hill_Monument_2005.jpg" headers = {'Ocp-Apim-Subscription-Key': subscription_key} params = {'model': 'landmarks'} data = {'url': image_url} response = requests.post( landmark_analyze_url, headers=headers, params=params, json=data) response.raise_for_status() # The 'analysis' object contains various fields that describe the image. The # most relevant landmark for the image is obtained from the 'result' property. analysis = response.json() assert analysis["result"]["landmarks"] is not [] print(analysis) landmark_name = analysis["result"]["landmarks"][0]["name"].capitalize() # Display the image and overlay it with the landmark name. image = Image.open(BytesIO(requests.get(image_url).content)) plt.imshow(image) plt.axis("off") _ = plt.title(landmark_name, size="x-large", y=-0.1) Examine the response for the landmarks sample A successful response is returned in JSON. The sample webpage parses and displays a successful response in the command prompt window, similar to the following example: { "result": { "landmarks": [ { "name": "Bunker Hill Monument", "confidence": 0.9768505096435547 } ] }, "requestId": "659a10cd-44bb-44db-9147-a295b853b2b8", "metadata": { "height": 1600, "width": 1200, "format": "Jpeg" } } Create and run the celebrities sample To create and run the landmark sample, do the following steps: 1. Copy the following code into a text editor. 2. Make the following changes in code where needed: a. Replace the value of subscription_key with your subscription key. b. Replace the value of vision_base_url with the endpoint URL for the Computer Vision resource in the Azure region where you obtained your subscription keys, if necessary. c. Optionally, replace the value of image_url with the URL of a different image in which you want to detect celebrities. 3. Save the code as a file with an .py extension. For example, get-celebrities.py . 4. Open a command prompt window. 5. At the prompt, use the python command to run the sample. For example, python get-celebrities.py . import requests # If you are using a Jupyter notebook, uncomment the following line. #%matplotlib inline import matplotlib.pyplot as plt from PIL import Image from io import BytesIO # Replace with your valid subscription key. subscription_key = "" assert subscription_key vision_base_url = "https://westcentralus.api.cognitive.microsoft.com/vision/v2.0/" celebrity_analyze_url = vision_base_url + "models/celebrities/analyze" # Set image_url to the URL of an image that you want to analyze. image_url = "https://upload.wikimedia.org/wikipedia/commons/d/d9/" + \ "Bill_gates_portrait.jpg" headers = {'Ocp-Apim-Subscription-Key': subscription_key} params = {'model': 'celebrities'} data = {'url': image_url} response = requests.post( celebrity_analyze_url, headers=headers, params=params, json=data) response.raise_for_status() # The 'analysis' object contains various fields that describe the image. The # most relevant celebrity for the image is obtained from the 'result' property. analysis = response.json() assert analysis["result"]["celebrities"] is not [] print(analysis) celebrity_name = analysis["result"]["celebrities"][0]["name"].capitalize() # Display the image and overlay it with the celebrity name. image = Image.open(BytesIO(requests.get(image_url).content)) plt.imshow(image) plt.axis("off") _ = plt.title(celebrity_name, size="x-large", y=-0.1) Examine the response for the celebrities sample A successful response is returned in JSON. The sample webpage parses and displays a successful response in the command prompt window, similar to the following example: { "result": { "celebrities": [ { "faceRectangle": { "top": 123, "left": 156, "width": 187, "height": 187 }, "name": "Bill Gates", "confidence": 0.9993845224380493 } ] }, "requestId": "f14ec1d0-62d4-4296-9ceb-6b5776dc2020", "metadata": { "height": 521, "width": 550, "format": "Jpeg" } } Clean up resources When no longer needed, delete the files for both samples. Next steps Explore a Python application that uses Computer Vision to perform optical character recognition (OCR ); create smart-cropped thumbnails; plus detect, categorize, tag, and describe visual features, including faces, in an image. To rapidly experiment with the Computer Vision API, try the Open API testing console. Computer Vision API Python Tutorial Quickstart: Analyze an image using the Computer Vision SDK and C# 4/18/2019 • 3 minutes to read • Edit Online In this quickstart, you will analyze both a local and a remote image to extract visual features using the Computer Vision client library for C#. If you wish, you can download the code in this guide as a complete sample app from the Cognitive Services Csharp Vision repo on GitHub. Prerequisites A Computer Vision subscription key. You can get a free trial subscription key from Try Cognitive Services. Or, follow the instructions in Create a Cognitive Services account to subscribe to Computer Vision and get your key. Any edition of Visual Studio 2015 or 2017. The Microsoft.Azure.CognitiveServices.Vision.ComputerVision client library NuGet package. It isn't necessary to download the package. Installation instructions are provided below. Create and run the sample application To run the sample, do the following steps: 1. Create a new Visual C# Console App in Visual Studio. 2. Install the Computer Vision client library NuGet package. a. On the menu, click Tools, select NuGet Package Manager, then Manage NuGet Packages for Solution. b. Click the Browse tab, and in the Search box type "Microsoft.Azure.CognitiveServices.Vision.ComputerVision". c. Select Microsoft.Azure.CognitiveServices.Vision.ComputerVision when it displays, then click the checkbox next to your project name, and Install. 3. Replace the contents of Program.cs with the following code. The AnalyzeImageAsync and AnalyzeImageInStreamAsync methods wrap the Analyze Image REST API for remote and local images, respectively. using Microsoft.Azure.CognitiveServices.Vision.ComputerVision; using Microsoft.Azure.CognitiveServices.Vision.ComputerVision.Models; using using using using System; System.Collections.Generic; System.IO; System.Threading.Tasks; namespace ImageAnalyze { class Program { // subscriptionKey = "0123456789abcdef0123456789ABCDEF" private const string subscriptionKey = ""; // localImagePath = @"C:\Documents\LocalImage.jpg" private const string localImagePath = @""; private const string remoteImageUrl = "https://upload.wikimedia.org/wikipedia/commons/3/3c/Shaki_waterfall.jpg"; // Specify the features to return private static readonly List features = new List() { VisualFeatureTypes.Categories, VisualFeatureTypes.Description, VisualFeatureTypes.Faces, VisualFeatureTypes.ImageType, VisualFeatureTypes.Tags }; static void Main(string[] args) { ComputerVisionClient computerVision = new ComputerVisionClient( new ApiKeyServiceClientCredentials(subscriptionKey), new System.Net.Http.DelegatingHandler[] { }); // // // // // // // You must use the same region as you used to get your subscription keys. For example, if you got your subscription keys from westus, replace "westcentralus" with "westus". Free trial subscription keys are generated in the "westus" region. If you use a free trial subscription key, you shouldn't need to change the region. // Specify the Azure region computerVision.Endpoint = "https://westcentralus.api.cognitive.microsoft.com"; Console.WriteLine("Images being analyzed ..."); var t1 = AnalyzeRemoteAsync(computerVision, remoteImageUrl); var t2 = AnalyzeLocalAsync(computerVision, localImagePath); Task.WhenAll(t1, t2).Wait(5000); Console.WriteLine("Press ENTER to exit"); Console.ReadLine(); } // Analyze a remote image private static async Task AnalyzeRemoteAsync( ComputerVisionClient computerVision, string imageUrl) { if (!Uri.IsWellFormedUriString(imageUrl, UriKind.Absolute)) { Console.WriteLine( "\nInvalid remoteImageUrl:\n{0} \n", imageUrl); return; } ImageAnalysis analysis = await computerVision.AnalyzeImageAsync(imageUrl, features); DisplayResults(analysis, imageUrl); } // Analyze a local image private static async Task AnalyzeLocalAsync( ComputerVisionClient computerVision, string imagePath) { if (!File.Exists(imagePath)) { Console.WriteLine( "\nUnable to open or read localImagePath:\n{0} \n", imagePath); return; } using (Stream imageStream = File.OpenRead(imagePath)) { ImageAnalysis analysis = await computerVision.AnalyzeImageInStreamAsync( imageStream, features); imageStream, features); DisplayResults(analysis, imagePath); } } // Display the most relevant caption for the image private static void DisplayResults(ImageAnalysis analysis, string imageUri) { Console.WriteLine(imageUri); if (analysis.Description.Captions.Count != 0) { Console.WriteLine(analysis.Description.Captions[0].Text + "\n"); } else { Console.WriteLine("No description generated."); } } } } 4. Replace 5. Change computerVision.Endpoint 6. Replace 7. Optionally, set with your valid subscription key. to the Azure region associated with your subscription keys, if necessary. with the path and file name of a local image. remoteImageUrl to a different image URL. 8. Run the program. Examine the response A successful response displays the most relevant caption for each image. You can change the method to output different image data. See the AnalyzeLocalAsync method to learn more. DisplayResults See API Quickstarts: Analyze a local image with C# for an example of a raw JSON output. https://upload.wikimedia.org/wikipedia/commons/3/3c/Shaki_waterfall.jpg a large waterfall over a rocky cliff Next steps Explore the Computer Vision APIs used to analyze an image, detect celebrities and landmarks, create a thumbnail, and extract printed and handwritten text. Explore Computer Vision APIs Quickstart: Generate a thumbnail using the Computer Vision SDK and C# 4/18/2019 • 3 minutes to read • Edit Online In this quickstart, you will generate a smart-cropped thumbnail from an image using the Computer Vision SDK for C#. If you wish, you can download the code in this guide as a complete sample app from the Cognitive Services Csharp Vision repo on GitHub. Prerequisites A Computer Vision subscription key. You can get a free trial key from Try Cognitive Services. Or, follow the instructions in Create a Cognitive Services account to subscribe to Computer Vision and get your key. Any edition of Visual Studio 2015 or 2017. The Microsoft.Azure.CognitiveServices.Vision.ComputerVision client library NuGet package. It isn't necessary to download the package. Installation instructions are provided below. GenerateThumbnailAsync method You can use these methods to generate a thumbnail of an image. You specify the height and width, which can differ from the aspect ratio of the input image. Computer Vision uses smart cropping to intelligently identify the area of interest and generate cropping coordinates based on that region. To run the sample, do the following steps: 1. Create a new Visual C# Console App in Visual Studio. 2. Install the Computer Vision client library NuGet package. a. On the menu, click Tools, select NuGet Package Manager, then Manage NuGet Packages for Solution. b. Click the Browse tab, and in the Search box type "Microsoft.Azure.CognitiveServices.Vision.ComputerVision". c. Select Microsoft.Azure.CognitiveServices.Vision.ComputerVision when it displays, then click the checkbox next to your project name, and Install. 3. Replace Program.cs with the following code. The GenerateThumbnailAsync and GenerateThumbnailInStreamAsync methods wrap the Get Thumbnail API for remote and local images, respectively. using Microsoft.Azure.CognitiveServices.Vision.ComputerVision; using System; using System.IO; using System.Threading.Tasks; namespace ImageThumbnail { class Program { private const bool writeThumbnailToDisk = false; // subscriptionKey = "0123456789abcdef0123456789ABCDEF" private const string subscriptionKey = ""; // localImagePath = @"C:\Documents\LocalImage.jpg" private const string localImagePath = @""; private const string remoteImageUrl = "https://upload.wikimedia.org/wikipedia/commons/9/94/Bloodhound_Puppy.jpg"; private const int thumbnailWidth = 100; private const int thumbnailHeight = 100; static void Main(string[] args) { ComputerVisionClient computerVision = new ComputerVisionClient( new ApiKeyServiceClientCredentials(subscriptionKey), new System.Net.Http.DelegatingHandler[] { }); // // // // // // // You must use the same region as you used to get your subscription keys. For example, if you got your subscription keys from westus, replace "westcentralus" with "westus". Free trial subscription keys are generated in the "westus" region. If you use a free trial subscription key, you shouldn't need to change the region. // Specify the Azure region computerVision.Endpoint = "https://westcentralus.api.cognitive.microsoft.com"; Console.WriteLine("Images being analyzed ...\n"); var t1 = GetRemoteThumbnailAsync(computerVision, remoteImageUrl); var t2 = GetLocalThumbnailAsnc(computerVision, localImagePath); Task.WhenAll(t1, t2).Wait(5000); Console.WriteLine("Press ENTER to exit"); Console.ReadLine(); } // Create a thumbnail from a remote image private static async Task GetRemoteThumbnailAsync( ComputerVisionClient computerVision, string imageUrl) { if (!Uri.IsWellFormedUriString(imageUrl, UriKind.Absolute)) { Console.WriteLine( "\nInvalid remoteImageUrl:\n{0} \n", imageUrl); return; } Stream thumbnail = await computerVision.GenerateThumbnailAsync( thumbnailWidth, thumbnailHeight, imageUrl, true); string path = Environment.CurrentDirectory; string imageName = imageUrl.Substring(imageUrl.LastIndexOf('/') + 1); string thumbnailFilePath = path + "\\" + imageName.Insert(imageName.Length - 4, "_thumb"); // Save the thumbnail to the current working directory, // using the original name with the suffix "_thumb". SaveThumbnail(thumbnail, thumbnailFilePath); } // Create a thumbnail from a local image private static async Task GetLocalThumbnailAsnc( ComputerVisionClient computerVision, string imagePath) { if (!File.Exists(imagePath)) { Console.WriteLine( "\nUnable to open or read localImagePath:\n{0} \n", imagePath); return; } } using (Stream imageStream = File.OpenRead(imagePath)) { Stream thumbnail = await computerVision.GenerateThumbnailInStreamAsync( thumbnailWidth, thumbnailHeight, imageStream, true); string thumbnailFilePath = localImagePath.Insert(localImagePath.Length - 4, "_thumb"); // Save the thumbnail to the same folder as the original image, // using the original name with the suffix "_thumb". SaveThumbnail(thumbnail, thumbnailFilePath); } } // Save the thumbnail locally. // NOTE: This will overwrite an existing file of the same name. private static void SaveThumbnail(Stream thumbnail, string thumbnailFilePath) { if (writeThumbnailToDisk) { using (Stream file = File.Create(thumbnailFilePath)) { thumbnail.CopyTo(file); } } Console.WriteLine("Thumbnail {0} written to: {1}\n", writeThumbnailToDisk ? "" : "NOT", thumbnailFilePath); } } } 4. Replace 5. Change computerVision.Endpoint 6. Optionally, replace with your valid subscription key. to the Azure region associated with your subscription keys, if necessary. with the path and file name of a local image (will be ignored if not set). 7. Optionally, set remoteImageUrl to a different image. 8. Optionally, set writeThumbnailToDisk to true to save the thumbnail to disk. 9. Run the program. Examine the response A successful response saves the thumbnail for each image locally and displays the thumbnail's location, for example: Thumbnail written to: C:\Documents\LocalImage_thumb.jpg Thumbnail written to: ...\bin\Debug\Bloodhound_Puppy_thumb.jpg Next steps Explore the Computer Vision APIs used to analyze an image, detect celebrities and landmarks, create a thumbnail, and extract printed and handwritten text. Explore Computer Vision APIs Quickstart: Extract handwritten text using the Computer Vision C# SDK 5/29/2019 • 3 minutes to read • Edit Online In this quickstart, you will extract handwritten or printed text from an image using the Computer Vision SDK for C#. If you wish, you can download the code in this guide as a complete sample app from the Cognitive Services Csharp Vision repo on GitHub. Prerequisites A Computer Vision subscription key. You can get a free trial key from Try Cognitive Services. Or, follow the instructions in Create a Cognitive Services account to subscribe to Computer Vision and get your key. Any edition of Visual Studio 2015 or 2017. The Microsoft.Azure.CognitiveServices.Vision.ComputerVision client library NuGet package. It isn't necessary to download the package. Installation instructions are provided below. Create and run the sample app To run the sample, do the following steps: 1. Create a new Visual C# Console App in Visual Studio. 2. Install the Computer Vision client library NuGet package. a. On the menu, click Tools, select NuGet Package Manager, then Manage NuGet Packages for Solution. b. Click the Browse tab, and in the Search box type "Microsoft.Azure.CognitiveServices.Vision.ComputerVision". c. Select Microsoft.Azure.CognitiveServices.Vision.ComputerVision when it displays, then click the checkbox next to your project name, and Install. 3. Replace Program.cs with the following code. The BatchReadFileAsync and BatchReadFileInStreamAsync methods wrap the Batch Read API for remote and local images, respectively. The GetReadOperationResultAsync method wraps the Get Read Operation Result API. using Microsoft.Azure.CognitiveServices.Vision.ComputerVision; using Microsoft.Azure.CognitiveServices.Vision.ComputerVision.Models; using System; using System.IO; using System.Threading.Tasks; namespace ExtractText { class Program { // subscriptionKey = "0123456789abcdef0123456789ABCDEF" private const string subscriptionKey = ""; // localImagePath = @"C:\Documents\LocalImage.jpg" private const string localImagePath = @""; private const string remoteImageUrl = "https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Cursive_Writing_on_Notebook_paper.jpg/800px- "https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Cursive_Writing_on_Notebook_paper.jpg/800pxCursive_Writing_on_Notebook_paper.jpg"; private const int numberOfCharsInOperationId = 36; static void Main(string[] args) { ComputerVisionClient computerVision = new ComputerVisionClient( new ApiKeyServiceClientCredentials(subscriptionKey), new System.Net.Http.DelegatingHandler[] { }); // // // // // // // You must use the same region as you used to get your subscription keys. For example, if you got your subscription keys from westus, replace "westcentralus" with "westus". Free trial subscription keys are generated in the westcentralus region. If you use a free trial subscription key, you shouldn't need to change the region. // Specify the Azure region computerVision.Endpoint = "https://westus.api.cognitive.microsoft.com"; Console.WriteLine("Images being analyzed ..."); var t1 = ExtractRemoteTextAsync(computerVision, remoteImageUrl); var t2 = ExtractLocalTextAsync(computerVision, localImagePath); Task.WhenAll(t1, t2).Wait(5000); Console.WriteLine("Press ENTER to exit"); Console.ReadLine(); } // Read text from a remote image private static async Task ExtractRemoteTextAsync( ComputerVisionClient computerVision, string imageUrl) { if (!Uri.IsWellFormedUriString(imageUrl, UriKind.Absolute)) { Console.WriteLine( "\nInvalid remoteImageUrl:\n{0} \n", imageUrl); return; } // Start the async process to read the text BatchReadFileHeaders textHeaders = await computerVision.BatchReadFileAsync( imageUrl); await GetTextAsync(computerVision, textHeaders.OperationLocation); } // Recognize text from a local image private static async Task ExtractLocalTextAsync( ComputerVisionClient computerVision, string imagePath) { if (!File.Exists(imagePath)) { Console.WriteLine( "\nUnable to open or read localImagePath:\n{0} \n", imagePath); return; } using (Stream imageStream = File.OpenRead(imagePath)) { // Start the async process to recognize the text BatchReadFileInStreamHeaders textHeaders = await computerVision.BatchReadFileInStreamAsync( imageStream); await GetTextAsync(computerVision, textHeaders.OperationLocation); } } // Retrieve the recognized text private static async Task GetTextAsync( ComputerVisionClient computerVision, string operationLocation) { // Retrieve the URI where the recognized text will be // stored from the Operation-Location header string operationId = operationLocation.Substring( operationLocation.Length - numberOfCharsInOperationId); Console.WriteLine("\nCalling GetHandwritingRecognitionOperationResultAsync()"); ReadOperationResult result = await computerVision.GetReadOperationResultAsync(operationId); // Wait for the operation to complete int i = 0; int maxRetries = 10; while ((result.Status == TextOperationStatusCodes.Running || result.Status == TextOperationStatusCodes.NotStarted) && i++ < maxRetries) { Console.WriteLine( "Server status: {0}, waiting {1} seconds...", result.Status, i); await Task.Delay(1000); result = await computerVision.GetReadOperationResultAsync(operationId); } // Display the results Console.WriteLine(); var recResults = result.RecognitionResults; foreach (TextRecognitionResult recResult in recResults) { foreach (Line line in recResult.Lines) { Console.WriteLine(line.Text); } } Console.WriteLine(); } } } 4. Replace 5. Change computerVision.Endpoint 6. Replace 7. Optionally, set with your valid subscription key. to the Azure region associated with your subscription keys, if necessary. with the path and file name of a local image. remoteImageUrl to a different image. 8. Run the program. Examine the response A successful response prints the lines of recognized text for each image. Calling GetHandwritingRecognitionOperationResultAsync() Calling GetHandwritingRecognitionOperationResultAsync() Server status: Running, waiting 1 seconds... Server status: Running, waiting 1 seconds... dog The quick brown fox jumps over the lazy Pack my box with five dozen liquor jugs See Quickstart: Extract handwritten text - REST, C# for an example of the raw JSON output from the API call. Next steps Explore the Computer Vision APIs used to analyze an image, detect celebrities and landmarks, create a thumbnail, and extract printed and handwritten text. Explore Computer Vision APIs Azure Cognitive Services Computer Vision SDK for Python 5/29/2019 • 6 minutes to read • Edit Online The Computer Vision service provides developers with access to advanced algorithms for processing images and returning information. Computer Vision algorithms analyze the content of an image in different ways, depending on the visual features you're interested in. Analyze an image Get subject domain list Analyze an image by domain Get text description of an image Get handwritten text from image Generate thumbnail For more information about this service, see What is Computer Vision?. Looking for more documentation? SDK reference documentation Cognitive Services Computer Vision documentation Prerequisites Python 3.6+ Free Computer Vision key and associated endpoint. You need these values when you create the instance of the ComputerVisionClient client object. Use one of the following methods to get these values. If you don't have an Azure Subscription Create a free key valid for 7 days with the Try It experience for the Computer Vision service. When the key is created, copy the key and endpoint name. You will need this to create the client. Keep the following after the key is created: Key value: a 32 character string with the format of xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx Key endpoint: the base endpoint URL, https://westcentralus.api.cognitive.microsoft.com If you have an Azure Subscription The easiest method to create a resource in your subscription is to use the following Azure CLI command. This creates a Cognitive Service key that can be used across many cognitive services. You need to choose the existing resource group name, for example, "my-cogserv-group" and the new computer vision resource name, such as "mycomputer-vision-resource". RES_REGION=westeurope RES_GROUP= ACCT_NAME= az cognitiveservices account create \ --resource-group $RES_GROUP \ --name $ACCT_NAME \ --location $RES_REGION \ --kind CognitiveServices \ --sku S0 \ --yes Install the SDK Install the Azure Cognitive Services Computer Vision SDK for Python package with pip: pip install azure-cognitiveservices-vision-computervision Authentication Once you create your Computer Vision resource, you need its endpoint, and one of its account keys to instantiate the client object. Use these values when you create the instance of the ComputerVisionClient client object. For example, use the Bash terminal to set the environment variables: ACCOUNT_ENDPOINT= ACCT_NAME= For Azure subscription users, get credentials for key and endpoint If you do not remember your endpoint and key, you can use the following method to find them. If you need to create a key and endpoint, you can use the method for Azure subscription holders or for users without an Azure subscription. Use the Azure CLI snippet below to populate two environment variables with the Computer Vision account endpoint and one of its keys (you can also find these values in the Azure portal ). The snippet is formatted for the Bash shell. RES_GROUP= ACCT_NAME= export ACCOUNT_ENDPOINT=$(az cognitiveservices account show \ --resource-group $RES_GROUP \ --name $ACCT_NAME \ --query endpoint \ --output tsv) export ACCOUNT_KEY=$(az cognitiveservices account keys list \ --resource-group $RES_GROUP \ --name $ACCT_NAME \ --query key1 \ --output tsv) Create client Get the endpoint and key from environment variables then create the ComputerVisionClient client object. from azure.cognitiveservices.vision.computervision import ComputerVisionClient from azure.cognitiveservices.vision.computervision.models import VisualFeatureTypes from msrest.authentication import CognitiveServicesCredentials # Get endpoint and key from environment variables import os endpoint = os.environ['ACCOUNT_ENDPOINT'] key = os.environ['ACCOUNT_KEY'] # Set credentials credentials = CognitiveServicesCredentials(key) # Create client client = ComputerVisionClient(endpoint, credentials) Examples You need a ComputerVisionClient client object before using any of the following tasks. Analyze an image You can analyze an image for certain features with analyze_image . Use the visual_features property to set the types of analysis to perform on the image. Common values are VisualFeatureTypes.tags and VisualFeatureTypes.description . url = "https://upload.wikimedia.org/wikipedia/commons/thumb/1/12/Broadway_and_Times_Square_by_night.jpg/450pxBroadway_and_Times_Square_by_night.jpg" image_analysis = client.analyze_image(url,visual_features=[VisualFeatureTypes.tags]) for tag in image_analysis.tags: print(tag) Get subject domain list Review the subject domains used to analyze your image with list_models . These domain names are used when analyzing an image by domain. An example of a domain is landmarks . models = client.list_models() for x in models.models_property: print(x) Analyze an image by domain You can analyze an image by subject domain with domains in order to use the correct domain name. analyze_image_by_domain . Get the list of supported subject # type of prediction domain = "landmarks" # Public domain image of Eiffel tower url = "https://images.pexels.com/photos/338515/pexels-photo-338515.jpeg" # English language response language = "en" analysis = client.analyze_image_by_domain(domain, url, language) for landmark in analysis.result["landmarks"]: print(landmark["name"]) print(landmark["confidence"]) Get text description of an image You can get a language-based text description of an image with describe_image . Request several descriptions with the max_description property if you are doing text analysis for keywords associated with the image. Examples of a text description for the following image include a train crossing a bridge over a body of water , a large bridge over a body of water , and a train crossing a bridge over a large body of water . domain = "landmarks" url = "http://www.public-domain-photos.com/free-stock-photos-4/travel/san-francisco/golden-gate-bridge-in-sanfrancisco.jpg" language = "en" max_descriptions = 3 analysis = client.describe_image(url, max_descriptions, language) for caption in analysis.captions: print(caption.text) print(caption.confidence) Get text from image You can get any handwritten or printed text from an image. This requires two calls to the SDK: batch_read_file and get_read_operation_result . The call to batch_read_file is asynchronous. In the results of the get_read_operation_result call, you need to check if the first call completed with TextOperationStatusCodes before extracting the text data. The results include the text as well as the bounding box coordinates for the text. # import models from azure.cognitiveservices.vision.computervision.models import TextOperationStatusCodes import time url = "https://azurecomcdn.azureedge.net/cvt1979217d3d0d31c5c87cbd991bccfee2d184b55eeb4081200012bdaf6a65601a/images/shared/cognitive-services-demos/readtext/read-1-thumbnail.png" raw = True custom_headers = None numberOfCharsInOperationId = 36 # Async SDK call rawHttpResponse = client.batch_read_file(url, custom_headers, raw) # Get ID from returned headers operationLocation = rawHttpResponse.headers["Operation-Location"] idLocation = len(operationLocation) - numberOfCharsInOperationId operationId = operationLocation[idLocation:] # SDK call while True: result = client.get_read_operation_result(operationId) if result.status not in ['NotStarted', 'Running']: break time.sleep(1) # Get data if result.status == TextOperationStatusCodes.succeeded: for textResult in result.recognition_results: for line in textResult.lines: print(line.text) print(line.bounding_box) Generate thumbnail You can generate a thumbnail (JPG ) of an image with the same proportions as the original image. generate_thumbnail . The thumbnail does not need to be in Install Pillow to use this example: pip install Pillow Once Pillow is installed, use the package in the following code example to generate the thumbnail image. # Pillow package from PIL import Image # IO package to create local image import io width = 50 height = 50 url = "http://www.public-domain-photos.com/free-stock-photos-4/travel/san-francisco/golden-gate-bridge-in-sanfrancisco.jpg" thumbnail = client.generate_thumbnail(width, height, url) for x in thumbnail: image = Image.open(io.BytesIO(x)) image.save('thumbnail.jpg') Troubleshooting General When you interact with the ComputerVisionClient client object using the Python SDK, the ComputerVisionErrorException class is used to return errors. Errors returned by the service correspond to the same HTTP status codes returned for REST API requests. For example, if you try to analyze an image with an invalid key, a 401 error is returned. In the following snippet, the error is handled gracefully by catching the exception and displaying additional information about the error. domain = "landmarks" url = "http://www.public-domain-photos.com/free-stock-photos-4/travel/san-francisco/golden-gate-bridge-in-sanfrancisco.jpg" language = "en" max_descriptions = 3 try: analysis = client.describe_image(url, max_descriptions, language) for caption in analysis.captions: print(caption.text) print(caption.confidence) except HTTPFailure as e: if e.status_code == 401: print("Error unauthorized. Make sure your key and endpoint are correct.") else: raise Handle transient errors with retries While working with the ComputerVisionClient client, you might encounter transient failures caused by rate limits enforced by the service, or other transient problems like network outages. For information about handling these types of failures, see Retry pattern in the Cloud Design Patterns guide, and the related Circuit Breaker pattern. Next steps Applying content tags to images Tutorial: Use Computer Vision to generate image metadata in Azure Storage 5/10/2019 • 5 minutes to read • Edit Online In this tutorial, you will learn how to integrate the Azure Computer Vision service into a web app to generate metadata for uploaded images. A full app guide can be found in the Azure Storage and Cognitive Services Lab on GitHub, and this tutorial essentially covers Exercise 5 of the lab. You may wish to create the end-to-end application by following every step, but if you'd just like to see how Computer Vision can be integrated into an existing web app, read along here. This tutorial shows you how to: Create a Computer Vision resource in Azure Perform image analysis on Azure Storage images Attach metadata to Azure Storage images Check image metadata using Azure Storage Explorer If you don't have an Azure subscription, create a free account before you begin. Prerequisites Visual Studio 2017 Community edition or higher, with the "ASP.NET and web development" and "Azure development" workloads installed. An Azure Storage account with a blob container allocated for images (follow Exercises 1 of the Azure Storage Lab if you need help with this step). The Azure Storage Explorer tool (follow Exercise 2 of the Azure Storage Lab if you need help with this step). An ASP.NET web application with access to Azure Storage (follow Exercise 3 of the Azure Storage Lab to create such an app quickly). Create a Computer Vision resource You will need to create a Computer Vision resource for your Azure account; this resource manages your access to Azure's Computer Vision service. 1. Follow the instructions in Create an Azure Cognitive Services resource to create a Computer Vision resource. 2. Then go to the menu for your resource group and click the Computer Vision API subscription that you just created. Copy the URL under Endpoint to somewhere you can easily retrieve it in a moment. Then click Show access keys. 3. In the next window, copy the value of KEY 1 to the clipboard. Add Computer Vision credentials Next, you will add required credentials to your app so that it can access Computer Vision resources Open your ASP.NET web application in Visual Studio and navigate to the Web.config file at the root of the project. Add the following statements to the section of the file, replacing VISION_KEY with the key you copied in the previous step, and VISION_ENDPOINT with the URL you saved in the step before that. Then in the Solution Explorer, right-click the project and use the Manage NuGet Packages command to install the package Microsoft.Azure.CognitiveServices.Vision.ComputerVision. This package contains the types needed to call the Computer Vision API. Add metadata generation code Next, you will add the code that actually leverages the Computer Vision service to create metadata for images. These steps will apply to the ASP.NET app in the lab, but you can adapt them to your own app. What's important is that at this point you have an ASP.NET web application that can upload images to an Azure Storage container, read images from it, and display them in the view. If you're unsure about this, it's best to follow Exercise 3 of the Azure Storage Lab. 1. Open the HomeController.cs file in the project's Controllers folder and add the following at the top of the file: using statements using Microsoft.Azure.CognitiveServices.Vision.ComputerVision; using Microsoft.Azure.CognitiveServices.Vision.ComputerVision.Models; 2. Then, go to the Upload method; this method converts and uploads images to blob storage. Add the following code immediately after the block that begins with // Generate a thumbnail (or at the end of your image-blob-creation process). This code takes the blob containing the image ( photo ), and uses Computer Vision to generate a description for that image. The Computer Vision API also generates a list of keywords that apply to the image. The generated description and keywords are stored in the blob's metadata so that they can be retrieved later on. // Submit the image to Azure's Computer Vision API ComputerVisionClient vision = new ComputerVisionClient( new ApiKeyServiceClientCredentials(ConfigurationManager.AppSettings["SubscriptionKey"]), new System.Net.Http.DelegatingHandler[] { }); vision.Endpoint = ConfigurationManager.AppSettings["VisionEndpoint"]; VisualFeatureTypes[] features = new VisualFeatureTypes[] { VisualFeatureTypes.Description }; var result = await vision.AnalyzeImageAsync(photo.Uri.ToString(), features); // Record the image description and tags in blob metadata photo.Metadata.Add("Caption", result.Description.Captions[0].Text); for (int i = 0; i < result.Description.Tags.Count; i++) { string key = String.Format("Tag{0}", i); photo.Metadata.Add(key, result.Description.Tags[i]); } await photo.SetMetadataAsync(); 3. Next, go to the Index method in the same file; this method enumerates the stored image blobs in the targeted blob container (as IListBlobItem instances) and passes them to the application view. Replace the foreach block in this method with the following code. This code calls CloudBlockBlob.FetchAttributes to get each blob's attached metadata. It extracts the computer-generated description ( caption ) from the metadata and adds it to the BlobInfo object, which gets passed to the view. foreach (IListBlobItem item in container.ListBlobs()) { var blob = item as CloudBlockBlob; if (blob != null) { blob.FetchAttributes(); // Get blob metadata var caption = blob.Metadata.ContainsKey("Caption") ? blob.Metadata["Caption"] : blob.Name; blobs.Add(new BlobInfo() { ImageUri = blob.Uri.ToString(), ThumbnailUri = blob.Uri.ToString().Replace("/photos/", "/thumbnails/"), Caption = caption }); } } Test the app Save your changes in Visual Studio and press Ctrl+F5 to launch the application in your browser. Use the app to upload a few images, either from the "photos" folder in the lab's resources or from your own folder. When you hover the cursor over one of the images in the view, a tooltip window should appear and display the computergenerated caption for the image. To view all of the attached metadata, use the Azure Storage Explorer to view the storage container you're using for images. Right-click any of the blobs in the container and select Properties. In the dialog, you'll see a list of keyvalue pairs. The computer-generated image description is stored in the item "Caption," and the search keywords are stored in "Tag0," "Tag1," and so on. When you're finished, click Cancel to close the dialog. Clean up resources If you'd like to keep working on your web app, see the Next steps section. If you don't plan to continue using this application, you should delete all app-specific resources. To do this, you can simply delete the resource group that contains your Azure Storage subscription and Computer Vision resource. This will remove the storage account, the blobs uploaded to it, and the App Service resource needed to connect with the ASP.NET web app. To delete the resource group, open the Resource groups blade in the portal, navigate to the resource group you used for this project, and click Delete resource group at the top of the view. You will be asked to type the resource group's name to confirm that you want to delete it, because once deleted, a resource group can't be recovered. Next steps In this tutorial, you integrated Azure's Computer Vision service into an existing web app to automatically generate captions and keywords for blob images as they're uploaded. Next, refer to the Azure Storage Lab, Exercise 6, to learn how to add search functionality to your web app. This takes advantage of the search keywords that the Computer Vision service generates. Add search to your app Applying content tags to images 2/15/2019 • 2 minutes to read • Edit Online Computer Vision returns tags based on thousands of recognizable objects, living beings, scenery, and actions. When tags are ambiguous or not common knowledge, the API response provides 'hints' to clarify the meaning of the tag in context of a known setting. Tags are not organized as a taxonomy and no inheritance hierarchies exist. A collection of content tags forms the foundation for an image 'description' displayed as human readable language formatted in complete sentences. Note, that at this point English is the only supported language for image description. After uploading an image or specifying an image URL, Computer Vision algorithms output tags based on the objects, living beings, and actions identified in the image. Tagging is not limited to the main subject, such as a person in the foreground, but also includes the setting (indoor or outdoor), furniture, tools, plants, animals, accessories, gadgets etc. Image tagging example The following JSON response illustrates what Computer Vision returns when tagging visual features detected in the example image. . { "tags": [ { "name": "grass", "confidence": 0.9999995231628418 }, { "name": "outdoor", "confidence": 0.99992108345031738 }, { "name": "house", "confidence": 0.99685388803482056 }, { "name": "sky", "confidence": 0.99532157182693481 }, { "name": "building", "confidence": 0.99436837434768677 }, { "name": "tree", "confidence": 0.98880356550216675 }, { "name": "lawn", "confidence": 0.788884699344635 }, { "name": "green", "confidence": 0.71250593662261963 }, { "name": "residential", "confidence": 0.70859086513519287 }, { "name": "grassy", "confidence": 0.46624681353569031 } ], "requestId": "06f39352-e445-42dc-96fb-0a1288ad9cf1", "metadata": { "height": 200, "width": 300, "format": "Jpeg" } } Next steps Learn concepts about categorizing images and describing images. Detect common objects in images 4/19/2019 • 2 minutes to read • Edit Online Object detection is similar to tagging, but the API returns the bounding box coordinates (in pixels) for each object found. For example, if an image contains a dog, cat and person, the Detect operation will list those objects together with their coordinates in the image. You can use this functionality to process the relationships between the objects in an image. It also lets you determine whether there are multiple instances of the same tag in an image. The Detect API applies tags based on the objects or living things identified in the image. There is currently no formal relationship between the tagging taxonomy and the object detection taxonomy. At a conceptual level, the Detect API only finds objects and living things, while the Tag API can also include contextual terms like "indoor", which can't be localized with bounding boxes. Object detection example The following JSON response illustrates what Computer Vision returns when detecting objects in the example image. { "objects":[ { "rectangle":{ "x":730, "y":66, "w":135, "h":85 }, "object":"kitchen appliance", "confidence":0.501 }, { "rectangle":{ "x":523, "y":377, "w":185, "h":46 }, "object":"computer keyboard", "confidence":0.51 }, { "rectangle":{ "x":471, "y":218, "w":289, "h":226 }, "object":"Laptop", "confidence":0.85, "parent":{ "object":"computer", "confidence":0.851 } }, { "rectangle":{ "x":654, "y":0, "w":584, "h":473 }, "object":"person", "confidence":0.855 } ], "requestId":"a7fde8fd-cc18-4f5f-99d3-897dcd07b308", "metadata":{ "width":1260, "height":473, "format":"Jpeg" } } Limitations It's important to note the limitations of object detection so you can avoid or mitigate the effects of false negatives (missed objects) and limited detail. Objects are generally not detected if they're small (less than 5% of the image). Objects are generally not detected if they're arranged closely together (a stack of plates, for example). Objects are not differentiated by brand or product names (different types of sodas on a store shelf, for example). However, you can get brand information from an image by using the Brand detection feature. Use the API The object detection feature is part of the Analyze Image API. You can call this API through a native SDK or through REST calls. Include Objects in the visualFeatures query parameter. Then, when you get the full JSON response, simply parse the string for the contents of the "objects" section. Quickstart: Analyze an image (.NET SDK) Quickstart: Analyze an image (REST API) Detect popular brands in images 4/19/2019 • 2 minutes to read • Edit Online Brand detection is a specialized mode of object detection that uses a database of thousands of global logos to identify commercial brands in images or video. You can use this feature, for example, to discover which brands are most popular on social media or most prevalent in media product placement. The Computer Vision service detects whether there are brand logos in a given image; if so, it returns the brand name, a confidence score, and the coordinates of a bounding box around the logo. The built-in logo database covers popular brands in consumer electronics, clothing, and more. If you find that the brand you're looking for is not detected by the Computer Vision service, you may be better served creating and training your own logo detector using the Custom Vision service. Brand detection example The following JSON responses illustrate what Computer Vision returns when detecting brands in the example images. { "brands":[ { "name":"Microsoft", "confidence":0.706, "rectangle":{ "x":470, "y":862, "w":338, "h":327 } } ], "requestId":"5fda6b40-3f60-4584-bf23-911a0042aa13", "metadata":{ "width":2286, "height":1715, "format":"Jpeg" } } In some cases, the brand detector will pick up both the logo image and the stylized brand name as two separate logos. { "brands":[ { "name":"Microsoft", "confidence":0.657, "rectangle":{ "x":436, "y":473, "w":568, "h":267 } }, { "name":"Microsoft", "confidence":0.85, "rectangle":{ "x":101, "y":561, "w":273, "h":263 } } ], "requestId":"10dcd2d6-0cf6-4a5e-9733-dc2e4b08ac8d", "metadata":{ "width":1286, "height":1715, "format":"Jpeg" } } Use the API The brand detection feature is part of the Analyze Image API. You can call this API through a native SDK or through REST calls. Include Brands in the visualFeatures query parameter. Then, when you get the full JSON response, simply parse the string for the contents of the "brands" section. Quickstart: Analyze an image (.NET SDK) Quickstart: Analyze an image (REST API) Categorize images by subject matter 4/19/2019 • 2 minutes to read • Edit Online In addition to tags and a description, Computer Vision returns the taxonomy-based categories detected in an image. Unlike tags, categories are organized in a parent/child hereditary hierarchy, and there are fewer of them (86, as opposed to thousands of tags). All category names are in English. Categorization can be done by itself or alongside the newer tags model. The 86-category concept Computer vision can categorize an image broadly or specifically, using the list of 86 categories in the following diagram. For the full taxonomy in text format, see Category Taxonomy. Image categorization examples The following JSON response illustrates what Computer Vision returns when categorizing the example image based on its visual features. { "categories": [ { "name": "people_", "score": 0.81640625 } ], "requestId": "bae7f76a-1cc7-4479-8d29-48a694974705", "metadata": { "height": 200, "width": 300, "format": "Jpeg" } } The following table illustrates a typical image set and the category returned by Computer Vision for each image. IMAGE CATEGORY people_group animal_dog outdoor_mountain IMAGE CATEGORY food_bread Next steps Learn concepts about tagging images and describing images. Describe images with human-readable language 2/15/2019 • 2 minutes to read • Edit Online Computer Vision can analyze an image and generate a human-readable sentence that describes its contents. The algorithm actually retruns several descriptions based on different visual features, and each description is given a confidence score. The final output is a list of descriptions ordered from highest to lowest confidence. Image description example The following JSON response illustrates what Computer Vision returns when describing the example image based on its visual features. { "description": { "tags": ["outdoor", "building", "photo", "city", "white", "black", "large", "sitting", "old", "water", "skyscraper", "many", "boat", "river", "group", "street", "people", "field", "tall", "bird", "standing"], "captions": [ { "text": "a black and white photo of a city", "confidence": 0.95301952483304808 }, { "text": "a black and white photo of a large city", "confidence": 0.94085190563213816 }, { "text": "a large white building in a city", "confidence": 0.93108362931954824 } ] }, "requestId": "b20bfc83-fb25-4b8d-a3f8-b2a1f084b159", "metadata": { "height": 300, "width": 239, "format": "Jpeg" } } Next steps Learn concepts about tagging images and categorizing images. Face detection with Computer Vision 4/19/2019 • 2 minutes to read • Edit Online Computer Vision can detect human faces within an image and generate the age, gender, and rectangle for each detected face. NOTE This feature is also offered by the Azure Face service. See this alternative for more detailed face analysis, including face identification and pose detection. Face detection examples The following example demonstrates the JSON response returned by Computer Vision for an image containing a single human face. { "faces": [ { "age": 23, "gender": "Female", "faceRectangle": { "top": 45, "left": 194, "width": 44, "height": 44 } } ], "requestId": "8439ba87-de65-441b-a0f1-c85913157ecd", "metadata": { "height": 200, "width": 300, "format": "Png" } } The next example demonstrates the JSON response returned for an image containing multiple human faces. { "faces": [ { "age": 11, "gender": "Male", "faceRectangle": { "top": 62, "left": 22, "width": 45, "height": 45 } }, { "age": 11, "gender": "Female", "faceRectangle": { "top": 127, "left": 240, "width": 42, "height": 42 } }, { "age": 37, "gender": "Female", "faceRectangle": { "top": 55, "left": 200, "width": 41, "height": 41 } }, { "age": 41, "gender": "Male", "faceRectangle": { "top": 45, "left": 103, "width": 39, "height": 39 } } ], "requestId": "3a383cbe-1a05-4104-9ce7-1b5cf352b239", "metadata": { "height": 230, "width": 300, "format": "Png" } } Next steps See the Analyze Image reference documentation to learn more about how to use the face detection feature. Detecting image types with Computer Vision 3/12/2019 • 2 minutes to read • Edit Online With the Analyze Image API, Computer Vision can analyze the content type of images, indicating whether an image is clip art or a line drawing. Detecting clip art Computer Vision analyzes an image and rates the likelihood of the image being clip art on a scale of 0 to 3, as described in the following table. VALUE MEANING 0 Non-clip-art 1 Ambiguous 2 Normal-clip-art 3 Good-clip-art Clip art detection examples The following JSON responses illustrates what Computer Vision returns when rating the likelihood of the example images being clip art. { "imageType": { "clipArtType": 3, "lineDrawingType": 0 }, "requestId": "88c48d8c-80f3-449f-878f-6947f3b35a27", "metadata": { "height": 225, "width": 300, "format": "Jpeg" } } { "imageType": { "clipArtType": 0, "lineDrawingType": 0 }, "requestId": "a9c8490a-2740-4e04-923b-e8f4830d0e47", "metadata": { "height": 200, "width": 300, "format": "Jpeg" } } Detecting line drawings Computer Vision analyzes an image and returns a boolean value indicating whether the image is a line drawing. Line drawing detection examples The following JSON responses illustrates what Computer Vision returns when indicating whether the example images are line drawings. { "imageType": { "clipArtType": 2, "lineDrawingType": 1 }, "requestId": "6442dc22-476a-41c4-aa3d-9ceb15172f01", "metadata": { "height": 268, "width": 300, "format": "Jpeg" } } { "imageType": { "clipArtType": 0, "lineDrawingType": 0 }, "requestId": "98437d65-1b05-4ab7-b439-7098b5dfdcbf", "metadata": { "height": 200, "width": 300, "format": "Jpeg" } } Next steps See the Analyze Image reference documentation to learn how to detect image types. Detect domain-specific content 4/19/2019 • 2 minutes to read • Edit Online In addition to tagging and high-level categorization, Computer Vision also supports further domain-specific analysis using models that have been trained on specialized data. There are two ways to use the domain-specific models: by themselves (scoped analysis) or as an enhancement to the categorization feature. Scoped analysis You can analyze an image using only the chosen domain-specific model by calling the Models//Analyze API. The following is a sample JSON response returned by the models/celebrities/analyze API for the given image: { "result": { "celebrities": [{ "faceRectangle": { "top": 391, "left": 318, "width": 184, "height": 184 }, "name": "Satya Nadella", "confidence": 0.99999856948852539 }] }, "requestId": "8217262a-1a90-4498-a242-68376a4b956b", "metadata": { "width": 800, "height": 1200, "format": "Jpeg" } } Enhanced categorization analysis You can also use domain-specific models to supplement general image analysis. You do this as part of high-level categorization by specifying domain-specific models in the details parameter of the Analyze API call. In this case, the 86-category taxonomy classifier is called first. If any of the detected categories have a matching domain-specific model, the image is passed through that model as well and the results are added. The following JSON response shows how domain-specific analysis can be included as the broader categorization analysis. "categories":[ { "name":"abstract_", "score":0.00390625 }, { "name":"people_", "score":0.83984375, "detail":{ "celebrities":[ { "name":"Satya Nadella", "faceRectangle":{ "left":597, "top":162, "width":248, "height":248 }, "confidence":0.999028444 } ], "landmarks":[ { "name":"Forbidden City", "confidence":0.9978346 } ] } } ] detail node in a List the domain-specific models Currently, Computer Vision supports the following domain-specific models: NAME DESCRIPTION celebrities Celebrity recognition, supported for images classified in the people_ category landmarks Landmark recognition, supported for images classified in the outdoor_ or building_ categories Calling the Models API will return this information along with the categories to which each model can apply: { "models":[ { "name":"celebrities", "categories":[ "people_", "人_", "pessoas_", "gente_" ] }, { "name":"landmarks", "categories":[ "outdoor_", "户外_", "屋外_", "aoarlivre_", "alairelibre_", "building_", "建筑_", "建物_", "edifício_" ] } ] } Next steps Learn concepts about categorizing images. Detect color schemes in images 4/19/2019 • 2 minutes to read • Edit Online Computer Vision analyzes the colors in an image to provide three different attributes: the dominant foreground color, the dominant background color, and the set of dominant colors for the image as a whole. Returned colors belong to the set: black, blue, brown, gray, green, orange, pink, purple, red, teal, white, and yellow. Computer Vision also extracts an accent color, which represents the most vibrant color in the image, based on a combination of dominant colors and saturation. The accent color is returned as a hexadecimal HTML color code. Computer Vision also returns a boolean value indicating whether an image is in black and white. Color scheme detection examples The following example illustrates the JSON response returned by Computer Vision when detecting the color scheme of the example image. In this case, the example image is not a black and white image, but the dominant foreground and background colors are black, and the dominant colors for the image as a whole are black and white. { "color": { "dominantColorForeground": "Black", "dominantColorBackground": "Black", "dominantColors": ["Black", "White"], "accentColor": "BB6D10", "isBwImg": false }, "requestId": "0dc394bf-db50-4871-bdcc-13707d9405ea", "metadata": { "height": 202, "width": 300, "format": "Jpeg" } } Dominant color examples The following table shows the returned foreground, background, and image colors for each sample image. IMAGE DOMINANT COLORS Foreground: Black Background: White Colors: Black, White, Green Foreground: Black Background: Black Colors: Black Accent color examples The following table shows the returned accent color, as a hexadecimal HTML color value, for each example image. IMAGE ACCENT COLOR #BB6D10 #C6A205 IMAGE ACCENT COLOR #474A84 Black & white detection examples The following table shows Computer Vision's black and white evaluation in the sample images. IMAGE BLACK & WHITE? true false Next steps Learn concepts about detecting image types. Generating smart-cropped thumbnails with Computer Vision 4/19/2019 • 2 minutes to read • Edit Online A thumbnail is a reduced-size representation of an image. Thumbnails are used to represent images and other data in a more economical, layout-friendly way. The Computer Vision API uses smart cropping, together with resizing the image, to create intuitive thumbnails for a given image. The Computer Vision thumbnail generation algorithm works as follows: 1. Remove distracting elements from the image and identify the area of interest—the area of the image in which the main object(s) appears. 2. Crop the image based on the identified area of interest. 3. Change the aspect ratio to fit the target thumbnail dimensions. Area of interest When you upload an image, the Computer Vision API analyzes it to determine the area of interest. It can then use this region to determine how to crop the image. The cropping operation, however, will always match the desired aspect ratio if one is specified. You can also get the raw bounding box coordinates of this same area of interest by calling the areaOfInterest API instead. You can then use this information to modify the original image however you wish. Examples The generated thumbnail can vary widely depending on what you specify for height, width, and smart cropping, as shown in the following image. The following table illustrates typical thumbnails generated by Computer Vision for the example images. The thumbnails were generated for a specified target height and width of 50 pixels, with smart cropping enabled. IMAGE Next steps Learn about tagging images and categorizing images. THUMBNAIL Recognize printed and handwritten text 5/29/2019 • 3 minutes to read • Edit Online Computer Vision provides a number of services that detect and extract printed or handwritten text that appears in images. This is useful in a variety of scenarios such as note taking, medical records, security, and banking. The following three sections detail three different text recognition APIs, each optimized for different use cases. Read API The Read API detects text content in an image using our latest recognition models and converts the identified text into a machine-readable character stream. It's optimized for text-heavy images (such as documents that have been digitally scanned) and for images with a lot of visual noise. It will determine which recognition model to use for each line of text, supporting images with both printed and handwritten text. The Read API executes asynchronously because larger documents can take several minutes to return a result. The Read operation maintains the original line groupings of recognized words in its output. Each line comes with bounding box coordinates, and each word within the line also has its own coordinates. If a word was recognized with low confidence, that information is conveyed as well. See the Read API reference docs to learn more. NOTE This feature is only available for English text. Image requirements The Read API works with images that meet the following requirements: The image must be presented in JPEG, PNG, BMP, PDF, or TIFF format. The dimensions of the image must be between 50 x 50 and 4200 x 4200 pixels. PDF pages must be 17 x 17 inches or smaller. The file size of the image must be less than 20 megabytes (MB ). Limitations If you are using a free-tier subscription, the Read API will only process the first two pages of a PDF or TIFF document. With a paid subscription, it will process up to 200 pages. Also note that the API will detect a maximum of 300 lines per page. OCR (optical character recognition) API Computer Vision's optical character recognition (OCR ) API is similar to the Read API, but it executes synchronously and is not optimized for large documents. It uses an earlier recognition model but works with more languages; see Language support for a full list of the supported languages. If necessary, OCR corrects the rotation of the recognized text by returning the rotational offset in degrees about the horizontal image axis. OCR also provides the frame coordinates of each word, as seen in the following illustration. See the OCR reference docs to learn more. Image requirements The OCR API works on images that meet the following requirements: The image must be presented in JPEG, PNG, GIF, or BMP format. The size of the input image must be between 50 x 50 and 4200 x 4200 pixels. The text in the image can be rotated by any multiple of 90 degrees plus a small angle of up to 40 degrees. Limitations On photographs where text is dominant, false positives may come from partially recognized words. On some photographs, especially photos without any text, precision can vary depending on the type of image. Recognize Text API NOTE The Recognize Text API is being deprecated in favor of the Read API. The Read API has similar capabilities and is updated to handle PDF, TIFF, and multi-page files. The Recognize Text API is similar to OCR, but it executes asynchronously and uses updated recognition models. See the Recognize Text API reference docs to learn more. Image requirements The Recognize Text API works with images that meet the following requirements: The image must be presented in JPEG, PNG, or BMP format. The dimensions of the image must be between 50 x 50 and 4200 x 4200 pixels. The file size of the image must be less than 4 megabytes (MB ). Limitations The accuracy of text recognition operations depends on the quality of the images. The following factors may cause an inaccurate reading: Blurry images. Handwritten or cursive text. Artistic font styles. Small text size. Complex backgrounds, shadows, or glare over text or perspective distortion. Oversized or missing capital letters at the beginnings of words. Subscript, superscript, or strikethrough text. Next steps Follow the Extract printed text (OCR ) quickstart to implement text recognition in a simple C# app. Detect adult and racy content 2/15/2019 • 2 minutes to read • Edit Online Computer Vision can detect adult material in images so that developers can restrict the display of such images in their software. Content flags are applied with a score between zero and one so that developers can interpret the results according to their own preferences. NOTE This feature is also offered by the Azure Content Moderator service. See this alternative for solutions to more rigorous content moderation scenarios, such as text moderation and human review workflows. Content flag definitions Adult images are defined as those which are pornographic in nature and often depict nudity and sexual acts. Racy images are defined as images that are sexually suggestive in nature and often contain less sexually explicit content than images tagged as Adult. Identify adult and racy content The Analyze API. The Analyze Image method returns two boolean properties, isAdultContent and isRacyContent , in the JSON response of the method to indicate adult and racy content respectively. The method also returns two properties, adultScore and racyScore , which represent the confidence scores for identifying adult and racy content respectively. Next steps Learn concepts about detecting domain-specific content and detecting faces. Use Computer Vision features with the REST API and Java 5/7/2019 • 22 minutes to read • Edit Online This tutorial shows the features of the Azure Cognitive Services Computer Vision REST API. Explore a Java Swing application that uses the Computer Vision REST API to perform optical character recognition (OCR ), create smart-cropped thumbnails, plus detect, categorize, tag, and describe visual features, including faces, in an image. This example lets you submit an image URL for analysis or processing. You can use this open source example as a template for building your own app in Java to use the Computer Vision REST API. This tutorial will cover how to use Computer Vision to: Analyze an image Identify a natural or artificial landmark in an image Identify a celebrity in an image Create a quality thumbnail from an image Read printed text in an image Read handwritten text in an image The Java Swing form application has already been written but has no functionality. In this tutorial, you add the code specific to the Computer Vision REST API to complete the application's functionality. Prerequisites Platform requirements This tutorial has been developed using the NetBeans IDE. Specifically, the Java SE version of NetBeans, which you can download here. Subscribe to Computer Vision API and get a subscription key Before creating the example, you must subscribe to Computer Vision API which is part of Azure Cognitive Services. For subscription and key management details, see Subscriptions. Both the primary and secondary keys are valid to use in this tutorial. Acquire incomplete tutorial project Download the project 1. Go to the Cognitive Services Java Computer Vision Tutorial repository. 2. Click the Clone or download button. 3. Click Download ZIP to download a .zip file of the tutorial project. There is no need to extract the contents of the .zip file because NetBeans imports the project from the .zip file. Import the tutorial project Import the cognitive-services-java-computer-vision-tutorial-master.zip file into NetBeans. 1. In NetBeans, click File > Import Project > From ZIP.... The Import Project(s) from ZIP dialog box appears. 2. In the ZIP File: field, click the Browse button to locate the cognitive-services-java-computer-visiontutorial-master.zip file, then click Open. 3. Click Import from the Import Project(s) from ZIP dialog box. 4. In the Projects panel, expand ComputerVision > Source Packages > . Some versions of NetBeans use src instead of Source Packages > . In that case, expand src. 5. Double-click MainFrame.java to load the file into the NetBeans editor. The Design tab of the MainFrame.java file appears. 6. Click the Source tab to view the Java source code. Build and run the tutorial project 1. Press F6 to build and run the tutorial application. In the tutorial application, click a tab to bring up the pane for that feature. The buttons have empty methods, so they do nothing. At the bottom of the window are the fields Subscription Key and Subscription Region. These fields must be filled with a valid subscription key and the correct region for that subscription key. To obtain a subscription key, see Subscriptions. If you obtained your subscription key from the free trial at that link, then the default region westcentralus is the correct region for your subscription keys. 2. Exit the tutorial application. Add tutorial code to the project The Java Swing application is set up with six tabs. Each tab demonstrates a different function of Computer Vision (analyze, OCR, and so on). The six tutorial sections do not have interdependencies, so you can add one section, all six sections, or any subset. You can add the sections in any order. Analyze an image The Analyze feature of Computer Vision scans an image for more than 2,000 recognizable objects, living things, scenery, and actions. Once the analysis is complete, Analyze returns a JSON object that describes the image with descriptive tags, color analysis, captions, and more. To complete the Analyze feature of the tutorial application, perform the following steps: Add the event handler code for the analyze button The analyzeImageButtonActionPerformed event handler method clears the form, displays the image specified in the URL, then calls the AnalyzeImage method to analyze the image. When AnalyzeImage returns, the method displays the formatted JSON response in the Response text area, extracts the first caption from the JSONObject, and displays the caption and the confidence level that the caption is correct. Copy and paste the following code into the analyzeImageButtonActionPerformed method. NOTE NetBeans won't let you paste to the method definition line ( private void ) or to the closing curly brace of that method. To copy the code, copy the lines between the method definition and the closing curly brace, and paste them over the contents of the method. private void analyzeImageButtonActionPerformed(java.awt.event.ActionEvent evt) { URL analyzeImageUrl; // Clear out the previous image, response, and caption, if any. analyzeImage.setIcon(new ImageIcon()); analyzeCaptionLabel.setText(""); analyzeResponseTextArea.setText(""); // Display the image specified in the text box. try { analyzeImageUrl = new URL(analyzeImageUriTextBox.getText()); BufferedImage bImage = ImageIO.read(analyzeImageUrl); scaleAndShowImage(bImage, analyzeImage); } catch(IOException e) { analyzeResponseTextArea.setText("Error loading Analyze image: " + e.getMessage()); return; } // Analyze the image. JSONObject jsonObj = AnalyzeImage(analyzeImageUrl.toString()); // A return of null indicates failure. if (jsonObj == null) { return; } // Format and display the JSON response. analyzeResponseTextArea.setText(jsonObj.toString(2)); // Extract the text and confidence from the first caption in the description object. if (jsonObj.has("description") && jsonObj.getJSONObject("description").has("captions")) { JSONObject jsonCaption = jsonObj.getJSONObject("description").getJSONArray("captions").getJSONObject(0); if (jsonCaption.has("text") && jsonCaption.has("confidence")) { analyzeCaptionLabel.setText("Caption: " + jsonCaption.getString("text") + " (confidence: " + jsonCaption.getDouble("confidence") + ")."); } } } Add the wrapper for the REST API call The AnalyzeImage method wraps the REST API call to analyze an image. The method returns a JSONObject describing the image, or null if there was an error. Copy and paste the AnalyzeImage method to just underneath the analyzeImageButtonActionPerformed method. /** * Encapsulates the Microsoft Cognitive Services REST API call to analyze an image. * @param imageUrl: The string URL of the image to analyze. * @return: A JSONObject describing the image, or null if a runtime error occurs. */ private JSONObject AnalyzeImage(String imageUrl) { try (CloseableHttpClient httpclient = HttpClientBuilder.create().build()) { // Create the URI to access the REST API call for Analyze Image. String uriString = uriBasePreRegion + String.valueOf(subscriptionRegionComboBox.getSelectedItem()) + uriBasePostRegion + uriBaseAnalyze; URIBuilder builder = new URIBuilder(uriString); // Request parameters. All of them are optional. builder.setParameter("visualFeatures", "Categories,Description,Color,Adult"); builder.setParameter("language", "en"); // Prepare the URI for the REST API call. URI uri = builder.build(); HttpPost request = new HttpPost(uri); // Request headers. request.setHeader("Content-Type", "application/json"); request.setHeader("Ocp-Apim-Subscription-Key", subscriptionKeyTextField.getText()); // Request body. StringEntity reqEntity = new StringEntity("{\"url\":\"" + imageUrl + "\"}"); request.setEntity(reqEntity); // Execute the REST API call and get the response entity. HttpResponse response = httpclient.execute(request); HttpEntity entity = response.getEntity(); // If we got a response, parse it and display it. if (entity != null) { // Return the JSONObject. String jsonString = EntityUtils.toString(entity); return new JSONObject(jsonString); } else { // No response. Return null. return null; } } catch (Exception e) { // Display error message. System.out.println(e.getMessage()); return null; } } Run the Analyze function Press F6 to run the application. Put your subscription key into the Subscription Key field and verify that you are using the correct region in Subscription Region. Enter a URL to an image to analyze, then click the Analyze Image button to analyze an image and see the result. Recognize a landmark The Landmark feature of Computer Vision analyzes an image for natural and artificial landmarks, such as mountains or famous buildings. Once the analysis is complete, Landmark returns a JSON object that identifies the landmarks found in the image. To complete the Landmark feature of the tutorial application, perform the following steps: Add the event handler code for the form button The landmarkImageButtonActionPerformed event handler method clears the form, displays the image specified in the URL, then calls the LandmarkImage method to analyze the image. When LandmarkImage returns, the method displays the formatted JSON response in the Response text area, then extracts the first landmark name from the JSONObject and displays it on the window along with the confidence level that the landmark was identified correctly. Copy and paste the following code into the landmarkImageButtonActionPerformed method. NOTE NetBeans won't let you paste to the method definition line ( private void ) or to the closing curly brace of that method. To copy the code, copy the lines between the method definition and the closing curly brace, and paste them over the contents of the method. private void landmarkImageButtonActionPerformed(java.awt.event.ActionEvent evt) { URL landmarkImageUrl; // Clear out the previous image, response, and caption, if any. landmarkImage.setIcon(new ImageIcon()); landmarkCaptionLabel.setText(""); landmarkResponseTextArea.setText(""); // Display the image specified in the text box. try { landmarkImageUrl = new URL(landmarkImageUriTextBox.getText()); BufferedImage bImage = ImageIO.read(landmarkImageUrl); scaleAndShowImage(bImage, landmarkImage); } catch(IOException e) { landmarkResponseTextArea.setText("Error loading Landmark image: " + e.getMessage()); return; } // Identify the landmark in the image. JSONObject jsonObj = LandmarkImage(landmarkImageUrl.toString()); // A return of null indicates failure. if (jsonObj == null) { return; } // Format and display the JSON response. landmarkResponseTextArea.setText(jsonObj.toString(2)); // Extract the text and confidence from the first caption in the description object. if (jsonObj.has("result") && jsonObj.getJSONObject("result").has("landmarks")) { JSONObject jsonCaption = jsonObj.getJSONObject("result").getJSONArray("landmarks").getJSONObject(0); if (jsonCaption.has("name") && jsonCaption.has("confidence")) { landmarkCaptionLabel.setText("Caption: " + jsonCaption.getString("name") + " (confidence: " + jsonCaption.getDouble("confidence") + ")."); } } } Add the wrapper for the REST API call The LandmarkImage method wraps the REST API call to analyze an image. The method returns a JSONObject describing the landmarks found in the image, or null if there was an error. Copy and paste the LandmarkImage method to just underneath the landmarkImageButtonActionPerformed method. /** * Encapsulates the Microsoft Cognitive Services REST API call to identify a landmark in an image. * @param imageUrl: The string URL of the image to process. * @return: A JSONObject describing the image, or null if a runtime error occurs. */ private JSONObject LandmarkImage(String imageUrl) { try (CloseableHttpClient httpclient = HttpClientBuilder.create().build()) { // Create the URI to access the REST API call to identify a Landmark in an image. String uriString = uriBasePreRegion + String.valueOf(subscriptionRegionComboBox.getSelectedItem()) + uriBasePostRegion + uriBaseLandmark; URIBuilder builder = new URIBuilder(uriString); // Request parameters. All of them are optional. builder.setParameter("visualFeatures", "Categories,Description,Color"); builder.setParameter("language", "en"); // Prepare the URI for the REST API call. URI uri = builder.build(); HttpPost request = new HttpPost(uri); // Request headers. request.setHeader("Content-Type", "application/json"); request.setHeader("Ocp-Apim-Subscription-Key", subscriptionKeyTextField.getText()); // Request body. StringEntity reqEntity = new StringEntity("{\"url\":\"" + imageUrl + "\"}"); request.setEntity(reqEntity); // Execute the REST API call and get the response entity. HttpResponse response = httpclient.execute(request); HttpEntity entity = response.getEntity(); // If we got a response, parse it and display it. if (entity != null) { // Return the JSONObject. String jsonString = EntityUtils.toString(entity); return new JSONObject(jsonString); } else { // No response. Return null. return null; } } catch (Exception e) { // Display error message. System.out.println(e.getMessage()); return null; } } Run the landmark function Press F6 to run the application. Put your subscription key into the Subscription Key field and verify that you are using the correct region in Subscription Region. Click the Landmark tab, enter a URL to an image of a landmark, then click the Analyze Image button to analyze an image and see the result. Recognize celebrities The Celebrities feature of Computer Vision analyzes an image for famous people. Once the analysis is complete, Celebrities returns a JSON object that identifies the Celebrities found in the image. To complete the Celebrities feature of the tutorial application, perform the following steps: Add the event handler code for the celebrities button The celebritiesImageButtonActionPerformed event handler method clears the form, displays the image specified in the URL, then calls the CelebritiesImage method to analyze the image. When CelebritiesImage returns, the method displays the formatted JSON response in the Response text area, then extracts the first celebrity name from the JSONObject and displays the name on the window along with the confidence level that the celebrity was identified correctly. Copy and paste the following code into the celebritiesImageButtonActionPerformed method. NOTE NetBeans won't let you paste to the method definition line ( private void ) or to the closing curly brace of that method. To copy the code, copy the lines between the method definition and the closing curly brace, and paste them over the contents of the method. private void celebritiesImageButtonActionPerformed(java.awt.event.ActionEvent evt) { URL celebritiesImageUrl; // Clear out the previous image, response, and caption, if any. celebritiesImage.setIcon(new ImageIcon()); celebritiesCaptionLabel.setText(""); celebritiesResponseTextArea.setText(""); // Display the image specified in the text box. try { celebritiesImageUrl = new URL(celebritiesImageUriTextBox.getText()); BufferedImage bImage = ImageIO.read(celebritiesImageUrl); scaleAndShowImage(bImage, celebritiesImage); } catch(IOException e) { celebritiesResponseTextArea.setText("Error loading Celebrity image: " + e.getMessage()); return; } // Identify the celebrities in the image. JSONObject jsonObj = CelebritiesImage(celebritiesImageUrl.toString()); // A return of null indicates failure. if (jsonObj == null) { return; } // Format and display the JSON response. celebritiesResponseTextArea.setText(jsonObj.toString(2)); // Extract the text and confidence from the first caption in the description object. if (jsonObj.has("result") && jsonObj.getJSONObject("result").has("celebrities")) { JSONObject jsonCaption = jsonObj.getJSONObject("result").getJSONArray("celebrities").getJSONObject(0); if (jsonCaption.has("name") && jsonCaption.has("confidence")) { celebritiesCaptionLabel.setText("Caption: " + jsonCaption.getString("name") + " (confidence: " + jsonCaption.getDouble("confidence") + ")."); } } } Add the wrapper for the REST API call The CelebritiesImage method wraps the REST API call to analyze an image. The method returns a JSONObject describing the celebrities found in the image, or null if there was an error. Copy and paste the CelebritiesImage method to just underneath the celebritiesImageButtonActionPerformed method. /** * Encapsulates the Microsoft Cognitive Services REST API call to identify celebrities in an image. * @param imageUrl: The string URL of the image to process. * @return: A JSONObject describing the image, or null if a runtime error occurs. */ private JSONObject CelebritiesImage(String imageUrl) { try (CloseableHttpClient httpclient = HttpClientBuilder.create().build()) { // Create the URI to access the REST API call to identify celebrities in an image. String uriString = uriBasePreRegion + String.valueOf(subscriptionRegionComboBox.getSelectedItem()) + uriBasePostRegion + uriBaseCelebrities; URIBuilder builder = new URIBuilder(uriString); // Request parameters. All of them are optional. builder.setParameter("visualFeatures", "Categories,Description,Color"); builder.setParameter("language", "en"); // Prepare the URI for the REST API call. URI uri = builder.build(); HttpPost request = new HttpPost(uri); // Request headers. request.setHeader("Content-Type", "application/json"); request.setHeader("Ocp-Apim-Subscription-Key", subscriptionKeyTextField.getText()); // Request body. StringEntity reqEntity = new StringEntity("{\"url\":\"" + imageUrl + "\"}"); request.setEntity(reqEntity); // Execute the REST API call and get the response entity. HttpResponse response = httpclient.execute(request); HttpEntity entity = response.getEntity(); // If we got a response, parse it and display it. if (entity != null) { // Return the JSONObject. String jsonString = EntityUtils.toString(entity); return new JSONObject(jsonString); } else { // No response. Return null. return null; } } catch (Exception e) { // Display error message. System.out.println(e.getMessage()); return null; } } Run the celebrities function Press F6 to run the application. Put your subscription key into the Subscription Key field and verify that you are using the correct region in Subscription Region. Click the Celebrities tab, enter a URL to an image of a celebrity, then click the Analyze Image button to analyze an image and see the result. Intelligently generate a thumbnail The Thumbnail feature of Computer Vision generates a thumbnail from an image. By using the Smart Crop feature, the Thumbnail feature will identify the area of interest in an image and center the thumbnail on this area, to generate more aesthetically pleasing thumbnail images. To complete the Thumbnail feature of the tutorial application, perform the following steps: Add the event handler code for the thumbnail button The thumbnailImageButtonActionPerformed event handler method clears the form, displays the image specified in the URL, then calls the getThumbnailImage method to create the thumbnail. When getThumbnailImage returns, the method displays the generated thumbnail. Copy and paste the following code into the thumbnailImageButtonActionPerformed method. NOTE NetBeans won't let you paste to the method definition line ( private void ) or to the closing curly brace of that method. To copy the code, copy the lines between the method definition and the closing curly brace, and paste them over the contents of the method. private void thumbnailImageButtonActionPerformed(java.awt.event.ActionEvent evt) { URL thumbnailImageUrl; JSONObject jsonError[] = new JSONObject[1]; // Clear out the previous image, response, and thumbnail, if any. thumbnailSourceImage.setIcon(new ImageIcon()); thumbnailResponseTextArea.setText(""); thumbnailImage.setIcon(new ImageIcon()); // Display the image specified in the text box. try { thumbnailImageUrl = new URL(thumbnailImageUriTextBox.getText()); BufferedImage bImage = ImageIO.read(thumbnailImageUrl); scaleAndShowImage(bImage, thumbnailSourceImage); } catch(IOException e) { thumbnailResponseTextArea.setText("Error loading image to thumbnail: " + e.getMessage()); return; } // Get the thumbnail for the image. BufferedImage thumbnail = getThumbnailImage(thumbnailImageUrl.toString(), jsonError); // A non-null value indicates error. if (jsonError[0] != null) { // Format and display the JSON error. thumbnailResponseTextArea.setText(jsonError[0].toString(2)); return; } // Display the thumbnail. if (thumbnail != null) { scaleAndShowImage(thumbnail, thumbnailImage); } } Add the wrapper for the REST API call The getThumbnailImage method wraps the REST API call to analyze an image. The method returns a BufferedImage that contains the thumbnail, or null if there was an error. The error message will be returned in the first element of the jsonError string array. Copy and paste the following getThumbnailImage method to just underneath the thumbnailImageButtonActionPerformed method. /** * Encapsulates the Microsoft Cognitive Services REST API call to create a thumbnail for an image. * @param imageUrl: The string URL of the image to process. * @return: A BufferedImage containing the thumbnail, or null if a runtime error occurs. In the case * of an error, the error message will be returned in the first element of the jsonError string array. */ private BufferedImage getThumbnailImage(String imageUrl, JSONObject[] jsonError) { try (CloseableHttpClient httpclient = HttpClientBuilder.create().build()) { // Create the URI to access the REST API call to identify celebrities in an image. String uriString = uriBasePreRegion + String.valueOf(subscriptionRegionComboBox.getSelectedItem()) + uriBasePostRegion + uriBaseThumbnail; URIBuilder uriBuilder = new URIBuilder(uriString); // Request parameters. uriBuilder.setParameter("width", "100"); uriBuilder.setParameter("height", "150"); uriBuilder.setParameter("smartCropping", "true"); // Prepare the URI for the REST API call. URI uri = uriBuilder.build(); HttpPost request = new HttpPost(uri); // Request headers. request.setHeader("Content-Type", "application/json"); request.setHeader("Ocp-Apim-Subscription-Key", subscriptionKeyTextField.getText()); // Request body. StringEntity requestEntity = new StringEntity("{\"url\":\"" + imageUrl + "\"}"); request.setEntity(requestEntity); // Execute the REST API call and get the response entity. HttpResponse response = httpclient.execute(request); HttpEntity entity = response.getEntity(); // Check for success. if (response.getStatusLine().getStatusCode() == 200) { // Return the thumbnail. return ImageIO.read(entity.getContent()); } else { // Format and display the JSON error message. String jsonString = EntityUtils.toString(entity); jsonError[0] = new JSONObject(jsonString); return null; } } catch (Exception e) { String errorMessage = e.getMessage(); System.out.println(errorMessage); jsonError[0] = new JSONObject(errorMessage); return null; } } Run the thumbnail function Press F6 to run the application. Put your subscription key into the Subscription Key field and verify that you are using the correct region in Subscription Region. Click the Thumbnail tab, enter a URL to an image, then click the Generate Thumbnail button to analyze an image and see the result. Read printed text (OCR ) The Optical Character Recognition (OCR ) feature of Computer Vision analyzes an image of printed text. After the analysis is complete, OCR returns a JSON object that contains the text and the location of the text in the image. To complete the OCR feature of the tutorial application, perform the following steps: Add the event handler code for the OCR button The ocrImageButtonActionPerformed event handler method clears the form, displays the image specified in the URL, then calls the OcrImage method to analyze the image. When OcrImage returns, the method displays the detected text as formatted JSON in the Response text area. Copy and paste the following code into the ocrImageButtonActionPerformed method. NOTE NetBeans won't let you paste to the method definition line ( private void ) or to the closing curly brace of that method. To copy the code, copy the lines between the method definition and the closing curly brace, and paste them over the contents of the method. private void ocrImageButtonActionPerformed(java.awt.event.ActionEvent evt) { URL ocrImageUrl; // Clear out the previous image, response, and caption, if any. ocrImage.setIcon(new ImageIcon()); ocrResponseTextArea.setText(""); // Display the image specified in the text box. try { ocrImageUrl = new URL(ocrImageUriTextBox.getText()); BufferedImage bImage = ImageIO.read(ocrImageUrl); scaleAndShowImage(bImage, ocrImage); } catch(IOException e) { ocrResponseTextArea.setText("Error loading OCR image: " + e.getMessage()); return; } // Read the text in the image. JSONObject jsonObj = OcrImage(ocrImageUrl.toString()); // A return of null indicates failure. if (jsonObj == null) { return; } // Format and display the JSON response. ocrResponseTextArea.setText(jsonObj.toString(2)); } Add the wrapper for the REST API call The OcrImage method wraps the REST API call to analyze an image. The method returns a JSONObject of the JSON data returned from the call, or null if there was an error. Copy and paste the following OcrImage method to just underneath the ocrImageButtonActionPerformed method. /** * Encapsulates the Microsoft Cognitive Services REST API call to read text in an image. * @param imageUrl: The string URL of the image to process. * @return: A JSONObject describing the image, or null if a runtime error occurs. */ private JSONObject OcrImage(String imageUrl) { try (CloseableHttpClient httpclient = HttpClientBuilder.create().build()) { // Create the URI to access the REST API call to read text in an image. String uriString = uriBasePreRegion + String.valueOf(subscriptionRegionComboBox.getSelectedItem()) + uriBasePostRegion + uriBaseOcr; URIBuilder uriBuilder = new URIBuilder(uriString); // Request parameters. uriBuilder.setParameter("language", "unk"); uriBuilder.setParameter("detectOrientation ", "true"); // Prepare the URI for the REST API call. URI uri = uriBuilder.build(); HttpPost request = new HttpPost(uri); // Request headers. request.setHeader("Content-Type", "application/json"); request.setHeader("Ocp-Apim-Subscription-Key", subscriptionKeyTextField.getText()); // Request body. StringEntity reqEntity = new StringEntity("{\"url\":\"" + imageUrl + "\"}"); request.setEntity(reqEntity); // Execute the REST API call and get the response entity. HttpResponse response = httpclient.execute(request); HttpEntity entity = response.getEntity(); // If we got a response, parse it and display it. if (entity != null) { // Return the JSONObject. String jsonString = EntityUtils.toString(entity); return new JSONObject(jsonString); } else { // No response. Return null. return null; } } catch (Exception e) { // Display error message. System.out.println(e.getMessage()); return null; } } Run the OCR function Press F6 to run the application. Put your subscription key into the Subscription Key field and verify that you are using the correct region in Subscription Region. Click the OCR tab, enter a URL to an image of printed text, then click the Read Image button to analyze an image and see the result. Read handwritten text (handwriting recognition) The Handwriting Recognition feature of Computer Vision analyzes an image of handwritten text. After the analysis is complete, Handwriting Recognition returns a JSON object that contains the text and the location of the text in the image. To complete the Handwriting Recognition feature of the tutorial application, perform the following steps: Add the event handler code for the handwriting button The handwritingImageButtonActionPerformed event handler method clears the form, displays the image specified in the URL, then calls the HandwritingImage method to analyze the image. When HandwritingImage returns, the method displays the detected text as formatted JSON in the Response text area. Copy and paste the following code into the handwritingImageButtonActionPerformed method. NOTE NetBeans won't let you paste to the method definition line ( private void ) or to the closing curly brace of that method. To copy the code, copy the lines between the method definition and the closing curly brace, and paste them over the contents of the method. private void handwritingImageButtonActionPerformed(java.awt.event.ActionEvent evt) { URL handwritingImageUrl; // Clear out the previous image, response, and caption, if any. handwritingImage.setIcon(new ImageIcon()); handwritingResponseTextArea.setText(""); // Display the image specified in the text box. try { handwritingImageUrl = new URL(handwritingImageUriTextBox.getText()); BufferedImage bImage = ImageIO.read(handwritingImageUrl); scaleAndShowImage(bImage, handwritingImage); } catch(IOException e) { handwritingResponseTextArea.setText("Error loading Handwriting image: " + e.getMessage()); return; } // Read the text in the image. JSONObject jsonObj = HandwritingImage(handwritingImageUrl.toString()); // A return of null indicates failure. if (jsonObj == null) { return; } // Format and display the JSON response. handwritingResponseTextArea.setText(jsonObj.toString(2)); } Add the wrapper for the REST API call The HandwritingImage method wraps the two REST API calls needed to analyze an image. Because handwriting recognition is a time consuming process, a two step process is used. The first call submits the image for processing; the second call retrieves the detected text when the processing is complete. After the text is retrieved, the HandwritingImage method returns a JSONObject describing the text and the locations of the text, or null if there was an error. Copy and paste the following HandwritingImage method to just underneath the handwritingImageButtonActionPerformed method. /** * Encapsulates the Microsoft Cognitive Services REST API call to read handwritten text in an image. * @param imageUrl: The string URL of the image to process. * @return: A JSONObject describing the image, or null if a runtime error occurs. */ private JSONObject HandwritingImage(String imageUrl) { try (CloseableHttpClient textClient = HttpClientBuilder.create().build(); CloseableHttpClient resultClient = HttpClientBuilder.create().build()) { // Create the URI to access the REST API call to read text in an image. String uriString = uriBasePreRegion + String.valueOf(subscriptionRegionComboBox.getSelectedItem()) + uriBasePostRegion + uriBaseHandwriting; URIBuilder uriBuilder = new URIBuilder(uriString); // Request parameters. uriBuilder.setParameter("handwriting", "true"); // Prepare the URI for the REST API call. URI uri = uriBuilder.build(); HttpPost request = new HttpPost(uri); // Request headers. request.setHeader("Content-Type", "application/json"); request.setHeader("Ocp-Apim-Subscription-Key", subscriptionKeyTextField.getText()); // Request body. StringEntity reqEntity = new StringEntity("{\"url\":\"" + imageUrl + "\"}"); request.setEntity(reqEntity); // Execute the REST API call and get the response. HttpResponse textResponse = textClient.execute(request); // Check for success. if (textResponse.getStatusLine().getStatusCode() != 202) { // An error occurred. Return the JSON error message. HttpEntity entity = textResponse.getEntity(); String jsonString = EntityUtils.toString(entity); return new JSONObject(jsonString); } String operationLocation = null; // The 'Operation-Location' in the response contains the URI to retrieve the recognized text. Header[] responseHeaders = textResponse.getAllHeaders(); for(Header header : responseHeaders) { if(header.getName().equals("Operation-Location")) { // This string is the URI where you can get the text recognition operation result. operationLocation = header.getValue(); break; } } // // // // // NOTE: The response may not be immediately available. Handwriting recognition is an async operation that can take a variable amount of time depending on the length of the text you want to recognize. You may need to wait or retry this operation. This example checks once per second for ten seconds. JSONObject responseObj = null; int i = 0; do { // Wait one second. Thread.sleep(1000); // Check to see if the operation completed. HttpGet resultRequest = new HttpGet(operationLocation); resultRequest.setHeader("Ocp-Apim-Subscription-Key", subscriptionKeyTextField.getText()); HttpResponse resultResponse = resultClient.execute(resultRequest); HttpEntity responseEntity = resultResponse.getEntity(); if (responseEntity != null) { // Get the JSON response. String jsonString = EntityUtils.toString(responseEntity); responseObj = new JSONObject(jsonString); } } while (i < 10 && responseObj != null && !responseObj.getString("status").equalsIgnoreCase("Succeeded")); // If the operation completed, return the JSON object. if (responseObj != null) { return responseObj; } else { // Return null for timeout error. System.out.println("Timeout error."); return null; } } catch (Exception e) { // Display error message. System.out.println(e.getMessage()); return null; } } Run the handwriting function To run the application, press F6. Put your subscription key into the Subscription Key field and verify that you are using the correct region in Subscription Region. Click the Read Handwritten Text tab, enter a URL to an image of handwritten text, then click the Read Image button to analyze an image and see the result. Next steps In this guide, you used the Computer Vision REST API with Java to test many of the available image analysis features. Next, see the reference documentation to learn more about the APIs involved. Computer Vision REST API Use Computer Vision features with the REST API and JavaScript 5/7/2019 • 18 minutes to read • Edit Online This guide shows the features of the Azure Cognitive Services Computer Vision REST API. Explore a JavaScript application that uses the Computer Vision REST API to perform optical character recognition (OCR ), create smart-cropped thumbnails, plus detect, categorize, tag, and describe visual features, including faces, in an image. This example lets you submit an image URL for analysis or processing. You can use this open source example as a template for building your own JavaScript app to use the Computer Vision REST API. The JavaScript form application has already been written, but has no Computer Vision functionality. In this guide, you add the code specific to the Computer Vision REST API to complete the application's functionality. Prerequisites Platform requirements You can follow the steps in this guide using a simple text editor. Subscribe to Computer Vision API and get a subscription key Before creating the example, you must subscribe to Computer Vision API which is part of the Azure Cognitive Services. For subscription and key management details, see Subscriptions. Both the primary and secondary keys are valid to use in this guide. Acquire incomplete tutorial project Download the project Clone the Cognitive Services JavaScript Computer Vision Tutorial, or download the .zip file and extract it to an empty directory. If you would prefer to use the finished project with all tutorial code added, you can use the files in the Completed folder. Add tutorial code to the project The JavaScript application is set up with six .html files, one for each feature. Each file demonstrates a different function of Computer Vision (analyze, OCR, etc.). The six sections do not have interdependencies, so you can add the tutorial code to one file, all six files, or only a couple of files. And you can add the tutorial code to the files in any order. Analyze an image The Analyze feature of Computer Vision scans an image for thousands of recognizable objects, living things, scenery, and actions. Once the analysis is complete, Analyze returns a JSON object that describes the image with descriptive tags, color analysis, captions, and more. To complete the Analyze feature of the application, perform the following steps: Add the event handler code for the analyze button Open the analyze.html file in a text editor and locate the analyzeButtonClick function near the bottom of the file. The analyzeButtonClick event handler function clears the form, displays the image specified in the URL, then calls the AnalyzeImage function to analyze the image. Copy and paste the following code into the analyzeButtonClick function. function analyzeButtonClick() { // Clear the display fields. $("#sourceImage").attr("src", "#"); $("#responseTextArea").val(""); $("#captionSpan").text(""); // Display the image. var sourceImageUrl = $("#inputImage").val(); $("#sourceImage").attr("src", sourceImageUrl); AnalyzeImage(sourceImageUrl, $("#responseTextArea"), $("#captionSpan")); } Add the wrapper for the REST API call The AnalyzeImage function wraps the REST API call to analyze an image. Upon a successful return, the formatted JSON analysis will be displayed in the specified textarea, and the caption will be displayed in the specified span. Copy and paste the AnalyzeImage function code to just underneath the analyzeButtonClick function. /* Analyze the image at the specified URL by using Microsoft Cognitive Services Analyze Image API. * @param {string} sourceImageUrl - The URL to the image to analyze. * @param {

element} responseTextArea - The text area to display the JSON string returned
*
from the REST API call, or to display the error message if there was
*
an error.
* @param {<span> element} captionSpan - The span to display the image caption.
*/
function AnalyzeImage(sourceImageUrl, responseTextArea, captionSpan) {
// Request parameters.
var params = {
"visualFeatures": "Categories,Description,Color",
"details": "",
"language": "en",
};
// Perform the REST API call.
$.ajax({
url: common.uriBasePreRegion +
$("#subscriptionRegionSelect").val() +
common.uriBasePostRegion +
common.uriBaseAnalyze +
"?" +
$.param(params),
// Request headers.
beforeSend: function(jqXHR){
jqXHR.setRequestHeader("Content-Type","application/json");
jqXHR.setRequestHeader("Ocp-Apim-Subscription-Key",
encodeURIComponent($("#subscriptionKeyInput").val()));
},
type: "POST",
// Request body.
data: '{"url": ' + '"' + sourceImageUrl + '"}',
})
.done(function(data) {
// Show formatted JSON on webpage.
responseTextArea.val(JSON.stringify(data, null, 2));
// Extract and display the caption and confidence from the first caption in the description object.
if (data.description && data.description.captions) {
var caption = data.description.captions[0];
if (caption.text && caption.confidence) {
captionSpan.text("Caption: " + caption.text +
" (confidence: " + caption.confidence + ").");
}
}
})
.fail(function(jqXHR, textStatus, errorThrown) {
// Prepare the error string.
var errorString = (errorThrown === "") ? "Error. " : errorThrown + " (" + jqXHR.status + "): ";
errorString += (jqXHR.responseText === "") ? "" : (jQuery.parseJSON(jqXHR.responseText).message) ?
jQuery.parseJSON(jqXHR.responseText).message : jQuery.parseJSON(jqXHR.responseText).error.message;
// Put the error JSON in the response textarea.
responseTextArea.val(JSON.stringify(jqXHR, null, 2));
// Show the error message.
alert(errorString);
});
}

Run the Analyze function

Save the analyze.html file and open it in a Web browser. Put your subscription key into the Subscription Key
field and verify that you are using the correct region in Subscription Region. Enter a URL to an image to analyze,
then click the Analyze Image button to analyze an image and see the result.
Recognize a landmark
The Landmark feature of Computer Vision analyzes an image for natural and artificial landmarks, such as
mountains or famous buildings. Once the analysis is complete, Landmark returns a JSON object that identifies the
landmarks found in the image.
To complete the Landmark feature of the application, perform the following steps:
Add the event handler code for the landmark button

Open the landmark.html file in a text editor and locate the landmarkButtonClick function near the bottom of
the file.
The landmarkButtonClick event handler function clears the form, displays the image specified in the URL, then
calls the IdentifyLandmarks function to analyze the image. Copy and paste the following code into the
landmarkButtonClick function.
function landmarkButtonClick() {
// Clear the display fields.
$("#sourceImage").attr("src", "#");
$("#responseTextArea").val("");
$("#captionSpan").text("");
// Display the image.
var sourceImageUrl = $("#inputImage").val();
$("#sourceImage").attr("src", sourceImageUrl);
IdentifyLandmarks(sourceImageUrl, $("#responseTextArea"), $("#captionSpan"));
}

Add the wrapper for the REST API call

The IdentifyLandmarks function wraps the REST API call to analyze an image. Upon a successful return, the
formatted JSON analysis will be displayed in the specified textarea, and the caption will be displayed in the
specified span.
Copy and paste the IdentifyLandmarks function code to just underneath the landmarkButtonClick function.

/* Identify landmarks in the image at the specified URL by using Microsoft Cognitive Services
* Landmarks API.
* @param {string} sourceImageUrl - The URL to the image to analyze for landmarks.
* @param {<textarea> element} responseTextArea - The text area to display the JSON string returned
*
from the REST API call, or to display the error message if there was
*
an error.
* @param {<span> element} captionSpan - The span to display the image caption.
*/
function IdentifyLandmarks(sourceImageUrl, responseTextArea, captionSpan) {
// Request parameters.
var params = {
"model": "landmarks"
};
// Perform the REST API call.
$.ajax({
url: common.uriBasePreRegion +
$("#subscriptionRegionSelect").val() +
common.uriBasePostRegion +
common.uriBaseLandmark +
"?" +
$.param(params),
// Request headers.
beforeSend: function(jqXHR){
jqXHR.setRequestHeader("Content-Type","application/json");
jqXHR.setRequestHeader("Ocp-Apim-Subscription-Key",
encodeURIComponent($("#subscriptionKeyInput").val()));
},
type: "POST",
// Request body.
data: '{"url": ' + '"' + sourceImageUrl + '"}',
})
.done(function(data) {
// Show formatted JSON on webpage.
responseTextArea.val(JSON.stringify(data, null, 2));
// Extract and display the caption and confidence from the first caption in the description object.
if (data.result && data.result.landmarks) {
var landmark = data.result.landmarks[0];
if (landmark.name && landmark.confidence) {
captionSpan.text("Landmark: " + landmark.name +
" (confidence: " + landmark.confidence + ").");
}
}
})
.fail(function(jqXHR, textStatus, errorThrown) {
// Prepare the error string.
var errorString = (errorThrown === "") ? "Error. " : errorThrown + " (" + jqXHR.status + "): ";
errorString += (jqXHR.responseText === "") ? "" : (jQuery.parseJSON(jqXHR.responseText).message) ?
jQuery.parseJSON(jqXHR.responseText).message : jQuery.parseJSON(jqXHR.responseText).error.message;
// Put the error JSON in the response textarea.
responseTextArea.val(JSON.stringify(jqXHR, null, 2));
// Show the error message.
alert(errorString);
});
}

Run the landmark function

Save the landmark.html file and open it in a Web browser. Put your subscription key into the Subscription Key
field and verify that you are using the correct region in Subscription Region. Enter a URL to an image to analyze,
then click the Analyze Image button to analyze an image and see the result.
Recognize celebrities
The Celebrities feature of Computer Vision analyzes an image for famous people. Once the analysis is complete,
Celebrities returns a JSON object that identifies the Celebrities found in the image.
To complete the Celebrities feature of the application, perform the following steps:
Add the event handler code for the celebrities button

Open the celebrities.html file in a text editor and locate the celebritiesButtonClick function near the bottom of
the file.
The celebritiesButtonClick event handler function clears the form, displays the image specified in the URL, then
calls the IdentifyCelebrities function to analyze the image. Copy and paste the following code into the
celebritiesButtonClick function.
function celebritiesButtonClick() {
// Clear the display fields.
$("#sourceImage").attr("src", "#");
$("#responseTextArea").val("");
$("#captionSpan").text("");
// Display the image.
var sourceImageUrl = $("#inputImage").val();
$("#sourceImage").attr("src", sourceImageUrl);
IdentifyCelebrities(sourceImageUrl, $("#responseTextArea"), $("#captionSpan"));
}

Add the wrapper for the REST API call

/* Identify celebrities in the image at the specified URL by using Microsoft Cognitive Services
* Celebrities API.
* @param {string} sourceImageUrl - The URL to the image to analyze for celebrities.
* @param {<textarea> element} responseTextArea - The text area to display the JSON string returned
*
from the REST API call, or to display the error message if there was
*
an error.
* @param {<span> element} captionSpan - The span to display the image caption.
*/
function IdentifyCelebrities(sourceImageUrl, responseTextArea, captionSpan) {
// Request parameters.
var params = {
"model": "celebrities"
};
// Perform the REST API call.
$.ajax({
url: common.uriBasePreRegion +
$("#subscriptionRegionSelect").val() +
common.uriBasePostRegion +
common.uriBaseCelebrities +
"?" +
$.param(params),
// Request headers.
beforeSend: function(jqXHR){
jqXHR.setRequestHeader("Content-Type","application/json");
jqXHR.setRequestHeader("Ocp-Apim-Subscription-Key",
encodeURIComponent($("#subscriptionKeyInput").val()));
},
type: "POST",
// Request body.
data: '{"url": ' + '"' + sourceImageUrl + '"}',
})
.done(function(data) {
// Show formatted JSON on webpage.
responseTextArea.val(JSON.stringify(data, null, 2));
// Extract and display the caption and confidence from the first caption in the description object.
if (data.result && data.result.celebrities) {
var celebrity = data.result.celebrities[0];
if (celebrity.name && celebrity.confidence) {
captionSpan.text("Celebrity name: " + celebrity.name +
" (confidence: " + celebrity.confidence + ").");
}
}
})
.fail(function(jqXHR, textStatus, errorThrown) {
// Prepare the error string.
var errorString = (errorThrown === "") ? "Error. " : errorThrown + " (" + jqXHR.status + "): ";
errorString += (jqXHR.responseText === "") ? "" : (jQuery.parseJSON(jqXHR.responseText).message) ?
jQuery.parseJSON(jqXHR.responseText).message : jQuery.parseJSON(jqXHR.responseText).error.message;
// Put the error JSON in the response textarea.
responseTextArea.val(JSON.stringify(jqXHR, null, 2));
// Show the error message.
alert(errorString);
});
}

Run the celebrities function

Save the celebrities.html file and open it in a Web browser. Put your subscription key into the Subscription Key
field and verify that you are using the correct region in Subscription Region. Enter a URL to an image to analyze,
then click the Analyze Image button to analyze an image and see the result.
Intelligently generate a thumbnail
The Thumbnail feature of Computer Vision generates a thumbnail from an image. By using the Smart Crop
feature, the Thumbnail feature will identify the area of interest in an image and center the thumbnail on this area, to
generate more aesthetically pleasing thumbnail images.
To complete the Thumbnail feature of the application, perform the following steps:
Add the event handler code for the thumbnail button

Open the thumbnail.html file in a text editor and locate the thumbnailButtonClick function near the bottom of
the file.
The thumbnailButtonClick event handler function clears the form, displays the image specified in the URL, then
calls the getThumbnail function twice to create two thumbnails, one smart cropped and one without smart crop.
Copy and paste the following code into the thumbnailButtonClick function.
function thumbnailButtonClick() {
// Clear the display fields.
document.getElementById("sourceImage").src = "#";
document.getElementById("thumbnailImageSmartCrop").src = "#";
document.getElementById("thumbnailImageNonSmartCrop").src = "#";
document.getElementById("responseTextArea").value = "";
document.getElementById("captionSpan").text = "";
// Display the image.
var sourceImageUrl = document.getElementById("inputImage").value;
document.getElementById("sourceImage").src = sourceImageUrl;
// Get a smart cropped thumbnail.
getThumbnail (sourceImageUrl, true, document.getElementById("thumbnailImageSmartCrop"),
document.getElementById("responseTextArea"));
// Get a non-smart-cropped thumbnail.
getThumbnail (sourceImageUrl, false, document.getElementById("thumbnailImageNonSmartCrop"),
document.getElementById("responseTextArea"));
}

Add the wrapper for the REST API call

The getThumbnail function wraps the REST API call to analyze an image. Upon a successful return, the thumbnail
will be displayed in the specified img element.
Copy and paste the following getThumbnail function to just underneath the thumbnailButtonClick function.
/* Get a thumbnail of the image at the specified URL by using Microsoft Cognitive Services
* Thumbnail API.
* @param {string} sourceImageUrl URL to image.
* @param {boolean} smartCropping Set to true to use the smart cropping feature which crops to the
*
more interesting area of an image; false to crop for the center
*
of the image.
* @param {<img> element} imageElement The img element in the DOM which will display the thumbnail image.
* @param {<textarea> element} responseTextArea - The text area to display the Response Headers returned
*
from the REST API call, or to display the error message if there was
*
an error.
*/
function getThumbnail (sourceImageUrl, smartCropping, imageElement, responseTextArea) {
// Create the HTTP Request object.
var xhr = new XMLHttpRequest();
// Request parameters.

// Request parameters.
var params = "width=100&height=150&smartCropping=" + smartCropping.toString();
// Build the full URI.
var fullUri = common.uriBasePreRegion +
document.getElementById("subscriptionRegionSelect").value +
common.uriBasePostRegion +
common.uriBaseThumbnail +
"?" +
params;
// Identify the request as a POST, with the URI and parameters.
xhr.open("POST", fullUri);
// Add the request headers.
xhr.setRequestHeader("Content-Type","application/json");
xhr.setRequestHeader("Ocp-Apim-Subscription-Key",
encodeURIComponent(document.getElementById("subscriptionKeyInput").value));
// Set the response type to "blob" for the thumbnail image data.
xhr.responseType = "blob";
// Process the result of the REST API call.
xhr.onreadystatechange = function(e) {
if(xhr.readyState === XMLHttpRequest.DONE) {
// Thumbnail successfully created.
if (xhr.status === 200) {
// Show response headers.
var s = JSON.stringify(xhr.getAllResponseHeaders(), null, 2);
responseTextArea.value = JSON.stringify(xhr.getAllResponseHeaders(), null, 2);
// Show thumbnail image.
var urlCreator = window.URL || window.webkitURL;
var imageUrl = urlCreator.createObjectURL(this.response);
imageElement.src = imageUrl;
} else {
// Display the error message. The error message is the response body as a JSON string.
// The code in this code block extracts the JSON string from the blob response.
var reader = new FileReader();
// This event fires after the blob has been read.
reader.addEventListener('loadend', (e) => {
responseTextArea.value = JSON.stringify(JSON.parse(e.srcElement.result), null, 2);
});
// Start reading the blob as text.
reader.readAsText(xhr.response);
}
}
}
// Execute the REST API call.
xhr.send('{"url": ' + '"' + sourceImageUrl + '"}');
}

Run the thumbnail function

Save the thumbnail.html file and open it in a Web browser. Put your subscription key into the Subscription Key
field and verify that you are using the correct region in Subscription Region. Enter a URL to an image to analyze,
then click the Generate Thumbnails button to analyze an image and see the result.
Read printed text (OCR )
The Optical Character Recognition (OCR ) feature of Computer Vision analyzes an image of printed text. After the
analysis is complete, OCR returns a JSON object that contains the text and the location of the text in the image.
To complete the OCR feature of the application, perform the following steps:

Add the event handler code for the OCR button
Open the ocr.html file in a text editor and locate the ocrButtonClick function near the bottom of the file.
The ocrButtonClick event handler function clears the form, displays the image specified in the URL, then calls the
ReadOcrImage function to analyze the image. Copy and paste the following code into the ocrButtonClick
function.
function ocrButtonClick() {
// Clear the display fields.
$("#sourceImage").attr("src", "#");
$("#responseTextArea").val("");
$("#captionSpan").text("");
// Display the image.
var sourceImageUrl = $("#inputImage").val();
$("#sourceImage").attr("src", sourceImageUrl);
ReadOcrImage(sourceImageUrl, $("#responseTextArea"));
}

Add the wrapper for the REST API call

The ReadOcrImage function wraps the REST API call to analyze an image. Upon a successful return, the
formatted JSON describing the text and the location of the text will be displayed in the specified textarea.
Copy and paste the following ReadOcrImage function to just underneath the ocrButtonClick function.

/* Recognize and read printed text in an image at the specified URL by using Microsoft Cognitive
* Services OCR API.
* @param {string} sourceImageUrl - The URL to the image to analyze for printed text.
* @param {<textarea> element} responseTextArea - The text area to display the JSON string returned
*
from the REST API call, or to display the error message if there was
*
an error.
*/
function ReadOcrImage(sourceImageUrl, responseTextArea) {
// Request parameters.
var params = {
"language": "unk",
"detectOrientation ": "true",
};
// Perform the REST API call.
$.ajax({
url: common.uriBasePreRegion +
$("#subscriptionRegionSelect").val() +
common.uriBasePostRegion +
common.uriBaseOcr +
"?" +
$.param(params),
// Request headers.
beforeSend: function(jqXHR){
jqXHR.setRequestHeader("Content-Type","application/json");
jqXHR.setRequestHeader("Ocp-Apim-Subscription-Key",
encodeURIComponent($("#subscriptionKeyInput").val()));
},
type: "POST",
// Request body.
data: '{"url": ' + '"' + sourceImageUrl + '"}',
})
.done(function(data) {
// Show formatted JSON on webpage.
responseTextArea.val(JSON.stringify(data, null, 2));
})
.fail(function(jqXHR, textStatus, errorThrown) {
// Put the JSON description into the text area.
responseTextArea.val(JSON.stringify(jqXHR, null, 2));
// Display error message.
var errorString = (errorThrown === "") ? "Error. " : errorThrown + " (" + jqXHR.status + "): ";
errorString += (jqXHR.responseText === "") ? "" : (jQuery.parseJSON(jqXHR.responseText).message) ?
jQuery.parseJSON(jqXHR.responseText).message : jQuery.parseJSON(jqXHR.responseText).error.message;
alert(errorString);
});
}

Run the OCR function

Save the ocr.html file and open it in a Web browser. Put your subscription key into the Subscription Key field and
verify that you are using the correct region in Subscription Region. Enter a URL to an image of text to read, then
click the Read Image button to analyze an image and see the result.
Read handwritten text (handwriting recognition)
The Handwriting Recognition feature of Computer Vision analyzes an image of handwritten text. After the analysis
is complete, Handwriting Recognition returns a JSON object that contains the text and the location of the text in the
image.
To complete the Handwriting Recognition feature of the application, perform the following steps:

Add the event handler code for the handwriting button

Open the handwriting.html file in a text editor and locate the handwritingButtonClick function near the bottom
of the file.
The handwritingButtonClick event handler function clears the form, displays the image specified in the URL,
then calls the HandwritingImage function to analyze the image.
Copy and paste the following code into the handwritingButtonClick function.
function handwritingButtonClick() {
// Clear the display fields.
$("#sourceImage").attr("src", "#");
$("#responseTextArea").val("");
// Display the image.
var sourceImageUrl = $("#inputImage").val();
$("#sourceImage").attr("src", sourceImageUrl);
ReadHandwrittenImage(sourceImageUrl, $("#responseTextArea"));
}

Add the wrapper for the REST API call

The ReadHandwrittenImage function wraps the two REST API calls needed to analyze an image. Because
Handwriting Recognition is a time consuming process, a two step process is used. The first call submits the image
for processing; the second call retrieves the detected text when the processing is complete.
After the text is retrieved, the formatted JSON describing the text and the location of the text will be displayed in
the specified textarea.
Copy and paste the following ReadHandwrittenImage function to just underneath the handwritingButtonClick
function.
/* Recognize and read text from an image of handwriting at the specified URL by using Microsoft
* Cognitive Services Recognize Handwritten Text API.
* @param {string} sourceImageUrl - The URL to the image to analyze for handwriting.
* @param {<textarea> element} responseTextArea - The text area to display the JSON string returned
*
from the REST API call, or to display the error message if there was
*
an error.
*/
function ReadHandwrittenImage(sourceImageUrl, responseTextArea) {
// Request parameters.
var params = {
"handwriting": "true",
};
// This operation requires two REST API calls. One to submit the image for processing,
// the other to retrieve the text found in the image.
//
// Perform the first REST API call to submit the image for processing.
$.ajax({
url: common.uriBasePreRegion +
$("#subscriptionRegionSelect").val() +
common.uriBasePostRegion +
common.uriBaseHandwriting +
"?" +
$.param(params),
// Request headers.
beforeSend: function(jqXHR){
jqXHR.setRequestHeader("Content-Type","application/json");
jqXHR.setRequestHeader("Ocp-Apim-Subscription-Key",
encodeURIComponent($("#subscriptionKeyInput").val()));
},

},
type: "POST",
// Request body.
data: '{"url": ' + '"' + sourceImageUrl + '"}',
})
.done(function(data, textStatus, jqXHR) {
// Show progress.
responseTextArea.val("Handwritten image submitted.");
// Note: The response may not be immediately available. Handwriting Recognition is an
// async operation that can take a variable amount of time depending on the length
// of the text you want to recognize. You may need to wait or retry this GET operation.
//
// Try once per second for up to ten seconds to receive the result.
var tries = 10;
var waitTime = 100;
var taskCompleted = false;
var timeoutID = setInterval(function () {
// Limit the number of calls.
if (--tries <= 0) {
window.clearTimeout(timeoutID);
responseTextArea.val("The response was not available in the time allowed.");
return;
}
// The "Operation-Location" in the response contains the URI to retrieve the recognized text.
var operationLocation = jqXHR.getResponseHeader("Operation-Location");
// Perform the second REST API call and get the response.
$.ajax({
url: operationLocation,
// Request headers.
beforeSend: function(jqXHR){
jqXHR.setRequestHeader("Content-Type","application/json");
jqXHR.setRequestHeader("Ocp-Apim-Subscription-Key",
encodeURIComponent($("#subscriptionKeyInput").val()));
},
type: "GET",
})
.done(function(data) {
// If the result is not yet available, return.
if (data.status && (data.status === "NotStarted" || data.status === "Running")) {
return;
}
// Show formatted JSON on webpage.
responseTextArea.val(JSON.stringify(data, null, 2));
// Indicate the task is complete and clear the timer.
taskCompleted = true;
window.clearTimeout(timeoutID);
})
.fail(function(jqXHR, textStatus, errorThrown) {
// Indicate the task is complete and clear the timer.
taskCompleted = true;
window.clearTimeout(timeoutID);
// Display error message.
var errorString = (errorThrown === "") ? "Error. " : errorThrown + " (" + jqXHR.status + "): ";
errorString += (jqXHR.responseText === "") ? "" :
(jQuery.parseJSON(jqXHR.responseText).message) ?
jQuery.parseJSON(jqXHR.responseText).message :

jQuery.parseJSON(jqXHR.responseText).message :
jQuery.parseJSON(jqXHR.responseText).error.message;
alert(errorString);
});
}, waitTime);
})
.fail(function(jqXHR, textStatus, errorThrown) {
// Put the JSON description into the text area.
responseTextArea.val(JSON.stringify(jqXHR, null, 2));
// Display error message.
var errorString = (errorThrown === "") ? "Error. " : errorThrown + " (" + jqXHR.status + "): ";
errorString += (jqXHR.responseText === "") ? "" : (jQuery.parseJSON(jqXHR.responseText).message) ?
jQuery.parseJSON(jqXHR.responseText).message : jQuery.parseJSON(jqXHR.responseText).error.message;
alert(errorString);
});
}

Run the handwriting function

Save the handwriting.html file and open it in a Web browser. Put your subscription key into the Subscription
Key field and verify that you are using the correct region in Subscription Region. Enter a URL to an image of text
to read, then click the Read Image button to analyze an image and see the result.

Next steps
In this guide, you used the Computer Vision REST API with JavaScript to test many of the available image analysis
features. Next, see the reference documentation to learn more about the APIs involved.
Computer Vision REST API

Tutorial: Computer Vision API Python
2/15/2019 • 2 minutes to read • Edit Online

This tutorial shows you how to use the Computer Vision API in Python and how to visualize your results using
popular libraries. You will use Jupyter to run the tutorial. To learn how to get started with interactive Jupyter
notebooks, refer to the Jupyter Documentation.

Prerequisites
Python 2.7+ or 3.5+
pip tool
Jupyter Notebook installed

Open the Tutorial Notebook in Jupyter
1.
2.
3.
4.

Go to the Cognitive Vision Python GitHub repo.
Click on the green button to clone or download the repo.
Open a command prompt and navigate to the folder Cognitive-Vision-Python\Jupyter Notebook.
Ensure you have all the required libraries by running the command
pip install requests opencv-python numpy matplotlib from the command prompt.
5. Start Jupyter by running the command jupyter notebook from the command prompt.
6. In the Jupyter window, click on Computer Vision API Example.ipynb to open the tutorial notebook.

Run the Tutorial
To use this notebook, you will need a subscription key for the Computer Vision API. Visit the Subscription page to
sign up. On the Sign in page, use your Microsoft account to sign in and you will be able to subscribe and get free
keys. After completing the sign-up process, paste your key into the Variables section of the notebook (reproduced
below ). Either the primary or the secondary key will work. Be sure to enclose the key in quotes to make it a string.
You will also need to make sure the

_region

field matches the region that corresponds to your subscription.

# Variables
_region = 'westcentralus' #Here you enter the region of your subscription
_url = 'https://{}.api.cognitive.microsoft.com/vision/v2.0/analyze'.format(_region)
_key = None #Here you have to paste your primary key
_maxNumRetries = 10

When you run the tutorial, you will be able to add images to analyze, both from a URL and from local storage. The
script will display the images and analysis information in the notebook.

Example: How to call the Computer Vision API
4/18/2019 • 5 minutes to read • Edit Online

This guide demonstrates how to call Computer Vision API using REST. The samples are written both in C# using
the Computer Vision API client library, and as HTTP POST/GET calls. We will focus on:
How to get "Tags", "Description" and "Categories".
How to get "Domain-specific" information (celebrities).

Prerequisites
Image URL or path to locally stored image.
Supported input methods: Raw image binary in the form of an application/octet stream or image URL
Supported image formats: JPEG, PNG, GIF, BMP
Image file size: Less than 4MB
Image dimension: Greater than 50 x 50 pixels
In the examples below, the following features are demonstrated:
1. Analyzing an image and getting an array of tags and a description returned.
2. Analyzing an image with a domain-specific model (specifically, "celebrities" model) and getting the
corresponding result in JSON retune.
Features are broken down on:
Option One: Scoped Analysis - Analyze only a given model
Option Two: Enhanced Analysis - Analyze to provide additional details with 86-categories taxonomy

Authorize the API call
Every call to the Computer Vision API requires a subscription key. This key needs to be either passed through a
query string parameter or specified in the request header.
To obtain a free trial key, see Try Cognitive Services. Or, follow the instructions in Create a Cognitive Services
account to subscribe to Computer Vision and get your key.
1. Passing the subscription key through a query string, see below as a Computer Vision API example:
https://westus.api.cognitive.microsoft.com/vision/v2.0/analyze?visualFeatures=Description,Tags&subscription-key=
<Your subscription key>

1. Passing the subscription key can also be specified in the HTTP request header:
ocp-apim-subscription-key: <Your subscription key>

1. When using the client library, the subscription key is passed in through the constructor of VisionServiceClient:
var visionClient = new VisionServiceClient("Your subscriptionKey");

Upload an image to the Computer Vision API service and get back tags,
descriptions and celebrities
The basic way to perform the Computer Vision API call is by uploading an image directly. This is done by sending a

"POST" request with application/octet-stream content type together with the data read from the image. For "Tags"
and "Description", this upload method will be the same for all the Computer Vision API calls. The only difference
will be the query parameters the user specifies.
Here’s how to get "Tags" and "Description" for a given image:
Option One: Get list of "Tags" and one "Description"
POST https://westus.api.cognitive.microsoft.com/vision/v2.0/analyze?
visualFeatures=Description,Tags&subscription-key=<Your subscription key>

using Microsoft.ProjectOxford.Vision;
using Microsoft.ProjectOxford.Vision.Contract;
using System.IO;
AnalysisResult analysisResult;
var features = new VisualFeature[] { VisualFeature.Tags, VisualFeature.Description };
using (var fs = new FileStream(@"C:\Vision\Sample.jpg", FileMode.Open))
{
analysisResult = await visionClient.AnalyzeImageAsync(fs, features);
}

Option Two Get list of "Tags" only, or list of "Description" only:
Ta g s o n l y :

POST https://westus.api.cognitive.microsoft.com/vision/v2.0/tag&subscription-key=<Your subscription key>
var analysisResult = await visionClient.GetTagsAsync("http://contoso.com/example.jpg");
De s c ri p t i o n o n l y :

POST https://westus.api.cognitive.microsoft.com/vision/v2.0/describe&subscription-key=<Your subscription key>
using (var fs = new FileStream(@"C:\Vision\Sample.jpg", FileMode.Open))
{
analysisResult = await visionClient.DescribeAsync(fs);
}

Get domain-specific analysis (celebrities)
Option One: Scoped Analysis - Analyze only a given model
POST https://westus.api.cognitive.microsoft.com/vision/v2.0/models/celebrities/analyze
var celebritiesResult = await visionClient.AnalyzeImageInDomainAsync(url, "celebrities");

For this option, all other query parameters {visualFeatures, details} are not valid. If you want to see all supported
models, use:
GET https://westus.api.cognitive.microsoft.com/vision/v2.0/models
var models = await visionClient.ListModelsAsync();

Option Two: Enhanced Analysis - Analyze to provide additional details with 86-categories taxonomy
For applications where you want to get generic image analysis in addition to details from one or more domainspecific models, we extend the v1 API with the models query parameter.

POST https://westus.api.cognitive.microsoft.com/vision/v2.0/analyze?details=celebrities

When this method is invoked, we will call the 86-category classifier first. If any of the categories match that of a
known/matching model, a second pass of classifier invocations will occur. For example, if "details=all", or "details"
include ‘celebrities’, we will call the celebrities model after the 86-category classifier is called and the result includes
the category person. This will increase latency for users interested in celebrities, compared to Option One.
All v1 query parameters will behave the same in this case. If visualFeatures=categories is not specified, it will be
implicitly enabled.

Retrieve and understand the JSON output for analysis
Here's an example:
{
"tags":[
{
"name":"outdoor",
"score":0.976
},
{
"name":"bird",
"score":0.95
}
],
"description":{
"tags":[
"outdoor",
"bird"
],
"captions":[
{
"text":"partridge in a pear tree",
"confidence":0.96
}
]
}
}

FIELD

TYPE

CONTENT