Google Cloud Vision API | AI Agent Tools

google.com

•

One-click Login

About Google Cloud Vision API

Google Cloud Vision API enables businesses to analyze and interpret visual content such as images, documents, and videos using advanced AI models. It offers features like object detection, optical character recognition (OCR) for printed and handwritten text, facial recognition, logo and landmark detection, and content moderation to filter inappropriate material. By automating these visual recognition tasks, it eliminates manual processing, accelerating workflows and enhancing accuracy. Industries such as e-commerce, healthcare, security, and media leverage the API to improve product tagging, medical image analysis, identity verification, and document digitization. Its integration with Google Cloud and flexible pricing make it accessible for businesses of all sizes to unlock valuable insights from their visual data.

Available Actions

These are the specific actions that AI agents can perform with this tool

Annotate Images

1 input

Run image detection and annotation for a batch of images.

Inputs

requests

A list of individual image annotation requests for this batch.

Asynchronous File Annotation

2 inputs

Perform asynchronous image detection and annotation for files like PDFs.

Inputs

requests

Batch of file annotation requests containing file content and features to detect.

outputConfig

Specifies where to store the results of the annotation.

Cancel Vision Operation

1 input

Request to cancel a long-running Cloud Vision operation

Inputs

name

The name of the operation resource to be cancelled

Get Operation Status

2 inputs

Retrieve the latest state of a long-running operation using its unique name

Inputs

name

The server-assigned name, unique within the service, which identifies the long-running operation.

Authorization

OAuth 2.0 Bearer token for authentication

Project Files Annotation

2 inputs

Perform image detection and annotation for a batch of files in a specified project.

Inputs

parent

The name of the project in which the files reside.

requests

The list of file annotation requests, each containing the file information and features to apply.

About Google Cloud Vision API

Available Actions

These are the specific actions that AI agents can perform with this tool

Annotate Images

1 input

Run image detection and annotation for a batch of images.

Inputs

requests

A list of individual image annotation requests for this batch.

Asynchronous File Annotation

2 inputs

Perform asynchronous image detection and annotation for files like PDFs.

Inputs

requests

Batch of file annotation requests containing file content and features to detect.

outputConfig

Specifies where to store the results of the annotation.

Cancel Vision Operation

1 input

Request to cancel a long-running Cloud Vision operation

Inputs

name

The name of the operation resource to be cancelled

Get Operation Status

2 inputs

Retrieve the latest state of a long-running operation using its unique name

Inputs

name

The server-assigned name, unique within the service, which identifies the long-running operation.

Authorization

OAuth 2.0 Bearer token for authentication

Project Files Annotation

2 inputs

Perform image detection and annotation for a batch of files in a specified project.

Inputs

parent

The name of the project in which the files reside.

requests

The list of file annotation requests, each containing the file information and features to apply.