managed computer vision services

A Feature Comparison of Managed Computer Vision Services

Lucid Content

Reading time: about 4 min

Are you considering the use of a cloud-based, managed computer vision service like Amazon Rekognition or Google Cloud Vision in your application? Do you want to know how the Amazon and Google services compare to one another feature by feature? By cost? By constraint?

Then you’re in the right place.

I compiled this information when I was evaluating Amazon Rekognition for @AircraftML, an aircraft classification system that uses Rekognition to assist in aircraft detection. This information helped me decide which service was right for AircraftML and ultimately led to me choosing Amazon Rekognition for my use case.

To broaden the comparison beyond just Amazon and Google (I was already familiar with both), I included other providers identified in Gartner’s Infrastructure as a Service (IaaS) Magic Quadrant. I left out Alibaba and Oracle because they do not currently offer managed computer vision services.

At the end of this article, I’ve included several examples demonstrating the results obtained from each of the services for tasks performed by AircraftML.

Feature comparison

Providers generally use the same terminology when describing features, but where necessary, I’ve attempted to normalize descriptions for clarity. Features common across all providers are listed at the top of the table, and features unique to single providers are at the bottom. Providers are listed left to right in alphabetical order. feature comparison of managed computer vision services

Service constraints

service constraints comparison table Notes:

  • Amazon does not allow images to be specified by URI. For images larger than 5MB, users would need to first upload to S3, then process with Rekognition.
  • Microsoft only allows URI-based image specification and also restricts image sizes to just 4MB.

Locations and access

locations and access comparison table

Feature costs

feature costs comparison table

  1. US East (N. Virginia)
  2. East US 2 Region

Examples

The examples below represent a sample of the tests I did with each of the services and the results they provided. All of these tests were done using the web consoles found here:

aircraft detection example

Aircraft detection

AircraftML uses Amazon Rekognition to decide whether an image contains an aircraft or not. It does this by requesting image labels from Amazon Rekognition and then inspecting those labels for aircraft-related terms. The example below shows the different labels returned by each of the provider services. aircraft detection comparison table Notes:

  • This table includes only labels returned with greater than 50% confidence by each service.

aircraft detection example

Tail number recognition

AircraftML uses Rekognition to extract aircraft tail numbers from images and augment the aircraft classification process. The example below shows the results of tail number detection using each of the provider services. “Y” indicates that the tail number was correctly identified by the service; “N” means it wasn’t. tail number recognition comparison table Notes:

  • Microsoft UI consistently returned a limit error when images were less than 4MB in size. It turns out there is also a 3200x3200 pixel limit when performing OCR.
  • IBM consistently failed all attempts at tail number recognition. In their defense, this is feature is listed as in “limited beta.”

Pricing example

The following table shows the approximate costs for AircraftML to evaluate two million images per month, including aircraft detection, content filtering, and tail number recognition. pricing comparison table Notes:

  • AircraftML processes nowhere near two million images per month. Most months, it would qualify under each provider’s free tier.

Conclusion

I hope you find this information helpful and that it's useful in your search for a managed computer vision service. Please let me know if you find any errors or omissions, or there are other details that you think might also be helpful to include. Good luck on your next project!

Lucidchart

Lucidchart, a cloud-based intelligent diagramming application, is a core component of Lucid Software's Visual Collaboration Suite. This intuitive, cloud-based solution empowers teams to collaborate in real-time to build flowcharts, mockups, UML diagrams, customer journey maps, and more. Lucidchart propels teams forward to build the future faster. Lucid is proud to serve top businesses around the world, including customers such as Google, GE, and NBC Universal, and 99% of the Fortune 500. Lucid partners with industry leaders, including Google, Atlassian, and Microsoft. Since its founding, Lucid has received numerous awards for its products, business, and workplace culture. For more information, visit lucidchart.com.

Bring your bright ideas to life.

Sign up free

or continue with

Sign in with GoogleSign inSign in with MicrosoftSign inSign in with SlackSign in

Get started

  • Pricing
  • Individual
  • Team
  • Enterprise
  • Contact sales
PrivacyLegal
  • linkedin
  • twitter
  • instagram
  • facebook
  • youtube
  • glassdoor
  • tiktok

© 2024 Lucid Software Inc.