BETA — Сайт у режимі бета-тестування. Можливі помилки та зміни.
UK | EN |
LIVE
Технології 🇺🇸 США

Auge Vision: Advanced Image Analysis Tool Now Accessible Directly from Terminal

Hacker News franze 0 переглядів 2 хв читання

A new command-line utility enables comprehensive image recognition capabilities, including face detection, optical character recognition, and content classification, all executable from the terminal.

The tool demonstrates its functionality through analysis of a historic 1948 photograph from Bell Labs featuring transistor inventors John Bardeen, William Shockley, and Walter Brattain—the three scientists who pioneered the transistor in December 1947. The system successfully identifies all three individuals in the image.

Core Capabilities

When operating in comprehensive mode using the command auge --all bell-labs-transistor.jpg, the application performs multiple simultaneous analyses:

Face Detection

The tool identified 3 faces within the photograph, providing precise bounding box coordinates for each individual. The first face registers at coordinates x=0.273, y=0.700 with dimensions 0.123 by 0.155. The second face appears at x=0.442, y=0.495 measuring 0.119 by 0.149. The third face is positioned at x=0.609, y=0.651 with dimensions 0.129 by 0.162.

Content Classification

The classification engine generated ten distinct labels assessing image content:

  • People: 81% confidence
  • Adult: 81% confidence
  • Clothing: 71% confidence
  • Necktie: 67% confidence
  • Cord: 49% confidence
  • Suit: 49% confidence
  • Structure: 35% confidence
  • Furniture: 35% confidence
  • Table: 34% confidence
  • Tableware: 26% confidence

Text Recognition and Barcode Detection

Optical character recognition analysis returned no text content. Similarly, barcode and QR code scanning detected neither barcodes nor QR codes within the image.

Technical Details

The analysis operates with on-device processing, utilizing version 1.1.0 of the system. Results are provided in raw JSON format containing complete technical specifications for each detection module, enabling integration into broader workflows and applications.

Поділитися

Схожі новини