NVIDIA has officially released its AI Blueprint for Video Search and Summarization (VSS), a major milestone in the evolution of video analytics. Built on the NVIDIA Metropolis platform, this toolkit enables developers to deploy AI agents capable of real-time and archived video analysis—dramatically accelerating insights for manufacturing, smart cities, logistics, media, and more.
Unlocking the Potential of the Most Underutilized Data Source: Video
Video accounts for more than 50% of global data traffic, yet less than 1% is analyzed. NVIDIA’s VSS blueprint changes that by offering tools to build AI agents that can see, search, and summarize video 100x faster than real-time viewing.
Backed by NVIDIA’s latest vision-language models (VLMs) and large language models (LLMs) like VILA, Llama Nemotron, and NeMo Retriever microservices, the blueprint supports advanced techniques such as retrieval-augmented generation (RAG) for faster, smarter results.
Key Capabilities
- Summarize 1-hour video in under 1 minute
- Supports real-time or batch processing of hundreds of streams
- Now deployable on a single NVIDIA A100, H100 GPU, or RTX 6000 PRO at the edge
- Includes speech-to-text for richer context
These capabilities enable enterprises to monitor operations, detect anomalies, and retrieve key video segments with minimal latency—ideal for industries from manufacturing to public safety.
Real-World Impact Across Industries
Pegatron uses VSS-powered AI agents to train staff, analyze production workflows, and improve efficiency—resulting in a 7% reduction in labor costs and a 67% drop in defect rates.
Kaohsiung City, Taiwan, in collaboration with Linker Vision, has deployed VSS agents across 12 departments, improving emergency response and traffic monitoring. The city aims to scale from 30,000 to over 50,000 cameras by 2026, cutting response times by up to 80%.
The NHL uses the VAST InsightEngine built with VSS to scan petabytes of game footage in seconds—automatically clipping, tagging, and assembling highlight reels. In the future, these agents could even generate live game stats and strategy analysis.
Siemens integrated VSS tools into its Industrial Copilot, providing real-time maintenance support on factory floors. This has already boosted productivity by 30%, with the potential to hit 50%.
A Thriving Ecosystem of AI Agent Builders
- Superb AI fast-tracked airport video analytics for Incheon Airport in weeks.
- ITMAX Malaysia is improving city operations in Kuala Lumpur with VSS-powered agents.
- PYLER used VSS to help Samsung and BYD boost ad effectiveness, with 4x increases in click-through rates.
- Fingermark is embedding VSS into Eyecue, its video platform for drive-thru performance in quick-service restaurants.
Try it now: The VSS blueprint is available at build.nvidia.com.
Watch Jensen Huang’s COMPUTEX 2025 keynote and sessions from NVIDIA GTC Taipei to explore more on Physical AI and vision agents.