Feed

Computer Vision

Computer vision news covering image recognition, object detection, and visual AI models discussed in Hacker News and Reddit.

Articles from the last 30 days

The Waymo World Model: A New Frontier for Autonomous Driving Simulation
01Friday, February 6, 2026

The Waymo World Model: A New Frontier for Autonomous Driving Simulation

Waymo has introduced the Waymo World Model, a pioneering generative AI system designed for hyper-realistic autonomous driving simulation. Built upon Google DeepMind's Genie 3, the model moves beyond traditional on-road data by leveraging vast pre-trained world knowledge to simulate rare, long-tail scenarios such as extreme weather or unexpected obstacles. The system features high controllability through language prompts, scene layouts, and driving inputs, allowing for 'what-if' counterfactual testing. Crucially, it generates multimodal outputs including both camera imagery and 4D lidar point clouds, providing a comprehensive training environment for the Waymo Driver. This advancement enhances road safety by preparing the vehicle for complex edge cases long before it encounters them in reality, significantly scaling Waymo's ability to deploy across diverse urban environments.

Sources:Hacker News1075 pts
Nano Banana 2: Google's latest AI image generation model
02Thursday, February 26, 2026

Nano Banana 2: Google's latest AI image generation model

Google introduces Nano Banana 2, a state-of-the-art image model combining the reasoning of Nano Banana Pro with Flash-level speed. It features advanced world knowledge for infographics, precise text rendering, and improved subject consistency. The model integrates SynthID and C2PA credentials for robust provenance and is rolling out across the Gemini app, Search, and Vertex AI.

Sources:Hacker News546 pts
Beginning fully autonomous operations with the 6th-generation Waymo driver
03Thursday, February 12, 2026

Beginning fully autonomous operations with the 6th-generation Waymo driver

Waymo has unveiled its 6th-generation Driver, a fully autonomous system featuring streamlined hardware and custom-designed sensing technology. By integrating high-resolution cameras, lidar, and radar with advanced AI, the system reduces costs while enhancing performance in diverse weather. This scalable architecture is designed for high-volume production across multiple vehicle platforms.

Sources:Hacker News268 pts
OpenScan
04Friday, February 20, 2026

OpenScan

This gallery showcases high-quality 3D reconstructions using OpenScan technology. Featuring models like the Giant Swallowtail and Ammonite, it highlights the versatility of OpenScan Classic and OpenScan Mini combined with DSLR cameras and photogrammetry software. These textured meshes demonstrate applications in entomology, paleontology, and digital preservation using tools like OpenScanCloud and 3DF Zephyr.

Sources:Hacker News193 pts
Computer Vision News & Summaries for Developers | Snapbyte.dev