Feed

Data Science

Discover data science insights covering statistical analysis, visualization, and machine learning workflows. Our digest aggregates pandas notebooks, ML frameworks, and data engineering discussions from developer communities across Hacker News and Reddit.

Articles from the last 30 days

The Singularity will occur on a Tuesday
01Tuesday, February 10, 2026

The Singularity will occur on a Tuesday

The article explores the concept of the 'Singularity' using hyperbolic modeling applied to five key AI progress metrics, including MMLU scores, cost efficiency, and research output. The author argues that while technical metrics like performance and infrastructure appear to follow a linear growth path, the human perception and academic excitement surrounding 'emergent' behaviors are accelerating at a hyperbolic rate toward a vertical asymptote. This mathematical approach predicts a specific 'Singularity' date in 2034. However, the author emphasizes that the 'Social Singularity' is already occurring, manifesting as institutional collapse, labor market disruption, and psychological anxiety. The core takeaway is that the machines are improving at a constant rate, but human franticness and attention are the components actually hitting a singularity point, leading to a breakdown in our collective ability to process and regulate the technology.

Sources:Hacker News1275 pts
Poison Fountain: An Anti-AI Weapon
02Thursday, February 19, 2026

Poison Fountain: An Anti-AI Weapon

The Poison Fountain technique generates vast amounts of subtly incorrect data to protect against unauthorized web scraping. By injecting small errors into code, structured data, and prose, it creates a 'practically endless' stream of adversarial content that degrades the quality of datasets used for training machine learning models and artificial intelligence.

Sources:/r/programming858 pts
Recreating Epstein PDFs from raw encoded attachments
03Wednesday, February 4, 2026

Recreating Epstein PDFs from raw encoded attachments

The Department of Justice (DoJ) release of the Epstein archives has been criticized for numerous technical failures, including poor redaction, broken search functionality, and corrupted encoding. A significant oversight discovered in the dump is the inclusion of raw base64-encoded email attachments. While the DoJ attempted to censor the archives, they inadvertently left pages of hex and base64 string data visible in the document scans. This article explores the technical challenge of reconstructing a PDF attachment (a benefit invitation) from 76 pages of low-quality, OCR-unfriendly Courier New text. The author documents failed attempts using Tesseract and Adobe Acrobat, and provides a partially successful workflow using poppler-utils and AWS Textract. The primary difficulty lies in the phonetic and visual ambiguity of characters like '1' and 'l' within JPEG-compressed scans, presenting a unique digital forensics challenge for the open-source community.

Agent Skills
04Tuesday, February 3, 2026

Agent Skills

Agent Skills represent a significant advancement in autonomous system architecture by providing agents with the necessary procedural knowledge and specific context required for reliable enterprise work. By decoupling capabilities from the core model, Skill authors can build specialized functions once and deploy them across various agent products. This framework enables domain expertise in fields like legal review or data analysis, while also allowing for the creation of repeatable, auditable workflows. Furthermore, it fosters interoperability, letting teams capture organizational intelligence in version-controlled packages that can be loaded on demand to extend an agent's functionality for specific tasks.

Sources:Hacker News501 pts
Is Show HN Dead? No, but It's Drowning
05Tuesday, February 17, 2026

Is Show HN Dead? No, but It's Drowning

Data analysis reveals that Show HN is experiencing a significant surge in submission volume, leading to shorter visibility windows and decreased engagement per post. This trend, dubbed the Sideprocalypse, suggests that indie projects struggle to compete against a noise-heavy landscape, making discovery of high-quality niche projects increasingly difficult.

Sources:Hacker News446 pts
Hard-braking events as indicators of road segment crash risk
06Monday, February 9, 2026

Hard-braking events as indicators of road segment crash risk

Google Research has validated hard-braking events (HBEs) sourced from Android Auto as a proactive 'leading' indicator for road safety. Unlike traditional crash data, which is 'lagging' and statistically sparse, HBEs—defined as forward deceleration exceeding -3m/s²—provide a high-density data stream for traffic safety assessment. By analyzing 10 years of crash data from California and Virginia alongside anonymized HBE measurements, researchers used negative binomial regression to confirm a significant correlation between hard braking and actual crash risk. The HBE signal identified high-risk segments 18 times more frequently than crash reports, even capturing dangers at complex freeway merges before historical data could. This methodology allows transportation agencies to identify hazardous infrastructure early and implement interventions like redesigned signage or signal timing adjustments through the Roads Management Insights platform.

Sources:Hacker News341 pts
“Car Wash” test with 53 models
07Thursday, February 19, 2026

“Car Wash” test with 53 models

The "Car Wash Test" reveals a significant reasoning flaw in 42 out of 53 leading AI models. Most models prioritize distance over the car's presence at the wash, suggesting walking 50 meters. Only five models, including Claude Opus 4.6 and Gemini 3 Pro, consistently passed, highlighting reliability issues in LLM contextual logic compared to human performance.

Sources:Hacker News332 pts
Large-Scale Online Deanonymization with LLMs
08Tuesday, February 24, 2026

Large-Scale Online Deanonymization with LLMs

Researchers demonstrated that LLM agents can perform large-scale deanonymization across platforms like Reddit and Hacker News. By combining reasoning with search and embeddings, these models identify users from unstructured data with high precision. This study highlights significant privacy risks and the increasing feasibility of scalable AI-driven surveillance and people search.

Sources:Hacker News306 pts
The First Fully General Computer Action Model
09Monday, February 23, 2026

The First Fully General Computer Action Model

FDM-1 is a foundation model for computer use trained on an 11-million-hour video dataset. Utilizing a novel video encoder, it processes nearly two hours of video within 1M tokens. By employing an inverse dynamics model for automated labeling, it enables unsupervised internet-scale learning, facilitating long-context tasks in engineering, finance, and CAD beyond previous limitations.

Sources:Hacker News300 pts
Can you reverse engineer our neural network?
10Tuesday, February 24, 2026

Can you reverse engineer our neural network?

Jane Street released a unique 'capture-the-flag' machine learning puzzle featuring a hand-designed neural network with integer weights. Unlike traditional models, it resisted backpropagation and brute force. A student reverse-engineered it using mechanistic interpretability and SAT solvers, discovering it implemented an MD5 hash algorithm. The challenge highlights complex model interpretation and internal logic reconstruction.

Sources:Hacker News273 pts
Audio is the one area small labs are winning
11Thursday, February 12, 2026

Audio is the one area small labs are winning

The article explores the rise of specialized startups like Gradium and Kyutai in the audio AI space. Despite limited funding compared to major labs, these small teams outperform giants through deep domain expertise, innovative full-duplex architectures for real-time conversation, and efficient neural codecs like Mimi, positioning audio as a critical future modality.

Sources:Hacker News269 pts
I traced 3,177 API calls to see what 4 AI coding tools put in the context window
12Thursday, February 19, 2026

I traced 3,177 API calls to see what 4 AI coding tools put in the context window

A developer analyzed 3,177 API calls from four AI coding tools—Claude Opus, Claude Sonnet, Codex, and Gemini—using a custom tool called Context Lens. The study reveals stark differences in context window usage and strategies, ranging from Opus's surgical precision to Gemini's aggressive data consumption of up to 350,000 tokens for a simple bug fix.

Sources:/r/programming250 pts
I converted 2D conventional flight tracking into 3D
13Tuesday, February 17, 2026

I converted 2D conventional flight tracking into 3D

Aeris is a live flight tracking dashboard focused on San Francisco airspace. Leveraging data from OpenSky Network, OpenStreetMap, and MapLibre, it allows users to monitor real-time aircraft movements at various altitudes ranging from 43,000 feet down to ground level. The tool provides a clean, visual representation of aviation traffic for enthusiasts and professionals.

Sources:Hacker News239 pts
Smallest transformer that can add two 10-digit numbers
14Wednesday, February 25, 2026

Smallest transformer that can add two 10-digit numbers

This project tracks the smallest Transformers capable of adding two 10-digit numbers with over 99% accuracy. It distinguishes between trained models (weights learned via algorithms) and hand-coded models (analytically set weights). Currently, the smallest hand-coded model uses only 36 parameters, while the smallest trained model utilizes 311.

Sources:Hacker News223 pts
Acme Weather
15Monday, February 16, 2026

Acme Weather

The creators of Dark Sky have launched Acme Weather, a new app focused on forecast uncertainty. It utilizes multiple data sources to provide a spread of alternate predictions rather than a single guess. Key features include community-driven reports, comprehensive maps, custom notifications, and experimental meteorological tools through Acme Labs.

Sources:Hacker News218 pts
Don't Trust the Salt: AI Summarization, Multilingual Safety, and LLM Guardrails
17Monday, February 16, 2026

Don't Trust the Salt: AI Summarization, Multilingual Safety, and LLM Guardrails

Research into AI summarization reveals critical vulnerabilities in multilingual safety and guardrails. Using techniques like Bilingual Shadow Reasoning, models can bypass ethics policies through non-English hidden prompts. Evaluations show that non-English responses often lack necessary safety disclaimers and factual accuracy compared to English, highlighting significant inconsistencies in global LLM deployment.

Sources:Hacker News201 pts
Where did all the starships go?
18Tuesday, February 3, 2026

Where did all the starships go?

Jonathan, the technical lead for Datawrapper, explores the shifting landscape of speculative fiction by analyzing approximately 210,000 book titles from the Internet Speculative Fiction Database (ISFDB). His research highlights a significant decline in classic science fiction keywords like 'Space', 'Mars', and 'Planet' since their peak in the 1950s and 60s. Conversely, fantasy-related terms such as 'Dragon', 'Magic', and 'Witch' have seen a sharp increase in popularity, particularly following the commercial success of major franchises like Harry Potter and The Lord of the Rings. While genre-agnostic terms like 'War' and 'Death' remain stable, the data suggests that traditional space-themed science fiction has ceded significant commercial territory to fantasy and supernatural genres. This analysis demonstrates how keyword trends in titles can reflect broader cultural shifts and the commercial evolution of literary genres over several decades.

Sources:Hacker News191 pts
Show HN: A real-time strategy game that AI agents can play
19Wednesday, February 25, 2026

Show HN: A real-time strategy game that AI agents can play

LLM Skirmish is a new adversarial benchmark that evaluates LLMs through 1v1 real-time strategy games. Models like Claude Opus 4.5 and GPT 5.2 write Javascript code to compete, testing their in-context learning and coding capabilities across five-round tournaments. Results show Claude leading in ELO, while the benchmark highlights performance variances and context managing issues in models like Gemini 3 Pro.

Sources:Hacker News174 pts
Show HN: A physically-based GPU ray tracer written in Julia
20Monday, February 16, 2026

Show HN: A physically-based GPU ray tracer written in Julia

RayMakie and Hikari introduce a physically-based GPU ray tracing pipeline for the Julia Makie ecosystem. It enables photorealistic rendering, volumetric media, and spectral path tracing directly within research workflows. Supporting AMD, NVIDIA, and CPUs, it eliminates data export needs for fields like climate science, structural biology, and fluid dynamics.

Sources:Hacker News168 pts