Feed

Database

Database technology news covering PostgreSQL, MySQL, NoSQL, query optimization, and data modeling from developer communities.

Articles from the last 30 days

It's 2026, Just Use Postgres
01Monday, February 2, 2026

It's 2026, Just Use Postgres

This article challenges the 'use the right tool for the right job' mantra that often leads developers to manage multiple specialized databases (Elasticsearch, Pinecone, Redis, etc.). Such sprawl creates technical debt, complex security models, and high operational costs. Instead, the author advocates for using PostgreSQL as a unified platform. Innovations like pgvector (vector search), TimeScaleDB (time-series), and pg_textsearch (BM25 search) allow Postgres to match specialized tools' performance using similar algorithms. This simplification is especially critical in the AI era, where maintaining consistent data for agents is easier within a single, atomic environment. The post concludes that 99% of companies can achieve their goals with Postgres alone, reducing architecture fragmentation and potential failure points.

Sources:Hacker News435 pts
The dumbest performance fix ever
02Saturday, January 31, 2026

The dumbest performance fix ever

In this technical narrative, Leónidas Neftalí González Campos recounts a significant optimization challenge involving a REST API built with C# and the ABP framework. The issue centered on a 'Users list' endpoint that appeared to crash but was actually just extremely inefficient, taking over five minutes to complete bulk operations. The root cause was a '2-year-old hotfix' where developers used a foreach loop to await individual database inserts one by one. By bypassing the outdated ABP repository and directly implementing Entity Framework Core's AddRange and SaveChangesAsync methods, the author reduced execution time from five minutes to 300 milliseconds. The story highlights the dangers of technical debt and a 'feature factory' management culture that prioritizes speed over quality, emphasizing that significant gains often come from removing suboptimal practices rather than complex innovations.

Sources:/r/programming422 pts
Bunny Database
03Tuesday, February 3, 2026

Bunny Database

Bunny Database has launched in public preview, offering a managed SQLite-compatible database service designed to bridge the gap between complex self-managed VMs and expensive DBaaS platforms. Built on a fork of libSQL, it features a serverless-like architecture that spins down when idle to save costs while utilizing 41 global regions to minimize latency via data locality. Key features include one-click deployment, SDK support for major languages like TypeScript, Go, and Rust, and a pay-as-you-go pricing model based on actual reads, writes, and storage. The service emphasizes developer experience with integrated metrics, a database editor, and seamless integration with edge computing environments like Bunny Edge Scripting. During its preview phase, the service is free to use, allowing for up to 50 databases per account.

Sources:Hacker News296 pts
Show HN: PgDog – Scale Postgres without changing the app
04Monday, February 23, 2026

Show HN: PgDog – Scale Postgres without changing the app

PgDog is an open-source, high-performance PostgreSQL proxy written in Rust. It offers advanced connection pooling, load balancing, and sharding capabilities. Featuring application-layer routing, health checks, and automatic failover, it supports complex cross-shard queries and online re-sharding. It is designed for scalability on commodity hardware and integrates with Kubernetes, AWS RDS, and Terraform.

Sources:Hacker News286 pts
AliSQL: Alibaba's open-source MySQL with vector and DuckDB engines
05Tuesday, February 3, 2026

AliSQL: Alibaba's open-source MySQL with vector and DuckDB engines

AliSQL is Alibaba's specialized open-source branch of MySQL, specifically forked and optimized for massive-scale production environments. The latest release, version 8.0.44 (LTS), introduces significant enhancements including the native integration of DuckDB as a storage engine, which provides users with lightweight analytical capabilities using standard SQL interfaces. Furthermore, AliSQL now supports enterprise-grade vector storage with up to 16,383 dimensions, utilizing an optimized HNSW algorithm for high-performance Approximate Nearest Neighbor search, making it a powerful tool for AI-driven applications like semantic search. The project roadmap includes ambitious improvements to DDL efficiency, recovery time objectives (RTO), and replication throughput via parallel processing. These features aim to reduce latency and eliminate replication lag, ensuring high availability and stability for modern, data-intensive organizations.

Sources:Hacker News264 pts
PostgreSQL Bloat Is a Feature, Not a Bug
06Monday, February 16, 2026

PostgreSQL Bloat Is a Feature, Not a Bug

This article explores PostgreSQL bloat, explaining how MVCC and the physical storage layer (Pages and Tuples) lead to dead space during UPDATE and DELETE operations. It details the performance impact of accumulated dead tuples and provides practical solutions using VACUUM, REINDEX, and autovacuum tuning to manage database size and efficiency effectively.

Sources:/r/programming250 pts
Sqldef: Idempotent schema management tool for MySQL, PostgreSQL, SQLite
07Sunday, February 1, 2026

Sqldef: Idempotent schema management tool for MySQL, PostgreSQL, SQLite

sqldef is an innovative command-line interface tool designed to streamline database schema management by performing diff operations between two SQL schemas. Unlike traditional migration tools that require managing stateful migration files, sqldef allows developers to update their relational database management systems by comparing a desired SQL schema with the current live state. The tool provides robust support for a wide range of popular databases, including MySQL, MariaDB, TiDB, PostgreSQL, SQL Server, and SQLite3. By generating the necessary DDL statements to reconcile differences, it simplifies the Continuous Integration and deployment lifecycle for backend developers. Additionally, an online demo environment utilizes a WebAssembly build of sqldef, enabling users to experiment with schema comparisons and DDL generation directly within a web browser without local installation.

Sources:Hacker News226 pts
What Is a Database Transaction?
08Sunday, February 22, 2026

What Is a Database Transaction?

This article explores database transactions, explaining atomicity through BEGIN, COMMIT, and ROLLBACK. It compares Postgres's multi-versioning (MVCC) and VACUUM processes against MySQL's undo logs. Additionally, it details isolation levels—Serializable, Repeatable Read, Read Committed, and Read Uncommitted—and how both systems handle concurrent writes using locking or optimistic serialization.

Sources:Hacker News215 pts
Show HN: SQL-tap – Real-time SQL traffic viewer for PostgreSQL and MySQL
09Saturday, February 14, 2026

Show HN: SQL-tap – Real-time SQL traffic viewer for PostgreSQL and MySQL

sql-tap is a real-time SQL traffic viewer composed of a proxy daemon and a TUI client. It intercepts PostgreSQL or MySQL queries using wire protocols to display transactions and performance metrics. Users can run EXPLAIN, analyze execution times, and inspect queries without modifying application code, streaming data via gRPC to an interactive terminal.

Sources:Hacker News210 pts
Zvec: A lightweight, fast, in-process vector database
10Friday, February 13, 2026

Zvec: A lightweight, fast, in-process vector database

Zvec is an open-source, in-process vector database built on Alibaba's Proxima engine. It offers high-performance, low-latency similarity search for dense and sparse vectors. Designed for easy integration via Python or Node.js, it supports hybrid search and multi-vector queries directly within applications without requiring separate server management.

Sources:Hacker News206 pts
TIL: Apple Broke Time Machine Again on Tahoe
11Sunday, February 1, 2026

TIL: Apple Broke Time Machine Again on Tahoe

The author details a systematic failure of macOS Time Machine backups targeting Synology NAS devices following recent Apple software updates. Investigations revealed that Apple changed SMB protocol defaults, specifically regarding mandatory signing and directory caching, which broke compatibility with many NAS configurations. The post provides a technical walkthrough for fixing these issues by editing the local nsmb.conf file on macOS and adjusting SMB settings within the Synology DSM interface. Furthermore, the author explores a more resilient alternative using a Docker-based Samba implementation (mbentley/timemachine) on a Proxmox ZFS environment to bypass proprietary software limitations. The narrative criticizes Apple for its lack of communication regarding breaking changes in critical backup infrastructure and ongoing bugs in its device restoration processes.

Sources:Hacker News182 pts
We gave terabytes of CI logs to an LLM
12Friday, February 27, 2026

We gave terabytes of CI logs to an LLM

Mendral (YC W26) utilizes a SQL interface and ClickHouse to empower AI agents to investigate flaky CI tests. By denormalizing 48 metadata columns per log line, they achieve 35:1 compression and millisecond query latency. This architecture allows agents to autonomously scan billions of log lines, identifying root causes across months of history in seconds.

Sources:Hacker News173 pts
Two Bits Are Better Than One: making bloom filters 2x more accurate
13Monday, February 16, 2026

Two Bits Are Better Than One: making bloom filters 2x more accurate

Floe explains using bloom filters, probabilistic data structures that efficiently speed up SQL queries by filtering non-matching rows before decompression. By implementing a fixed-size 256KB filter with two bits stored in a single uint32, they reduced false positive rates by 2x (11.7% to 5.7%) with negligible performance costs, significantly optimizing database join operations.

Sources:Hacker News172 pts
How we made geo joins 400× faster with H3 indexes
14Thursday, February 5, 2026

How we made geo joins 400× faster with H3 indexes

Geospatial joins, which use spatial predicates like ST_Intersects, often suffer from performance issues at scale due to their quadratic complexity. While conventional joins utilize efficient hash partitioning, spatial predicates lack a clean join key, forcing expensive row-by-row comparisons. Floe addresses this by automatically rewriting queries to utilize H3 indexes, a hexagonal hierarchical tiling system that converts complex geometries into big-integer cell IDs. By representing shapes as sets of H3 cells, the system performs a fast integer equi-join as a pre-filter, followed by an exact spatial recheck to eliminate false positives. This method effectively reduces candidate pairs by up to 99 percent. Benchmark results show that optimizing H3 resolution can lead to speedups of nearly 400x compared to baseline queries, transforming expensive spatial operations into efficient parallel hash joins.

Sources:Hacker News154 pts
A DuckDB-based metabase alternative
15Wednesday, February 18, 2026

A DuckDB-based metabase alternative

Shaper is an open-source, SQL-driven business intelligence tool powered by DuckDB. It allows developers to build analytics dashboards through SQL, supporting Git-based workflows and AI-ready features. Users can embed white-labeled analytics using JS and React SDKs, automate reports, and connect to multiple data sources. It is self-hostable via Docker and licensed under MPL 2.0.

Sources:Hacker News149 pts
The Perils of ISBN
16Sunday, February 15, 2026

The Perils of ISBN

The author critiques the poor UI/UX of GoodReads and explores building a Letterboxd-like alternative for books. The project faces engineering challenges, specifically the lack of a clean 'works-based' canonical database compared to film, as existing APIs like Google Books and OpenLibrary struggle with ISBN fragmentation and data duplication.

Sources:Hacker News125 pts
How I Cut My Google Search Dependence in Half
17Monday, February 9, 2026

How I Cut My Google Search Dependence in Half

The creator of Hister, a self-hosted web history management tool, details how they reduced their dependence on Google Search by 50% in just 1.5 months. The core insight is the distinction between Discovery Search (finding new content) and Recall Search (refinding previously visited content). While search engines like Google are plagued by excessive advertising, SEO manipulation, and privacy concerns, Hister allows users to index the full text of their own browsing history locally. This includes content behind authentication and private networks that global search engines cannot access. By prioritizing speed, privacy, and local full-text indexing, Hister serves as a personal knowledge base that improves over time. The project is open-source under the AGPLv3 license and aims to provide a faster, ad-free alternative for professionals who frequently revisit known documentation or resources.

Sources:Lobsters124 pts
Google Street View in 2026
18Thursday, February 26, 2026

Google Street View in 2026

The author details a workflow for processing Google Street View coverage datasets using a high-performance AMD Ryzen 9 workstation. By leveraging DuckDB, Parquet, and QGIS, the project converts 647 MB of JSON data into a spatially-sorted 85 MB Parquet file to analyze global geospatial patterns and update frequencies from 2003 to 2025.

Sources:Hacker News114 pts
Why the KeePass format should be based on SQLite
19Tuesday, February 24, 2026

Why the KeePass format should be based on SQLite

The article argues for transitioning the KeePass KDBX file format from XML to an SQLite-based structure using SQLCipher. It highlights issues with brittle schemas, storage inefficiencies, and centralized governance. Proponents suggest SQLite would improve performance, enable delta syncs, and facilitate a more democratic development process among community forks.

Sources:Hacker News110 pts
Postgres Postmaster does not scale
20Wednesday, February 4, 2026

Postgres Postmaster does not scale

Recall.ai, a platform for recording and processing millions of virtual meetings, encountered a rare scaling bottleneck within PostgreSQL. Due to the highly synchronized nature of meetings starting on the hour, their EC2 infrastructure faced extreme traffic bursts, leading to connection delays of up to 15 seconds. Investigation revealed that the PostgreSQL postmaster process operates on a single-threaded main loop responsible for spawning and reaping backends. During high connection churn or heavy background worker activity, this loop saturates a single CPU core, delaying new connections. The team mitigated this by enabling Linux huge pages to reduce 'fork' overhead, introducing jitter to stagger EC2 connection times, and limited parallel query bursts. The findings highlight that the primary bottleneck in PostgreSQL scaling is often the single-threaded nature of the postmaster rather than general resource availability.

Sources:Hacker News110 pts