🛠️ Hacker News Tools
Showing 701–720 of 1476 tools from Hacker News
Last Updated
January 18, 2026 at 04:00 PM
Show HN: Solving the cluster 1 problem with vCluster standalone
Hacker News (score: 11)[DevOps] Show HN: Solving the cluster 1 problem with vCluster standalone vcluster is an open source tool for Kubernetes multi tenancy and over the years it has matured to have hosted controlplane virtual cluster, shared virtual clusters but the host cluster problem was always there. With vcluster standalone, you can now create the first cluster also with the same developer experience and consolidate the multiple vendor problem. With this, you can now use vcluster for entire multi tenancy spectrum. Feel free to discuss, happy to answer any questuons.
Show HN: Recall: Give Claude perfect memory with Redis-backed persistent context
Hacker News (score: 27)[Other] Show HN: Recall: Give Claude perfect memory with Redis-backed persistent context Hey HN! I'm José, and I built Recall to solve a problem that was driving me crazy.<p>The Problem: I use Claude for coding daily, but every conversation starts from scratch. I'd explain my architecture, coding standards, past decisions... then hit the context limit and lose everything. Next session? Start over.<p>The Solution: Recall is an MCP (Model Context Protocol) server that gives Claude persistent memory using Redis + semantic search. Think of it as long-term memory that survives context limits and session restarts.<p>How it works: - Claude stores important context as "memories" during conversations - Memories are embedded (OpenAI) and stored in Redis with metadata - Semantic search retrieves relevant memories automatically - Works across sessions, projects, even machines (if you use cloud Redis)<p>Key Features: - Global memories: Share context across all projects - Relationships: Link related memories into knowledge graphs - Versioning: Track how memories evolve over time - Templates: Reusable patterns for common workflows - Workspace isolation: Project A memories don't pollute Project B<p>Tech Stack: - TypeScript + MCP SDK - Redis for storage - OpenAI embeddings (text-embedding-3-small) - ~189KB bundle, runs locally<p>Current Stats: - 27 tools exposed to Claude - 10 context types (directives, decisions, patterns, etc.) - Sub-second semantic search on 10k+ memories - Works with Claude Desktop, Claude Code, any MCP client<p>Example Use Case: I'm building an e-commerce platform. I told Claude once: "We use Tailwind, prefer composition API, API rate limit is 1000/min." Now every conversation, Claude remembers and applies these preferences automatically.<p>What's Next (v1.6.0 in progress): - CI/CD pipeline with GitHub Actions - Docker support for easy deployment - Proper test suite with Vitest - Better error messages and logging<p>Try it:<p>npm install -g @joseairosa/recall # Add to claude_desktop_config.json # Start using persistent memory
Testing a compiler-driven full-stack web framework
Hacker News (score: 21)[Other] Testing a compiler-driven full-stack web framework
Show HN: Baby's first international landline
Show HN (score: 66)[Other] Show HN: Baby's first international landline Hi HN,<p>As a weekend project, I hacked together a physical phone, a Raspberry Pi running Asterisk and Twilio, to let toddlers safely make international calls.<p>I’ve documented the setup in this write-up and published the code + Ansible playbooks on GitHub so others can replicate it.<p>I built this so kids of expats can easily stay in touch with family on other continents.<p>Would love feedback from anyone who’s worked on something similar or tries building this themselves!<p>writeup: <a href="https://wip.tf/posts/telefonefix-building-babys-first-international-landline/" rel="nofollow">https://wip.tf/posts/telefonefix-building-babys-first-intern...</a> github repos: - <a href="https://github.com/nbr23/ansible-role-telefonefix" rel="nofollow">https://github.com/nbr23/ansible-role-telefonefix</a> - <a href="https://github.com/nbr23/allo-wed" rel="nofollow">https://github.com/nbr23/allo-wed</a>
Scaling request logging with ClickHouse, Kafka, and Vector
Hacker News (score: 92)[Other] Scaling request logging with ClickHouse, Kafka, and Vector
Show HN: Oh Yah – Routine management app I built for my sons
Show HN (score: 96)[Other] Show HN: Oh Yah – Routine management app I built for my sons Hi HN! I built Oh Yah! to help my sons (age 7 and 10) stick to daily routines without constant reminders. The core idea: minimal distractions during tasks by locking navigation when a timer is running, plus optional photo-based task completion for accountability<p>Built with React Native/Expo and Firebase. The trickiest part was designing the UX to be simple enough for kids with minimal distractions while giving parents enough control – ended up with a task-definition system that lets parents create weekly schedules with daily toggles instead of duplicating tasks across days<p>It's on the App Store now after a few months of dogfooding with my family. There's a 1-month free trial, then it's subscription-based. Would love feedback from other parents dealing with similar challenges
Database Linting and Analysis for PostgreSQL
Hacker News (score: 12)[Database] Database Linting and Analysis for PostgreSQL
Show HN: Agentic Design Patterns – Python Edition, from the Codex Codebase
Show HN (score: 5)[Other] Show HN: Agentic Design Patterns – Python Edition, from the Codex Codebase While reading Agentic Design Patterns by Antonio Gulli, I wanted to see how these patterns look in real code. I cloned the OpenAI Codex repo (the open-source AI coding assistant that recently trended on HN) — but it was in Rust.<p>So, I used an Cursor to help me extract and translate 18+ agentic patterns from Codex’s codebase into Python. That small experiment turned into a full open-source guide: GitHub: Codex Agentic Patterns <a href="https://github.com/artvandelay/codex-agentic-patterns" rel="nofollow">https://github.com/artvandelay/codex-agentic-patterns</a><p>Each pattern comes with:<p>A short explanation and code sample<p>A runnable exercise and agent snippet<p>A summary of how Codex used the pattern (e.g., prompt chaining, tool orchestration, reflection loops, sandbox escalation)<p>One full working Python agent that ties it all together<p>If you’ve read the agentic design patterns book or explored Codex, this is a bridge between theory and practice — focused on runnable, open examples instead of abstract diagrams.<p>It’s completely free and open-source. Would love feedback, ideas, or even new patterns from your own agent experiments.
Building a JavaScript Runtime from Scratch using C
Hacker News (score: 39)[Other] Building a JavaScript Runtime from Scratch using C
ThalamusDB: Query text, tables, images, and audio
Hacker News (score: 10)[Database] ThalamusDB: Query text, tables, images, and audio
Show HN: Rethinking audit trails in Django (structured and database-free)
Show HN (score: 5)[Other] Show HN: Rethinking audit trails in Django (structured and database-free) We were using django-easy-audit to track CRUD and API events in Django. The problem: every action meant an extra database write, which slowed transactions, increased infra costs, and left us with audit tables that weren’t great for analytics.<p>I built django-activity-audit (PyPI) to fix this: - Extends Django’s logging system with custom AUDIT and API levels. - Captures CRUD + API request/response events as structured JSON logs. - Vector tails the logs and ships them into ClickHouse. - Grafana makes them queryable and visual.<p>This removes the extra DB writes, gives structured data ready for analysis, and keeps costs down.<p>Curious — how are others handling audit logging in Django (or other frameworks)? Do you log it, write it to a DB, or something else entirely?
Show HN: Arc – high-throughput time-series warehouse with DuckDB analytics
Hacker News (score: 10)[Database] Show HN: Arc – high-throughput time-series warehouse with DuckDB analytics Hi HN, I’m Ignacio, founder at Basekick Labs.<p>Over the past months I’ve been building Arc, a time-series data platform designed to combine very fast ingestion with strong analytical queries.<p>What Arc does? Ingest via a binary MessagePack API (fast path), Compatible with Line Protocol for existing tools (Like InfluxDB, I'm ex Influxer), Store data as Parquet with hourly partitions, Query via DuckDB engine using SQL<p>Why I built it:<p>Many systems force you to trade retention, throughput, or complexity. I wanted something where ingestion performance doesn’t kill your analytics.<p>Performance & benchmarks that I have so far.<p>Write throughput: ~1.88M records/sec (MessagePack, untuned) in my M3 Pro Max (14 cores, 36gb RAM) ClickBench on AWS c6a.4xlarge: 35.18 s cold, ~0.81 s hot (43/43 queries succeeded) In those runs, caching was disabled to match benchmark rules; enabling cache in production gives ~20% faster repeated queries<p>I’ve open-sourced the Arc repo so you can dive into implementation, benchmarks, and code. Would love your thoughts, critiques, and use-case ideas.<p>Thanks!
Show HN: We trained an MoE LLM built for developer tasks
Show HN (score: 5)[Other] Show HN: We trained an MoE LLM built for developer tasks
Show HN: Kalendis – Scheduling API (keep your UI, we handle timezones/DST)
Show HN (score: 5)[API/SDK] Show HN: Kalendis – Scheduling API (keep your UI, we handle timezones/DST) Kalendis is an API-first scheduling backend. You keep your UI; we handle the gnarly parts (recurrence, time zones, DST, conflict-safe bookings).<p>What it does:<p>• MCP tool: generates typed clients and API route handlers (Next.js/Express/Fastify/Nest) so you can scaffold calls straight from your IDE/agent tooling.<p>• Availability engine: recurring rules + one-off exceptions/blackouts, returned in a clean, queryable shape.<p>• Bookings: conflict-safe endpoints for creating/updating/canceling slots.<p>Why we built it:<p>We kept rebuilding the same "hard parts" of scheduling: time zones/DST edge cases, recurring availability, conflict-aware booking, etc. We wanted a boring, reliable backend so we could ship product features without adopting a hosted scheduling UI.<p>How it's helped:<p>We stopped re-implementing DST/recurrence math and shipped booking flows faster.<p>One small team (just 2 developers) built a robust booking platform for their business using Kalendis—they kept full control of their UX without spending lots of cycles on scheduling infrastructure.<p>The MCP generator cut the glue code: drop in a typed client or route, call the API, move on.<p>Some tech details:<p>• REST API with ISO-8601 timestamps and IANA time zones<p>• Recurring availability + one-off exceptions (designed to compose cleanly)<p>• Focused scope: users, availability, exceptions, bookings (not a monolithic suite)<p>MCP integration:<p>MCP server exposes tools like generate-frontend-client, generate-backend-client, generate-api-routes, and list-endpoints.<p>Add to your MCP settings:<p>{ "mcpServers": { "kalendis": { "command": "npx", "args": ["-y", "@kalendis/mcp"] } } }<p>How to try it:<p>Create a free account → get an API key. (<a href="https://kalendis.dev" rel="nofollow">https://kalendis.dev</a>)<p>Hit an endpoint:<p>curl -H "x-api-key: $KALENDIS_API_KEY" \ "https://api.kalendis.dev/v1/availability/getAvailability?userId=<user-id>&start=2025-10-07T00:00:00Z&end=2025-10-14T00:00:00Z&includeExceptions=true"<p>What feedback would be most useful:<p>Gaps in the endpoint surface (what's missing for your use case?).<p>Features that would benefit this service integration in your app.<p>MCP generator output—anything you'd want it to emit differently?<p>Happy to answer questions and post example snippets in the thread. Thanks for taking a look!<p>— Dave (Kalendis)
Launch HN: LlamaFarm (YC W22) – Open-source framework for distributed AI
Hacker News (score: 11)[Other] Launch HN: LlamaFarm (YC W22) – Open-source framework for distributed AI Hi HN! We're Rob, Matt, and Rachel from LlamaFarm (<a href="https://llamafarm.dev">https://llamafarm.dev</a>). We're building an open-source AI framework based on a simple belief: the future isn't one massive model in the cloud—it's specialized models running everywhere, continuously fine-tuned from real usage.<p>The problem: We were building AI tools and kept falling into the same trap. AI demos die before production. We built a bunch of AI demos but they were impossible to get to production. It would work perfectly on our laptop, but when we deployed it, something broke, and RAG would degrade. If we were running our own model, it would quickly become out of date. The proof-of-concept that impressed the team couldn't handle real-world data.<p>Our solution: declarative AI-as-code. One YAML defines models, policies, data, evals, and deploy. Instead of one brittle giant, we orchestrate a Mixture of Experts—many small, specialized models you continuously fine-tune from real usage. With RAG for source-grounded answers, systems get cheaper, faster, and auditable.<p>There’s a short demo here: <a href="https://www.youtube.com/watch?v=W7MHGyN0MdQ" rel="nofollow">https://www.youtube.com/watch?v=W7MHGyN0MdQ</a> and a more in-depth one at <a href="https://www.youtube.com/watch?v=HNnZ4iaOSJ4" rel="nofollow">https://www.youtube.com/watch?v=HNnZ4iaOSJ4</a>.<p>Ultimately, we want to deliver a single, signed bundle—models + retrieval + database + API + tests—that runs anywhere: cloud, edge, or air-gapped. No glue scripts. No surprise egress bills. Your data stays in your runtime.<p>We believe that the AI industry is evolving like computing did. Just as we went from mainframes to distributed systems and monolithic apps to microservices, AI is following the same path: models are getting smaller and better. Mixture of Experts is here to stay. Qwen3 is sick. Llama 3.2 runs on phones. Phi-3 fits on edge devices. Domain models beat GPT-5 on specific tasks.<p>RAG brings specialized data to your model: You don't need a 1T parameter model that "knows everything." You need a smart model that can read <i>your</i> data. Fine-tuning is democratizing: what cost $100k last year now costs $500. Every company will have custom models.<p>Data gravity is real: Your data wants to stay where it is: on-prem, in your AWS account, on employee laptops.<p>Bottom line: LlamaFarm turns AI from experiments into repeatable, secure releases, so teams can ship fast.<p>What we have working today: Full RAG pipeline: 15+ document formats, programmatic extraction (no LLM calls needed), vector-database embedding, universal model layer that runs the same code for 25+ providers, automatic failover, cost-based routing; Truly portable: Identical behavior from laptop → datacenter → cloud; Real deployment: Docker Compose works now with Kubernetes basics and cloud templates on the way.<p>Check out our readme/quickstart for easy install instructions: <a href="https://github.com/llama-farm/llamafarm?tab=readme-ov-file#-quickstart-tldr" rel="nofollow">https://github.com/llama-farm/llamafarm?tab=readme-ov-file#-...</a><p>Or just grab a binary for your platform directly from the latest release: <a href="https://github.com/llama-farm/llamafarm/releases/latest" rel="nofollow">https://github.com/llama-farm/llamafarm/releases/latest</a><p>The vision is to be able to run, update, and continuously fine-tune dozens of models across environments with built-in RAG and evaluations, all wrapped in a self-healing runtime. We have an MVP of that today (with a lot more to do!).<p>We’d love to hear your feedback! Think we’re way off? Spot on? Want us to build something for your specific use case? We’re here for all your comments!
Show HN: FizzBee – Formal Model based autonomous testing
Show HN (score: 5)[Testing] Show HN: FizzBee – Formal Model based autonomous testing GitHub: <a href="https://github.com/fizzbee-io/fizzbee-mbt-examples" rel="nofollow">https://github.com/fizzbee-io/fizzbee-mbt-examples</a> Quick Start: <a href="https://fizzbee.io/testing/tutorials/quick-start/" rel="nofollow">https://fizzbee.io/testing/tutorials/quick-start/</a><p>Most developers agree testing is important. At the same time, most developers don’t enjoy writing tests. With AI generating code faster than ever, testing is becoming even more crucial. But even AI-generated tests need review and maintenance, which makes them another burden.<p>I'm introducing another form of autonomous testing - "model-based testing". Instead of writing test cases, you describe expected behavior in a Python-like specification language.<p>The FizzBee model can be: - Verified exhaustively for design bugs (like formal methods). - Mapped to your actual system, automatically generating the tests.<p>This gives you:<p>- No hand-crafted test cases - Automatic testing of concurrent as well as sequential behavior - No cascading test rewrites when behavior changes - No cluttering the SUT with tracing code<p>With FizzBee, you get both design validation (like in formal methods) and automatic test generation, saving time and effort.<p>Currently, only Go is supported. Java and Rust are next and would love to hear which language you’d want supported next.<p>I’d love your feedback!
Show HN: Mix – Open-source multimodal agents SDK
Show HN (score: 5)[API/SDK] Show HN: Mix – Open-source multimodal agents SDK Why we built it: • Claude Code: great for coding, but no video/audio support, localhost only • OpenAI SDK: single-model, no native multimedia tools • Both: no integrated DevTools for debugging agent reasoning<p>So, we built Mix as an alternative for multimodal applications. • Native video/audio/PDF analysis tools (via Gemini for vision, Claude for reasoning) • Multi-model routing instead of single-provider lock-in • One-command Supabase setup for cloud deployment (vs localhost-only) • HTTP architecture that enables visual DevTools alongside agent workflows • Go backend: 50-80% lower memory footprint than Node.js—efficient for concurrent agent sessions. Python and typescript clients are available<p>Example use cases in the demo video: portfolio analyzer that reads Excel and generates charts, YouTube search agent that finds and edits video clips.<p>GitHub: <a href="https://github.com/recreate-run/mix" rel="nofollow">https://github.com/recreate-run/mix</a> Demo video: <a href="https://youtu.be/IwgKt68wQSc" rel="nofollow">https://youtu.be/IwgKt68wQSc</a><p>Would appreciate feedback, especially from folks building multimodal agents.
Stress test for parallel disk i/o using git and pnpm
Hacker News (score: 47)[Other] Stress test for parallel disk i/o using git and pnpm
Show HN: 1-Bit Pixel Art Font Editor
Show HN (score: 5)[Other] Show HN: 1-Bit Pixel Art Font Editor I like making small fonts for Arduino and ESP32 projects with those little SSD1306 OLED screens. So I made this browser-based tool to help design pixel art fonts for use in those embedded projects.<p>It has a preview area that updates when you save your character. It can support fixed-width and variable-width fonts. And when you're done you can export the font in a few different formats.<p>It comes with a bunch of example fonts built-in. Also it's fully client-side code so you can save the webpage to use it offline.<p>If you want some inspiration for fonts, I made a bunch of really small fonts a few years ago: <a href="https://www.moonbench.xyz/projects/tiny-pixel-art-fonts/" rel="nofollow">https://www.moonbench.xyz/projects/tiny-pixel-art-fonts/</a>
Show HN: NewsGoat – A terminal-based RSS reader written in Go
Show HN (score: 6)[CLI Tool] Show HN: NewsGoat – A terminal-based RSS reader written in Go