π οΈ All DevTools
Showing 1–20 of 4373 tools
Last Updated
April 28, 2026 at 08:00 AM
Show HN: AgentSwift β Open-source iOS builder agent
Hacker News (score: 15)[Build/Deploy] Show HN: AgentSwift β Open-source iOS builder agent I'm working on a coding agent for building iOS apps. It's built on openspec and xcodebuildmcp. It's free and open source.
Ted Nyman β High Performance Git
Hacker News (score: 36)[Other] Ted Nyman β High Performance Git
L123: A Lotus 1-2-3βstyle terminal spreadsheet with modern Excel compatibility
Hacker News (score: 61)[Other] L123: A Lotus 1-2-3βstyle terminal spreadsheet with modern Excel compatibility
GitHub is having issues now
Hacker News (score: 214)[Other] GitHub is having issues now
Git-based cache saves 50% on token usage
Hacker News (score: 14)[Other] Git-based cache saves 50% on token usage
GitHub Copilot is moving to usage-based billing
Hacker News (score: 370)[Other] GitHub Copilot is moving to usage-based billing
Show HN: I made a website to clean up recipe websites
Show HN (score: 5)[Other] Show HN: I made a website to clean up recipe websites Hi hn! It has always annoyed me that recipe websites are so cluttered and have so much (in my opinion) useless fluff, so I made a little website to solve that problem! One good thing about SEO for once is that Google has pretty much forced the internet to adopt JSON-LD for recipes so many recipes are directly extractable and for others I have several fall back parsers including an indieweb parser :)<p>If you do find a website that is broken feel free to click the flag button or share it as I would love to fix any edge cases
Show HN: Utilyze β an open source GPU monitoring tool more accurate than nvtop
Hacker News (score: 46)[Monitoring/Observability] Show HN: Utilyze β an open source GPU monitoring tool more accurate than nvtop The standard GPU utilization metric reported by nvidia-smi, nvtop, Weights & Biases, Amazon CloudWatch, Google Cloud Monitoring, and Azure Monitor is highly misleading. It reports the fraction of time that any kernel is running on the GPU, which means a GPU can report 100% utilization even if only a small portion of its compute capacity is actually being used. In practice, we've seen workloads with ~1β10% real compute throughput while dashboards show 100%.<p>This becomes a problem when teams rely on that metric for capacity planning or optimization decisions, it can make underutilized systems look saturated.<p>We're releasing an open-source (Apache 2.0) tool, Utilyze, to measure GPU utilization differently. It samples hardware performance counters and reports compute and memory throughput relative to the hardware's theoretical limits. It also estimates an attainable utilization ceiling for a given workload.<p>GitHub link: <a href="https://github.com/systalyze/utilyze" rel="nofollow">https://github.com/systalyze/utilyze</a><p>We'd love to hear your thoughts!
Tendril β a self-extending agent that builds and registers its own tools
Hacker News (score: 27)[Other] Tendril β a self-extending agent that builds and registers its own tools
Show HN: OSS Agent I built topped the TerminalBench on Gemini-3-flash-preview
Hacker News (score: 158)[Other] Show HN: OSS Agent I built topped the TerminalBench on Gemini-3-flash-preview Scored 65.2% vs google's official 47.8%, and the existing top closed source model Junie CLI's 64.3%.<p>Since there are a lot of reports of deliberate cheating on TerminalBench 2.0 lately (<a href="https://debugml.github.io/cheating-agents/" rel="nofollow">https://debugml.github.io/cheating-agents/</a>), I would like to also clarify a few things<p>1. Absolutely no {agents/skills}.md files were inserted at any point. No cheating mechanisms whatsoever<p>2. The cli agent was run in leaderboard compliant way (no modification of resources or timeouts)<p>3. The full terminal bench run was done using the fully open source version of the agent, no difference between what is on github and what was run.<p>I was originally going to wait for it to land on the leaderboard, but it has been 8 days and the maintainers do not respond unfortunately (there is a large backlog of the pull requests on their HF) so I decided to post anyways.<p>HF PR: <a href="https://huggingface.co/datasets/harborframework/terminal-bench-2-leaderboard/discussions/145" rel="nofollow">https://huggingface.co/datasets/harborframework/terminal-ben...</a><p>It is astounding how much the harness matters, based on this and other experiments I have done.
Show HN: A terminal spreadsheet editor with Vim keybindings
Show HN (score: 5)[Other] Show HN: A terminal spreadsheet editor with Vim keybindings While speccing out this spreadsheet tool, I realized that I never had to think about the keybindings. It all just came naturally from Vim. Normal/insert/visual modes, hjkl navigation, dd/yy/p, :w, :q. The usual muscle memory works.<p>It supports CSV/TSV import and export, and a native .cell format that preserves formulas. The formula engine handles SUM, AVERAGE, COUNT, MIN, MAX, and IF with range references.<p>The codebase is a Cargo workspace: a pure cell-sheet-core library (no TUI dependency) and a cell-sheet-tui crate on top of ratatui. Early days, but it's usable.<p>To try it out: cargo install cell-sheet-tui<p>Feedback of any kind is greatly appreciated!
[Other] Show HN: I built a reference site for the recurring hard problems in software Hi HN, I've been working on this for a while and it was hard to decide when to stop, either on the way information is presented or when to stop with adding entries. It's not meant as a blog, but rather as a reference that keeps growing.<p>Link: <a href="https://thehardparts.dev" rel="nofollow">https://thehardparts.dev</a><p>Currently I've created 4 main section:<p>- Failure Modes: ways project go wrong<p>- Red Flags: early signals that are worth taking seriously<p>- Tech Decisions: common and not so common trade-offs for hard choices<p>- Playbooks - guided approach for situations that repeat<p>I've also focused on creating links between them to show how connected many things are: a red flag usually precedes a failure mode, which might connect to a forced decision, etc.<p>Some entry points to give you an idea:<p>- The Invisible Deadline: a date that exists socially but not explicitly enough to manage honestly<p>- Eveyone Asks The Same Person: when one person becomes the default source of truth<p>- Build a Practical Rollback Strategy: how to build a reliable rollback strategy<p>It has 151 entries across the 4 sections.<p>Curious what you think about the content, format, grouping.
Show HN: Building a SQL analyst agent from scratch
Show HN (score: 5)[Other] Show HN: Building a SQL analyst agent from scratch I built a SQL analyst agent based on a simple idea: generating SQL is not the same as doing analysis.<p>Most text-to-SQL tools stop at producing a query. But real analysis is iterative. You explore the schema, run queries, adjust, and refine.<p>This project tries to model that loop instead of treating a query as the final output.<p>I wrote about the approach, challenges, and tradeoffs here: <a href="https://raminmousavi.dev/blog/building-a-sql-analyst-agent" rel="nofollow">https://raminmousavi.dev/blog/building-a-sql-analyst-agent</a><p>Github repo: <a href="https://github.com/raminious/sql-analyst-ai-agent" rel="nofollow">https://github.com/raminious/sql-analyst-ai-agent</a>
The cost math behind routing Claude Code through Ollama (~90% cut)
Hacker News (score: 13)[Other] The cost math behind routing Claude Code through Ollama (~90% cut)
EvanFlow β A TDD driven feedback loop for Claude Code
Hacker News (score: 20)[Testing] EvanFlow β A TDD driven feedback loop for Claude Code
Show HN: The Unix Magic poster, annotated (updated)
Show HN (score: 49)[Other] Show HN: The Unix Magic poster, annotated (updated) This is a site that maps the references on Gary Overacre's 1980s UNIX Magic poster to short write-ups with sources. I posted an earlier version about a year ago [1]. Since then I rewrote some of the annotations, added deep-linking to individual markers and a frame/sidebar view, gave the site a terminal-style redesign, and fixed historical inaccuracies (daemon etymology, nroff origin, B language vs. Multics, etc.).<p>Contributions and comments welcome; each marker is a GitHub issue.<p>site: <a href="https://unixmagic.net" rel="nofollow">https://unixmagic.net</a><p>[1] <a href="https://news.ycombinator.com/item?id=43019136">https://news.ycombinator.com/item?id=43019136</a>
Show HN: Matrirc β run irssi in 2026, talk to people on Matrix
Show HN (score: 5)[Other] Show HN: Matrirc β run irssi in 2026, talk to people on Matrix This solves no real problem β Element works, there's already a Matrix-to-IRC bridge running on half the FOSS networks, and probably nobody under 30 has opened irssi voluntarily this decade.<p>I wrote it anyway because I miss Esc 4 and clunky window-split commands.<p>Matrirc is a local IRC server that speaks Matrix on the back. Point irssi at localhost:6667, log in with Matrix creds, rooms show up as channels.<p>brew tap pawelb0/tap brew install matrirc
Show HN: AI memory with biological decay (52% recall)
Hacker News (score: 45)[Other] Show HN: AI memory with biological decay (52% recall) Most RAG setups fail because they treat memory like a static filing cabinet. When every transient bug fix or abandoned rule is stored forever, the context window eventually chokes on noise, spiking token costs and degrading the agent's reasoning.<p>This implementation experiments with a biological approach by using the Ebbinghaus forgetting curve to manage context as a living substrate. Memories are assigned a "strength" score where each recall reinforces the data and flattens its decay curve (spaced repetition), while unused data eventually hits a threshold and is pruned.<p>To solve the "logical neighbor" problem where semantic search misses relevant but non-similar nodes, a graph layer is layered over the vector store. Benchmarked against the LoCoMo dataset, this reached 52% Recall@5, nearly double the accuracy of stateless vector stores, while cutting token waste by roughly 84%.<p>Built as a local first MCP server using DuckDB, the hypothesis is that for agents handling long-running projects, "what to forget" is just as critical as "what to remember." I'd be interested to hear if others are exploring non-linear decay or similar biological constraints for context management.<p>GitHub: <a href="https://github.com/sachitrafa/cognitive-ai-memory" rel="nofollow">https://github.com/sachitrafa/cognitive-ai-memory</a>
gastownhall/beads
GitHub Trending[Other] Beads - A memory upgrade for your coding agent