All DevTools - DevTools Scraper

[Other] Show HN: Duplicate 3 layers in a 24B LLM, logical deduction .22→.76. No training I replicated David Ng's RYS method (<a href="https://dnhkng.github.io/posts/rys/" rel="nofollow">https://dnhkng.github.io/posts/rys/</a>) on consumer AMD GPUs (RX 7900 XT + RX 6950 XT) and found something I didn't expect.Transformers appear to have discrete "reasoning circuits" — contiguous blocks of 3-4 layers that act as indivisible cognitive units. Duplicate the right block and the model runs its reasoning pipeline twice. No weights change. No training. The model just thinks longer.The results on standard benchmarks (lm-evaluation-harness, n=50):Devstral-24B, layers 12-14 duplicated once: - BBH Logical Deduction: 0.22 → 0.76 - GSM8K (strict): 0.48 → 0.64 - MBPP (code gen): 0.72 → 0.78 - Nothing degradedQwen2.5-Coder-32B, layers 7-9 duplicated once: - Reasoning probe: 76% → 94%The weird part: different duplication patterns create different cognitive "modes" from the same weights. Double-pass boosts math. Triple-pass boosts emotional reasoning. Interleaved doubling (13,13,14,14,15,15,16) creates a pure math specialist. Same model, same VRAM, different routing.The circuit boundaries are sharp — shift by one layer and the effect disappears or inverts. Smaller models (24B) have tighter circuits (3 layers) than larger ones (Ng found 7 layers in 72B).Tools to find circuits in any GGUF model and apply arbitrary layer routing are in the repo. The whole thing — sweep, discovery, validation — took one evening.Happy to answer questions.

Visit Tool

Found: March 18, 2026 ID: 3830

Show HN: Tmux-IDE, OSS agent-first terminal IDE

Hacker News (score: 25)

[IDE/Editor] Show HN: Tmux-IDE, OSS agent-first terminal IDE Hey HN,Small OSS project that i created for myself and want to share with the community. It's a declarative, scriptable, terminal-based IDE focussed on agentic engineering.That's a lot of jargon, but essentially its a multi-agent IDE that you start in your terminal.Why is that relevant? Thanks to tmux and SSH, it means that you have a really simple and efficient way to create your own always-on coding setup.Boot into your IDE through ssh, give a prompt to claude and close off your machine. In tmux-ide claude will keep working.The tool is intentionally really lightweight, because I think the power should come from the harnesses that you are working with.I'm hoping to share this with the community and get feedback and suggestions to shape this project! I think that "remote work" is directionally correct, because we can now have extremely long-running coding tasks. But I also think we should be able to control and orchstrate that experience according to what we need.The project is 100% open-source, and i hope to shape it together with others who like to work in this way too!Github: <a href="https://github.com/wavyrai/tmux-ide" rel="nofollow">https://github.com/wavyrai/tmux-ide</a> Docs: <a href="https://tmux.thijsverreck.com/docs" rel="nofollow">https://tmux.thijsverreck.com/docs</a>

Visit Tool

Found: March 18, 2026 ID: 3823

Show HN: I got tired of print(x.shape) so I built runtime type hints for Python

Show HN (score: 7)

[Code Quality] Show HN: I got tired of print(x.shape) so I built runtime type hints for Python As a beginner learning to build ML models, I found it annoying to have to keep printing tensor shapes every other line, having to step through the debugger to check where did I mess up the shapes again.So I built Trickle, it takes the data that flows through your code, caches the types and display them inline (as if you have type annotations).The idea is: "Let types trickle from runtime into your IDE". You get types in Python without having the write them manually.It works by rewriting your Python AST at import time — after every variable assignment, it inserts a lightweight call that records the type and value. No decorators, no code changes. Just run your script through trickle run python train.py and every variable gets its type visible.One cool feature is Error snapshots, by toggling it in VSCode status bar, you can see the exact data that is in each variable when the error happened.For AI agents, trickle-cli outputs the inline runtime types together with the source code so agent can better diagnose the issue.For Jupyter notebooks: %load_ext trickle as your first cell, and every cell after is traced.Quick try: pip install trickle-observe npm install -g trickle-cli code --install-extension yiheinchai.trickle-vscode<pre><code> trickle run python demo/demo.py trickle hints demo/demo.py </code></pre> Limitations: - Expect 2-5x slowdown — designed for development, not production.Also supports JavaScript/TypeScript (Express, Fastify, Koa, Hono) though the Python/ML side is where I've focused most effort.In the future, I imagine there to be potential for using this as runtime observability in production via probabilistic sampling of types. Now, we know the code, we know the data, which is all the information we need to debug.Happy to answer questions

Visit Tool

Found: March 18, 2026 ID: 3827

Google Engineers Launch "Sashiko" for Agentic AI Code Review of the Linux Kernel

Hacker News (score: 62)

[Other] Google Engineers Launch "Sashiko" for Agentic AI Code Review of the Linux Kernel

Visit Tool

Found: March 18, 2026 ID: 3824

langchain-ai/open-swe

GitHub Trending

[Other] An Open-Source Asynchronous Coding Agent

Visit Tool

Found: March 18, 2026 ID: 3819

Show HN: N0x – LLM inference, agents, RAG, Python exec in browser, no back end

Show HN (score: 9)

[Other] Show HN: N0x – LLM inference, agents, RAG, Python exec in browser, no back end Built this because I was tired of every AI tool shipping my data to someone else server n0x runs the full stack LLM inference via WebGPU, autonomous ReAct agents, RAG over your own docs, sandboxed Python execution via Pyodide all inside a single browser tab. No account No keys No backend Models download once, cache in IndexedDB permanently. Biggest challenge was context window budgeting for the agent loop and making the WASM vector search non-blocking. Happy to talk architecture.GitHub: <a href="https://github.com/ixchio/n0x" rel="nofollow">https://github.com/ixchio/n0x</a> | Live demo: <a href="https://n0x-three.vercel.app" rel="nofollow">https://n0x-three.vercel.app</a>

Visit Tool

Found: March 18, 2026 ID: 3822

Launch an autonomous AI agent with sandboxed execution in 2 lines of code

Hacker News (score: 18)

[Other] Launch an autonomous AI agent with sandboxed execution in 2 lines of code

Visit Tool

Found: March 18, 2026 ID: 3818

Show HN: Horizon – GPU-accelerated infinite-canvas terminal in Rust

Hacker News (score: 14)

[CLI Tool] Show HN: Horizon – GPU-accelerated infinite-canvas terminal in Rust Tabs, splits, and tmux work fine until you have several projects open with logs, tests, and long-running shells. I kept rebuilding context instead of resuming work. Horizon puts shells on an infinite canvas. You can arrange them into workspaces and reopen later with layout, scrollback, and history intact.Built in 3 days with Claude/Codex, dogfooding the workflow as I went. Feedback and contributions welcome.

Visit Tool

Found: March 17, 2026 ID: 3812

Edge.js: Run Node apps inside a WebAssembly sandbox

Hacker News (score: 21)

[Other] Edge.js: Run Node apps inside a WebAssembly sandbox

Visit Tool

Found: March 17, 2026 ID: 3810

Show HN: Flowershow Publish Markdown in seconds. Hosted, free, zero config

Show HN (score: 5)

[Build/Deploy] Show HN: Flowershow Publish Markdown in seconds. Hosted, free, zero config I'm Rufus, one of the founders of Flowershow. We love markdown and use it everywhere from making websites, to docs, to knowledgebases. Plus AI splits it everywhere now.Got tired of the framework/config/deploy overhead every time we wanted to share a file or put a site online.So we built the thing we wanted. Files in. Website out. "Vercel for Content" is our aspiration - make deploying (markdown) content as fast, seamless and easy as Vercel did for JS.Command line plus you can connect to github repos, use Obsidian via plugin, or drag and drop files.<pre><code> npm i -g @flowershow/publish publish ./my-notes # → https://your-site.flowershow.app live in seconds </code></pre> Flowershow is fully hosted — no server, no build pipeline, no CI/CD. Point it at a Markdown folder and get a URL.Full Obsidian syntax: wiki links, callouts, graph view, frontmatterGFM, Mermaid, LaTeX: diagrams and math render nativelyThemes via Tailwind & CSS variables: Tailwind out of the box. Customize without a build stepSupports HTML: use HTML, images etc.~7k Obsidian plugin installs, 1,400 users, 1,100 sites. Free forever for personal use. Premium ($5/mo) adds custom domains, search, and password protection.And it's open source: <a href="https://github.com/flowershow/flowershow" rel="nofollow">https://github.com/flowershow/flowershow</a>Check it out and let us know what you think and what we can improve

Visit Tool

Found: March 17, 2026 ID: 3813

Show HN: Unsloth Studio - Local Fine-tuning, Chat UI

Show HN (score: 6)

[Other] Show HN: Unsloth Studio - Local Fine-tuning, Chat UI Hey HN! We're excited to release Unsloth Studio - a culmination of many things we wanted to provide to the community - it includes:1. A Chat UI which has auto healing tool calling, Python & bash code execution, web search, image, docs input + more!2. Finetuning of audio, vision, LLMs with an Auto AI Assist data prep3. Supports GGUFs, Mac, Windows, Linux + audio gen4. Has SVG rendering in browser, exporting to GGUF5. gpt-oss harmony rendering, all inference parameters are pre-set and recommended6. Data designer + synthetic data generation7. Fast parallel data prep + embedding finetuning8. And much much more!To get it, run:pip install unslothunsloth studio setupunsloth studio -H 0.0.0.0 -p 8888Suggestions are welcome, and we're excited for contributions and for you all to try it out! Appreciate you all!

Visit Tool

Found: March 17, 2026 ID: 3814

Show HN: Antfly: Distributed, Multimodal Search and Memory and Graphs in Go

Show HN (score: 10)

[Database] Show HN: Antfly: Distributed, Multimodal Search and Memory and Graphs in Go Hey HN, I’m excited to share Antfly: a distributed document database and search engine written in Go that combines full-text, vector, and graph search. Use it for distributed multimodal search and memory, or for local dev and small deployments.I built this to give developers a single-binary deployment with native ML inference (via a built-in service called Termite), meaning you don't need external API calls for vector search unless you want to use them.Some things that might interest this crowd:Capabilities: Multimodal indexing (images, audio, video), MongoDB-style in-place updates, and streaming RAG.Distributed Systems: Multi-Raft setup built on etcd's library, backed by Pebble (CockroachDB's storage engine). Metadata and data shards get their own Raft groups.Single Binary: antfly swarm gives you a single-process deployment with everything running. Good for local dev and small deployments. Scale out by adding nodes when you need to.Ecosystem: Ships with a Kubernetes operator and an MCP server for LLM tool use.Native ML inference: Antfly ships with Termite. Think of it like a built-in Ollama for non-generative models too (embeddings, reranking, chunking, text generation). No external API calls needed, but also supports them (OpenAI, Ollama, Bedrock, Gemini, etc.)License: I went with Elastic License v2, not an OSI-approved license. I know that's a topic with strong feelings here. The practical upshot: you can use it, modify it, self-host it, build products on top of it, you just can't offer Antfly itself as a managed service. Felt like the right tradeoff for sustainability while still making the source available.Happy to answer questions about the architecture, the Raft implementation, or anything else. Feedback welcome!

Visit Tool

Found: March 17, 2026 ID: 3808

Show HN: Sub-millisecond VM sandboxes using CoW memory forking

Hacker News (score: 80)

[Other] Show HN: Sub-millisecond VM sandboxes using CoW memory forking I wanted to see how fast an isolated code sandbox could start if I never had to boot a fresh VM.So instead of launching a new microVM per execution, I boot Firecracker once with Python and numpy already loaded, then snapshot the full VM state. Every execution after that creates a new KVM VM backed by a `MAP_PRIVATE` mapping of the snapshot memory, so Linux gives me copy-on-write pages automatically.That means each sandbox starts from an already-running Python process inside a real VM, runs the code, and exits.These are real KVM VMs, not containers: separate guest kernel, separate guest memory, separate page tables. When a VM writes to memory, it gets a private copy of that page.The hard part was not CoW itself. The hard part was resuming the snapshotted VM correctly.Rust, Apache 2.0.

Visit Tool

Found: March 17, 2026 ID: 3817

jarrodwatts/claude-hud

GitHub Trending

[Other] A Claude Code plugin that shows what's happening - context usage, active tools, running agents, and todo progress

Visit Tool

Found: March 17, 2026 ID: 3805

Building a Shell

Hacker News (score: 110)

[Other] Building a Shell

Visit Tool

Found: March 17, 2026 ID: 3807

🛠️ All DevTools

shareAI-lab/learn-claude-code

gsd-build/get-shit-done

mobile-dev-inc/Maestro

Cook: A simple CLI for orchestrating Claude Code

FastAPI-compatible Python framework with Zig HTTP core; 7x faster

Show HN: Duplicate 3 layers in a 24B LLM, logical deduction .22→.76. No training

Show HN: Tmux-IDE, OSS agent-first terminal IDE

Show HN: I got tired of print(x.shape) so I built runtime type hints for Python

Google Engineers Launch "Sashiko" for Agentic AI Code Review of the Linux Kernel

langchain-ai/open-swe

Show HN: N0x – LLM inference, agents, RAG, Python exec in browser, no back end

Launch an autonomous AI agent with sandboxed execution in 2 lines of code

Show HN: Horizon – GPU-accelerated infinite-canvas terminal in Rust

Edge.js: Run Node apps inside a WebAssembly sandbox

Show HN: Flowershow Publish Markdown in seconds. Hosted, free, zero config

Show HN: Unsloth Studio - Local Fine-tuning, Chat UI

Show HN: Antfly: Distributed, Multimodal Search and Memory and Graphs in Go

Show HN: Sub-millisecond VM sandboxes using CoW memory forking

jarrodwatts/claude-hud

Building a Shell

DevTools Assistant