Show HN: Ocrisp, One-Click RAG Implementation, Simple and Portable

Show HN (score: 7)
Found: October 01, 2025
ID: 1659

Description

Other
Show HN: Ocrisp, One-Click RAG Implementation, Simple and Portable

More from Show

Show HN: Timeplus Proton 3.0 – First vectorized streaming SQL engine

Show HN: Timeplus Proton 3.0 – First vectorized streaming SQL engine Two years after open sourcing Proton, we&#x27;re releasing v3.0 which brings enterprise-grade streaming capabilities to the open source version: full connectivity, processing, and routing in a single binary with zero dependencies.<p>Key features:<p>First vectorized streaming SQL engine in modern C++ with JIT compilation<p>High-throughput, low-latency, high-cardinality processing End-to-end streaming: ETL, joins, aggregation, alerts, and tasks<p>Native connectors: Kafka, Redpanda, Pulsar, ClickHouse, Splunk, Elastic, MongoDB, S3, Iceberg<p>Native Python UDF&#x2F;UDAF support to support your AI&#x2F;ML work loads<p>The same performance we&#x27;ve proven in large enterprise deployments is now available in the community edition.<p>Would love feedback from anyone working with streaming data or looking for Flink&#x2F;ksqlDB alternatives.

Show HN: Clink – Bring your own CLI Agents, Ship instantly

Show HN: Clink – Bring your own CLI Agents, Ship instantly Clink lets you use the coding agents you already pay for (Claude Code, Codex CLI, Gemini CLI, Z.ai GLM) to build → live-preview → ship apps in an isolated container.<p>No token purchases, no extra cost for coding. Just link your existing Claude&#x2F;OpenAI&#x2F;Gemini account and start building and deploying instantly.<p>Why we built this:<p>Claude Code is our go-to for coding, but it lacked preview + deploy capabilities. We didn&#x27;t want to pay Lovable again just for that.<p>Different agents excel at different tasks - Claude Code for versatility, Codex for complex work, GLM for speed. We needed one platform to leverage them all.<p>CLI agents offer more freedom than traditional web builders. We wanted to unlock their full potential with proper dev tooling.<p>What it does:<p>• Prompt → Build → Live → Deploy - The fastest path from idea to live website. Deploy for free.<p>• BYO Subscription - Use your existing plans efficiently (Claude Code $20 = 10x Lovable $25 usage, GLM $3 = 3x Claude Code $20)<p>DEV Mode (Beta):<p>• Multi-stack support - Build with Node, Python, Go, Rust and deploy containers to public URLs instantly<p>• Repo imports - Upgrade and deploy your existing projects across any stack<p>Links:<p>• Clink: <a href="https:&#x2F;&#x2F;clink.new" rel="nofollow">https:&#x2F;&#x2F;clink.new</a><p>• OSS origin (Claudable, ~2.8k): <a href="https:&#x2F;&#x2F;github.com&#x2F;opactorai&#x2F;Claudable" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;opactorai&#x2F;Claudable</a><p>We&#x27;d love any feedback, bug reports, or stack requests - we iterate fast and read every comment.

Show HN: We packaged an MCP server inside Chromium

Show HN: We packaged an MCP server inside Chromium Hey HN, we just shipped a browser with an inbuilt MCP server!<p>We&#x27;re a YC startup (S24) building BrowserOS — an open‑source Chromium fork. We&#x27;re a privacy‑first alternative to the new wave of AI browsers like Dia, Perplexity Comet. Since launching ~3 months ago, the #1 request has been to expose our browser as an MCP server.<p>-- Google beat us to launch with chrome-devtools-mcp (solid product btw), which lets you build&#x2F;debug web apps by connecting Chrome to coding assistants. But we wanted to take this a step further: we packaged the MCP server directly into our browser binary. That gives three advantages:<p>1. MCP server setup is super simple — no npx install, no starting Chrome with CDP flags, you just download the BrowserOS binary.<p>2. with our browser&#x27;s inbuilt MCP server, AI agents can interact using your logged‑in sessions (unlike chrome-devtools-mcp which starts a fresh headless instance each time)<p>3. our MCP server also exposes new APIs from Chromium&#x27;s C++ core to click, type, and draw bounding boxes on a webpage. Our APIs are also not CDP-based (Chrome Debug Protocol) and have robust anti-bot detection.<p>-- Few example use cases for BrowserOS-mcp are:<p>a) *Frontend development with Claude Code*: instead of screenshot‑pasting, claude-code gets WYSIWYG access. It can write code, take a screenshot, check console logs, and fix issues in one agentic sweep. Since it has your sessions, it can do QA stuff like &quot;test the auth flow with my Google Sign‑In.&quot; Here&#x27;s a video of claude-code using browserOS to improve the css styling with back-and-forth checking: <a href="https:&#x2F;&#x2F;youtu.be&#x2F;vcSxzIIkg_0" rel="nofollow">https:&#x2F;&#x2F;youtu.be&#x2F;vcSxzIIkg_0</a><p>b) *Use as an agentic browser:* You can install BrowserOS-mcp in claude-code or Claude Desktop and do things like form-filling, extraction, multi-step agentic tasks, etc. It honestly works better than Perplexity Comet! Here&#x27;s a video of claude-code opening top 5 hacker news posts and summarizing: <a href="https:&#x2F;&#x2F;youtu.be&#x2F;rPFx_Btajj0" rel="nofollow">https:&#x2F;&#x2F;youtu.be&#x2F;rPFx_Btajj0</a><p>-- *How we packaged MCP server inside Chromium binary*: We package the server as a Bun binary and expose MCP tools over HTTP instead of stdio (to support multiple sessions). And we have a BrowserOS controller installed as an extension at the application layer which the MCP server connects to over WebSocket to control the browser. Here&#x27;s a rough architecture diagram: <a href="https:&#x2F;&#x2F;dub.sh&#x2F;browseros-mcp-diag" rel="nofollow">https:&#x2F;&#x2F;dub.sh&#x2F;browseros-mcp-diag</a><p>-- *How to install and use it:* We put together a short guide here: <a href="https:&#x2F;&#x2F;git.new&#x2F;browseros-mcp" rel="nofollow">https:&#x2F;&#x2F;git.new&#x2F;browseros-mcp</a><p>Our vision is to reimagine the browser as an operating system for AI agents, and packaging an MCP server directly into it is a big unlock for that!<p>I&#x27;ll be hanging around all day, would love to get your feedback and answer any questions!

Show HN: Code First CDC from Postgres to ClickHouse with MooseStack

Show HN: Code First CDC from Postgres to ClickHouse with MooseStack

Show HN: Local Browser AI

Show HN: Local Browser AI Using the new Prompt API to build a free open source privacy-first chat extension that runs on Chrome or Edge on Linux, Mac, Windows and ChromeOS

Show HN: Tangled – Git collaboration built on AT Protocol

Show HN: Tangled – Git collaboration built on AT Protocol

Show HN: Enhance – A Terminal UI for GitHub Actions

Show HN: Enhance – A Terminal UI for GitHub Actions I&#x27;m very excited to share what I&#x27;ve been working on lately!<p>Introducing ENHANCE, a terminal UI for GitHub Actions that lets you easily see and interact with your PRs checks.<p>It&#x27;s available under a sponsorware model. Get more info on the site:<p>-&gt; <a href="https:&#x2F;&#x2F;gh-dash.dev&#x2F;enhance" rel="nofollow">https:&#x2F;&#x2F;gh-dash.dev&#x2F;enhance</a><p>This is an attempt to make my OSS development something sustainable. Happy to hear feedback about the model as well as the tool! Cheers!

Show HN: Simple WhatsApp API (Open Source)

Show HN: Simple WhatsApp API (Open Source) Had a Lot of issues with existing APIs especially with Attachment support. Hence built own Whatsapp API.<p>Currently only Send Messages

Show HN: Resrap – A Parser but in Reverse

Show HN: Resrap – A Parser but in Reverse I built Resrap, a Go package that takes a grammar in ABNF format and generates infinitely long sequences of syntactically correct code...either completely randomly or with seeds for a deterministic generation.<p>ABNF is a modified version of EBNF(<a href="https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;Extended_Backus%E2%80%93Naur_form" rel="nofollow">https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;Extended_Backus%E2%80%93Naur_f...</a>) I made for this project, basically means you can specify when generating code 20% of lines will be if{} blocks and 50% will be while{} blocks which allows for more natural code generation, plus support for infinite generation of code.<p>It’s very fast...it generated ~40 million tokens of C syntax in about 26 seconds on my laptop and supports multithreading which actually saw boosts in performance since its very easy to parallelize.<p>I originally made this for a typing-test project (I didn’t want to store code snippets manually), but it turned out to be useful in other contexts too, like: - Stress-testing parsers and linters - Creating non-copyrighted “lorem ipsum” code for tech demos - Generating those endless “hacker” code scenes you see in movies<p>Curious what other cool things people might do with it!<p>Github: <a href="https:&#x2F;&#x2F;github.com&#x2F;osdc&#x2F;Resrap" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;osdc&#x2F;Resrap</a> Website: <a href="https:&#x2F;&#x2F;resrap.osdc.dev&#x2F;" rel="nofollow">https:&#x2F;&#x2F;resrap.osdc.dev&#x2F;</a>

Show HN: Free developer-first OneNote alternative

Show HN: Free developer-first OneNote alternative Hey Everyone, been working on a note-taking app called janta (Just Another Note Taking App) the past few months.<p>You can try it out at app.janta.dev (you will be redirected to app.janta.dev&#x2F;canvas&#x2F;temporary, which is the locally-stored whiteboard you can access)<p>I felt limited with OneNote, Excalidraw, and other infinite-canvas style apps, so I built an alternative. You have access to code-editors, Desmos graphs, and rich text editors (SlateJS). This is because the canvas is designed in a way that allows web components to exist on the same layer as pen-strokes, so you can annotate code, circle points-of-inflection, and programmatically generate graphs using matplotlib.pyplot!<p>This is a beta release, and feedback would be awesome!

Show HN: One API for all your SMTPs

Show HN: One API for all your SMTPs Hi HN,<p>I host my own mail infrastructure and don&#x27;t rely on third-party tools to send my mails. But in a microservice architecture, managing all the SMTP configs quickly becomes tiresome. After a while I&#x27;d always have my credentials and settings scattered in application-properties, bash scripts and environment files.<p>So I&#x27;ve build Brieferl, an app where you can add your SMTP servers and send emails through a single API with a simple JSON payload. You also get logs of when&#x2F;where emails were sent and HTML previews of messages.<p>I am interested in what you guys think about this. You can create a free account with just your email, add a SMTP server + API key and start sending. (There is no upsell, or paid plan yet)<p>This is super early, it works and I made it actually just for myself but a friend told me he‘d also love to use this, so I thought why not ask other developers what they think.<p>Is this something you’d use? Are there some features that would it more valuable for you? Or is this something only I’ve felt?

Show HN: Verdent – AI coding agent that plans, tests, and ships

Show HN: Verdent – AI coding agent that plans, tests, and ships Hi HN!<p>We just launched Verdent, an AI coding agent that actually plans before it codes, tests its own work, and delivers production-ready results without you having to clean up after it.<p>tl;dr Check out the website [1] and read the story [2]<p>- The problem: Current AI tools dump code and leave you to debug. You ask for &quot;build a login endpoint&quot; and get a passwordless JWT mess that doesn&#x27;t fit your project. We&#x27;ve talked to hundreds of engineers who are frustrated with having to micromanage AI output.<p>- What we built: Verdent works more like a senior engineer. It asks clarifying questions first, breaks down complex tasks, writes the code, then runs its own tests and fixes bugs until everything works. We call it the Plan → Code → Verify loop.<p>Two ways to use it: VS Code extension for staying close to the code, or Verdent Deck desktop app for handling bigger multi-component tasks in parallel.<p>- Background: I was Head of Algorithms at TikTok and spent 9 years as Chief Technical Architect at Baidu before this. Building those massive recommendation systems taught us that you need hundreds of specialized models working together, not just wrapping an LLM. We applied the same system engineering approach to AI coding.<p>- Try it: Available now, paid plans start at $19&#x2F;month. We&#x27;re a small team so no free tier, but we&#x27;d rather you experience what autonomous coding can actually do than get frustrated with a limited version.<p>- The bigger picture: We see AI coding as just the start of a larger shift where AI becomes the orchestrator across the entire software lifecycle, not just a typing assistant.<p>What do you think?<p>What tasks would you actually trust an AI agent to handle end-to-end?<p>[1] <a href="https:&#x2F;&#x2F;www.verdent.ai&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.verdent.ai&#x2F;</a> [2] <a href="https:&#x2F;&#x2F;thenewstack.io&#x2F;tiktoks-ex-algorithm-chief-launches-verdent-ai-coding-tool&#x2F;" rel="nofollow">https:&#x2F;&#x2F;thenewstack.io&#x2F;tiktoks-ex-algorithm-chief-launches-v...</a>

Show HN: We built our own technology radar

Show HN: We built our own technology radar A technology radar helps you monitor and assess emerging trends systematically. Depending on the context a “trend” can be technical, business-related or social.<p>While such as a technology radar is traditionally reserved for big companies, we’ve found it’s just as valuable for us as a small startup, especially since AI is evolving rapidly (AI is one of our main technologies). The radar helps us track fringe-innovation (like the latest agentic framework form Github) and assess tech beyond the hype.<p>We built it with our platform (Researchly), but in the post I have also included an N8N workflow and data schema for a do-it-yourself version.<p>The front-end was inspired by: <a href="https:&#x2F;&#x2F;github.com&#x2F;zalando&#x2F;tech-radar" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;zalando&#x2F;tech-radar</a>

Show HN: VectorLiteDB – a vector DB for local dev, like SQLite but for vectors

Show HN: VectorLiteDB – a vector DB for local dev, like SQLite but for vectors I built [VectorLiteDB (<a href="https:&#x2F;&#x2F;github.com&#x2F;vectorlitedb&#x2F;vectorlitedb" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;vectorlitedb&#x2F;vectorlitedb</a>)<p>— a simple, embedded vector database that stores everything in a single file, just like SQLite.<p>The problem:<p>If you’re a developer building AI apps, you usually have two choices for vector search<p>- Set up a server (e.g. Chroma, Weaviate) - Use a cloud service (e.g. Pinecone)<p>That works for production, but it’s overkill when you just want to:<p>- Quickly prototype with embeddings - Run offline without cloud dependencies - Keep your data portable in a single file<p>The inspiration was *SQLite* during development — simple, local, and reliable.<p>The solution:<p>So I built VectorLiteDB<p>- Single-file, embedded, no server - Stores vectors + metadata, persists to disk - Supports cosine &#x2F; L2 &#x2F; dot similarity - Works offline, ~100ms for 10K vectors - Perfect for local RAG, prototyping or personal AI memory<p>Feedback on both the tool and the approach would be really helpful.<p>- Is this something that would be useful - Use cases you’d try this for

Show HN: Datadef.io – Canvas for data lineage and metadata management

Show HN: Datadef.io – Canvas for data lineage and metadata management Hi HN, I’ve been working on <a href="https:&#x2F;&#x2F;datadef.io" rel="nofollow">https:&#x2F;&#x2F;datadef.io</a>, a tool to help data team (engineer, architect, project manager) make sense of their data universe.<p>The problem:<p>- Data models (dbt, SQL, warehouses) often grow into a tangled mess of tables, joins, and undocumented assumptions. - Lineage is either scattered across tools or missing entirely. - Documentation is usually an afterthought (and gets outdated fast).<p>Datadef.io aims to fix that by providing:<p>- Interactive canvas to map tables, relationships, and indicators. - Automatic lineage visualization to trace dependencies. - Metadata management: define table&#x2F;column-level details, ownership, and KPIs. - AI-generated documentation that stays in sync with your models. - Export&#x2F;share features so asset managers, analysts, and other teams don’t get lost in spreadsheets or PDFs.<p>It’s still early, and I’d love feedback from the HN community. In particular:<p>What’s missing for you in lineage&#x2F;metadata&#x2F;documentation tools?<p>How would you want to integrate a tool like this into your workflow (dbt, Databricks, Power BI, etc.)?<p>I’d really appreciate your thoughts, feature requests, and criticism.<p>Thanks!

Show HN: Fast Tor Onion Service vanity address generator

Show HN: Fast Tor Onion Service vanity address generator Hello,<p>I&#x27;ve built the tool to generate vanity Tor Onion Service addresses:<p><pre><code> $ onion-vanity-address allium Found allium... in 12s after 558986486 attempts (48529996 attempts&#x2F;s) --- hostname: alliumdye3it7ko4cuftoni4rlrupuobvio24ypz55qpzjzpvuetzhyd.onion hs_ed25519_public_key: PT0gZWQyNTUxOXYxLXB1YmxpYzogdHlwZTAgPT0AAAAC1ooweCbRP6ncFQs3NRyK40fRwaodrmH572D8py+tCQ== hs_ed25519_secret_key: PT0gZWQyNTUxOXYxLXNlY3JldDogdHlwZTAgPT0AAAAQEW4Rhot7oroPaETlAEG3GPAntvJ1agF2c7A2AXmBW3WqAH0oUZ1hySvvZl3hc9dSAIc49h1UuCPZacOWp4vQ </code></pre> The tool checks ~45&#x27;000&#x27;000 keys per second on a laptop which is ~2x faster than widely-used mkp224o <a href="https:&#x2F;&#x2F;github.com&#x2F;cathugger&#x2F;mkp224o" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;cathugger&#x2F;mkp224o</a><p>I&#x27;ve explained key performance difference here <a href="https:&#x2F;&#x2F;github.com&#x2F;AlexanderYastrebov&#x2F;onion-vanity-address?tab=readme-ov-file#the-fastest-search-algorithm" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;AlexanderYastrebov&#x2F;onion-vanity-address?t...</a><p>Would love your feedback, thanks!

Show HN: Shellcast.tv – Stream your vibe coding

Show HN: Shellcast.tv – Stream your vibe coding

Show HN: rm-safely – A shell alias that moves files to trash instead of deleting

Show HN: rm-safely – A shell alias that moves files to trash instead of deleting I made rm-safely, a simple shell wrapper that moves files to trash instead of permanently deleting them. It prevents accidental deletions from autocomplete mishaps or hasty rm -rf commands.<p>Should work as a drop-in replacement for rm but safer.<p>Would appreciate any feedback!

Show HN: We built an open-source alternative to expensive pair programming apps

Show HN: We built an open-source alternative to expensive pair programming apps My friend and I grew frustrated with the high cost of existing pair programming tools, and of course of grainy screens when we used Huddle or similar tools.<p>We believe core developer collaboration shouldn&#x27;t be locked behind an expensive subscription.<p>So for the past year we spent our nights and weekend building Hopp, an open-source alternative.<p>We would love your feedback and we are here to answer any and all questions.

Show HN: AI Agent in Jupyter – Runcell

Show HN: AI Agent in Jupyter – Runcell I build runcell, an AI Agent in Jupyter Lab. It can understand context (data, charts, code, etc) in your jupyterlab and write code for you.<p>Runcell has built-in tools that can edit or execute cells, read&#x2F;write files, search web, etc.<p>Comparing with AI IDE like cursor, runcell focus on building context for code agent in jupyter environment, which means the agent can understand different types of information in jupyter notebook, access kernel state, edit&#x2F;execute specific cells instead of handling jupyter as static ipynb file.<p>Comparing with jupyter ai, runcell is more like an agent instead of a chatbot. It have access to lots of tools to work and take actions by its own.<p>You can use runcell with simple &quot;pip install runcell&quot; to start.<p>Any comments and suggestions are welcome.

Show HN: I built AI that turns 4 hours of financial analysis into 30 seconds

Show HN: I built AI that turns 4 hours of financial analysis into 30 seconds I built Duebase AI to solve a problem I kept running into in fintech - analyzing UK company financial health takes forever. The process usually goes: download PDFs from Companies House → manually extract data to spreadsheets → calculate ratios → interpret trends. Takes 3-4 hours per company and requires serious financial expertise. The technical challenge: Companies House filings are messy. Inconsistent formats, complex accounting structures, missing data, and you need to understand UK accounting standards to make sense of it all. My approach:<p>Parse 15M+ UK company records from Companies House API Built ML models to extract and normalize financial data from varied filing formats Created scoring algorithms that weight liquidity, profitability, leverage, and growth trends Generate 1-5 health scores with explanations in plain English<p>What it does:<p>Instant financial analysis of any UK company (30 seconds vs 4 hours) Real-time monitoring with alerts for new filings&#x2F;director changes Risk detection that catches declining trends early No financial background needed to understand results<p>The hardest part was handling the data inconsistencies - UK companies file in different formats, use various accounting frameworks, and often have incomplete information. Had to build a lot of data cleaning and normalization logic. Currently focused on the UK market since I know the regulatory landscape well, but the approach could work for other countries with similar public filing systems. Link: <a href="https:&#x2F;&#x2F;duebase.com" rel="nofollow">https:&#x2F;&#x2F;duebase.com</a>

Show HN: Mixing Deterministic Codegen with LLM Codegen for Client SDKs

Show HN: Mixing Deterministic Codegen with LLM Codegen for Client SDKs Hi HN, I’m Patrick. Elias, Kevin, and I are building Sideko (<a href="https:&#x2F;&#x2F;sideko.dev" rel="nofollow">https:&#x2F;&#x2F;sideko.dev</a>), a new type of code generator for building and maintaining API client SDKs from OpenAPI specs.<p>Our approach differs significantly from traditional SDK generators in that we use structured pattern matching queries to create and update the code. Other SDK generators use templates, which overwrite custom changes and produce code that looks machine generated.<p>We’ve mixed in LLM codegen by creating this workflow: Run deterministic codegen to establish the SDK structure. Let LLMs enhance specific components where adaptability adds value and include agent rules files that enforce consistency and correctness with type checking and integration tests against mock servers. The system will retain the LLM edits, while the rest of the SDK is automatically maintained by the deterministic generator (keeping it in sync with the API). LLMs can edit most the files (see python rules and typescript rules).<p>You can try it out from your terminal: Install: npm install -g @sideko&#x2F;cli Login: sideko login Initialize: sideko sdk init Prompt: “Add a new function that…”<p>Check out the repo for more details: <a href="https:&#x2F;&#x2F;github.com&#x2F;Sideko-Inc&#x2F;sideko" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;Sideko-Inc&#x2F;sideko</a> We’d love to hear your thoughts!

Show HN: Okapi – a metrics engine based on open data formats

Show HN: Okapi – a metrics engine based on open data formats Hi All I wanted to share an early preview of Okapi an in-memory metrics engine that also integrates with existing datalakes. Modern software systems produce a mammoth amount of telemetry. While we can discuss whether or not this is necessary, we can all agree that it happens.<p>Most metrics engines today use proprietary formats to store data and don’t use disaggregated storage and compute. Okapi changes that by leveraging open data formats and integrating with existing data lakes. This makes it possible to use standard OLAP tools like Snowflake, Databricks, DuckDB or even Jupyter &#x2F; Polars to run analysis workflows (such as anomaly detection) while avoiding vendor lock-in in two ways - you can bring your own workflows and have a swappable compute engine. Disaggregation also reduces Ops burden of maintaining your own storage and the compute engine can be scaled up and down on demand.<p>Not all data can reside in a data-lake&#x2F;object store though - this doesn’t work for recent data. To ease realtime queries Okapi first writes all metrics data to an in memory store and reads on recent data are served from this store. Metrics are rolled up as they arrive which helps ease memory pressure. Metrics are held in-memory for a configurable retention period after which it gets shipped out to object storage&#x2F;datalake (currently only Parquet export is supported). This allows fast reads on recent data while offloading query-processing for older data. On benchmarks queries on in-memory data finish in under a millisecond while having write throughput of ~280k samples per second. On a real deployment, there’d be network delays so YMMV.<p>Okapi it is still early — feedback, critiques, and contributions welcome. Cheers !

Show HN: Code-snippets for developing eBPF Programs

Show HN: Code-snippets for developing eBPF Programs When developing eBPF-programs, we need to figure correct program-section SEC() and program-context.<p>Similary while creating eBPF-maps, we need to add certain fields such as; map-type, key&#x2F;values, map_options etc..<p>If you’re like me, you probably end up digging through documentation or browsing open-source projects just to piece this together every time.<p>So, I created a vscode-extension to help with these repetitive tasks.<p>Try it out and do share your feedback. I hope you like it.<p>Thanks !

Show HN: I integrated Ollama into Excel to run local LLMs

Show HN: I integrated Ollama into Excel to run local LLMs I built an Excel add-in that connects to Ollama, so you can run local LLMs like Llama3 directly inside Excel. I call it XLlama.<p>You can use it like a regular formula: =XLlamaPrompt(&quot;Is Excel a database&quot;) or run it on an entire range.<p>No API calls. No cloud. No subscriptions. Everything runs locally.<p>It’s useful for quick tasks like extracting names, emails, or phone numbers from text, or for doing light data analysis without leaving Excel.<p>Would love feedback, especially from people who use Excel daily.

Show HN: Embeddable -build interactive experiences you can drop into any website

Show HN: Embeddable -build interactive experiences you can drop into any website Hi HN, I’m a co‑founder of Embeddable AI.<p>After struggling to add interactive AI experiences to Wix, Shopify, Webflow, WordPress sites, I built this tool to let marketers build chatbots, quizzes or assistants and embed them anywhere with a snippet.<p>Built in React&#x2F;TypeScript front end and Node.js logic engine. It loads fast and works across CMS platforms.<p>I’d love feedback from builders and marketers on use cases, missing features, or integration ideas.

Show HN: Using DSPy to enrich a dataset of the Nobel laureate network

Show HN: Using DSPy to enrich a dataset of the Nobel laureate network I&#x27;ve been working a fair bit with DSPy lately, and I did some work in combining the benefits of vector search and LLMs (via a DSPy pipeline) to disambiguate records with a high degree of accuracy to help enrich a dataset. The blog post shows how this approach scales well, is very cost-effective and super concise - all it takes is &lt; 100 lines of DSPy code and it all runs async.<p>The code to reproduce is in this repo if anyone&#x27;s interested (all tools are 100% free and open source, and the methodology will work with open weight LLMs too). <a href="https:&#x2F;&#x2F;github.com&#x2F;kuzudb&#x2F;dspy-kuzu-demo" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;kuzudb&#x2F;dspy-kuzu-demo</a>

Show HN: WhiteLightning – ultra-lightweight ONNX text classifiers trained w LLMs

Show HN: WhiteLightning – ultra-lightweight ONNX text classifiers trained w LLMs Hey HN,<p>We’re Volodymyr and Volodymyr—two developers from Ukraine building WhiteLightning. It’s a tool that turns large LLMs (Claude 4, Grok 4, GPT-4o via OpenRouter) into tiny ONNX text classifiers that run anywhere—even on drones at the edge.<p>Why we built this: Many developers want custom models (spam filters, sentiment analysis, PII detection, moderation tools), but don’t want to deal with constant API calls or deploy heavy models in production.<p>How it works: WhiteLightning uses LLMs to generate training data and distills it into KB-sized ONNX models you can run on any device and in any language. Just describe your task in a sentence, grab the ONNX model, and run it locally—Python, JS, Rust, Java, Swift, C++, you name it.<p>Try it instantly in your browser: <a href="https:&#x2F;&#x2F;whitelightning.ai&#x2F;playground.html" rel="nofollow">https:&#x2F;&#x2F;whitelightning.ai&#x2F;playground.html</a><p>Code &amp; docs: <a href="https:&#x2F;&#x2F;github.com&#x2F;Inoxoft&#x2F;whitelightning">https:&#x2F;&#x2F;github.com&#x2F;Inoxoft&#x2F;whitelightning</a><p>Community model library: <a href="https:&#x2F;&#x2F;github.com&#x2F;Inoxoft&#x2F;whitelightning-model-library">https:&#x2F;&#x2F;github.com&#x2F;Inoxoft&#x2F;whitelightning-model-library</a><p>We’d love your feedback—what works, what doesn’t, and what to improve.

Show HN: A tool for complete WebSocket traffic control

Show HN: A tool for complete WebSocket traffic control I built a Chrome extension that acts as a WebSocket proxy, allowing real-time monitoring, message simulation, and traffic interception. Think &quot;Proxyman for WebSockets&quot; but integrated into Chrome DevTools.<p>Key features: Real-time WebSocket monitoring and message capture Send custom messages in both directions (client ↔ server) Block incoming&#x2F;outgoing messages for testing Background monitoring (captures connections even when DevTools is closed) Why: I was debugging a WebSocket chat app and needed better tools than browser DevTools. Existing solutions required external proxies or were too basic.<p>Tech: Injects proxy script to intercept WebSocket constructor, React + Vite UI, Chrome DevTools API integration, MIT licensed. Perfect for debugging WebSocket apps, testing error scenarios, reverse engineering APIs, and QA testing real-time features.<p>Links:<p>GitHub: <a href="https:&#x2F;&#x2F;github.com&#x2F;law-chain-hot&#x2F;websocket-devtools">https:&#x2F;&#x2F;github.com&#x2F;law-chain-hot&#x2F;websocket-devtools</a><p>YouTube Demo: <a href="https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=L64x__1xORQ" rel="nofollow">https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=L64x__1xORQ</a><p>Would love feedback from developers who work with WebSockets regularly!

Show HN: I made a web app for structured podcast summaries

Show HN: I made a web app for structured podcast summaries Hey HN,<p>I follow a lot of podcasts and the episodes are often 2-3 hours long, so I made a web app that gives me a structured podcast summary with applicable habits and recommendations.<p>My goal isn’t to discourage you from listening to the podcast, but rather to help you decide whether the episode is worthwhile and to provide notes.<p>The endgame is to give you a personal feed of podcasts that you follow, maybe even deliver it to your inbox or via RSS.<p>My tech stack is quite simple - React Router(v7), node.js and PostgreSQL with Redis. I&#x27;m using OpenAI API to generate the summary from the episode transcript.

Show HN: MCP Jetpack – The easiest way to get started with MCP in Cursor

Show HN: MCP Jetpack – The easiest way to get started with MCP in Cursor My friends and I built MCP Jetpack which makes it really easy to start using MCP. It’s an MCP server that automatically finds and executes the right tools needed to accomplish your task without having to manage MCP servers for each service (GitHub, Linear, Atlassian, Notion etc.). Once you add it to Cursor (or any other AI app with MCP support), Cursor instantly gets access to a growing library of remote MCP servers without any extra setup. For services that require authentication, you will be asked to login the first time you ask your AI to interact with that service.<p>Two problems we are trying to solve:<p>Friction - Normally if you want to give Cursor access to GitHub, you have to install the right MCP server and login before you can use GitHub with Cursor’s chat. With MCP Jetpack, you can ask Cursor to list your GitHub issues, and it will automatically execute the right tool behind the scenes to accomplish your task. For services that require authentication, you will be asked to login the first time you interact with the service. However, it all happens within the Cursor chat so you never have to context switch and fiddle with Cursor’s settings.<p>Tool Limits - Cursor warns you if you have more than 50 MCP tools installed as it says having more will degrade performance. However, just installing the GitHub MCP server itself adds 74 MCP tools. With MCP Jetpack, you get access to GitHub, Atlassian and 15 other services with just two tools: “FindTool” and “ExecTool”.<p>Here are the 17 services we support today: GitHub, Atlassian, Canva, Linear, Notion, Intercom, Monday.com, Neon, PayPal, Hugging Face, Sentry, Square, Webflow, Wix, Cloudflare Docs, Cloudflare AI Gateway, Cloudflare Workers Bindings.<p>We’ll continue to add more services as companies launch remote MCP servers. If yours isn’t listed and you’d like it to be added, please email us at team@mcpjetpack.com.<p>MCP Jetpack is in alpha so please let us know if you run into any problems or have any feedback - thanks!

Show HN: Mark 1.0, a notation that unifies JSON, HTML, JSX, XML, YAML, and more

Show HN: Mark 1.0, a notation that unifies JSON, HTML, JSX, XML, YAML, and more Author of Mark Notation here.<p>Mark is a unified notation for both object and markup data, combining the best of JSON, HTML, and XML with a clean syntax and succinct data model.<p>I&#x27;m glad to announce the 1.0 Release of Mark. This release is just the start of a long journey to make web a better platform to store and exchange data.<p>Your feedback welcome!

Show HN: GitGuard - Painless GitHub PR Automations

Show HN: GitGuard - Painless GitHub PR Automations Hey HN,<p>Every team I&#x27;ve been on has cobbled together some sort of combination of GitHub branch protections and custom scripts to make sure that PRs conform to organization policies and best practices.<p>Things like:<p>- When {X} file is changed, require review from team {Y}<p>- When a new db migration is added, ensure that a special set of tests pass<p>- Require multiple approvals when the PR is very large<p>- Add a special label to PRs that include breaking changes<p>- Allow emergencies &#x2F; hotfixes to break glass and bypass all of the above<p>Most teams tend to start out with a little script running in GitHub actions to enforce all of these policies but it tends to get out of hand and become hard to maintain. PRs that should require scrutiny slip through the cracks, and others that should be allowed through are unnecessarily blocked.<p>That&#x27;s why I made GitGuard (<a href="https:&#x2F;&#x2F;gitguard.dev&#x2F;" rel="nofollow">https:&#x2F;&#x2F;gitguard.dev&#x2F;</a>)<p>GitGuard lets you write and maintain these policies in a custom DSL so simple it looks like pseudocode. The policies are checked on every single PR nearly instantly (no need to wait for a GitHub actions runner) and the results are reported in plain english.<p>Right now policies can make simple assertions about PR metadata and take some stateful actions (adding labels, requesting review) but I&#x27;d love to hear more from HN about how GitGuard could be even more useful.

Show HN: HN v0.2.1 – Native macOS app for reading HN

Show HN: HN v0.2.1 – Native macOS app for reading HN Hey HN folks,<p>Last week I released the first public version of 120 HN — a fully native Hacker News client for macOS. This week, I&#x27;m excited to share version 0.2.1 with some essential updates:<p>The app now remembers window size, position, and sidebar state between sessions<p>Improved AI summaries: more concise, relevant with link back to the comments, easier to read with short paragraph form.<p>Minor tweaks to navigation and layout for a smoother experience<p>Sneak peek: local LLM support is coming in the next release.<p>As always, the app is free to use. Any suggestions or comments would be appreciated.<p>Thanks for checking it out.

Show HN: Timep – a next-gen profiler and flamegraph-generator for bash code

Show HN: Timep – a next-gen profiler and flamegraph-generator for bash code timep is a TIME Profiler for bash code that will give you an accurate per-command execution time breakdown of any bash script or function.<p>Unlike other profilers, timep also recovers and hierarchally records metadata on subshell and function nesting, allowing it to recreate the full call-stack tree for the bash code being profiled. If you call timep with the `--flame` flag, it will automatically generate a flamegraph .svg image where each block represents the wall-clock time spent on a particular command (top level) or its parent subshells&#x2F;functions (all the other levels).<p>Using timep is simple - just source the timep.bash file then add timep before whatever you want to profile. You do not need to change in the code being profiled - timep handles everything for you. Example usage:<p><pre><code> . .&#x2F;timep.bash timep someFunc timep -flame someScript &lt;inputFile </code></pre> timep will generate 2 profiles for you: one showing each individual command (with full subshell&#x2F;function nesting chains), and one that combines repeated loops commands into a count + total runtime line with minimal &quot;extra&quot; metadata.<p>See the github README for more info on the available flags and output profile specifics.<p>timep works by cramming all the timing instrumentation logic into a DEBUG trap that roughly does the following:<p>1. record end timestamp for previous command 2. compare current state to state saved in variables last DEBUG trap to determine what sort of command is happening. e.g., if BASH_SUBSHELL increased then we know we just entered a subshell or background fork. 3. once we know what type of command is happening, generate a log line for the previous command (now that we have its end time 4. save current state in various variables (for use next debug trap) 5. record start time for the next command<p>then after the profiled code is done running, timep post-processes the logs to produce the final profile

Show HN: c0admin – A terminal-based AI assistant for Linux sysadmins

Show HN: c0admin – A terminal-based AI assistant for Linux sysadmins I made a small CLI tool called `c0admin`. Runs locally using your own Gemini API key. No signup, no server, no tracking.<p>You just run `c0admin` in your terminal, and it gives you suggestions interactively.

Show HN: Ncrypt – Query encrypted files privately with FHE

Show HN: Ncrypt – Query encrypted files privately with FHE Hey HN,<p>We&#x27;re building ncrypt, an open-source encrypted file manager that allows you to store, manage, and privately query your files using fully homomorphic encryption (FHE). This project originally started as a simple SFTP-like CLI for my personal S3 buckets which I used to send and retrieve encrypted files and have more granular control over key rotation.<p>As the number of files that I was storing grew, file discovery started to become a problem, and I found myself frequently having to download and decrypt files to inspect their contents. Rather than leaving them unencrypted in S3 and therefore easier to search, I started looking into the concept of searching over encrypted data using fully homomorphic encryption. This led me to Zama&#x27;s concrete-python library (<a href="https:&#x2F;&#x2F;github.com&#x2F;zama-ai&#x2F;concrete">https:&#x2F;&#x2F;github.com&#x2F;zama-ai&#x2F;concrete</a>), which provides a simple Python interface for performing FHE operations.<p>FHE is notoriously slow, so rather than trying to search over entire files I focused on a more tractable problem, indexing and searching over file metadata (summaries, keywords, embeddings, etc) which was small enough to make search practical. While still not fast compared to traditional file management tools, ncrypts search performance is decent if you keep directory sizes relatively small (under 25 files), and most of the heavy lifting happens during metadata extraction, not at search time.<p>The two types of encrypted queries we currently support are keyword search and cosine similarity search over vector embeddings, which are generated using user-specified huggingface models. Ncrypt currently supports metadata extraction for text, image, and audio files. Check out our code and give it a try at <a href="https:&#x2F;&#x2F;github.com&#x2F;ncryptai&#x2F;ncrypt">https:&#x2F;&#x2F;github.com&#x2F;ncryptai&#x2F;ncrypt</a>.<p>We love feedback!

Show HN: Clai - Vendor agnostic Claude Code/Gemini CLI written in Go

Show HN: Clai - Vendor agnostic Claude Code/Gemini CLI written in Go

Show HN: Reactylon – Open-source framework for building 3D/XR apps with React

Show HN: Reactylon – Open-source framework for building 3D/XR apps with React

No other tools from this source yet.