Show HN: Dataset Explorer – Free tool to search any public datasets
Show HN (score: 6)Description
--
Finding the right dataset shouldn't be this hard.
Millions of high-quality datasets exist across Kaggle, data.gov, and other platforms, but discovering the ones you actually need feels like searching for a needle in a haystack.
Whether it's seasonality trends, weather patterns, holiday data, tech layoffs, currency rates, political content, or geo information – the perfect dataset is out there, but buried under poor search functionality.
That's why we built the dataset-explorer – a completely free tool that lets you search for datasets using natural language across multiple platforms.
Just describe what you want to analyze, and it uses Perplexity, scraping (Firecrawl), and other tools behind the scenes to surface relevant datasets.
Instead of manually browsing through categories or dealing with limited search filters, you can simply ask "show me tech layoff data from the past 5 years" and get preview of multiple datasets.
Quick demo: I analyzed tech layoffs from 2020-2025 and uncovered some striking insights:
- 2023 was brutal – 264K layoffs (the peak year)
- Post-IPO companies led the cuts – responsible for 58% of all layoffs
- Hardware hit hardest – with Intel leading the charge
- January 2023 = worst month ever – 89K people lost their jobs in just 30 days
Once you find your dataset, you can analyze it completely free on Hunch . Try it yourself and let us know we can improve it for you.
Data explorer - https://hunch.dev/data-explorer
Demo link - https://screen.studio/share/bLnYXAvZ
More from Show
Show HN: Ragnerock, an AI data analysis tool
Show HN: Ragnerock, an AI data analysis tool Hi HN, I’m Matt Mahowald, and together with my cofounder John, we’re launching the public beta of Ragnerock today.<p>As a data scientist, you spend the majority of your time wrangling data. Even though you might have a set of techniques and tricks you like to use, how exactly you treat a particular source of data tends to be fairly bespoke, so you end up writing custom logic each time.<p>Ragnerock was born from the observation that modern LLMs can be used to automate a lot of the grunt work involved in this process, while still allowing for fully customizable pipelines. What’s more, by leveraging techniques like constrained decoding, it’s possible to provide a unified query interface regardless of the data source - bridging raw data sources like text and images with your existing structured data living in your databases.<p>Ragnerock has four main components:<p>- A workflow designer that lets you build LLM-driven data processing and analysis pipelines<p>- A job orchestration layer that runs those workflows<p>- A query interface which lets you inspect the results of those workflows with plain SQL<p>- A notebook system which is 100% API-compatible with Jupyter and runs on your existing kernels, so you can easily pull data into your existing environments and analyses<p>Ragnerock also supports bring-your-own AI (OpenAI, Anthropic, and Google APIs), databases, and blob storage, so you can join with your existing datasets and have all outputs flow to your data lake. We’re particularly excited about our web crawling feature, which allows you to scrape websites and trigger workflows on updates: for example, you might point Ragnerock at your favorite blog and run a workflow to assess posts for topics and sentiment.<p>You can try it out at <a href="https://www.ragnerock.com" rel="nofollow">https://www.ragnerock.com</a> ; no credit card needed and the first 20 hours of compute are free. It’s an early-stage product so we’re especially interested in feedback.<p>Happy to answer any questions - John and I will be around in the comments today.
No other tools from this source yet.