HTML table extractor
- ID
- 2157
- Status
- summarized
- Published
- 30 Jun 2026, 7:38 AM
- Fetched
- 01 Jul 2026, 1:52 AM
- Provider
- Simon Willison
- Category
- developer-ai
- Original URL
- https://simonwillison.net/2026/Jun/29/html-table-extractor/
- Source URL
- https://simonwillison.net/atom/everything/
Summary
- Score
- 7.8
- Created
- 01 Jul 2026, 1:53 AM
- Tags
- Audience
- developersvibe_coders
What happened
Simon Willison built a free web tool that extracts any HTML table you paste from a browser into clean Markdown, CSV, TSV, or JSON. It now also integrates Wikipedia’s open CORS API, letting you search a page and pull tables directly without manual copy-paste. The tool is part of his growing suite of paste-conversion utilities for developers.
Why it matters
Saves developers hours of manual scraping or regex hell when grabbing tabular data from docs, reports, or wikis. For vibe coders and AI agent builders, this instantly turns messy web tables into structured, LLM-ready formats like Markdown or JSON—reducing friction for data pipelines and prototypes.
Discussion angle
Pair this tool with an LLM that accepts JSON or Markdown as context—you can go from a Wikipedia table to a fully summarized report or a structured dataset for your database project in minutes. Even ask: what local Malaysian open data portals could benefit from this kind of zero-friction table extraction?