AI Weekly Malaysia

Back to items Summaries

HTML table extractor

ID
2157
Status
summarized
Published
30 Jun 2026, 7:38 AM
Fetched
01 Jul 2026, 1:52 AM
Provider
Simon Willison
Category
developer-ai
Original URL
https://simonwillison.net/2026/Jun/29/html-table-extractor/
Source URL
https://simonwillison.net/atom/everything/

Summary

Score
7.8
Created
01 Jul 2026, 1:53 AM
Tags
Audience
developersvibe_coders

What happened

Simon Willison built a free web tool that extracts any HTML table you paste from a browser into clean Markdown, CSV, TSV, or JSON. It now also integrates Wikipedia’s open CORS API, letting you search a page and pull tables directly without manual copy-paste. The tool is part of his growing suite of paste-conversion utilities for developers.

Why it matters

Saves developers hours of manual scraping or regex hell when grabbing tabular data from docs, reports, or wikis. For vibe coders and AI agent builders, this instantly turns messy web tables into structured, LLM-ready formats like Markdown or JSON—reducing friction for data pipelines and prototypes.

Discussion angle

Pair this tool with an LLM that accepts JSON or Markdown as context—you can go from a Wikipedia table to a fully summarized report or a structured dataset for your database project in minutes. Even ask: what local Malaysian open data portals could benefit from this kind of zero-friction table extraction?

Top