Open dataset

Every number on our insights, directory, and technology pages comes from this dataset. Free under CC BY 4.0 — credit borahlabs.us and you can use it in your own research, articles, or client reports.

Regenerated April 19, 2026Refreshed weekly

Methodology

Each record is built from a full diagnostic crawl: Lighthouse / PageSpeed, SSL inspection, Wappalyzer-style tech detection, structured-data extraction, and social-profile discovery. We require a minimum of 10 completed diagnostics per bucket before any aggregate is published — thinner segments return a 404 on the SEO-export API so they never surface here.

Every field is an aggregate across the sample. We never expose individual business names, domains, or screenshots in this dataset.

Attribution

Use the data however you like — internal reports, blog posts, tools, pitch decks. Just credit borahlabs.us as the source with a link back.

Example citation: "Based on data from Borah Labs' US small-business website benchmarks, 2026."

Released under CC BY 4.0.