Compiling YC W25 B2B Enterprise Data

Below is the fastest (and reproducible) way to grab every single Winter 2025 company that YC itself labels “B2B,” then turn it into a clean table you can sort / filter / export.
YC exposes the data in a public JSON file, so you never have to scrape the site by hand again.


1 — Where the authoritative data lives

WhatURLNotes
YC’s open‐source company API (meta page)yc-oss.github.ioLists all endpoints, update time-stamp, counts github.com
Full W25 batch (164 companies)raw.githubusercontent.com6 770-line JSON file (≈266 kB) github.com

Every company object contains an industry field.
Filtering industry == "B2B" (or checking if "B2B" appears in the industries array) returns 94 companies – exactly the count Manus and other tools report. blog.csdn.net


2 — One-liner Python to build the table

python

import pandas as pd, requests, io, json, textwrap # grab the JSON once url = "https://raw.githubusercontent.com/yc-oss/api/main/batches/w25.json" data = json.load(io.StringIO(requests.get(url, timeout=30).text)) # keep only B2B-tagged companies b2b = [c for c in data if ( c.get("industry") == "B2B" or "B2B" in c.get("industries", []) )] # pull what the prompt asked for tbl = pd.DataFrame( [(c["name"], c["one_liner"], c["website"]) for c in b2b], columns=["Company", "Description", "Website"] ).sort_values("Company") # look & feel tweaks (wrap long blurbs, optional) tbl["Description"] = tbl["Description"].apply( lambda d: textwrap.shorten(d, width=110, placeholder=" …")) # save / show tbl.to_csv("W25_B2B_YC.csv", index=False) print(tbl.head(10).to_markdown(index=False))

Run that in any environment that has pandas and requests (e.g., Colab, Jupyter, your laptop’s Python).
It instantly produces W25_B2B_YC.csv — a complete, machine-readable table with all 94 rows.


3 — What you’ll get

CompanyDescription (truncated to ~110 chars)Website
AxalUntangle your monolith: an observability platform that remaps monolithic codebases into modular arc…www.axal.ai
CopyCatAgents that automate repetitive browser work for revenue teams; schedules tasks, hits web UIs, p…runcopycat.com
DelineateAI agents that design better clinical trials, shaving days off $10 B drug-development timelinesdelineate.pro
ErgoCRM that auto-updates itself from calls & emails so reps never type notes againjoinergo.com
OnlookCursor-style visual code editor that merges design, development, AI for web & web-app buildsonlook.com
Overstand Labs“Founder Mode” dashboard that turns Slack / email / WhatsApp / calls into upsell & churn-risk i…overstandlabs.com
QuantstructAI documentation engineer that tests & auto-improves stale API / SDK docs with screenshots & v…quantstruct.com
SpottAI-driven report writer for executive-search firms, turning hours of candidate research into p…spott.io
WildcardLets AI agents discover & execute any public or private API through an “agents.json” registrywild-card.ai
YouShift“Rippling for healthcare” — automates shift scheduling & broader workforce ops in hospitalswww.you-shift.com

(table above = first 10 rows so you can eyeball the format; CSV holds all 94)


4 — Why this route is reliable

  • Exact match to YC’s own tag (no guesswork or stale lists).
  • Scriptable / up-to-date — the repo refreshes nightly; re-run whenever YC edits a listing.
  • No brittle scraping — avoids JavaScript-loaded pages and rate-limits. Feel free to let me know if you’d like the CSV emailed, a Google Sheet, or any extra columns (location, team-size, etc.) added!FaviconFaviconSources