Name: Yosoi
Author: Cascading Labs

Question 1

What is Yosoi?

Accepted Answer

YOSOI stands for You Only Scrape Once (iteratively). Instead of calling an LLM on every scrape or relying on brittle regex heuristics, Yosoi discovers selectors automatically the first time and caches them. Cache hits avoid repeat LLM calls. When selectors go stale, Yosoi re-discovers only the broken fields, not the whole contract, so costs stay bounded as sites change.

Question 2

How does Yosoi work?

Accepted Answer

You point it at a URL. Yosoi fetches the HTML, sends it to an LLM to identify stable CSS selectors, validates them, and caches the result. Every subsequent scrape of that domain uses the cached selectors directly with no LLM involved.

Question 3

Why was Yosoi built?

Accepted Answer

For internal use at Cascading Labs. Every other API or LLM-per-request approach would have cost hundreds of thousands of dollars at scale. Yosoi was the only economically viable path, and we figured others had the same problem.

Question 4

Does Yosoi work with JavaScript-rendered sites?

Accepted Answer

Yes. Use the waterfall, headless, or headful fetcher. Browser-backed fetching runs through VoidCrawl, captures rendered HTML, and can feed accessibility-tree hints into selector discovery.

Question 5

What happens if a site redesigns its layout?

Accepted Answer

Some cached selectors will likely break. Yosoi detects stale selectors at scrape time and runs partial rediscovery. Only the broken fields are sent back to the LLM. The fresh selectors are merged into the existing cache. You can also force a full rediscovery by deleting the domain's file from .yosoi/selectors/ or passing --force.

Question 6

Which LLM provider gives the best results?

Accepted Answer

Results are generally consistent across providers. Larger context windows and stronger models usually help on difficult pages. Yosoi supports many providers, so run discovery with --model to compare on your target sites.

Introduction

Technical Notes

How It Works

FAQs

References