Add three new scripts for Purple Carrot recipe pipeline:
- discover_current_menu.py: fetches this week's active menu slugs from
/plant-based-recipes using requests (server-rendered HTML, no JS needed).
Accumulates slugs across weekly runs for building a recipe corpus over time.
- discover_slugs_categories.py: crawls recipe-category listing pages with
?page=N pagination to discover historical slug inventory. Note: category
archive slugs (past menu items) 404 when scraped live; only use for
identifying currently-featured recipes per category.
- scrape_live.py: updated with --slugs-from flag (load slug inventory from
any parquet, not just the default Wayback one) and fresh-context-per-slug
pattern to bypass Cloudflare session-level bot detection (which fires on
the 2nd+ request in a shared browser context).
Discovery: the live site only renders full ingredient/instruction content for
recipes currently on the active weekly menu. 23/23 current menu recipes
scraped successfully (100% hit rate vs ~1% for archived slugs).