Browse: audit and enrich domain keyword lists against actual corpus distribution #123
Labels
No labels
accessibility
backlog
beta-feedback
bug
duplicate
enhancement
feature-request
help wanted
invalid
needs-design
needs-triage
question
wontfix
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference: Circuit-Forge/kiwi#123
Loading…
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Problem
app/services/recipe/browser_domains.pycontains keyword lists for all four browse domains (cuisine, meal_type, dietary, main_ingredient). These were written as best-guesses before the corpus was fully analyzed and carry a prominent warning in the file header:This audit has not yet happened. The meal_type domain (tracked separately in #122) has demonstrated the problem: categories that appear empty in the UI because their keywords do not match the corpus vocabulary.
Scope
All four domains need to be audited:
cuisine
Verify that
italian,mexican,asian,american,mediterranean,indian,european,latin americancategory keywords match how the corpus actually categorizes those recipes. The corpus may use different spellings or compound tags.meal_type
Tracked in #122 — near-empty categories confirmed. Fix required urgently.
dietary
Keywords like
vegetarian,vegan,gluten-freemay match if the corpus uses dietary tags, but coverage may be low if the corpus does not tag dietary restrictions explicitly. May need inference from ingredient lists rather than keyword matching.main_ingredient
Keywords like
chicken,beef,pork,pasta,vegetables— these are likely to have reasonable overlap with corpuscategoryvalues, but should be verified.Tasks
result_countper browse)Notes
browser_telemetrytable (migration 020) capturesresult_countper domain/category/page — query it to see which categories users are hitting that return nothing.pyr0ball referenced this issue2026-04-27 11:38:25 -07:00