karyotype-databaseslisted

Use when working with karyotype, chromosome number, sex-system data, or any of the lab's curated species databases (Coleoptera, Diptera, Amphibia, Mammalia, Polyneoptera, Drosophila karyotypes; Tree of Sex; CURES; Epistasis; Tau). Covers how to resolve resource keys via require_data_resource, what each database contains, and safe read/write patterns.
coleoguy/tealc · ★ 0 · Data & Documents · score 68

Install: claude install-skill coleoguy/tealc

# Karyotype & Species Databases ## Overview The Blackmon Lab maintains a suite of curated comparative genomics databases, all publicly hosted at `coleoguy.github.io/data/`. Every database is registered in `data/known_sheets.json` and accessed via the `require_data_resource(key)` tool. **Hard rule: call `require_data_resource(key)` before writing any R or Python code that reads a lab database.** The tool either returns `OK|<path>` (local CSV/JSON) or `OK|<sheet_id>` (Google Sheet). Use the returned string verbatim in the generated code. If it returns `ERROR|...`, STOP — do not emit analysis code. Tell Heath what is missing and ask him to supply the path or Sheet ID. --- ## The require_data_resource Protocol ```python # Example call result = require_data_resource("coleoptera_karyotypes") # Returns one of: # "OK|/Users/blackmon/Desktop/GitHub/coleoguy.github.io/data/karyotypes-coleoptera.csv" # "ERROR|key 'coleoptera_karyotypes' not in known_sheets.json. Available keys: ..." # "ERROR|resource 'tree_of_sex' is registered but not yet configured ..." ``` On `OK|<path>`: - For local CSV: use the absolute path directly in `read.csv(<path>)` or `pd.read_csv("<path>")` - For Google Sheet: pass the ID to the Sheets API or `read_sheet(sheet_id, range)` On `ERROR|...`: - Do NOT write any analysis code - Report the error to Heath verbatim - Ask whether he wants to supply a corrected path/ID or create a fresh registration --- ## Database Catalog ### Karyotype Databases (ins