spacy-ner

Solid

spaCy NER model training and entity extraction for conversational AI

AI & Automation 1,160 stars 71 forks Updated today MIT

Install

View on GitHub

Quality Score: 94/100

Stars 20%
100
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
49
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# spaCy NER Skill ## Capabilities - Train custom spaCy NER models - Configure entity extraction pipelines - Design annotation schemas - Implement entity linking - Set up model evaluation - Deploy efficient NER inference ## Target Processes - entity-extraction-slot-filling - chatbot-design-implementation ## Implementation Details ### spaCy Components 1. **NER**: Named Entity Recognition 2. **EntityLinker**: Link to knowledge bases 3. **EntityRuler**: Rule-based matching 4. **SpanCategorizer**: Overlapping entities ### Training Configuration - config.cfg setup - Training data format (spaCy v3) - Augmentation strategies - Evaluation metrics ### Configuration Options - Base model selection (en_core_web_*) - Custom entity types - Training parameters - GPU acceleration - Model packaging ### Best Practices - Quality annotation data - Balance entity types - Use prodigy for annotation - Regular model evaluation ### Dependencies - spacy - spacy-transformers (optional)

Details

Author
a5c-ai
Repository
a5c-ai/babysitter
Created
4 months ago
Last Updated
today
Language
JavaScript
License
MIT

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Solid

entity-memory-extraction

Entity and fact extraction for user profiling and personalization

1,160 Updated today
a5c-ai
AI & Automation Solid

rasa-nlu-integration

Rasa NLU pipeline configuration and training for intent and entity extraction

1,160 Updated today
a5c-ai
Data & Documents Listed

cohere-v2-python

Master Cohere v2 Chat API with Python, specializing in entity extraction using JSON Schema mode for structured outputs. Use when extracting entities from text, building data extraction pipelines, implementing NER systems, or requiring validated JSON responses from LLMs.

335 Updated today
aiskillstore
AI & Automation Solid

entity-optimizer

Build entity presence in Knowledge Graph, Wikidata, AI systems for brand recognition and citations. 实体优化/知识图谱

1,941 Updated 2 weeks ago
aaron-he-zhu
AI & Automation Listed

context-engineering

Build and query a token-efficient, provenance-tracked company brain across code, docs, and connector streams. Five capabilities ship as one skill: multi-source indexer (14 languages via tree-sitter AST + markdown heading trees, schema-versioned, incremental); Source ABC (connector contract — connectors live elsewhere, never in this skill); EntityStore (three-tier raw/events/wiki layer with full provenance, semantic-shift consolidation, drift/split/merge auditor); depth-aware packer (2 levels — full / pointer, 95% budget utilization, multi-hop reasoning, query-as-lens reranking, anti-hallucination filters); MCP server. Anabasis `find-links` reference implementation. Use when: an agent needs broad file awareness within a token budget, building or querying a wiki/EntityStore over code + human knowledge, extracting features from a repo (code-to-knowledge), packing entity pages with provenance, multi-hop reasoning across linked concepts, indexing a workspace or GitHub repo. Do NOT use for: single-file reads, when

1 Updated 3 days ago
VictorGjn