scikit-bio

Solid

Biological data toolkit. Sequence analysis, alignments, phylogenetic trees, diversity metrics (alpha/beta, UniFrac), ordination (PCoA), PERMANOVA, FASTA/Newick I/O, for microbiome analysis.

AI & Automation 2,210 stars 164 forks Updated 1 weeks ago Apache-2.0

Install

View on GitHub

Quality Score: 91/100

Stars 20%
100
Recency 20%
90
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# scikit-bio ## Overview scikit-bio is a comprehensive Python library for working with biological data. Apply this skill for bioinformatics analyses spanning sequence manipulation, alignment, phylogenetics, microbial ecology, and multivariate statistics. ## When to Use This Skill This skill should be used when the user: - Works with biological sequences (DNA, RNA, protein) - Needs to read/write biological file formats (FASTA, FASTQ, GenBank, Newick, BIOM, etc.) - Performs sequence alignments or searches for motifs - Constructs or analyzes phylogenetic trees - Calculates diversity metrics (alpha/beta diversity, UniFrac distances) - Performs ordination analysis (PCoA, CCA, RDA) - Runs statistical tests on biological/ecological data (PERMANOVA, ANOSIM, Mantel) - Analyzes microbiome or community ecology data - Works with protein embeddings from language models - Needs to manipulate biological data tables ## Core Capabilities ### 1. Sequence Manipulation Work with biological sequences using specialized classes for DNA, RNA, and protein data. **Key operations:** - Read/write sequences from FASTA, FASTQ, GenBank, EMBL formats - Sequence slicing, concatenation, and searching - Reverse complement, transcription (DNA→RNA), and translation (RNA→protein) - Find motifs and patterns using regex - Calculate distances (Hamming, k-mer based) - Handle sequence quality scores and metadata **Common patterns:** ```python import skbio # Read sequences from file seq = skbio.DNA.read('inpu...

Details

Author
foryourhealth111-pixel
Repository
foryourhealth111-pixel/Vibe-Skills
Created
3 months ago
Last Updated
1 weeks ago
Language
Python
License
Apache-2.0

Similar Skills

Semantically similar based on skill content — not just same category

DevOps & Infrastructure Solid

scikit-bio

Biological data toolkit. Sequence analysis, alignments, phylogenetic trees, diversity metrics (alpha/beta, UniFrac), ordination (PCoA), PERMANOVA, FASTA/Newick I/O, for microbiome analysis.

26,817 Updated today
K-Dense-AI
DevOps & Infrastructure Solid

scikit-bio

Biological data toolkit. Sequence analysis, alignments, phylogenetic trees, diversity metrics (alpha/beta, UniFrac), ordination (PCoA), PERMANOVA, FASTA/Newick I/O, for microbiome analysis.

27,705 Updated today
davila7
DevOps & Infrastructure Listed

scikit-bio

Biological data toolkit. Sequence analysis, alignments, phylogenetic trees, diversity metrics (alpha/beta, UniFrac), ordination (PCoA), PERMANOVA, FASTA/Newick I/O, for microbiome analysis.

335 Updated today
aiskillstore
AI & Automation Featured

biopython

Biopython is a comprehensive set of freely available Python tools for biological computation. It provides functionality for sequence manipulation, file I/O, database access, structural bioinformatics, phylogenetics, and many other bioinformatics tasks.

39,350 Updated today
sickn33
Data & Documents Listed

biopython-sequence-analysis

Biopython sequence analysis: parse FASTA/FASTQ/GenBank/GFF (SeqIO), NCBI Entrez (esearch/efetch/elink), remote/local BLAST, pairwise/MSA alignment (PairwiseAligner, MUSCLE/ClustalW), phylogenetic trees (Phylo). Use for gene family studies, phylogenomics, comparative genomics, NCBI pipelines. For PCR/restriction/cloning use biopython-molecular-biology; for SAM/BAM use pysam.

186 Updated 4 days ago
jaechang-hits