star-rna-seq-aligner

Solid

Splice-aware RNA-seq aligner producing sorted BAM and splice junction tables. Builds genome index, runs two-pass alignment for better junctions. Outputs sorted BAM, junctions (SJ.out.tab), stats (Log.final.out), optional gene counts. Use Salmon for fast pseudoalignment; STAR when a BAM is needed for variant calling, IGV, or ENCODE pipelines.

AI & Automation 284 stars 26 forks Updated 3 days ago NOASSERTION

Install

View on GitHub

Quality Score: 82/100

Stars 20%

Recency 20%

100

Frontmatter 20%

Documentation 15%

100

Issue Health 10%

License 10%

100

Description 5%

100

Skill Content

# STAR — Spliced RNA-seq Aligner ## Overview STAR (Spliced Transcripts Alignment to a Reference) aligns RNA-seq reads to a genome in a splice-aware manner, identifying novel and annotated splice junctions in a single pass. It generates coordinate-sorted BAM files compatible with samtools, IGV, deeptools, and GATK. STAR's 2-pass mode re-aligns reads using junctions discovered in the first pass, improving sensitivity for novel splice sites. With `--quantMode GeneCounts`, STAR simultaneously produces gene-level read count tables without requiring a separate featureCounts or HTSeq step. ## When to Use - Aligning bulk RNA-seq reads to a reference genome when downstream tools require a BAM file (variant calling, visualization, deeptools) - Running ENCODE-compliant RNA-seq pipelines that mandate genome alignment - Discovering novel splice junctions and alternative splicing events in the dataset - Generating gene count tables alongside BAM alignment in a single step with `--quantMode GeneCounts` - Processing long reads or reads with high mismatch rates by tuning `--outFilterMismatchNmax` - Use **Salmon** instead when you only need transcript/gene quantification and do not need a BAM file — Salmon is 20-50× faster ## Prerequisites - **Software**: STAR ≥ 2.7.0 (conda or compiled binary) - **Reference files**: genome FASTA + GTF annotation (same assembly) - **RAM**: 30–32 GB for human/mouse genome index; 8–16 GB for smaller genomes - **Disk**: ~25 GB for human genome index, ~5–10 ...

Details

Author: jaechang-hits
Repository: jaechang-hits/SciAgent-Skills
Created: 5 months ago
Last Updated: 3 days ago
Language: Python
License: NOASSERTION

Bundled in these plugins

sciagent-skills

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Solid

bwa-mem2-dna-aligner

Fast short-read DNA aligner for WGS/WES/ChIP-seq. 2× faster BWA-MEM successor; outputs SAM/BAM with read group headers for GATK. Primary plus supplementary records for chimeric reads. Use STAR for RNA-seq splice-aware alignment; Bowtie2 is a comparable alternative.

284 Updated 3 days ago

jaechang-hits

AI & Automation Solid

samtools-bam-processing

CLI toolkit for SAM/BAM/CRAM: sort, index, convert, filter, QC alignments. Core commands: view, sort, index, flagstat, stats, depth, markdup, merge. Required between alignment and variant/peak calling. Use pysam for Python-native BAM access; deeptools for normalized coverage tracks.

284 Updated 3 days ago

jaechang-hits

AI & Automation Featured

bulk-rnaseq

End-to-end bulk RNA-seq orchestrator — takes raw FASTQ reads through QC and trimming (FastQC, fastp/Trim Galore), alignment and quantification (STAR, Salmon, featureCounts), assembles a gene-level counts matrix, then hands off to differential expression (pydeseq2), pathway/GSEA enrichment (pathway-enrichment), and publication figures (scientific-visualization). Use whenever the user has bulk RNA-seq reads or quant output and wants a complete, reproducible differential-expression workflow — e.g. "analyze my RNA-seq", "FASTQ to DESeq2", "run nf-core/rnaseq", "STAR/Salmon quantification", "build a counts matrix for DESeq2", or "go from reads to differentially expressed genes and enriched pathways". Routes between an nf-core/rnaseq (Nextflow) path and a standalone STAR/Salmon path, and covers experimental design, strandedness, and QC gates. For single-cell RNA-seq use the scanpy skill instead.

31,883 Updated today

K-Dense-AI