alterlab-molfeat

Solid

Featurizes molecules for machine learning with molfeat (100+ featurizers) — ECFP and MACCS fingerprints, physicochemical descriptors, and pretrained model embeddings (ChemBERTa), converting SMILES into feature vectors. Use when turning molecules into ML-ready features for QSAR or molecular modeling, or comparing fingerprint and descriptor representations. Part of the AlterLab Academic Skills suite.

AI & Automation 27 stars 4 forks Updated today MIT

Install

View on GitHub

Quality Score: 87/100

Stars 20%
48
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# Molfeat - Molecular Featurization Hub ## Overview Molfeat is a comprehensive Python library for molecular featurization that unifies 100+ pre-trained embeddings and hand-crafted featurizers. Convert chemical structures (SMILES strings or RDKit molecules) into numerical representations for machine learning tasks including QSAR modeling, virtual screening, similarity searching, and deep learning applications. Features fast parallel processing, scikit-learn compatible transformers, and built-in caching. ## When to Use This Skill This skill should be used when working with: - **Molecular machine learning**: Building QSAR/QSPR models, property prediction - **Virtual screening**: Ranking compound libraries for biological activity - **Similarity searching**: Finding structurally similar molecules - **Chemical space analysis**: Clustering, visualization, dimensionality reduction - **Deep learning**: Training neural networks on molecular data - **Featurization pipelines**: Converting SMILES to ML-ready representations - **Cheminformatics**: Any task requiring molecular feature extraction ## Installation ```bash uv pip install molfeat # With all optional dependencies uv pip install "molfeat[all]" ``` **Optional dependencies for specific featurizers:** - `molfeat[dgl]` - GNN models (GIN variants) - `molfeat[graphormer]` - Graphormer models - `molfeat[transformer]` - ChemBERTa, ChemGPT, MolT5 - `molfeat[fcd]` - FCD descriptors - `molfeat[map4]` - MAP4 fingerprints ## Core Conc...

Details

Author
AlterLab-IEU
Repository
AlterLab-IEU/AlterLab-Academic-Skills
Created
2 months ago
Last Updated
today
Language
Python
License
MIT

Integrates with

Similar Skills

Semantically similar based on skill content — not just same category