nnsight-remote-interpretability

Featured

Provides guidance for interpreting and manipulating neural network internals using nnsight with optional NDIF remote execution. Use when needing to run interpretability experiments on massive models (70B+) without local GPU resources, or when working with any PyTorch architecture.

AI & Automation 27,984 stars 2901 forks Updated today MIT

Install

View on GitHub

Quality Score: 99/100

Stars 20%
100
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
100
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# nnsight: Transparent Access to Neural Network Internals nnsight (/ɛn.saɪt/) enables researchers to interpret and manipulate the internals of any PyTorch model, with the unique capability of running the same code locally on small models or remotely on massive models (70B+) via NDIF. **GitHub**: [ndif-team/nnsight](https://github.com/ndif-team/nnsight) (730+ stars) **Paper**: [NNsight and NDIF: Democratizing Access to Foundation Model Internals](https://arxiv.org/abs/2407.14561) (ICLR 2025) ## Key Value Proposition **Write once, run anywhere**: The same interpretability code works on GPT-2 locally or Llama-3.1-405B remotely. Just toggle `remote=True`. ```python # Local execution (small model) with model.trace("Hello world"): hidden = model.transformer.h[5].output[0].save() # Remote execution (massive model) - same code! with model.trace("Hello world", remote=True): hidden = model.model.layers[40].output[0].save() ``` ## When to Use nnsight **Use nnsight when you need to:** - Run interpretability experiments on models too large for local GPUs (70B, 405B) - Work with any PyTorch architecture (transformers, Mamba, custom models) - Perform multi-token generation interventions - Share activations between different prompts - Access full model internals without reimplementation **Consider alternatives when:** - You want consistent API across models → Use **TransformerLens** - You need declarative, shareable interventions → Use **pyvene** - You're training SAEs → Use...

Details

Author
davila7
Repository
davila7/claude-code-templates
Created
11 months ago
Last Updated
today
Language
Python
License
MIT

Integrates with

Similar Skills

Semantically similar based on skill content — not just same category