nnsight-remote-interpretability

Featured

Provides guidance for interpreting and manipulating neural network internals using nnsight with optional NDIF remote execution. Use when needing to run interpretability experiments on massive models (70B+) without local GPU resources, or when working with any PyTorch architecture.

AI & Automation 27,984 stars 2901 forks Updated today MIT

Install

View on GitHub

Quality Score: 99/100

Stars 20%

100

Recency 20%

100

Frontmatter 20%

Documentation 15%

100

Issue Health 10%

License 10%

100

Description 5%

100

Skill Content

# nnsight: Transparent Access to Neural Network Internals nnsight (/ɛn.saɪt/) enables researchers to interpret and manipulate the internals of any PyTorch model, with the unique capability of running the same code locally on small models or remotely on massive models (70B+) via NDIF. **GitHub**: [ndif-team/nnsight](https://github.com/ndif-team/nnsight) (730+ stars) **Paper**: [NNsight and NDIF: Democratizing Access to Foundation Model Internals](https://arxiv.org/abs/2407.14561) (ICLR 2025) ## Key Value Proposition **Write once, run anywhere**: The same interpretability code works on GPT-2 locally or Llama-3.1-405B remotely. Just toggle `remote=True`. ```python # Local execution (small model) with model.trace("Hello world"): hidden = model.transformer.h[5].output[0].save() # Remote execution (massive model) - same code! with model.trace("Hello world", remote=True): hidden = model.model.layers[40].output[0].save() ``` ## When to Use nnsight **Use nnsight when you need to:** - Run interpretability experiments on models too large for local GPUs (70B, 405B) - Work with any PyTorch architecture (transformers, Mamba, custom models) - Perform multi-token generation interventions - Share activations between different prompts - Access full model internals without reimplementation **Consider alternatives when:** - You want consistent API across models → Use **TransformerLens** - You need declarative, shareable interventions → Use **pyvene** - You're training SAEs → Use...

Details

Author: davila7
Repository: davila7/claude-code-templates
Created: 11 months ago
Last Updated: today
Language: Python
License: MIT

Integrates with

Anthropic · AI

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Solid