INDEX

Explanations

scientific/legal documents

np_max-act · gemini-2.0-flash

The neuron lights up on low-frequency, multi-syllable “content” tokens—i.e. uncommon or domain-specific words rather than common function words.

oai_token-act-pair · o4-mini Triggered by @xinyanhu8

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 müdür

-0.07

 Vietnamese

-0.07

 tuple

-0.07

سون

-0.06

head

-0.06

 lesson

-0.06

Protocol

-0.06

 tuto

-0.06

_LOCK

-0.06

 classroom

-0.06

POSITIVE LOGITS

 onCancelled

0.07

�

0.06

AQ

0.06

.SOCK

0.06

TransparentColor

0.06

 fingertips

0.06

"+↵

0.06

рования

0.06

�

0.06

 aerospace

0.06

Activations Density 0.181%

scientific/legal documents

The neuron lights up on low-frequency, multi-syllable “content” tokens—i.e. uncommon or domain-specific words rather than common function words.

No Comments

No Known Activations

scientific/legal documents

The neuron lights up on low-frequency, multi-syllable “content” tokens—i.e. uncommon or domain-specific words rather than common function words.

No Comments

No Known Activations