INDEX

Explanations

points

np_max-act · gemini-2.0-flash

The neuron specializes in spotting numeric/statistical tokens—counts, figures, and other number-related words in sports‐stat contexts.

oai_token-act-pair · o4-mini Triggered by @xinyanhu8

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

Toe

-0.06

.number

-0.06

 belle

-0.06

 Anna

-0.06

 Teil

-0.06

.fields

-0.06

gé

-0.06

 Grande

-0.06

(),"

-0.06

Between

-0.06

POSITIVE LOGITS

학년도

0.09

erseniz

0.07

ाधन

0.07

ATEG

0.06

 clutter

0.06

imité

0.06

沒

0.06

 ΑΠ

0.06

scripts

0.06

恶

0.06

Activations Density 0.006%

points

The neuron specializes in spotting numeric/statistical tokens—counts, figures, and other number-related words in sports‐stat contexts.

No Comments

No Known Activations

points

The neuron specializes in spotting numeric/statistical tokens—counts, figures, and other number-related words in sports‐stat contexts.

No Comments

No Known Activations