INDEX

Explanations

punctuation

np_max-act · gemini-2.0-flash

This neuron activates on numerical tokens containing decimal points (floating-point numbers).

oai_token-act-pair · o4-mini Triggered by @xinyanhu8

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ON

-0.07

 crashing

-0.07

hora

-0.06

_topics

-0.06

 Rent

-0.06

.";↵↵

-0.06

')}}">↵

-0.06

.coordinates

-0.06

 अम

-0.06

 ск

-0.06

POSITIVE LOGITS

 juicy

0.06

 gerne

0.06

$template

0.06

 krás

0.06

 luaL

0.06

 kolo

0.06

đo

0.06

ुं

0.06

 відпов

0.06

 vlastní

0.06

Activations Density 0.087%

punctuation

This neuron activates on numerical tokens containing decimal points (floating-point numbers).

No Comments

No Known Activations

punctuation

This neuron activates on numerical tokens containing decimal points (floating-point numbers).

No Comments

No Known Activations