INDEX

Explanations

ir

np_max-act · gemini-2.0-flash

This neuron responds to mentions of people or person-like entities—proper names (e.g. “NAME_1”) and human/AI references.

oai_token-act-pair · o4-mini Triggered by @xinyanhu8

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

_ord

-0.07

(ord

-0.07

motion

-0.07

 дит

-0.07

 Rotation

-0.07

#ga

-0.07

 embracing

-0.07

 eski

-0.07

 getSource

-0.06

sse

-0.06

POSITIVE LOGITS

 ภาษ

0.07

KD

0.06

Deploy

0.06

 раб

0.06

 Saints

0.06

-react

0.06

зем

0.06

-Speed

0.06

 разі

0.06

 cere

0.06

Activations Density 0.125%

ir

This neuron responds to mentions of people or person-like entities—proper names (e.g. “NAME_1”) and human/AI references.

No Comments

No Known Activations