INDEX

Explanations

Database type/parameters

np_max-act · gemini-2.0-flash

The neuron activates on numeric literals (especially floating-point numbers) in code or config text.

oai_token-act-pair · o4-mini Triggered by @xinyanhu8

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

eve

-0.07

 cathedral

-0.06

 Nietzsche

-0.06

 Yaş

-0.05

atar

-0.05

.getNum

-0.05

singleton

-0.05

 singleton

-0.05

-topic

-0.05

Dar

-0.05

POSITIVE LOGITS

нул

0.07

 }],↵

0.07

Adjust

0.07

ในว

0.07

]>

0.07

 kappa

0.07

 incredibly

0.07

 جن

0.06

ují

0.06

 proficiency

0.06

Activations Density 0.001%

Database type/parameters

The neuron activates on numeric literals (especially floating-point numbers) in code or config text.

No Comments

No Known Activations