INDEX

Explanations

for

np_max-act · gemini-2.0-flash

legal disclaimers and licensing information in software documentation.

oai_token-act-pair · gpt-4o-mini Triggered by @xinyanhu8

The neuron activates on numeric literals with decimal points (floating-point numbers).

oai_token-act-pair · o4-mini Triggered by @xinyanhu8

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

legate

-0.07

 wireType

-0.06

Aj

-0.06

,tp

-0.06

اجات

-0.06

 Guerrero

-0.06

DataTask

-0.06

"]↵

-0.06

any

-0.06

�

-0.06

POSITIVE LOGITS

 Volkswagen

0.07

 aktiv

0.07

 integrate

0.07

calc

0.07

crime

0.07

 Hart

0.07

It

0.06

COOKIE

0.06

 improvement

0.06

.compile

0.06

Activations Density 0.001%

for

legal disclaimers and licensing information in software documentation.

The neuron activates on numeric literals with decimal points (floating-point numbers).

No Comments

No Known Activations