INDEX

Explanations

-

np_max-act · gemini-2.0-flash

This neuron detects user‐interface navigation cues such as clickable pagination or “Back/Next” controls and other bracketed menu/action markers.

oai_token-act-pair · o4-mini Triggered by @xinyanhu8

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Bulgarian

-0.07

olulu

-0.07

ूछ

-0.07

듯

-0.07

ANDLE

-0.06

Ukr

-0.06

oter

-0.06

 rotated

-0.06

 Sciences

-0.06

 greens

-0.06

POSITIVE LOGITS

+"]

0.07

:self

0.07

 enterprise

0.07

DBG

0.06

 Clan

0.06

_FRONT

0.06

/".$

0.06

dál

0.06

дн

0.06

_Private

0.06

Activations Density 0.008%

-

This neuron detects user‐interface navigation cues such as clickable pagination or “Back/Next” controls and other bracketed menu/action markers.

No Comments

No Known Activations

-

This neuron detects user‐interface navigation cues such as clickable pagination or “Back/Next” controls and other bracketed menu/action markers.

No Comments

No Known Activations