INDEX

Explanations

Names and locations

np_max-act · gemini-2.0-flash

This neuron activates on personal names (proper names of individuals) in the text.

oai_token-act-pair · o4-mini Triggered by @xinyanhu8

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

opro

-0.07

_SHARE

-0.06

 situación

-0.06

オ

-0.06

SPE

-0.06

iframe

-0.06

Seriously

-0.06

abella

-0.06

ší

-0.06

 plaza

-0.06

POSITIVE LOGITS

 akıl

0.06

剑

0.06

 CHANGE

0.06

 Rodgers

0.06

ponent

0.06

MOVED

0.06

.dense

0.06

 ขนาด

0.06

lux

0.06

 mohli

0.06

Activations Density 0.187%

Names and locations

This neuron activates on personal names (proper names of individuals) in the text.

No Comments

No Known Activations