INDEX

Explanations

Blizzard/Warcraft

np_max-act · gemini-2.0-flash

The neuron flags mentions of Blizzard-related proper names, especially the company name “Blizzard” and its game titles (e.g. “Warcraft”).

oai_token-act-pair · o4-mini Triggered by @xinyanhu8

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 जगह

-0.07

 مجموع

-0.06

.SetKeyName

-0.06

 CharSet

-0.06

stants

-0.06

(QL

-0.06

ียนบ

-0.06

 کاربرد

-0.06

.presentation

-0.06

popular

-0.06

POSITIVE LOGITS

 Scout

0.06

 málo

0.06

 Auto

0.06

 bieten

0.06

Clinical

0.06

 scouts

0.06

sic

0.06

 maç

0.06

ีฟ

0.06

/as

0.06

Activations Density 0.002%

Blizzard/Warcraft

The neuron flags mentions of Blizzard-related proper names, especially the company name “Blizzard” and its game titles (e.g. “Warcraft”).

No Comments

No Known Activations

Blizzard/Warcraft

The neuron flags mentions of Blizzard-related proper names, especially the company name “Blizzard” and its game titles (e.g. “Warcraft”).

No Comments

No Known Activations