INDEX

Explanations

phrases related to political conspiracy/organizations, mental conditions and storytelling terms

oai_token-act-pair · gemini-2.0-flash

State

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

google/gemma-scope-2b-pt-transcoders/layer_12/width_16k/average_l0_6

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Features

16,384

Data Type

float32

Hook Name

blocks.12.ln2.hook_normalized

Architecture

jumprelu_transcoder

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

aarrggbb

-0.64

 Vrbo

-0.45

'/',

-0.44

 vielmehr

-0.44

-0.43

versa

-0.42

ViewImports

-0.41

<bos>

-0.41

 Schicht

-0.41

бираем

-0.41

POSITIVE LOGITS

mybatisplus

0.75

IntoConstraints

0.64

 Мексичка

0.61

alyptus

0.59

Sucesor

0.59

DeleteBehavior

0.58

 Meksiku

0.53

 Vikipedi

0.52

 THOUGH

0.52

[:-

0.52

Activations Density 1.623%

phrases related to political conspiracy/organizations, mental conditions and storytelling terms

State

No Comments

No Known Activations

phrases related to political conspiracy/organizations, mental conditions and storytelling terms

State

No Comments

No Known Activations