INDEX

Explanations

structured code comments or documentation

oai_token-act-pair · gpt-4o-mini Triggered by @bot

symbols followed by 'summary' or 'eval'

np_acts-logits-general · gemini-2.5-flash-lite

New Auto-Interp

Top Features by Cosine Similarity

Comparing With GEMMA-2-9B-IT @ 9-gemmascope-res-131k

Configuration

google/gemma-scope-9b-it-res/layer_9/width_131k/average_l0_121

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Features

131,072

Data Type

float32

Hook Name

blocks.9.hook_resid_post

Hook Layer

Architecture

jumprelu

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 unless

-0.36

 Solve

-0.36

tab

-0.34

Solve

-0.33

 للاسماء

-0.32

 Stirn

-0.32

 fils

-0.32

 isolate

-0.32

 Sotto

-0.31

 fille

-0.31

POSITIVE LOGITS

%%%%%%%%

1.55

%%%%%%%%%%%%

1.54

%%%

1.53

%%%%%%%

1.47

%%%%%

1.47

%%%%%%%%%

1.42

%%%%%%

1.41

%%%%%%%%%%

1.39

%%%%

1.34

%%%%%%%%%%%%%%%%

0.89

Activations Density 0.008%

structured code comments or documentation

symbols followed by 'summary' or 'eval'

No Comments

No Known Activations