INDEX

Explanations

attends to the token "not" from variations of uncertainty or negation

oai_attention-head · gpt-4o-mini Triggered by @bot

New Auto-Interp

Top Features by Cosine Similarity

Comparing With GEMMA-2-2B @ 0-gemmascope-att-16k

Configuration

google/gemma-scope-2b-pt-att/layer_0/width_16k/average_l0_104

Prompts (Dashboard)

36,864 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Features

16,384

Data Type

float32

Hook Name

blocks.0.attn.hook_z

Hook Layer

Architecture

jumprelu

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Activation Function

relu

Embeds

PlotsExplanationShow Test FieldDefault Test Text

IFrame

Link

Not in Any Lists

No Comments

Head Attr Weights

0:0.05

1:0.55

2:0.04

3:0.03

4:0.05

5:0.07

6:0.04

7:0.14

Negative Logits

tôi

-0.30

Tetapi

-0.30

Mvh

-0.29

not

-0.28

󠁧

-0.28

actionMode

-0.27

榄

-0.27

Viki

-0.26

adays

-0.26

Kesimpulan

-0.26

POSITIVE LOGITS

oredCriteria

0.35

msgSender

0.31

“

0.28

”

0.28

…

0.28

 sauvages

0.27

前

0.27

WithIOException

0.27

antMatchers

0.27

Activations Density 0.362%

attends to the token "not" from variations of uncertainty or negation

No Comments

No Known Activations