INDEX

Explanations

attends to tokens expressing the concept of "already" from later tokens that indicate a prior occurrence or established state

oai_attention-head · gpt-4o-mini Triggered by @bot

New Auto-Interp

Configuration

google/gemma-scope-9b-pt-att/layer_0/width_16k/average_l0_61

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Features

16,384

Data Type

float32

Hook Name

blocks.0.attn.hook_z

Hook Layer

Architecture

jumprelu

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Activation Function

relu

Embeds

PlotsExplanationShow Test FieldDefault Test Text

IFrame

Link

Not in Any Lists

No Comments

Head Attr Weights

0:0.02

1:0.02

2:0.02

3:0.04

4:0.02

5:0.02

6:0.08

7:0.07

8:0.01

9:0.01

10:0.01

11:0.07

12:0.02

13:0.24

14:0.26

15:0.02

Negative Logits

 antidesliz

-0.28

 vecino

-0.27

 almofada

-0.27

 razonable

-0.26

 monasterio

-0.26

 createSlice

-0.25

 telefónica

-0.25

 húmedo

-0.25

 tombé

-0.25

 Absicht

-0.25

POSITIVE LOGITS

cola

0.33

cid

0.33

 cime

0.32

 Modifier

0.31

sure

0.31

 Dire

0.31

 jsPsych

0.31

 georg

0.31

cir

0.31

bite

0.31

Activations Density 0.078%

attends to tokens expressing the concept of "already" from later tokens that indicate a prior occurrence or established state

No Comments

No Known Activations