INDEX

Explanations

mentions of specific names or titles

oai_token-act-pair · gpt-3.5-turbo Triggered by @bot

New Auto-Interp

Configuration

jbloom/Gemma-2b-IT-Residual-Stream-SAEs/gemma_2b_it_blocks.12.hook_resid_post_16384

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

HuggingFaceFW/fineweb

Features

16,384

Data Type

float32

Hook Name

blocks.12.hook_resid_post

Hook Layer

Architecture

standard

Context Size

1,024

Dataset

Skylion007/openwebtext

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ethene

-0.71

ethane

-0.64

toluene

-0.56

aniline

-0.54

acetate

-0.52

perus

-0.52

 earnestness

-0.51

ltä

-0.51

lamino

-0.50

Iné

-0.50

POSITIVE LOGITS

RY

0.97

DY

0.97

 dovr

0.95

hy

0.92

hy

0.90

Hy

0.90

Hy

0.90

LY

0.89

Dy

0.89

HY

0.88

Activations Density 0.279%

mentions of specific names or titles

No Comments

No Known Activations