INDEX

Explanations

complex or technical terms

oai_token-act-pair · gpt-3.5-turbo Triggered by @bot

New Auto-Interp

Configuration

jbloom/Gemma-2b-IT-Residual-Stream-SAEs/gemma_2b_it_blocks.12.hook_resid_post_16384

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

HuggingFaceFW/fineweb

Features

16,384

Data Type

float32

Hook Name

blocks.12.hook_resid_post

Hook Layer

Architecture

standard

Context Size

1,024

Dataset

Skylion007/openwebtext

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

Siga

-0.55

Desen

-0.52

Filosof

-0.49

Significado

-0.49

Obra

-0.48

Quais

-0.46

História

-0.45

Nesta

-0.45

dataclass

-0.45

tslib

-0.45

POSITIVE LOGITS

exp

1.06

EXP

1.00

Exp

0.99

exp

0.94

Exp

0.92

EXP

0.88

 expon

0.85

xp

0.83

 exponents

0.81

 popoli

0.81

Activations Density 0.138%

complex or technical terms

No Comments

No Known Activations