INDEX

Explanations

steps or instructions for a technical process, likely related to electronics or mechanics

oai_token-act-pair · gpt-3.5-turbo Triggered by @bot

New Auto-Interp

Configuration

jbloom/Gemma-2b-IT-Residual-Stream-SAEs/gemma_2b_it_blocks.12.hook_resid_post_16384

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

HuggingFaceFW/fineweb

Features

16,384

Data Type

float32

Hook Name

blocks.12.hook_resid_post

Hook Layer

Architecture

standard

Context Size

1,024

Dataset

Skylion007/openwebtext

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

Carcinogenicity

-0.55

quité

-0.51

 Baillargeon

-0.50

marle

-0.49

ectoria

-0.49

tonode

-0.49

 Bourgoin

-0.48

 Leiter

-0.47

atars

-0.47

 Feier

-0.46

POSITIVE LOGITS

pop

1.35

Pop

1.29

 pops

1.26

 popping

1.23

POP

1.22

 intersper

1.19

 popped

1.18

 encomp

1.17

Pop

1.17

 emphat

1.17

Activations Density 0.073%

steps or instructions for a technical process, likely related to electronics or mechanics

No Comments

No Known Activations