INDEX

Explanations

terms related to legal or formal conduct and procedures

oai_token-act-pair · gpt-3.5-turbo Triggered by @bot

New Auto-Interp

Configuration

jbloom/Gemma-2b-IT-Residual-Stream-SAEs/gemma_2b_it_blocks.12.hook_resid_post_16384

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

HuggingFaceFW/fineweb

Features

16,384

Data Type

float32

Hook Name

blocks.12.hook_resid_post

Hook Layer

Architecture

standard

Context Size

1,024

Dataset

Skylion007/openwebtext

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

Wat

-0.46

eyes

-0.44

Wat

-0.43

paz

-0.43

Più

-0.43

Spring

-0.43

 Jake

-0.42

Buck

-0.42

Jake

-0.42

page

-0.41

POSITIVE LOGITS

conducted

1.18

 conduct

1.16

conduct

1.10

 Conduct

1.10

 CONDUCT

1.09

 conducts

1.08

 conducted

1.06

 Conducted

1.05

 conducting

1.02

 Conducting

0.99

Activations Density 0.103%

terms related to legal or formal conduct and procedures

No Comments

No Known Activations