INDEX

Explanations

phrases with the word "also," but it also appears to target specific informational details from varied content sources

oai_token-act-pair · gpt-3.5-turbo Triggered by @bot

New Auto-Interp

Configuration

jbloom/Gemma-2b-IT-Residual-Stream-SAEs/gemma_2b_it_blocks.12.hook_resid_post_16384

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

HuggingFaceFW/fineweb

Features

16,384

Data Type

float32

Hook Name

blocks.12.hook_resid_post

Hook Layer

Architecture

standard

Context Size

1,024

Dataset

Skylion007/openwebtext

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 karton

-0.88

 utop

-0.78

 sement

-0.77

 territo

-0.76

 hunde

-0.74

 silikon

-0.73

 lemp

-0.71

 solidar

-0.71

 kosme

-0.70

 keramik

-0.69

POSITIVE LOGITS

also

0.80

 also

0.78

 ALSO

0.71

también

0.67

Also

0.66

 também

0.65

 Also

0.62

 también

0.62

 også

0.60

 juga

0.59

Activations Density 0.251%

phrases with the word "also," but it also appears to target specific informational details from varied content sources

No Comments

No Known Activations