INDEX

Explanations

references to specific provinces, locations, and events

oai_token-act-pair · gpt-3.5-turbo Triggered by @bot

New Auto-Interp

Configuration

jbloom/Gemma-2b-IT-Residual-Stream-SAEs/gemma_2b_it_blocks.12.hook_resid_post_16384

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

HuggingFaceFW/fineweb

Features

16,384

Data Type

float32

Hook Name

blocks.12.hook_resid_post

Hook Layer

Architecture

standard

Context Size

1,024

Dataset

Skylion007/openwebtext

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 chong

-1.29

kyo

-1.28

 guarante

-1.26

XIA

-1.26

 bangkok

-1.26

 increa

-1.20

 chrysler

-1.20

 seoul

-1.19

 chande

-1.19

 eyel

-1.18

POSITIVE LOGITS

antd

0.82

Bei

0.68

mybatisplus

0.65

 China

0.65

Bei

0.64

 Chinese

0.63

Dazu

0.62

China

0.62

Xinhua

0.61

Chinese

0.60

Activations Density 0.254%

references to specific provinces, locations, and events

No Comments

No Known Activations