INDEX

Explanations

locations or objects related to dens

oai_token-act-pair · gpt-3.5-turbo Triggered by @bot

New Auto-Interp

Configuration

jbloom/Gemma-2b-IT-Residual-Stream-SAEs/gemma_2b_it_blocks.12.hook_resid_post_16384

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

HuggingFaceFW/fineweb

Features

16,384

Data Type

float32

Hook Name

blocks.12.hook_resid_post

Hook Layer

Architecture

standard

Context Size

1,024

Dataset

Skylion007/openwebtext

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

raud

-0.48

 jenem

-0.45

 familiari

-0.42

chwitz

-0.42

 Vicksburg

-0.40

 Baird

-0.39

 Saratoga

-0.39

 bootstra

-0.38

 coordinate

-0.38

COORD

-0.38

POSITIVE LOGITS

DEN

1.24

Den

1.21

Den

1.12

den

1.10

DEN

1.02

den

0.92

 Dens

0.92

 susun

0.86

 tanong

0.84

 dens

0.82

Activations Density 0.081%

locations or objects related to dens

No Comments

No Known Activations