INDEX

Explanations

mentions of legal and political issues along with notable events or actions related to social or public figures

oai_token-act-pair · gpt-3.5-turbo

words that indicate a strong personal attachment or possession, often seen by the presence of personal pronouns or possessive adjectives

oai_token-act-pair · gpt-3.5-turbo Triggered by @danbraun

New Auto-Interp

Configuration

neuronpedia/gpt2-small__res_scefr-ajt/6-res_scefr-ajt

Prompts (Dashboard)

12,288 prompts, 128 tokens each

Dataset (Dashboard)

Skylion007/openwebtext

Features

46,080

Data Type

torch.float32

Hook Point

blocks.6.hook_resid_pre

Architecture

standard

Context Size

128

Dataset

apollo-research/Skylion007-openwebtext-tokenizer-gpt2

Hook Point Layer

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 guiActiveUn

-0.07

é¾

-0.07

ĳå£«

-0.06

elsius

-0.06

oÄŁ

-0.06

 Azerb

-0.06

ĪĴ

-0.06

©¶æ

-0.06

ĵĺ

-0.06

¿½

-0.05

POSITIVE LOGITS

the

0.08

0.07

↵

0.07

and

0.07

in

0.07

to

0.07

is

0.07

of

0.07

Activations Density 3.390%

mentions of legal and political issues along with notable events or actions related to social or public figures

words that indicate a strong personal attachment or possession, often seen by the presence of personal pronouns or possessive adjectives

No Comments

No Known Activations

mentions of legal and political issues along with notable events or actions related to social or public figures

words that indicate a strong personal attachment or possession, often seen by the presence of personal pronouns or possessive adjectives

No Comments

No Known Activations