INDEX

Explanations

sexual arousal and explicit content

This neuron detects explicit pornographic content, especially graphic sexual actions, body parts, and erotic terminology.

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

 Keim

-1.21

 this

-1.17

 summer

-1.11

ּוֹ

-1.10

וּ

-1.08

 people

-1.06

it

-1.03

 бампер

-0.99

</em>

-0.98

fin

-0.95

POSITIVE LOGITS

久しぶりの

1.34

ׇ

1.32

Spesifikasi

1.24

 сахара

1.24

ヌーピー

1.18

 abriu

1.16

 белье

1.16

 homicidio

1.13

 држа

1.13

ṝ

1.13

Activations Density 0.151%