INDEX

Explanations

not real or fake

The neuron responds to words that signal something is false, a hoax, prank, conspiracy, or otherwise not genuine.

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

 Stupid

-1.05

 praticamente

-1.04

 ゆう

-1.02

 プール

-0.99

 īpa

-0.96

 wonderfully

-0.94

 装修

-0.93

uresh

-0.92

 adorned

-0.92

 soooo

-0.91

POSITIVE LOGITS

 just

1.53

 only

1.40

 merely

1.38

 csak

1.20

 hanya

1.17

 только

1.16

 simplemente

1.13

 лишь

1.09

only

1.05

ϳ

1.05

Activations Density 0.035%