INDEX

Explanations

disguise

The neuron activates on words referring to disguise or camouflage.

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

-3.34

-2.66

荦

-2.42

釤

-2.31

-2.30

-2.27

 auft

-2.25

嵛

-2.25

-2.22

-2.19

POSITIVE LOGITS

’

2.56

 voegen

2.56

狆

2.53

 vertellen

2.36

 trekken

2.23



2.19

 voeren

2.16

 Ejecutivo

2.13

SocketChannel

2.13

 voelen

2.11

Activations Density 0.004%