INDEX

Explanations

states or consequences

The neuron detects words and phrases denoting bureaucratic or political actions, errors, or maneuvers.

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

probably

-1.43

 Irlanda

-1.32

 probablement

-1.31

 marea

-1.31

 wahrscheinlich

-1.28

 pirata

-1.27

 regalar

-1.24

 sûrement

-1.23

gether

-1.19

 durer

-1.18

POSITIVE LOGITS

or

2.64

 more

1.88

as

1.87

 with

1.73

 some

1.60

veja

1.40

 profundos

1.38

 svært

1.35

 considerazione

1.34

 administrativos

1.34

Activations Density 0.319%