INDEX

Explanations

rebel, rebellion, defiant attitude

The neuron strongly activates on terms expressing nonconformity or rebellion.

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

罒

-0.92

alogy

-0.89

čila

-0.89

celotti

-0.87

 régl

-0.87

 مساب

-0.84

 ſen

-0.84

keyup

-0.84

amkeit

-0.82

 morons

-0.82

POSITIVE LOGITS

 rebel

3.59

 rebellion

3.23

 rebellious

3.20

rebel

2.98

 rebels

2.91

 Rebel

2.53

Rebel

2.44

 defiance

2.22

 Rebellion

2.22

 revolt

2.22

Activations Density 0.094%