INDEX
Explanations
rules, bans, regulations
The neuron fires on phrases describing government-imposed restrictions or bans on an app for official devices.
New Auto-Interp
Negative Logits
exists
-0.06
>Total
-0.06
Super
-0.06
"W
-0.06
ArrayOf
-0.06
brief
-0.05
网
-0.05
ETYPE
-0.05
.radians
-0.05
Fly
-0.05
POSITIVE LOGITS
стак
0.07
reservation
0.07
prolifer
0.07
owers
0.07
.Butter
0.07
amento
0.06
uckland
0.06
direct
0.06
_PE
0.06
Russell
0.06
Activations Density 0.006%