INDEX
Explanations
references to obedience and compliance with authority
New Auto-Interp
Negative Logits
Lomb
-0.16
.infinity
-0.15
olio
-0.15
elly
-0.15
Coal
-0.15
ãĤ
-0.15
/Table
-0.15
ÑĦÑĢан
-0.14
coal
-0.14
acob
-0.14
POSITIVE LOGITS
inth
0.16
fully
0.14
.ls
0.14
ná
0.14
Kra
0.14
istra
0.14
amus
0.13
Å¥
0.13
arsers
0.13
ews
0.13
Activations Density 0.061%