INDEX
Explanations
phrases related to compliance and non-compliance in a legal or authoritarian context
New Auto-Interp
Negative Logits
Taj
-0.15
uars
-0.15
è±
-0.14
891
-0.14
³
-0.14
abox
-0.14
inin
-0.14
phinx
-0.14
éľĩ
-0.14
iaux
-0.13
POSITIVE LOGITS
ναν
0.15
INDOW
0.15
ETY
0.14
brook
0.14
brtc
0.14
otel
0.14
Rubin
0.14
رÙģ
0.14
BCH
0.14
Ped
0.14
Activations Density 0.027%