INDEX
Explanations
phrases related to authority and compliance
New Auto-Interp
Negative Logits
itemprop
-0.16
BITS
-0.16
SPA
-0.15
illon
-0.15
spa
-0.15
osta
-0.15
ese
-0.15
forman
-0.15
-Clause
-0.14
kelig
-0.14
POSITIVE LOGITS
Gent
0.17
anka
0.17
authority
0.16
åĿĬ
0.15
ÑĢÑĥб
0.15
gent
0.15
wil
0.15
ught
0.14
orders
0.14
rial
0.14
Activations Density 0.443%