INDEX
Explanations
phrases related to the activation or deactivation of systems or devices
New Auto-Interp
Negative Logits
ënten
-0.44
atrici
-0.43
erweise
-0.43
leven
-0.42
ulity
-0.41
…
-0.39
[
-0.39
тивні
-0.39
co
-0.39
mentaux
-0.39
POSITIVE LOGITS
المعيارى
1.01
ConstraintMaker
0.89
Monfieur
0.78
Cæsar
0.76
AndEndTag
0.75
QMetaType
0.75
myſelf
0.74
فريبيس
0.73
Италијани
0.72
MenuView
0.71
Activations Density 0.283%