INDEX
Explanations
mentions of specific individuals in leadership positions
New Auto-Interp
Negative Logits
erved
-0.48
Shared
-0.48
CWE
-0.48
Par
-0.47
logo
-0.47
dolo
-0.45
ilan
-0.45
abord
-0.45
Army
-0.45
lores
-0.44
POSITIVE LOGITS
personalmente
0.90
personally
0.88
persönlich
0.85
GenerationType
0.79
himſelf
0.77
himself
0.76
UnifiedTopology
0.70
principalTable
0.70
RTDA
0.69
CppMethod
0.68
Activations Density 0.378%