INDEX
Explanations
mentions of individuals or entities associated with leadership roles or titles
New Auto-Interp
Negative Logits
aliz
-0.17
ovice
-0.16
APTER
-0.16
al
-0.16
eca
-0.15
zung
-0.15
tring
-0.15
iang
-0.15
arov
-0.15
alis
-0.15
POSITIVE LOGITS
olution
0.24
arez
0.23
antage
0.21
ãĤ©
0.21
à¥įह
0.19
erson
0.19
ins
0.18
illage
0.18
olumes
0.18
antages
0.18
Activations Density 0.047%