INDEX
Explanations
mentions of geographical locations and countries
New Auto-Interp
Negative Logits
PERT
-0.18
alink
-0.16
Bow
-0.16
agan
-0.15
oda
-0.15
ption
-0.15
param
-0.15
iete
-0.15
Wy
-0.15
blockade
-0.15
POSITIVE LOGITS
TRACE
0.16
VERSE
0.15
venile
0.14
vince
0.14
culus
0.14
Cord
0.14
reuse
0.14
occasion
0.14
NAV
0.14
212
0.13
Activations Density 0.020%