INDEX
Explanations
words related to geographical locations or proper nouns
terms associated with conflicts or issues in specific regions, particularly geopolitical references
New Auto-Interp
Negative Logits
panic
-0.69
roup
-0.66
lio
-0.62
porter
-0.57
Gaul
-0.57
gress
-0.56
oshenko
-0.55
irlf
-0.54
INTON
-0.51
tti
-0.51
POSITIVE LOGITS
ciating
0.61
anu
0.57
art
0.55
unden
0.53
ruct
0.53
iable
0.52
ruption
0.51
zhen
0.51
unpop
0.50
predec
0.50
Activations Density 0.229%