INDEX
Explanations
instances of specific geographical terms or names in the text
New Auto-Interp
Negative Logits
elo
-0.18
γή
-0.17
ledon
-0.17
IDI
-0.16
elib
-0.16
eli
-0.16
ingly
-0.15
uien
-0.14
jt
-0.14
nex
-0.14
POSITIVE LOGITS
pread
0.24
spread
0.22
spread
0.20
Spread
0.19
Freed
0.17
Spread
0.17
spreading
0.16
gons
0.16
Loren
0.16
spre
0.15
Activations Density 0.007%