INDEX
Explanations
mentions of regions or locations, especially countries
occurrences of the word "the" in various contexts
New Auto-Interp
Negative Logits
Luffy
-0.74
fy
-0.70
raped
-0.70
vg
-0.64
fully
-0.63
_-
-0.63
BIL
-0.63
Goku
-0.63
LY
-0.61
uala
-0.61
POSITIVE LOGITS
globe
1.38
country
1.25
nation
1.10
world
1.08
continent
1.01
province
0.98
region
0.97
Midwest
0.96
spectrum
0.95
countryside
0.93
Activations Density 0.159%