INDEX
Explanations
countries
references to countries and their contextual issues
New Auto-Interp
Negative Logits
rious
-0.77
hement
-0.71
henko
-0.63
arsity
-0.63
Nicarag
-0.60
letes
-0.59
zes
-0.59
Afee
-0.58
Strength
-0.58
Dalai
-0.58
POSITIVE LOGITS
classrooms
0.91
where
0.79
today
0.77
illegally
0.75
alone
0.72
airspace
0.71
abouts
0.69
jails
0.68
countryside
0.66
folklore
0.65
Activations Density 0.256%