INDEX
Explanations
terms related to refugees and immigration
occurrences of the term "huge"
New Auto-Interp
Negative Logits
istically
-0.99
istic
-0.73
icum
-0.65
mac
-0.64
Skydragon
-0.63
izes
-0.63
mates
-0.61
behold
-0.60
netflix
-0.57
stown
-0.57
POSITIVE LOGITS
llo
0.87
mble
0.86
xual
0.84
lli
0.84
es
0.80
Polo
0.77
olitan
0.77
yre
0.75
hire
0.75
rique
0.74
Activations Density 0.103%