INDEX
Explanations
"The" followed by US entities
New Auto-Interp
Negative Logits
degree
0.44
others
0.44
described
0.43
directly
0.43
willingly
0.43
impress
0.41
postulated
0.41
molte
0.41
malice
0.40
아니고
0.40
POSITIVE LOGITS
Fédération
0.50
Wochschr
0.45
skies
0.45
Archdiocese
0.45
United
0.44
United
0.43
Organización
0.42
rollout
0.42
мире
0.41
الأمريكية
0.40
Activations Density 0.004%