INDEX
Explanations
references to travel and geographic locations
New Auto-Interp
Negative Logits
Jamaica
-0.17
Jama
-0.17
Hawai
-0.16
Hawaiian
-0.16
Trinidad
-0.15
México
-0.15
Hawaii
-0.14
wort
-0.14
Pu
-0.14
Ring
-0.14
POSITIVE LOGITS
Moz
0.37
Angola
0.32
moz
0.29
MOZ
0.28
Ang
0.28
Ang
0.27
Cab
0.26
MPL
0.26
Portuguese
0.26
_ang
0.25
Activations Density 0.015%