INDEX
Explanations
locations or directions mentioned in a document
New Auto-Interp
Negative Logits
TED
-0.84
endi
-0.77
staking
-0.76
ATURE
-0.74
ulous
-0.71
atis
-0.70
natureconservancy
-0.70
hement
-0.70
wcsstore
-0.69
ration
-0.69
POSITIVE LOGITS
hemisphere
1.12
side
1.08
western
1.07
most
1.06
Africa
1.04
Hemisphere
1.03
Asia
1.03
ward
0.98
coast
0.97
Side
0.96
Activations Density 2.725%