INDEX
Explanations
directional words related to geographic locations
directional words related to movement
New Auto-Interp
Negative Logits
nutshell
-0.68
iqueness
-0.63
peppers
-0.62
ollar
-0.62
urrection
-0.61
ggles
-0.61
ettle
-0.61
ulous
-0.60
popup
-0.60
uracy
-0.60
POSITIVE LOGITS
toward
1.07
towards
1.02
wards
0.98
WARD
0.88
ward
0.85
unnoticed
0.83
stairs
0.79
into
0.78
travel
0.78
wind
0.76
Activations Density 0.146%