INDEX
Explanations
words related to distance or progression away from a point
phrases indicating increasing distances or separations
New Auto-Interp
Negative Logits
LOT
-0.62
kes
-0.61
urated
-0.60
Swap
-0.58
TS
-0.57
RY
-0.57
ASP
-0.56
eson
-0.56
pport
-0.55
Stick
-0.53
POSITIVE LOGITS
away
1.30
inland
1.21
apart
1.15
into
1.10
away
1.08
upstream
1.04
distances
1.02
downstream
1.01
forward
1.00
Away
0.96
Activations Density 0.060%