INDEX
Explanations
paths and directions expressed in a narrative context
sequences of conjunctions and prepositions indicating movement or direction
New Auto-Interp
Negative Logits
Adapt
-0.66
dozen
-0.64
Respons
-0.63
Effective
-0.63
kes
-0.62
icult
-0.62
eal
-0.62
zman
-0.60
UF
-0.59
unique
-0.59
POSITIVE LOGITS
into
1.78
onto
1.78
INTO
1.53
into
1.40
Into
1.39
toward
1.28
onward
1.27
thence
1.25
onwards
1.22
towards
1.19
Activations Density 0.249%