INDEX
Explanations
references to movement downwards
occurrences of the word "down."
New Auto-Interp
Negative Logits
cius
-0.68
ificent
-0.66
anooga
-0.65
ista
-0.63
#$
-0.62
ament
-0.62
ature
-0.61
Pros
-0.58
ctic
-0.58
gadget
-0.58
POSITIVE LOGITS
stairs
1.37
graded
1.14
stairs
1.09
grading
1.07
LOAD
1.07
river
1.00
grades
0.96
hill
0.95
pour
0.91
played
0.90
Activations Density 0.041%