INDEX
Explanations
prepositions
instances of the word "down," indicating decreases or reductions
New Auto-Interp
Negative Logits
9999
-0.65
tein
-0.65
://
-0.64
#$
-0.63
ature
-0.63
âģ
-0.63
ĸļ
-0.61
Huck
-0.60
cius
-0.60
inka
-0.59
POSITIVE LOGITS
stairs
1.26
graded
1.14
grading
1.05
grades
0.94
river
0.88
pour
0.86
hill
0.85
stairs
0.85
LOAD
0.85
fall
0.81
Activations Density 0.056%