INDEX
Explanations
instances of the word "down" in sentences
instances of the phrase "sat down."
New Auto-Interp
Negative Logits
velt
-0.77
[+
-0.65
âģ
-0.65
Gs
-0.65
Carbuncle
-0.63
/
-0.62
Antar
-0.61
ugh
-0.59
Whitman
-0.59
aha
-0.59
POSITIVE LOGITS
stairs
1.21
hill
0.86
LOAD
0.81
pour
0.75
dates
0.75
stairs
0.74
ash
0.71
stem
0.71
cloth
0.71
grading
0.70
Activations Density 0.029%