INDEX
Explanations
phrases where the word "up" is emphasized or repeated
instances of the word "up."
New Auto-Interp
Negative Logits
³³³³³³³³³³³³³³³³
-0.65
________________________
-0.65
blinded
-0.63
segregated
-0.61
num
-0.60
debtor
-0.60
Manson
-0.59
âĸ¬âĸ¬
-0.59
\\\\\\\\\\\\\\\\
-0.59
mileage
-0.59
POSITIVE LOGITS
dates
1.43
stairs
1.26
olicy
1.17
dating
1.03
grades
1.03
graded
1.01
icult
1.00
rising
0.95
odcast
0.94
grade
0.94
Activations Density 0.027%