INDEX
Explanations
occurrences of the word "up" and its variations
"up" preceding "to"
New Auto-Interp
Negative Logits
DoubleQuotes
-0.88
jsonwebtoken
-0.83
متعلقه
-0.82
Efq
-0.81
itſelf
-0.80
pleaſure
-0.79
faſt
-0.77
themſelves
-0.74
neſs
-0.72
ſtill
-0.72
POSITIVE LOGITS
down
0.78
down
0.68
Down
0.67
Down
0.63
DOWN
0.58
up
0.55
Up
0.51
Up
0.50
downs
0.48
up
0.47
Activations Density 0.060%