INDEX
Explanations
terms related to upcoming or upcoming events
the word "Up" in various contexts
New Auto-Interp
Negative Logits
vain
-0.76
plain
-0.72
judgment
-0.69
recess
-0.68
sarc
-0.68
disgust
-0.68
sanct
-0.67
deserts
-0.67
bore
-0.66
medi
-0.66
POSITIVE LOGITS
Up
3.49
up
2.03
UP
1.99
Down
1.82
ups
1.63
Up
1.60
Upgrade
1.34
DOWN
1.33
Forward
1.30
UP
1.22
Activations Density 0.010%