INDEX
Explanations
words related to upward movement or direction
occurrences of the word "up."
New Auto-Interp
Negative Logits
âĸ¬âĸ¬
-0.65
mia
-0.59
Emanuel
-0.58
士
-0.58
Synopsis
-0.58
understatement
-0.58
scapego
-0.57
keyword
-0.56
Closure
-0.56
¯¯¯¯
-0.55
POSITIVE LOGITS
stairs
1.10
rights
1.02
river
0.97
stage
0.89
stairs
0.87
raised
0.86
ris
0.84
np
0.81
ornia
0.81
erd
0.80
Activations Density 0.077%