INDEX
Explanations
phrases or words related to being straightforward or directly stated
the word "straight" in various contexts
New Auto-Interp
Negative Logits
Lauder
-0.72
adle
-0.70
è¦ļéĨĴ
-0.68
orage
-0.68
healthy
-0.65
onics
-0.64
akings
-0.64
Vu
-0.62
Kard
-0.60
Memories
-0.60
POSITIVE LOGITS
straight
0.96
straight
0.93
\\\\\\\\
0.85
Stra
0.83
Straight
0.83
lined
0.79
bent
0.77
forward
0.74
ibur
0.73
away
0.72
Activations Density 0.007%