INDEX
Explanations
phrases related to introspection or reflection
instances of the verb "look" in various contexts
New Auto-Interp
Negative Logits
accompan
-0.70
icipated
-0.61
unfocusedRange
-0.61
Palestin
-0.60
ä¹
-0.60
shown
-0.59
collaps
-0.58
estern
-0.57
uala
-0.57
weather
-0.57
POSITIVE LOGITS
favorably
0.86
forward
0.80
ahead
0.77
inward
0.76
elsewhere
0.76
upon
0.76
toward
0.75
foolish
0.74
backward
0.73
kindly
0.72
Activations Density 0.050%