INDEX
Explanations
phrases urging to investigate or examine something
repeated instances of the verb "look."
New Auto-Interp
Negative Logits
theless
-0.66
Cele
-0.65
cffff
-0.58
cape
-0.57
âĹ¼
-0.56
CENT
-0.56
kson
-0.55
ä¹
-0.55
Skill
-0.55
Palestin
-0.55
POSITIVE LOGITS
ahead
0.96
izons
0.85
favorably
0.79
suspic
0.77
ression
0.77
ãĤ¶
0.76
forward
0.72
into
0.71
ocene
0.69
closely
0.69
Activations Density 0.073%