INDEX
Explanations
instances where the action of looking or searching for something is mentioned
instances of the word "look" and its variations
New Auto-Interp
Negative Logits
âĹ¼
-0.68
theless
-0.66
Failure
-0.65
RIC
-0.63
Els
-0.63
accompan
-0.61
kson
-0.61
rots
-0.59
CENT
-0.57
versive
-0.57
POSITIVE LOGITS
closely
0.97
behind
0.85
around
0.84
inside
0.84
elsewhere
0.83
carefully
0.82
ahead
0.79
up
0.79
deeper
0.78
into
0.78
Activations Density 0.052%