INDEX
Explanations
verbs related to investigating or examining
instances of the verbs "look" and "looking"
New Auto-Interp
Negative Logits
âĹ¼
-0.67
theless
-0.65
Ranked
-0.64
panic
-0.64
understatement
-0.60
Own
-0.59
Osw
-0.59
hugs
-0.59
âĢ¢âĢ¢
-0.59
CENT
-0.57
POSITIVE LOGITS
into
0.91
suspic
0.78
forward
0.77
INTO
0.76
izons
0.74
unsuccessfully
0.74
closely
0.73
diligently
0.68
ression
0.67
oward
0.66
Activations Density 0.050%