INDEX
Explanations
phrases indicating the act of examining or reviewing something closely
New Auto-Interp
Negative Logits
Appearance
-0.25
Appearance
-0.24
appearance
-0.23
appearances
-0.22
appearance
-0.22
appearing
-0.20
Appears
-0.20
appears
-0.20
appeared
-0.19
appear
-0.19
POSITIVE LOGITS
look
0.67
look
0.56
Look
0.53
Look
0.49
LOOK
0.48
_look
0.47
looks
0.45
.look
0.45
looked
0.42
looking
0.40
Activations Density 0.051%