INDEX
Explanations
expressions of gratitude and enthusiasm related to teamwork and community involvement
New Auto-Interp
Negative Logits
don
-0.15
iston
-0.14
isted
-0.14
λι
-0.14
öh
-0.14
æľī人
-0.14
avigation
-0.13
beer
-0.13
DON
-0.13
etwork
-0.13
POSITIVE LOGITS
look
0.71
looking
0.58
Look
0.57
look
0.55
looks
0.53
Look
0.51
Looking
0.50
LOOK
0.48
.look
0.46
looking
0.46
Activations Density 0.123%