INDEX
Explanations
words related to sports, games, and competition
expressions related to confidence and performance in sports
New Auto-Interp
Negative Logits
Citiz
-0.68
DATA
-0.67
obos
-0.66
bris
-0.63
sels
-0.62
horrors
-0.59
ãĥ©ãĥ³
-0.58
scant
-0.57
NAME
-0.56
bidder
-0.55
POSITIVE LOGITS
.''
1.35
."
1.25
laughs
1.21
.�
1.14
,''
1.09
.
1.08
because
1.06
,"
1.03
.'
1.01
[
0.98
Activations Density 0.420%