INDEX
Explanations
proper nouns and specific names related to sports events and personalities
New Auto-Interp
Negative Logits
ibar
-0.14
úp
-0.14
vens
-0.14
Sutton
-0.14
roc
-0.14
ilon
-0.14
ún
-0.14
há
-0.13
fern
-0.13
hta
-0.13
POSITIVE LOGITS
himself
0.19
brothers
0.18
-san
0.15
isser
0.15
Brothers
0.15
ová
0.15
ripple
0.15
Ñıв
0.15
sisters
0.14
Tw
0.14
Activations Density 0.636%