INDEX
Explanations
mentions of specific sports events and achievements
New Auto-Interp
Negative Logits
ÏĦεί
-0.15
Ïİνα
-0.14
Popular
-0.14
ottage
-0.14
legen
-0.13
wid
-0.13
_OBJC
-0.13
åĩī
-0.13
ÑĢедиÑĤ
-0.13
VICE
-0.13
POSITIVE LOGITS
ạch
0.17
ington
0.15
visiting
0.15
kea
0.15
Heal
0.14
INGTON
0.14
unc
0.14
ilos
0.14
ant
0.14
ç¶
0.14
Activations Density 0.088%