INDEX
Explanations
mentions of medals and awards in sports contexts
New Auto-Interp
Negative Logits
airo
-0.18
itesse
-0.14
ilty
-0.14
gebung
-0.14
ä¿Ĭ
-0.14
Garland
-0.14
ORMAL
-0.14
taÅŁ
-0.14
alarından
-0.14
orris
-0.13
POSITIVE LOGITS
Wolfe
0.15
yz
0.15
iless
0.14
anan
0.14
ibri
0.14
ocl
0.14
sung
0.14
ersiz
0.14
ustral
0.14
incible
0.13
Activations Density 0.021%