INDEX
Explanations
names related to sports teams and their locations
New Auto-Interp
Negative Logits
ä¹Ļ
-0.17
onna
-0.16
onne
-0.16
inne
-0.15
ÑĨÑı
-0.14
ÑĮÑı
-0.14
Reports
-0.14
tsky
-0.13
ç¶
-0.13
orm
-0.13
POSITIVE LOGITS
ichel
0.16
å¹²
0.16
acha
0.16
usat
0.15
ery
0.14
oj
0.14
iper
0.14
ach
0.14
ayo
0.14
iones
0.14
Activations Density 0.020%