INDEX
Explanations
names of sports teams and geographical locations
New Auto-Interp
Negative Logits
osu
-0.17
osphere
-0.17
ôme
-0.15
Aub
-0.15
idar
-0.15
Hague
-0.14
Wak
-0.14
Tra
-0.14
Hus
-0.14
Doll
-0.14
POSITIVE LOGITS
San
0.32
San
0.29
SAN
0.26
san
0.24
SAN
0.22
SA
0.21
Spurs
0.21
Сан
0.20
_san
0.20
san
0.18
Activations Density 0.010%