INDEX
Explanations
references to major sporting events, particularly in football (soccer)
New Auto-Interp
Negative Logits
itur
-0.17
agua
-0.17
ugas
-0.15
istem
-0.15
&o
-0.15
ughters
-0.14
itten
-0.14
okin
-0.14
administr
-0.14
pus
-0.14
POSITIVE LOGITS
cxx
0.18
SError
0.15
lav
0.15
mac
0.14
Tro
0.14
гÑĢÑĥз
0.14
ì¡
0.13
vw
0.13
aggio
0.13
llen
0.13
Activations Density 0.044%