INDEX
Explanations
references to football as a sport
New Auto-Interp
Negative Logits
enda
-0.15
844
-0.15
acea
-0.15
.Modules
-0.14
Hitch
-0.14
iale
-0.14
еÑģÑĮ
-0.14
adele
-0.14
á»Ļp
-0.14
ura
-0.14
POSITIVE LOGITS
er
0.27
s
0.22
ers
0.22
ing
0.20
erman
0.19
/base
0.19
erland
0.17
ered
0.17
erre
0.16
/Base
0.16
Activations Density 0.018%