INDEX
Explanations
references to baseball statistics and game outcomes
New Auto-Interp
Negative Logits
incentiv
-0.81
Whilst
-0.67
Whilst
-0.66
impactful
-0.62
Amongst
-0.60
menuStrip
-0.59
kasarigan
-0.58
-0.57
namelijk
-0.57
Datuak
-0.56
POSITIVE LOGITS
muß
0.58
idéia
0.52
daß
0.50
läßt
0.49
<=",
0.48
Schluß
0.47
Moslem
0.45
faßt
0.44
>=",
0.43
mußte
0.43
Activations Density 0.399%