INDEX
Explanations
dates and scores related to sports games
instances of the word "in" signifying location or context
New Auto-Interp
Negative Logits
issance
-0.74
emis
-0.72
ect
-0.71
10000
-0.70
76561
-0.69
NOW
-0.68
channelAvailability
-0.65
Versions
-0.65
essor
-0.64
emporary
-0.64
POSITIVE LOGITS
overtime
1.03
front
1.00
ked
0.95
favor
0.93
unison
0.91
emph
0.91
spite
0.90
heartbreaking
0.87
Week
0.85
Round
0.84
Activations Density 0.102%