INDEX
Explanations
statements or descriptions using the word "a" or "an" followed by a noun
the letter 'a'
New Auto-Interp
Negative Logits
Allied
-0.71
Atkins
-0.70
EVs
-0.69
Abbey
-0.62
reports
-0.61
advis
-0.60
Abortion
-0.60
Everton
-0.60
Oscar
-0.59
infield
-0.59
POSITIVE LOGITS
usterity
1.13
uras
1.10
vertisement
1.10
merce
1.08
ria
1.06
rouse
1.03
ird
0.95
hem
0.94
unts
0.93
ctors
0.90
Activations Density 0.149%