INDEX
Explanations
phrases expressing uncertainty or inclusivity
New Auto-Interp
Negative Logits
isse
-0.75
aters
-0.72
olis
-0.72
arie
-0.72
arde
-0.71
ciating
-0.71
etr
-0.71
auri
-0.71
és
-0.70
agos
-0.69
POSITIVE LOGITS
else
0.86
whatsoever
0.85
soever
0.83
amount
0.75
number
0.71
extrad
0.70
floats
0.69
whether
0.67
rested
0.67
circumstance
0.67
Activations Density 0.584%