INDEX
Explanations
verbs expressing certainty or expectation
assertions of certainty or affirmation
New Auto-Interp
Negative Logits
basically
-0.85
supposedly
-0.84
apparently
-0.81
essentially
-0.80
allegedly
-0.73
arently
-0.73
finally
-0.73
obviously
-0.72
purportedly
-0.72
presumably
-0.70
POSITIVE LOGITS
underestimated
0.72
eds
0.71
racted
0.69
onen
0.67
gged
0.66
ciplinary
0.64
preferable
0.64
ented
0.62
underest
0.62
ounters
0.62
Activations Density 0.324%