INDEX
Explanations
contradictions in statements
present tense and past tense forms of the verb "to be"
New Auto-Interp
Negative Logits
doms
-0.90
ESE
-0.81
osate
-0.72
ievers
-0.72
glers
-0.71
Edit
-0.68
IFE
-0.67
wake
-0.65
inav
-0.65
Congratulations
-0.65
POSITIVE LOGITS
worth
0.86
meant
0.85
certainly
0.82
also
0.80
downright
0.79
nonetheless
0.79
necessary
0.75
nt
0.75
inevitable
0.75
achievable
0.75
Activations Density 0.332%