INDEX
Explanations
phrases indicating doubt or uncertainty
instances of the word "would."
New Auto-Interp
Negative Logits
DN
-0.62
Kag
-0.61
Case
-0.61
Kis
-0.61
âĢ¢âĢ¢âĢ¢âĢ¢
-0.61
notes
-0.60
Building
-0.59
SIG
-0.59
marker
-0.57
DOC
-0.57
POSITIVE LOGITS
be
1.01
prefer
0.94
nt
0.89
qualify
0.86
tolerate
0.86
gladly
0.84
allow
0.83
itate
0.83
consider
0.83
suffice
0.82
Activations Density 0.149%