INDEX
Explanations
future actions or intentions described using the word "will."
future intentions expressed through the word "will."
New Auto-Interp
Negative Logits
bies
-0.75
elled
-0.72
amped
-0.66
REE
-0.63
uria
-0.60
uman
-0.60
Creat
-0.60
constituted
-0.59
ential
-0.58
endar
-0.58
POSITIVE LOGITS
gladly
1.22
admit
1.22
reiterate
1.08
assume
1.02
summarize
1.02
confess
0.99
paraph
0.99
explain
0.98
presume
0.97
concede
0.95
Activations Density 0.111%