INDEX
Explanations
adverbs that express certainty or confidence
the word "certainly" and its variations in context
New Auto-Interp
Negative Logits
glers
-0.80
ulative
-0.76
lins
-0.76
gencies
-0.75
bucks
-0.71
lay
-0.70
uese
-0.69
ú
-0.67
ollen
-0.66
endar
-0.65
POSITIVE LOGITS
deline
0.72
deserved
0.71
torped
0.71
qualifies
0.71
exagger
0.70
distinguish
0.70
benefited
0.70
appreci
0.68
distinguished
0.68
behaved
0.67
Activations Density 0.021%