INDEX
Explanations
instances where someone is ready or prepared to do something
instances of the word "willing."
New Auto-Interp
Negative Logits
alien
-0.78
Anthem
-0.78
ORGE
-0.71
gran
-0.68
ECH
-0.67
arrow
-0.67
adish
-0.67
older
-0.67
APH
-0.66
ogg
-0.66
POSITIVE LOGITS
willing
1.04
theless
0.91
unwilling
0.90
willingly
0.82
incent
0.79
gladly
0.77
terday
0.76
lend
0.76
unres
0.75
willingness
0.75
Activations Density 0.013%