INDEX
Explanations
phrases indicating readiness or willingness to take action
New Auto-Interp
Negative Logits
AMA
-0.78
VERTISEMENT
-0.76
adish
-0.71
aples
-0.69
MRI
-0.66
eatures
-0.64
uo
-0.63
ophone
-0.61
Gems
-0.61
tein
-0.58
POSITIVE LOGITS
enough
0.82
nesses
0.75
ptin
0.74
to
0.72
gladly
0.72
tc
0.68
going
0.68
trusting
0.68
tarian
0.67
£ı
0.66
Activations Density 0.043%