INDEX
Explanations
instances where someone is likely or willing to do something
phrases expressing willingness or tendency towards actions or opinions
New Auto-Interp
Negative Logits
soDeliveryDate
-0.68
Mek
-0.67
illin
-0.65
Nak
-0.64
oufl
-0.64
CHA
-0.63
mson
-0.63
original
-0.62
apeake
-0.61
arer
-0.61
POSITIVE LOGITS
inclined
1.01
irection
0.86
toward
0.86
ĺħ
0.86
towards
0.83
¿½
0.76
thereto
0.73
leaning
0.73
igible
0.73
Ń·
0.71
Activations Density 0.034%