INDEX
Explanations
phrases expressing hypothetical intentions or suggestions
New Auto-Interp
Negative Logits
Xuan
-0.68
Kag
-0.63
Chal
-0.62
Brill
-0.58
è¦ļéĨĴ
-0.57
rylic
-0.56
Cairo
-0.56
{*-0.56
rpm
-0.56
Nile
-0.55
POSITIVE LOGITS
be
0.94
gladly
0.93
dearly
0.91
ideally
0.91
prefer
0.90
characterize
0.85
doubtless
0.80
ordinarily
0.80
qualify
0.79
ĸļ
0.79
Activations Density 0.137%