INDEX
Explanations
phrases related to commands or instructions
conversational phrases that express conditions or suggestions
New Auto-Interp
Negative Logits
Demand
-0.76
ichen
-0.74
ãĤ¼ãĤ¦ãĤ¹
-0.72
Ö¼
-0.69
hab
-0.69
hoe
-0.68
Attempts
-0.68
leneck
-0.67
moil
-0.67
sbm
-0.67
POSITIVE LOGITS
congr
1.47
congratulations
1.31
remember
1.16
beware
1.15
apologies
1.08
sorry
1.07
note
1.06
check
1.04
thank
1.04
hey
1.04
Activations Density 0.202%