INDEX
Explanations
conversational phrases that indicate agreement or acknowledgment
New Auto-Interp
Negative Logits
Roskov
-0.96
saites
-0.78
BoxFit
-0.77
Cæsar
-0.73
Cordialement
-0.73
abyrinth
-0.72
marquis
-0.72
genheim
-0.72
endpush
-0.72
epa
-0.71
POSITIVE LOGITS
Well
1.05
Well
0.99
well
0.74
hey
0.71
WELL
0.68
well
0.66
technically
0.65
Wells
0.65
уж
0.64
Pues
0.64
Activations Density 0.032%