INDEX
Explanations
expressions of agreement or affirmation in conversation
New Auto-Interp
Negative Logits
Amis
-0.62
े
-0.60
Tur
-0.59
yns
-0.58
ężczy
-0.57
Morin
-0.57
Treff
-0.56
acar
-0.56
acaktır
-0.55
:///
-0.54
POSITIVE LOGITS
Yeah
1.53
Yeah
1.52
YEAH
1.47
yeah
1.45
yeah
1.38
YEAH
1.31
Yea
0.98
eah
0.98
outta
0.97
GONNA
0.93
Activations Density 0.035%