INDEX
Explanations
acknowledgment or agreement
New Auto-Interp
Negative Logits
say
1.20
Say
1.16
saying
1.14
mengatakan
1.13
Say
1.10
Saying
1.08
说
1.05
said
0.99
說
0.98
says
0.98
POSITIVE LOGITS
yep
1.03
alright
0.99
hmm
0.98
yeah
0.97
okay
0.97
wow
0.96
OK
0.96
ok
0.94
oh
0.93
surely
0.92
Activations Density 0.013%