INDEX
Explanations
expressions that indicate trust or reliance on the speaker's credibility
New Auto-Interp
Negative Logits
wap
-0.16
mens
-0.15
akra
-0.15
wan
-0.15
may
-0.14
aqu
-0.14
[System
-0.14
íĥģ
-0.14
atform
-0.14
quist
-0.14
POSITIVE LOGITS
engkap
0.19
ONGL
0.17
none
0.16
.Solid
0.15
contrary
0.15
enaire
0.15
oven
0.15
CLU
0.15
ermann
0.14
_cmos
0.14
Activations Density 0.029%