INDEX
Explanations
expressions of wonder or disbelief
New Auto-Interp
Negative Logits
Milo
0.85
subtiliter
0.84
खाते
0.84
starch
0.83
약간
0.83
altos
0.82
shows
0.81
Isn
0.81
tingling
0.81
△
0.80
POSITIVE LOGITS
contador
1.01
ற்றி
0.91
mamy
0.85
ский
0.85
industrielle
0.84
呃
0.83
ﷺ
0.82
विक्रय
0.82
лися
0.80
bygg
0.80
Activations Density 0.003%