INDEX
Negative Logits
ndar
-0.09
ด้าน
-0.08
độc
-0.08
flora
-0.08
))).
-0.08
Pis
-0.08
conv
-0.08
toque
-0.08
Retriever
-0.07
Known
-0.07
POSITIVE LOGITS
感谢
0.13
grateful
0.13
thank
0.12
thankful
0.11
thrilled
0.11
excited
0.11
regret
0.11
നന്ദ
0.11
wish
0.11
disappointed
0.11
Activations Density 0.050%