INDEX
Negative Logits
trả
-0.07
hậu
-0.06
رشد
-0.06
edula
-0.06
getc
-0.06
seq
-0.06
shoot
-0.06
patriotism
-0.06
erne
-0.06
993
-0.06
POSITIVE LOGITS
SAME
0.07
ad
0.06
className
0.06
xx
0.06
dominant
0.06
useEffect
0.06
Rot
0.06
Ze
0.06
by
0.06
entropy
0.06
Activations Density 0.044%