INDEX
Negative Logits
ără
-0.08
്റ
-0.08
neighbor
-0.08
massa
-0.08
패
-0.08
_ins
-0.07
juven
-0.07
law
-0.07
অপর
-0.07
armo
-0.07
POSITIVE LOGITS
骗局
0.09
detailing
0.09
elaborado
0.09
elaborate
0.08
detail
0.08
erder
0.08
sophist
0.08
lokaci
0.08
Stored
0.08
التفاصيل
0.08
Activations Density 0.016%