INDEX
Negative Logits
dryness
0.83
सख्ती
0.73
सलाह
0.71
dialog
0.69
orang
0.69
cough
0.69
consultation
0.68
porridge
0.68
dry
0.67
consultant
0.67
POSITIVE LOGITS
起動
0.83
habitable
0.81
馭
0.80
분류
0.80
variously
0.79
เตอร์
0.78
parcialmente
0.78
accuse
0.78
غيل
0.77
<unused1>
0.76
Activations Density 0.080%