INDEX
Negative Logits
ไปด้วย
0.96
reacts
0.95
pake
0.91
behaves
0.88
pacc
0.85
dotato
0.84
낌
0.83
nyeri
0.83
passwd
0.83
detects
0.82
POSITIVE LOGITS
년대
0.78
ーム
0.72
ו
0.71
Ent
0.70
Rod
0.69
SizePolicy
0.69
خل
0.68
बचें
0.68
esco
0.67
рас
0.67
Activations Density 0.066%