INDEX
Negative Logits
ၞ
0.41
伸縮
0.39
印刷
0.38
persons
0.38
genc
0.36
injurious
0.36
পুনঃ
0.36
televisions
0.36
奢侈
0.36
physicians
0.36
POSITIVE LOGITS
சர்
0.43
chatID
0.42
Explicit
0.39
Bundle
0.38
Explicit
0.38
Cer
0.37
theria
0.37
Esp
0.36
Against
0.36
pooled
0.36
Activations Density 0.001%