INDEX
Negative Logits
हमले
0.94
ને
0.93
Butterfly
0.90
cusson
0.86
ટી
0.84
entraî
0.83
Severe
0.81
Wen
0.80
Attack
0.78
Violence
0.78
POSITIVE LOGITS
sanitized
1.35
checklists
1.27
belongings
1.24
accurate
1.23
personable
1.22
didactic
1.21
infographics
1.21
refreshments
1.20
receipts
1.19
상세
1.19
Activations Density 0.001%