INDEX
Negative Logits
injuries
0.66
dasar
0.66
consequences
0.64
symptoms
0.64
convertible
0.61
instability
0.60
Pontiac
0.60
(`
0.60
instabilities
0.59
depende
0.59
POSITIVE LOGITS
ającym
0.70
蜮
0.68
atim
0.64
أنها
0.63
ब्स
0.63
佬
0.63
ated
0.62
أنه
0.60
관한
0.59
zero
0.59
Activations Density 0.001%