INDEX
Negative Logits
whose
0.43
man
0.43
home
0.43
reported
0.41
household
0.39
malicious
0.39
network
0.38
cujo
0.38
malware
0.38
nicknames
0.37
POSITIVE LOGITS
!”
0.51
SizedBox
0.46
周围
0.46
atoti
0.45
sonuc
0.45
setelah
0.44
jälkeen
0.44
铣
0.43
чений
0.43
。”
0.42
Activations Density 0.003%