INDEX
Negative Logits
crime
1.02
phishing
0.95
corruption
0.94
does
0.93
On
0.89
Corruption
0.88
allegiance
0.87
corrupt
0.87
loyalty
0.87
investment
0.86
POSITIVE LOGITS
graphicx
1.22
amsmath
1.11
cmath
1.01
fonts
0.97
amssymb
0.95
Mathematics
0.93
affichage
0.93
字体
0.92
itsyn
0.89
amsfonts
0.88
Activations Density 0.004%