INDEX
Negative Logits
dfx
-0.93
Ethics
-0.79
UGE
-0.78
ELD
-0.77
gamer
-0.76
INAL
-0.73
Editorial
-0.72
ODE
-0.70
Marijuana
-0.69
REAM
-0.69
POSITIVE LOGITS
anza
1.23
iton
1.21
uses
1.10
gey
1.08
itors
1.06
neau
1.06
isson
1.02
etooth
1.02
Bon
1.00
nie
0.99
Activations Density 8.902%