INDEX
Negative Logits
ãĥīãĥ©ãĤ´ãĥ³
-0.72
xon
-0.67
DAQ
-0.62
Magikarp
-0.59
AAF
-0.58
CHO
-0.58
rall
-0.57
mater
-0.57
ATING
-0.56
GGGG
-0.56
POSITIVE LOGITS
erent
1.11
ees
1.08
ords
1.01
iculty
1.00
irms
0.98
luent
0.96
mpeg
0.96
reys
0.95
rey
0.95
rites
0.94
Activations Density 0.015%