INDEX
Negative Logits
hugs
-0.80
Blasio
-0.71
Hunts
-0.69
BILITIES
-0.68
Unemployment
-0.68
salaries
-0.66
FX
-0.66
rones
-0.66
onyms
-0.66
Barker
-0.65
POSITIVE LOGITS
disc
3.90
discs
2.70
Disc
2.51
Disc
1.85
disc
1.81
disk
1.50
disks
1.35
Disk
1.19
Disk
1.14
disk
1.05
Activations Density 0.034%