INDEX
Negative Logits
hovah
-0.47
ľ
-0.44
inary
-0.43
uania
-0.42
urgy
-0.41
acebook
-0.40
anian
-0.40
ogun
-0.40
uitive
-0.39
Adin
-0.39
POSITIVE LOGITS
way
0.56
boro
0.49
gate
0.48
agher
0.47
Papers
0.46
eln
0.46
forth
0.45
leaf
0.45
plot
0.44
WAY
0.43
Activations Density 11.276%