INDEX
Negative Logits
Meaning
-0.61
anium
-0.61
assed
-0.57
Nicaragua
-0.57
dylib
-0.56
formerly
-0.54
rongh
-0.54
Tenth
-0.53
harm
-0.53
Himself
-0.53
POSITIVE LOGITS
forth
0.78
DragonMagazine
0.77
SPONSORED
0.75
onwards
0.74
,
0.68
abouts
0.68
itiz
0.67
kward
0.66
oresc
0.65
isson
0.64
Activations Density 0.120%