INDEX
Explanations
punctuation marks and negative sentiment in the text
empty or placeholder content
New Auto-Interp
Negative Logits
manif
-0.66
shove
-0.65
standing
-0.65
decon
-0.61
scissors
-0.61
compar
-0.60
prest
-0.60
clen
-0.59
amorph
-0.57
goodness
-0.57
POSITIVE LOGITS
=-=-=-=-=-=-=-=-
0.93
webkit
0.91
=-=-=-=-
0.91
Advertisement
0.89
CLUS
0.72
Updated
0.71
HOU
0.69
immune
0.69
Updated
0.69
COL
0.69
Activations Density 0.018%