INDEX
Explanations
words related to critique and criticism
intensifiers and descriptors related to extreme or significant conditions and qualities
New Auto-Interp
Negative Logits
etsk
-0.82
anwhile
-0.79
caption
-0.74
grate
-0.73
ersen
-0.72
Downloadha
-0.71
zl
-0.67
pherd
-0.66
ebus
-0.66
idav
-0.66
POSITIVE LOGITS
proportions
0.76
votes
0.69
ravity
0.68
minster
0.68
lif
0.66
isers
0.65
lat
0.61
activity
0.61
origin
0.59
mods
0.59
Activations Density 0.565%