INDEX
Explanations
words that convey a lack of value or significance
terms related to futility and worthlessness
New Auto-Interp
Negative Logits
orthy
-0.83
ebus
-0.77
annis
-0.75
asio
-0.74
avia
-0.74
artney
-0.70
PI
-0.66
uana
-0.64
arthy
-0.64
aeper
-0.63
POSITIVE LOGITS
waste
0.98
useless
0.83
wastes
0.81
idiots
0.80
worthless
0.78
glers
0.77
filler
0.77
meaningless
0.76
soever
0.75
ãĤŃ
0.74
Activations Density 0.028%