INDEX
Explanations
words related to a decrease or reduction in something
words indicating depletion or reduction
New Auto-Interp
Negative Logits
sw
-0.69
covers
-0.64
SW
-0.59
Astron
-0.59
ples
-0.58
lights
-0.57
serv
-0.56
aly
-0.56
Newman
-0.55
nex
-0.55
POSITIVE LOGITS
hift
1.01
ometimes
0.99
hirt
0.98
heet
0.93
imentary
0.90
omething
0.88
peak
0.87
anamo
0.84
pread
0.84
terday
0.82
Activations Density 0.138%