INDEX
Explanations
words related to rapid growth or increase
terms related to growth, decline, and various forms of deterioration
New Auto-Interp
Negative Logits
inaction
-0.80
spoilers
-0.75
brackets
-0.70
acron
-0.69
similarities
-0.68
Reviewer
-0.68
surn
-0.66
spoiler
-0.66
existential
-0.66
captive
-0.66
POSITIVE LOGITS
uate
1.22
itates
1.14
uates
1.13
itate
1.08
ighed
1.07
inate
1.07
rouse
1.06
ulate
1.05
ize
1.02
ulates
1.01
Activations Density 0.209%