INDEX
Explanations
words related to a decrease or reduction in something
references to decreasing values or quantities
New Auto-Interp
Negative Logits
nox
-0.74
haps
-0.74
Orig
-0.68
iasco
-0.66
Äĩ
-0.65
Carnage
-0.64
ebook
-0.64
fiction
-0.64
istry
-0.63
wald
-0.63
POSITIVE LOGITS
lower
3.40
lower
2.67
Lower
2.46
Lower
2.38
higher
2.27
upper
2.12
higher
1.95
lowest
1.87
Higher
1.85
lowered
1.82
Activations Density 0.014%