INDEX
Explanations
statements related to an increase in quantity or intensity
instances of the word "increased"
New Auto-Interp
Negative Logits
Bio
-0.78
puted
-0.74
Impossible
-0.74
Tycoon
-0.72
Mole
-0.72
verse
-0.70
Geneva
-0.68
isters
-0.68
Outs
-0.68
heim
-0.67
POSITIVE LOGITS
xual
1.04
subsequ
1.04
tremend
1.01
confir
0.96
millenn
0.94
awareness
0.93
unintention
0.93
eleph
0.92
responsiveness
0.91
carbohyd
0.90
Activations Density 0.019%