INDEX
Explanations
mentions of the word "positivity" or words related to a positive outlook
occurrences of the term "pos" and related variations, indicating a focus on positioning or status
New Auto-Interp
Negative Logits
loo
-0.90
HAEL
-0.76
Hearts
-0.70
clad
-0.68
Clarkson
-0.67
stall
-0.66
oats
-0.66
ARD
-0.64
OPS
-0.64
Continental
-0.63
POSITIVE LOGITS
itional
1.30
itions
1.27
idon
1.17
itory
1.14
itivity
1.10
itor
1.09
itives
1.07
pos
1.04
itionally
1.01
sel
0.98
Activations Density 0.021%