INDEX
Explanations
themes related to consumer health, marketing practices, and societal issues surrounding choices and accountability
New Auto-Interp
Negative Logits
alone
-0.22
ãģłãģij
-0.21
often
-0.21
sometimes
-0.20
themselves
-0.19
à¹Ģà¸Ńà¸ĩ
-0.18
often
-0.17
Alone
-0.16
itself
-0.16
souvent
-0.16
POSITIVE LOGITS
except
0.48
except
0.46
Except
0.40
Except
0.39
_except
0.36
imaginable
0.35
кÑĢоме
0.31
including
0.28
including
0.27
Including
0.27
Activations Density 0.459%