INDEX
Explanations
mentions of specific brands or companies
words related to products or meal replacements
New Auto-Interp
Negative Logits
STON
-0.77
side
-0.75
BILITIES
-0.74
CHR
-0.66
CTV
-0.65
iets
-0.65
STER
-0.65
jack
-0.64
head
-0.63
Heist
-0.62
POSITIVE LOGITS
ropy
1.15
ral
1.11
ucky
1.07
ertain
0.98
rification
0.97
inel
0.96
imental
0.96
acles
0.94
acion
0.92
iment
0.90
Activations Density 0.048%