INDEX
Explanations
references to food brands, specifically related to cheese snacks
New Auto-Interp
Negative Logits
independents
-0.84
specificity
-0.81
ufact
-0.77
hypert
-0.75
vp
-0.74
DRAG
-0.74
ONSORED
-0.73
GGGG
-0.73
lif
-0.71
acute
-0.71
POSITIVE LOGITS
ecake
1.00
eman
0.97
leader
0.95
eda
0.91
eta
0.88
da
0.86
ilee
0.86
Redditor
0.85
Book
0.85
GUI
0.85
Activations Density 0.761%