INDEX
Explanations
mentions of the restaurant chain "Pizza Hut"
mentions of fast food chains or related establishments
New Auto-Interp
Negative Logits
rogram
-0.75
lying
-0.71
penet
-0.70
param
-0.70
diffuse
-0.69
fuzz
-0.69
irlf
-0.69
ngth
-0.68
probing
-0.67
Utt
-0.67
POSITIVE LOGITS
Stores
1.37
Restaur
1.26
Cola
1.17
Store
1.15
stores
1.10
Foods
1.07
Restaurant
1.06
supermarkets
1.05
Shopping
1.04
supermarket
1.03
Activations Density 0.277%