INDEX
Explanations
references to bottled water and canned food
references to bottled and canned food products
New Auto-Interp
Negative Logits
nesota
-0.99
lihood
-0.92
rn
-0.83
lde
-0.81
alter
-0.79
ffield
-0.79
xual
-0.76
hov
-0.75
cloth
-0.74
holes
-0.74
POSITIVE LOGITS
bottled
0.93
Catal
0.79
kamp
0.70
ãĤª
0.70
bananas
0.69
ificate
0.68
water
0.68
ãĥĥãĤ¯
0.67
ãĤ¢ãĥ«
0.66
peanuts
0.66
Activations Density 0.019%