INDEX
Explanations
names of companies and specific events or locations
mentions of specific brands, notably "Nestlé," and references to the holiday "Easter."
New Auto-Interp
Negative Logits
istic
-0.83
istically
-0.77
istics
-0.61
ist
-0.60
¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯
-0.60
ights
-0.58
554
-0.57
nom
-0.57
mith
-0.56
xxxxxxxx
-0.56
POSITIVE LOGITS
lé
1.43
dale
1.10
led
0.98
lings
0.98
lies
0.94
Bunny
0.90
lements
0.89
ding
0.89
Eggs
0.88
ea
0.86
Activations Density 0.091%