INDEX
Explanations
mentions or references to the word "Cadbury"
references to the brand "Cadbury."
New Auto-Interp
Negative Logits
DEFENSE
-0.80
ãĥĥãĥĪ
-0.76
çīĪ
-0.76
edited
-0.70
ebin
-0.69
ãģį
-0.68
ãĥ£
-0.68
itably
-0.67
20439
-0.67
âĶĢâĶĢ
-0.67
POSITIVE LOGITS
Cad
0.96
ante
0.89
imar
0.82
arette
0.82
eteria
0.81
illian
0.79
rite
0.77
Publishers
0.77
emonium
0.77
uda
0.76
Activations Density 0.013%