INDEX
Explanations
mentions of the brand "Lego"
references to the Lego brand
New Auto-Interp
Negative Logits
suppress
-0.81
suppressed
-0.81
Shir
-0.81
suppressing
-0.78
suppression
-0.69
sickness
-0.67
counseling
-0.66
tab
-0.65
disp
-0.65
stricken
-0.65
POSITIVE LOGITS
Lego
4.02
LEGO
3.48
Minecraft
1.72
Barbie
1.53
Minecraft
1.49
Brick
1.43
Transformers
1.40
Toys
1.40
Disneyland
1.37
Catalog
1.33
Activations Density 0.033%