INDEX
Explanations
references to the luxury car brand Mercedes-Benz
references to the Mercedes-Benz brand
New Auto-Interp
Negative Logits
umbn
-0.94
ingo
-0.75
orescent
-0.74
ally
-0.70
ANK
-0.70
aji
-0.70
orate
-0.68
rely
-0.67
agnar
-0.67
alloween
-0.67
POSITIVE LOGITS
Benz
1.26
cedes
1.21
Benz
1.07
Mercedes
0.90
manship
0.80
lapt
0.76
seiz
0.68
dealership
0.67
Audi
0.67
wagon
0.65
Activations Density 0.004%