INDEX
Explanations
references to vehicles, specifically Volkswagen (VW) cars
mentions of specific car brands, particularly Volkswagen and its models
New Auto-Interp
Negative Logits
nces
-0.81
ttp
-0.79
xual
-0.76
yip
-0.72
mond
-0.71
olulu
-0.71
thodox
-0.70
buquerque
-0.69
hemy
-0.69
efeated
-0.67
POSITIVE LOGITS
Polo
1.01
wagen
0.94
beetle
0.85
Volkswagen
0.81
Beetle
0.79
VW
0.77
dioxide
0.74
dealership
0.72
Golf
0.70
Volvo
0.67
Activations Density 0.003%