INDEX
Explanations
references to brand names, especially related to automotive industry
references to specific car brands and models, particularly Volkswagen and Audi
New Auto-Interp
Negative Logits
nces
-1.06
ned
-1.00
esty
-0.85
xual
-0.78
ns
-0.77
stery
-0.75
llular
-0.75
forth
-0.72
ny
-0.71
mits
-0.70
POSITIVE LOGITS
Polo
0.85
geist
0.73
merch
0.68
Pascal
0.68
glass
0.67
chio
0.65
ogl
0.65
obook
0.64
CHAT
0.61
ively
0.61
Activations Density 0.084%