INDEX
Explanations
references to specific automobile brands and their characteristics
New Auto-Interp
Negative Logits
IsContent
-0.59
liflower
-0.55
propOrder
-0.54
WEBPACK
-0.52
lasso
-0.51
kimdir
-0.51
déput
-0.50
stället
-0.50
INSTRUMENT
-0.50
exchanger
-0.50
POSITIVE LOGITS
brand
0.94
brands
0.93
branded
0.82
brands
0.73
branded
0.73
manufacturer
0.73
marchio
0.72
brand
0.70
manufacture
0.69
ブランド
0.69
Activations Density 0.256%