INDEX
Explanations
mentions of the automobile brand "Honda"
mentions of the brand Honda and its models
New Auto-Interp
Negative Logits
ablishment
-0.93
Seym
-0.86
acular
-0.85
asure
-0.81
orial
-0.80
byter
-0.78
vironment
-0.77
ngth
-0.77
tle
-0.75
yip
-0.74
POSITIVE LOGITS
Civic
1.03
Accord
0.88
Motor
0.81
Honda
0.80
Odyssey
0.75
Karin
0.73
Rouse
0.70
uana
0.69
asaki
0.68
ity
0.68
Activations Density 0.020%