INDEX
Explanations
phrases related to vehicle features
New Auto-Interp
Negative Logits
umas
-0.15
assium
-0.15
imon
-0.15
寿
-0.14
aupt
-0.14
eldig
-0.14
iras
-0.14
.Metro
-0.14
Ħĸ
-0.14
Affero
-0.13
POSITIVE LOGITS
exterior
0.23
safety
0.23
cabin
0.22
Safety
0.21
interior
0.21
driver
0.20
Exterior
0.20
Driver
0.20
hatch
0.18
cab
0.18
Activations Density 0.040%