INDEX
Explanations
mentions of cars and automotive-related terms
New Auto-Interp
Negative Logits
Tiefen
-0.85
)");
-0.85
]")]
-0.83
Radu
-0.80
tals
-0.79
]").
-0.79
'\\;'
-0.76
{}{}-0.75
hydrauli
-0.75
}".
-0.75
POSITIVE LOGITS
car
1.70
Car
1.58
Car
1.54
car
1.53
cars
1.52
CAR
1.45
Cars
1.40
CAR
1.40
Cars
1.33
cars
1.33
Activations Density 0.089%