INDEX
Explanations
references to specific car models and manufacturers
New Auto-Interp
Negative Logits
Karel
-0.16
ikan
-0.15
roid
-0.15
FromArray
-0.15
Belg
-0.15
uant
-0.14
imus
-0.14
rael
-0.14
rysler
-0.14
Galaxy
-0.14
POSITIVE LOGITS
911
0.29
Porsche
0.27
Cay
0.27
Carr
0.26
orsche
0.24
carrera
0.22
718
0.21
Tay
0.19
Petersburg
0.17
Pan
0.17
Activations Density 0.009%