INDEX
Explanations
mentions of specific car models and their details
New Auto-Interp
Negative Logits
Manit
-0.23
Ħĸ
-0.17
Ñİ
-0.15
tele
-0.15
lover
-0.14
jni
-0.14
~>
-0.14
Dul
-0.14
meteor
-0.14
atron
-0.14
POSITIVE LOGITS
Pri
0.28
Toyota
0.28
Toyota
0.26
Toy
0.25
Lexus
0.24
Pri
0.24
Hil
0.22
toy
0.20
Tacoma
0.20
Cam
0.20
Activations Density 0.025%