INDEX
Explanations
names of car brands and related entities
New Auto-Interp
Negative Logits
swire
-0.16
cca
-0.16
sville
-0.15
cı
-0.15
ship
-0.14
luk
-0.14
cn
-0.14
————————
-0.13
.hs
-0.13
OrCreate
-0.13
POSITIVE LOGITS
-Benz
0.44
Benz
0.36
benz
0.25
ben
0.24
Mer
0.22
mer
0.21
merc
0.19
enz
0.19
Merc
0.19
Mercedes
0.18
Activations Density 0.001%