INDEX
Explanations
references to vehicles and accidents involving them
New Auto-Interp
Negative Logits
κή
-0.15
çŃĭ
-0.15
scape
-0.15
иÑĤов
-0.14
tplib
-0.14
ennis
-0.13
ptime
-0.13
roph
-0.13
arget
-0.13
sink
-0.13
POSITIVE LOGITS
ve
0.31
sides
0.28
care
0.25
fis
0.23
jack
0.22
sk
0.22
ram
0.22
sw
0.22
overturn
0.21
rear
0.21
Activations Density 0.034%