INDEX
Explanations
occurrences of accidents and associated damages
New Auto-Interp
Negative Logits
annel
-0.17
ham
-0.16
illa
-0.15
Mess
-0.15
ariant
-0.15
acs
-0.14
inspace
-0.14
keit
-0.14
agency
-0.14
anel
-0.14
POSITIVE LOGITS
ghi
0.17
545
0.15
allo
0.15
lick
0.14
galement
0.14
éģł
0.14
wnd
0.14
unge
0.14
ģ
0.13
yal
0.13
Activations Density 0.028%