INDEX
Explanations
phrases related to safety and health
New Auto-Interp
Head Attr Weights
0:0.05
1:0.04
2:0.07
3:0.06
4:0.03
5:0.04
6:0.05
7:0.18
8:0.07
9:0.06
10:0.06
11:0.23
Negative Logits
Ukrain
-2.62
DW
-2.41
Jakarta
-2.24
Omaha
-2.15
owder
-2.14
Mechdragon
-2.13
mill
-2.12
Malays
-2.04
Islamabad
-2.03
Moroccan
-2.02
POSITIVE LOGITS
oresc
2.62
\'
2.42
\">
2.25
guiActiveUn
2.23
iosis
2.17
actionDate
2.15
commuting
2.12
"}
2.07
\'
2.05
'>
2.04
Activations Density 0.003%