INDEX
Explanations
words related to safety and security measures or concepts, especially regarding potential risks or threats
regulatory terms and concepts related to safety measures
New Auto-Interp
Head Attr Weights
0:0.04
1:0.03
2:0.29
3:0.06
4:0.18
5:0.05
6:0.04
7:0.03
8:0.05
9:0.09
10:0.06
11:0.03
Negative Logits
エル
-1.29
turb
-1.18
pheus
-1.16
Struggle
-1.16
Ukrain
-1.15
chrom
-1.14
��
-1.13
Trin
-1.13
ILCS
-1.13
��
-1.12
POSITIVE LOGITS
iland
1.45
Secure
1.44
WARE
1.40
allas
1.27
PLIED
1.26
Safety
1.26
anding
1.25
illance
1.21
ournal
1.20
Catalog
1.19
Activations Density 0.002%