INDEX
Explanations
terms and phrases related to safety measures and standards
New Auto-Interp
Negative Logits
adeloupe
-0.76
Jackman
-0.74
TypeDef
-0.69
bint
-0.66
direta
-0.66
maș
-0.66
-0.65
mik
-0.65
république
-0.65
isolado
-0.64
POSITIVE LOGITS
afety
1.07
Safety
1.06
safety
1.05
Safety
1.05
SAFETY
0.91
SAFETY
0.91
Security
0.85
security
0.84
safety
0.81
enumi
0.80
Activations Density 0.068%