INDEX
Explanations
words related to safety
references to safety concerns and regulations
New Auto-Interp
Negative Logits
eric
-0.90
dx
-0.88
issance
-0.86
yss
-0.82
igs
-0.77
eta
-0.75
naire
-0.73
eds
-0.73
quart
-0.72
nant
-0.72
POSITIVE LOGITS
safety
1.07
ailability
0.85
valve
0.79
oided
0.78
safety
0.77
practition
0.77
Þ
0.76
precautions
0.75
precaution
0.74
hazards
0.72
Activations Density 0.021%