INDEX
Explanations
phrases related to ladder safety and ladder-related instructions
warnings or instructions regarding safety and caution
New Auto-Interp
Negative Logits
]."
-0.90
).[
-0.84
'."
-0.83
.'"
-0.82
.).
-0.77
."[
-0.77
anwhile
-0.67
".[
-0.67
)."
-0.64
.")
-0.60
POSITIVE LOGITS
||
0.75
?
0.71
?",
0.65
·
0.62
nor
0.60
or
0.60
/
0.58
OR
0.58
\'
0.57
&&
0.56
Activations Density 1.990%