INDEX
Explanations
references to ropes and tying mechanisms in contexts involving potential harm or danger
New Auto-Interp
Negative Logits
########.
-0.68
MaterialApp
-0.68
logits
-0.62
noqa
-0.61
GeoNames
-0.61
Nuovo
-0.60
nugget
-0.60
candlestick
-0.60
Thine
-0.59
eminence
-0.59
POSITIVE LOGITS
rope
0.86
ropes
0.78
Rope
0.78
Rope
0.75
strings
0.67
corda
0.67
dây
0.65
Cord
0.64
rope
0.62
camp
0.62
Activations Density 0.050%