INDEX
Explanations
safety tips and instructions
instructions or advice related to safety and preparation
New Auto-Interp
Negative Logits
john
-0.71
ylan
-0.66
descendants
-0.65
Today
-0.63
ocaust
-0.63
upon
-0.62
Founder
-0.62
vanished
-0.61
"}
-0.60
purportedly
-0.60
POSITIVE LOGITS
yourself
1.07
yourselves
0.90
carefully
0.86
beforehand
0.86
consistency
0.86
Yourself
0.85
spacing
0.83
minimize
0.81
wisely
0.80
spaced
0.79
Activations Density 0.722%