INDEX
Explanations
safety guidelines related to physical spaces
references to non-consensual acts and safety warnings regarding physical proximity and objects
New Auto-Interp
Negative Logits
renaissance
-0.58
Indie
-0.57
wow
-0.57
Merlin
-0.56
Strategy
-0.55
badass
-0.54
Prototype
-0.53
Startup
-0.52
found
-0.52
Blueprint
-0.51
POSITIVE LOGITS
etc
0.86
unaccompanied
0.79
genitals
0.77
menstru
0.74
oneself
0.72
inappropriately
0.72
prohibited
0.70
preferring
0.70
spouses
0.70
adultery
0.70
Activations Density 0.822%