INDEX
Explanations
phrases related to prohibitions or restrictions
phrases indicating negation or absence
New Auto-Interp
Negative Logits
Reloaded
-0.81
Reborn
-0.72
"$:/
-0.70
Royale
-0.69
vind
-0.68
restored
-0.68
chapter
-0.66
Fork
-0.66
gypt
-0.63
Unch
-0.63
POSITIVE LOGITS
brainer
1.31
strings
1.16
repeat
1.13
exc
1.11
reply
1.09
smoking
1.08
contact
1.08
platform
1.05
matter
1.03
notice
1.02
Activations Density 0.016%