INDEX
Explanations
phrases related to commands or directives
the presence of quoted statements or dialogues
New Auto-Interp
Negative Logits
whipping
-0.65
fabric
-0.63
skirts
-0.62
Vaugh
-0.62
harness
-0.62
reprint
-0.61
Odin
-0.61
folded
-0.61
ãĥ¼ãĥ«
-0.61
railing
-0.60
POSITIVE LOGITS
yss
0.94
Cause
0.93
Mech
0.87
',
0.84
-'
0.82
cause
0.82
affer
0.79
Interstitial
0.78
Too
0.76
Marginal
0.75
Activations Density 0.040%