INDEX
Explanations
HTML tags and related syntax in the text
New Auto-Interp
Negative Logits
ãĥ£
-0.80
Reborn
-0.77
ModLoader
-0.77
Ferr
-0.67
bitters
-0.66
exerc
-0.66
Sapphire
-0.64
virginity
-0.64
derog
-0.64
Winchester
-0.63
POSITIVE LOGITS
!--
1.33
img
1.18
span
1.12
iframe
1.08
div
1.04
meta
0.95
html
0.95
story
0.84
><
0.84
br
0.82
Activations Density 0.008%