INDEX
Explanations
numerical sequences in the format "1_9" or "1_10"
repeated placeholder tokens, indicating the presence of segments marked by specific formatting or structure rather than content
New Auto-Interp
Negative Logits
fences
-0.72
boundaries
-0.69
modelling
-0.69
scenes
-0.68
bent
-0.64
aesthetics
-0.64
thinkable
-0.64
awaru
-0.64
leading
-0.64
hygiene
-0.63
POSITIVE LOGITS
Password
1.30
125
1.14
Corinthians
1.12
st
1.10
120
1.09
½
0.95
000000
0.95
123
0.94
128
0.91
RM
0.86
Activations Density 0.052%