INDEX
Explanations
punctuation marks and parentheses in the text
Closing parenthesis
closing parentheses and brackets
New Auto-Interp
Negative Logits
Dol
-0.71
Thy
-0.67
dol
-0.66
osh
-0.64
shl
-0.64
ers
-0.64
∆
-0.64
hhhhhhhh
-0.63
ínez
-0.63
fulness
-0.63
POSITIVE LOGITS
}))
1.36
])
1.24
]")]
1.20
}))
1.16
'])
1.16
"]))
1.14
})]
1.13
"))
1.12
})
1.12
']))
1.12
Activations Density 0.821%