INDEX
Explanations
conversational/opinionated text
instances of direct address in dialogue—especially the word "you" and character-name mentions used to speak to or about the interlocutor.
New Auto-Interp
Negative Logits
Coc
-0.07
nao
-0.07
งน
-0.07
dıktan
-0.07
untlet
-0.06
että
-0.06
`
-0.06
�
-0.06
_even
-0.06
nakne
-0.06
POSITIVE LOGITS
invoke
0.07
obbies
0.07
adversity
0.07
disconnect
0.06
ugl
0.06
离
0.06
_cores
0.06
SEEK
0.06
ffic
0.06
exploits
0.06
Activations Density 0.068%