INDEX
Explanations
instances of direct speech or quotations within the text
New Auto-Interp
Negative Logits
Detach
-0.15
kou
-0.15
oko
-0.14
ãĥ¯ãĥ¼
-0.13
cuckold
-0.13
Jeh
-0.13
.annot
-0.13
Wheeler
-0.13
unks
-0.13
Jacques
-0.13
POSITIVE LOGITS
ÙĪØ£ÙĨ
0.18
æŁ
0.15
Edu
0.14
/Instruction
0.14
igy
0.14
755
0.14
Hizmet
0.14
alam
0.14
267
0.14
473
0.14
Activations Density 0.254%