INDEX
Explanations
occurrences of quotation marks in the text
llbracket bracketed sections
New Auto-Interp
Negative Logits
?”
-0.55
”
-0.53
nha
-0.52
”?
-0.48
anan
-0.48
Shady
-0.47
anen
-0.47
suspens
-0.47
”“
-0.46
?”
-0.46
POSITIVE LOGITS
'''
1.55
'''
1.27
'''
1.13
''')
0.86
''''
0.80
('''0.74
''',
0.63
''');
0.59
transfieras
0.57
mstyle
0.57
Activations Density 0.059%