INDEX
Explanations
punctuation, specifically commas and colons, as well as structural elements in text indicating separation or lists
New Auto-Interp
Negative Logits
']}
-1.01
"]}
-1.00
"]:
-0.93
})]
-0.88
"}>
-0.88
>}
-0.87
?>">
-0.87
"]))
-0.86
)}>
-0.84
)">
-0.83
POSITIVE LOGITS
élev
0.80
↵↵↵↵↵↵
0.77
↵↵↵↵↵↵↵
0.74
Verſ
0.72
Abar
0.72
Reſ
0.72
Chriſt
0.71
Jody
0.68
,,,,
0.68
itſelf
0.68
Activations Density 0.013%