INDEX
Explanations
references to citations, questions, or instructions in a document
punctuation marks, specifically closing parentheses and brackets
New Auto-Interp
Negative Logits
cra
-0.79
anyl
-0.76
©¶æ
-0.71
thrott
-0.70
hoard
-0.67
ĪĴ
-0.66
crate
-0.63
computing
-0.63
deliber
-0.63
transition
-0.62
POSITIVE LOGITS
Anyway
1.19
Similarly
0.92
Interestingly
0.92
Conversely
0.92
Therefore
0.90
Originally
0.87
<|endoftext|>
0.87
Alternatively
0.87
********************************
0.86
↵Âł
0.86
Activations Density 0.113%