INDEX
Explanations
sequences that resemble timestamps or coded data patterns
punctuation marks, particularly colons
New Auto-Interp
Negative Logits
mathemat
-0.76
ratified
-0.70
fatally
-0.68
steen
-0.67
bounded
-0.64
GBT
-0.63
paralyzed
-0.63
incent
-0.63
Chambers
-0.62
vier
-0.62
POSITIVE LOGITS
</
0.79
)</
0.77
addons
0.73
Holo
0.70
\">
0.70
/>
0.70
Display
0.69
)\
0.69
antage
0.68
ãĢį
0.67
Activations Density 0.023%