INDEX
Explanations
elements related to programming functions and code execution
Tokens after punctuation marks
technical terms and archaic spellings
New Auto-Interp
Negative Logits
<eos>
-0.59
none
-0.41
and
-0.41
NOPQRST
-0.40
with
-0.40
IND
-0.39
just
-0.38
so
-0.37
means
-0.36
zak
-0.36
POSITIVE LOGITS
незавершена
1.17
itſelf
0.94
myſelf
0.88
contextLoads
0.87
ſelf
0.83
Monfieur
0.82
ſche
0.80
pleaſure
0.80
NUMX
0.79
хьтан
0.79
Activations Density 0.672%