INDEX
Explanations
the beginning of various paragraphs or sections in the text (denoted by the <bos> token)
New Auto-Interp
Negative Logits
RegressionTest
-0.94
IVEREF
-0.80
Monfieur
-0.77
Theſe
-0.74
myſelf
-0.73
GEBURTSDATUM
-0.72
springfox
-0.71
Efq
-0.71
brities
-0.71
&___
-0.70
POSITIVE LOGITS
\{\\0.57
translation
0.56
also
0.46
PhpStorm
0.46
séjours
0.45
jednocześnie
0.45
rest
0.45
pañol
0.44
पू
0.43
involved
0.43
Activations Density 0.006%