INDEX
Explanations
instances of sentence punctuation and structural markers, particularly periods and parentheses
questionnaire and xkcd discussions
New Auto-Interp
Negative Logits
lenker
-0.74
parsedMessage
-0.71
ագրություններ
-0.69
disambiguazione
-0.68
protoimpl
-0.67
poffe
-0.66
queſta
-0.66
-0.65
ButterKnife
-0.65
rrggbb
-0.64
POSITIVE LOGITS
<bos>
0.47
mathbb
0.45
.
0.43
.
0.40
<0xE2>
0.40
strict
0.39
quase
0.38
s
0.37
large
0.36
0.35
Activations Density 0.002%