INDEX
Explanations
punctuation marks, particularly periods and commas
Text after colons or dashes
introductions and summaries
New Auto-Interp
Negative Logits
firstly
-0.80
entweder
-0.77
either
-0.77
Firstly
-0.74
一是
-0.74
nämlich
-0.72
either
-0.71
Specifically
-0.69
particularly
-0.69
specifically
-0.68
POSITIVE LOGITS
Bref
1.19
etc
1.16
etc
1.15
Etc
1.13
Etc
1.08
tudo
1.06
словом
1.06
bref
1.05
总之
1.03
这一切
1.02
Activations Density 0.254%