INDEX
Explanations
punctuation marks and delimiters
New Auto-Interp
Negative Logits
similarly
-0.77
同じく
-0.72
Similarly
-0.71
secondly
-0.69
firstly
-0.66
同様に
-0.65
entweder
-0.64
subsequent
-0.64
either
-0.64
Similarly
-0.63
POSITIVE LOGITS
tudo
1.27
etc
1.12
Bref
1.08
etc
1.08
allemaal
1.08
Etc
1.07
总之
1.07
などなど
1.05
这一切
1.05
Etc
1.02
Activations Density 0.293%