INDEX
Explanations
punctuation or sentence-ending marks within the text
periods and sentence endings
New Auto-Interp
Negative Logits
she
-1.73
she
-1.50
we
-1.32
he
-1.30
She
-1.14
they
-1.13
we
-1.12
they
-1.10
We
-1.03
He
-1.03
POSITIVE LOGITS
wendigkeit
0.52
élev
0.51
cipais
0.50
lotz
0.50
🎰
0.48
Odkazy
0.48
Portály
0.48
Legende
0.48
ovem
0.48
abbé
0.47
Activations Density 0.455%