INDEX
Explanations
specific references to possessive pronouns or articles in German text
New Auto-Interp
Negative Logits
en
-0.67
кровь
-0.61
serez
-0.61
zéro
-0.58
solchen
-0.58
ember
-0.58
Deserializer
-0.58
hancer
-0.56
avancé
-0.56
in
-0.56
POSITIVE LOGITS
Das
1.25
Das
1.17
das
1.16
DAS
1.08
het
1.05
Het
1.04
DAS
1.03
脚注の使い方
0.89
Het
0.88
Athene
0.82
Activations Density 0.027%