INDEX
Explanations
the presence of the pronoun "tu" in various contexts
New Auto-Interp
Negative Logits
Baillargeon
-0.75
tu
-0.74
tual
-0.60
TextAlign
-0.58
详细信息
-0.57
Köszönöm
-0.57
AndEndTag
-0.56
+#+#
-0.56
axel
-0.56
---*/
-0.55
POSITIVE LOGITS
tu
2.47
printStackTrace
0.69
مشين
0.68
tuo
0.66
tufted
0.65
tua
0.65
tú
0.60
الحره
0.60
referrerpolicy
0.60
вай
0.56
Activations Density 0.004%