INDEX
    Explanations

    specific French pronouns and articles

    New Auto-Interp
    Negative Logits
     propOrder
    -1.07
     varandra
    -0.80
     colorés
    -0.79
    -0.79
     wikipagina
    -0.78
    SharedDtor
    -0.76
     HasFactory
    -0.72
     creș
    -0.72
     ویکی‌پدیای
    -0.71
     дописавши
    -0.71
    POSITIVE LOGITS
    The
    1.21
     The
    1.11
     THE
    1.09
    THE
    1.06
    the
    0.92
     the
    0.86
    rethe
    0.83
    enthe
    0.81
    sthe
    0.79
     Die
    0.79
    Act Density 0.089%

    No Known Activations