INDEX
    Explanations

    narrative text

    New Auto-Interp
    Negative Logits
     vigueur
    -0.08
    ool
    -0.08
    aaa
    -0.07
    nu
    -0.07
     din
    -0.07
     cir
    -0.07
     November
    -0.07
    tte
    -0.07
    ance
    -0.07
    igan
    -0.07
    POSITIVE LOGITS
     Oeste
    0.10
    builtin
    0.08
    0.08
     Alone
    0.08
     تصنيع
    0.08
     "=
    0.08
     oefeningen
    0.08
     hierover
    0.08
     بذلك
    0.08
     మాత్రమే
    0.08
    Act Density 0.185%

    No Known Activations