INDEX
    Explanations

    Code and administration

    New Auto-Interp
    Negative Logits
    aaro
    -0.08
    artsen
    -0.08
     mily
    -0.08
     Multif
    -0.08
    akata
    -0.08
    Production
    -0.08
     Corinthians
    -0.08
    -0.08
    IEC
    -0.07
     briefs
    -0.07
    POSITIVE LOGITS
     intimid
    0.08
    0.08
     WOM
    0.07
     fists
    0.07
     trem
    0.07
    0.07
     prak
    0.07
     foment
    0.07
    Н
    0.07
     ottenere
    0.07
    Act Density 0.000%

    No Known Activations