INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Henrik
    -0.08
    .car
    -0.07
    АР
    -0.07
    3
    -0.06
    nds
    -0.06
    aid
    -0.06
    -focused
    -0.06
     odpowied
    -0.06
    ;color
    -0.06
    Longrightarrow
    -0.06
    POSITIVE LOGITS
     medio
    0.07
    ycopg
    0.07
     clases
    0.06
     municipalities
    0.06
    ToAdd
    0.06
     المغ
    0.06
     Had
    0.06
     devez
    0.06
     saison
    0.06
     adolescente
    0.06
    Act Density 0.002%

    No Known Activations