INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     doux
    -0.09
    -|
    -0.08
     lấy
    -0.08
     traff
    -0.08
     soit
    -0.08
     Pleasure
    -0.08
     Praia
    -0.08
     Billing
    -0.07
     lingerie
    -0.07
     Trilogy
    -0.07
    POSITIVE LOGITS
    emoc
    0.08
     stewardship
    0.07
     potvr
    0.07
     bestätigen
    0.07
     eus
    0.07
     Ezek
    0.07
    Instantiate
    0.07
     edu
    0.07
    edic
    0.07
     administering
    0.07
    Act Density 0.001%

    No Known Activations