INDEX
    Explanations

    references to research studies and reports

    New Auto-Interp
    Negative Logits
     utop
    -1.05
     palet
    -0.95
     ideolog
    -0.93
     santiago
    -0.90
     logis
    -0.90
     loto
    -0.88
     sement
    -0.86
     psychiat
    -0.85
     gend
    -0.84
     palab
    -0.84
    POSITIVE LOGITS
    jątk
    0.74
    Dziękuję
    0.70
     bénéficiaire
    0.67
    durante
    0.64
    Materiał
    0.63
    jusqu
    0.63
     Dijo
    0.61
    Dzięki
    0.61
    ypeł
    0.61
    pueden
    0.61
    Act Density 0.195%

    No Known Activations