INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -0.08
     الأصل
    -0.08
    予定
    -0.08
    .conf
    -0.08
     πλέον
    -0.07
     syd
    -0.07
     الأص
    -0.07
     DI
    -0.07
     사항
    -0.07
    .PARAM
    -0.07
    POSITIVE LOGITS
     Nada
    0.09
     atuação
    0.08
    zenie
    0.07
     denotes
    0.07
     imaginative
    0.07
    rolley
    0.07
     произ
    0.07
    ễn
    0.07
    bbc
    0.07
    ,to
    0.07
    Act Density 0.008%

    No Known Activations