INDEX
    Explanations

    phrases indicating location or position in relation to other elements

    New Auto-Interp
    Negative Logits
     ujednoznacz
    -0.74
    gyű
    -0.57
     manna
    -0.57
     brancas
    -0.57
     kasarigan
    -0.57
     cascada
    -0.57
     Lingkungan
    -0.56
    ValueStyle
    -0.56
     mijne
    -0.55
     vrijwilli
    -0.55
    POSITIVE LOGITS
    انگلیسی
    0.71
     Nev
    0.71
     Biel
    0.67
    ән
    0.67
    日在
    0.66
     Bres
    0.64
    قایناق‌لار
    0.63
     chó
    0.62
    OrBuilder
    0.62
    anin
    0.62
    Act Density 0.035%

    No Known Activations