INDEX
    Explanations

    terms related to authority and governance

    New Auto-Interp
    Negative Logits
    ancellable
    -0.16
     Nem
    -0.15
    åĩĿ
    -0.15
    ERM
    -0.15
    ninger
    -0.14
    uddle
    -0.14
    æİĽ
    -0.14
    tplib
    -0.14
    åĿĤ
    -0.14
    æŁĦ
    -0.14
    POSITIVE LOGITS
    468
    0.17
    FER
    0.16
     neighboring
    0.15
    469
    0.15
    anko
    0.14
    467
    0.14
    lez
    0.14
     lekker
    0.14
    ÑĭÑĪ
    0.13
    mazon
    0.13
    Act Density 0.019%

    No Known Activations