INDEX
    Explanations

    words and phrases related to authority or governance

    New Auto-Interp
    Negative Logits
     sel
    -0.14
    stable
    -0.14
    ignon
    -0.14
    xe
    -0.13
    еÑģÑĤи
    -0.13
    otr
    -0.13
    636
    -0.13
    оказ
    -0.13
    .AutoScale
    -0.13
    ɵ
    -0.13
    POSITIVE LOGITS
    errat
    0.16
    ossa
    0.16
     Byl
    0.15
    êµ´
    0.15
     Ecc
    0.14
    abela
    0.14
    λÏī
    0.14
    اع
    0.14
    anse
    0.14
     Incontri
    0.14
    Act Density 0.026%

    No Known Activations