INDEX
    Explanations

    names and titles of political figures

    New Auto-Interp
    Negative Logits
    égor
    -0.07
     voks
    -0.06
    主任
    -0.06
    igel
    -0.06
     ЧаÑģ
    -0.06
    _rewrite
    -0.06
    ktop
    -0.06
    dee
    -0.06
    jun
    -0.06
    æ¡Ī
    -0.06
    POSITIVE LOGITS
    vik
    0.07
    UnitOfWork
    0.07
    noinspection
    0.06
     deter
    0.06
    bones
    0.06
    ÏĨι
    0.06
     GHC
    0.06
     enc
    0.06
    ,exports
    0.06
    Enc
    0.06
    Act Density 0.023%

    No Known Activations