INDEX
    Explanations

    Biblical/mythological figures

    New Auto-Interp
    Negative Logits
    _
    ↵
    -0.07
    amins
    -0.06
     Ocak
    -0.06
     obras
    -0.06
    нообраз
    -0.06
    aternity
    -0.06
     lick
    -0.06
     +↵
    -0.06
    _fix
    -0.06
    iot
    -0.06
    POSITIVE LOGITS
     Vince
    0.07
     Colum
    0.07
    つぶ
    0.07
    _WEB
    0.06
     OCC
    0.06
     doğrudan
    0.06
    _hw
    0.06
    (Number
    0.06
     ARR
    0.06
     Федера
    0.06
    Act Density 0.051%

    No Known Activations