INDEX
    Explanations

    expressions of knowledge or awareness

    New Auto-Interp
    Negative Logits
     McCart
    -0.16
    ayah
    -0.15
    emailer
    -0.15
    иÑĤом
    -0.14
    agua
    -0.14
    Ïĥκε
    -0.14
    opis
    -0.14
    iland
    -0.14
     недел
    -0.13
    .me
    -0.13
    POSITIVE LOGITS
    anko
    0.15
    463
    0.15
    462
    0.15
    ضر
    0.14
    EDGE
    0.14
    lich
    0.14
    antic
    0.13
    ÏĨα
    0.13
    rang
    0.13
    ])]
    0.13
    Act Density 0.057%

    No Known Activations