INDEX
    Explanations

    names of people, places, or organizations

    New Auto-Interp
    Negative Logits
    agg
    -0.15
    jin
    -0.14
    عÙĨ
    -0.14
    ounder
    -0.14
    å¾³
    -0.13
     takson
    -0.13
    볨
    -0.13
    rocess
    -0.13
    ruz
    -0.13
     Champion
    -0.13
    POSITIVE LOGITS
    asio
    0.16
    dma
    0.15
    aru
    0.15
    UGIN
    0.15
    enstein
    0.15
    Äįást
    0.15
    levation
    0.14
    piler
    0.14
    antor
    0.14
    arine
    0.14
    Act Density 0.139%

    No Known Activations