INDEX
    Explanations

    signs of numerical data or quantifiable terms

    New Auto-Interp
    Negative Logits
    pus
    -0.17
    pone
    -0.16
    UGIN
    -0.16
     MetroFramework
    -0.16
    Ñĸон
    -0.15
    agog
    -0.14
     åŁ
    -0.14
    IEWS
    -0.14
    ená
    -0.13
     spol
    -0.13
    POSITIVE LOGITS
     contributions
    0.16
     ethical
    0.16
    contrib
    0.15
    Contrib
    0.15
    rol
    0.15
    trl
    0.15
    åį·
    0.15
     Contrib
    0.15
     contributors
    0.15
     Contributors
    0.15
    Act Density 0.012%

    No Known Activations