INDEX
    Explanations

    instances of high numerical values representing importance or significance

    New Auto-Interp
    Negative Logits
    Ctl
    -0.17
    illon
    -0.16
    pard
    -0.16
     milano
    -0.15
    ãĤº
    -0.15
    efa
    -0.15
    auf
    -0.14
    ÑĢел
    -0.14
    .codehaus
    -0.14
    cret
    -0.14
    POSITIVE LOGITS
    akan
    0.18
     ìĤ¬íķŃ
    0.17
    aste
    0.16
    å¾
    0.14
    ASTE
    0.14
       
    0.14
    /Area
    0.14
    uben
    0.14
     Vaugh
    0.14
    Advisor
    0.13
    Act Density 0.051%

    No Known Activations