INDEX
    Explanations

    numerical values and percentages related to research data

    New Auto-Interp
    Negative Logits
    AVE
    -0.17
     Shib
    -0.14
     Yen
    -0.14
    .html
    -0.14
    iale
    -0.14
    loor
    -0.14
    lingen
    -0.14
    shan
    -0.14
    LOPT
    -0.14
    ATCH
    -0.14
    POSITIVE LOGITS
    Occurred
    0.15
    ÙĪØ¹
    0.14
    ustain
    0.14
    orget
    0.14
    uger
    0.14
    ç
    0.14
    _verification
    0.14
    elight
    0.14
    Ā
    0.13
    xfff
    0.13
    Act Density 0.001%

    No Known Activations