INDEX
    Explanations

    instances of numerical data and specific proper nouns

    New Auto-Interp
    Negative Logits
    uder
    -0.17
    ucas
    -0.16
    eldon
    -0.15
    ÃŃÅĻ
    -0.15
    èĹį
    -0.15
     wid
    -0.15
    394
    -0.15
     inf
    -0.15
     widest
    -0.14
     bin
    -0.14
    POSITIVE LOGITS
    assel
    0.15
    modelName
    0.15
    uli
    0.14
     drafting
    0.14
    rics
    0.14
    aler
    0.14
    TEX
    0.14
    (^
    0.14
    anca
    0.14
    ormsg
    0.14
    Act Density 0.118%

    No Known Activations