INDEX
    Explanations

    references to unnecessary or problematic expenditures and their implications

    New Auto-Interp
    Negative Logits
    eki
    -0.16
    ahrain
    -0.16
    ubb
    -0.15
    247
    -0.14
    iek
    -0.14
    ãĥ³ãĥ
    -0.14
    lse
    -0.14
    hle
    -0.14
     æ³
    -0.14
    rong
    -0.14
    POSITIVE LOGITS
     needs
    0.39
     needed
    0.38
     need
    0.36
    needs
    0.35
    need
    0.35
     Needed
    0.33
    needed
    0.33
     Needs
    0.33
     NEED
    0.31
    éľĢè¦ģ
    0.31
    Act Density 0.194%

    No Known Activations