INDEX
    Explanations

    normal way then specific alternative

    New Auto-Interp
    Negative Logits
    ingle
    -0.09
    abl
    -0.09
    phin
    -0.09
    .ErrorCode
    -0.08
     Bacon
    -0.08
     precinct
    -0.08
    quiz
    -0.08
     trib
    -0.08
     Harden
    -0.08
    urg
    -0.08
    POSITIVE LOGITS
     normal
    0.24
    normal
    0.19
     Normal
    0.17
    æŃ£å¸¸
    0.17
     regular
    0.16
     conventional
    0.16
     standard
    0.16
    Normal
    0.15
     response
    0.15
    .normal
    0.13
    Act Density 0.051%

    No Known Activations