INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    natureconservancy
    -0.65
    xtap
    -0.64
    conn
    -0.63
    ĵĺ
    -0.63
    =-=-=-=-=-=-=-=-
    -0.61
    Scope
    -0.60
    essim
    -0.59
    ModLoader
    -0.58
    ĻĤ
    -0.58
    ãĤ©
    -0.57
    POSITIVE LOGITS
    forth
    0.82
    rimination
    0.67
    warts
    0.64
    lict
    0.64
     Forth
    0.63
    oche
    0.63
    adder
    0.63
     Warwick
    0.63
    hyde
    0.63
    arb
    0.62
    Act Density 5.301%

    No Known Activations