INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Bast
    -0.07
     dynamics
    -0.07
     Toilet
    -0.06
     plugin
    -0.06
     Audio
    -0.06
     dere
    -0.06
     tout
    -0.06
     Ev
    -0.06
     django
    -0.06
     negativity
    -0.06
    POSITIVE LOGITS
     ##↵
    0.07
    %
    ↵
    0.07
    };
    ↵
    0.07
    }↵
    0.06
    //---------------------------------------------------------------------------↵↵
    0.06
    UEL
    0.06
     +
    ↵
    0.06
    SHORT
    0.06
    stile
    0.06
    outdir
    0.06
    Act Density 0.029%

    No Known Activations