INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    <bos>
    -1.24
    -0.84
    <?
    -0.75
    public
    -0.73
     enable
    -0.71
    /*
    -0.69
    enable
    -0.69
    ണ്ട
    -0.68
     enhance
    -0.67
    /**
    -0.66
    POSITIVE LOGITS
     increa
    2.12
     affor
    2.10
     maneu
    2.06
     impra
    2.05
     Juf
    2.00
     philanth
    1.89
     accla
    1.88
     wherea
    1.85
     guarante
    1.83
     inev
    1.83
    Act Density 0.096%

    No Known Activations