INDEX
    Explanations

    expressions of personal enthusiasm or preference

    New Auto-Interp
    Negative Logits
    <bos>
    -1.82
    protected
    -0.68
    /**
    -0.67
    var
    -0.64
     became
    -0.64
     appeared
    -0.63
     become
    -0.63
    enumerate
    -0.63
    public
    -0.62
     becomes
    -0.62
    POSITIVE LOGITS
     Minang
    1.38
     increa
    1.36
     reluct
    1.35
     impra
    1.31
     cytoplas
    1.30
     swarovski
    1.28
     affor
    1.27
     quoique
    1.27
     disreg
    1.25
     maneu
    1.25
    Act Density 0.291%

    No Known Activations