INDEX
    Explanations

    mentions of trends in various contexts

    New Auto-Interp
    Negative Logits
    <bos>
    -3.44
    -1.11
    public
    -0.79
    /***
    
    -0.77
    /*
    -0.76
    
    
    -0.76
    <?
    -0.72
    protected
    -0.69
    via
    -0.66
     put
    -0.66
    POSITIVE LOGITS
     affor
    1.84
     bandung
    1.80
     maneu
    1.78
     Minang
    1.76
     increa
    1.74
     strick
    1.66
     jaya
    1.66
     Khart
    1.64
     guarante
    1.62
     reluct
    1.61
    Act Density 0.090%

    No Known Activations