INDEX
    Explanations

    numerical representations such as years, figures, and statistics

    New Auto-Interp
    Negative Logits
    <bos>
    -0.90
    -0.70
    
    
    -0.68
    ുറ
    -0.62
    <?
    
    -0.60
    <?
    -0.59
    //{
    
    -0.57
    -0.56
     may
    -0.56
    ਿੱ
    -0.55
    POSITIVE LOGITS
     maneu
    1.76
     increa
    1.59
     depic
    1.51
     stockholm
    1.49
     suscep
    1.45
     effe
    1.45
     emphat
    1.44
     disreg
    1.44
     guarante
    1.42
     inev
    1.41
    Act Density 0.128%

    No Known Activations