INDEX
    Explanations

    phrases indicating multiple instances or occurrences of something

    New Auto-Interp
    Negative Logits
    vest
    -1.00
    asta
    -0.93
    ittens
    -0.91
    NER
    -0.90
    bas
    -0.89
    amen
    -0.89
    Reviewer
    -0.89
    oper
    -0.88
    roit
    -0.88
    istan
    -0.87
    POSITIVE LOGITS
     hundred
    1.99
     thousand
    1.76
     dozen
    1.69
     iterations
    1.36
     occasions
    1.28
     times
    1.21
    teenth
    1.18
     months
    1.17
     aspects
    1.15
    dozen
    1.14
    Act Density 0.812%

    No Known Activations