INDEX
    Explanations

    phrases related to evaluation or comparison

    phrases indicating criteria or expectations related to performance or evaluations

    New Auto-Interp
    Negative Logits
    alion
    -0.91
    itional
    -0.70
    alysed
    -0.69
    gradation
    -0.69
    orah
    -0.67
    gered
    -0.66
    oche
    -0.65
    rection
    -0.64
    orage
    -0.64
    ackets
    -0.63
    POSITIVE LOGITS
     considering
    0.83
     rookies
    0.72
     someone
    0.72
     understatement
    0.71
     rookie
    0.68
     sandwic
    0.62
     fledgling
    0.62
     hindsight
    0.62
     oneself
    0.60
     modern
    0.59
    Act Density 0.314%

    No Known Activations