INDEX
    Explanations

    phrases related to past states of being or situations that have changed over time

    phrases indicating a transformation or transition from one state to another

    New Auto-Interp
    Negative Logits
     Retrieved
    -0.82
    2018
    -0.74
    wake
    -0.70
     Cosponsors
    -0.69
    update
    -0.69
     2018
    -0.68
     Extend
    -0.68
     Recall
    -0.67
    "]=>
    -0.66
     Whereas
    -0.66
    POSITIVE LOGITS
     unthinkable
    1.24
     taboo
    1.04
     unimaginable
    1.02
     innocuous
    1.00
     harmless
    0.97
     dormant
    0.96
     regarded
    0.92
    thinkable
    0.92
     unheard
    0.90
     timid
    0.87
    Act Density 0.267%

    No Known Activations