INDEX
    Explanations

    adverbs that denote rarity or infrequency

    phrases that include the word "rarely."

    New Auto-Interp
    Negative Logits
     Destruction
    -0.72
    oÄŁ
    -0.70
    utenberg
    -0.68
    hang
    -0.68
     Submission
    -0.67
    uid
    -0.67
    pour
    -0.66
    andi
    -0.66
    arta
    -0.65
     Melt
    -0.64
    POSITIVE LOGITS
    theless
    1.32
    entimes
    1.11
    icably
    0.98
     dime
    0.86
    epad
    0.86
     bothered
    0.83
    etheless
    0.81
    icable
    0.78
     seen
    0.75
     occas
    0.75
    Act Density 0.012%

    No Known Activations