INDEX
    Explanations

    phrases indicating uncertainty about future outcomes

    instances of the phrase "to be."

    New Auto-Interp
    Negative Logits
    hops
    -0.65
     Advertisement
    -0.59
     hamm
    -0.59
     nets
    -0.56
     simultane
    -0.56
     jails
    -0.54
    pointers
    -0.54
     Boots
    -0.53
     suspended
    -0.53
     banned
    -0.52
    POSITIVE LOGITS
    ggles
    0.98
    asty
    0.94
    othy
    0.86
    pless
    0.85
    psy
    0.83
    ads
    0.76
    fu
    0.75
    asted
    0.74
    lling
    0.74
    adies
    0.73
    Act Density 0.312%

    No Known Activations