INDEX
    Explanations

    phrases indicating attentiveness or monitoring

    New Auto-Interp
    Negative Logits
    çͰ
    -0.77
    Half
    -0.70
     disadvant
    -0.66
    é¾įå¥ij士
    -0.66
    VIDIA
    -0.64
     pathology
    -0.61
    Compat
    -0.61
    ibly
    -0.60
    ļé
    -0.60
    IER
    -0.59
    POSITIVE LOGITS
     tabs
    0.72
     inbox
    0.71
     forwarded
    0.69
     Updates
    0.66
    readable
    0.66
    lists
    0.63
    facebook
    0.62
    uum
    0.62
     scrolling
    0.60
     reader
    0.60
    Act Density 0.021%

    No Known Activations