INDEX
    Explanations

    phrases describing specific events or situations

    New Auto-Interp
    Negative Logits
    ding
    -0.71
    SPONSORED
    -0.70
    agin
    -0.70
    kaya
    -0.67
    whatever
    -0.66
    zzi
    -0.65
    aking
    -0.64
    åĤ
    -0.63
    udder
    -0.63
    \\\\\\\\
    -0.63
    POSITIVE LOGITS
    soever
    1.23
     asked
    1.10
     confronted
    1.06
     pressed
    1.00
     faced
    0.95
     contacted
    0.87
    ce
    0.86
     questioned
    0.85
     viewed
    0.81
     approached
    0.75
    Act Density 0.084%

    No Known Activations