INDEX
    Explanations

    references to specific names or titles

    proper nouns, specifically names related to brands, movies, or places

    New Auto-Interp
    Negative Logits
    ";
    -0.56
    SPONSORED
    -0.56
     ______
    -0.54
     ��������
    -0.52
     Shutterstock
    -0.51
    advertisement
    -0.51
    )",
    -0.51
     behalf
    -0.49
    .....
    -0.49
    .):
    -0.49
    POSITIVE LOGITS
     assures
    0.92
     has
    0.92
     insists
    0.91
     hasn
    0.90
     agrees
    0.90
     believes
    0.90
     knows
    0.89
     intends
    0.88
     recommends
    0.88
     expects
    0.86
    Act Density 0.694%

    No Known Activations