INDEX
    Explanations

    proper nouns or names

    specific names and references to individuals or groups

    New Auto-Interp
    Negative Logits
    hap
    -0.90
    pell
    -0.66
    ivably
    -0.59
    oven
    -0.57
    advertisement
    -0.55
    2020
    -0.55
    lov
    -0.54
     versus
    -0.54
     endif
    -0.53
    088
    -0.52
    POSITIVE LOGITS
    taboola
    0.67
    tro
    0.66
     scrut
    0.65
     wont
    0.60
     Tata
    0.57
     ',
    0.57
    tarians
    0.57
    ',
    0.56
     Swed
    0.56
    enum
    0.55
    Act Density 0.396%

    No Known Activations