INDEX
    Explanations

    proper nouns, particularly names of people related to movies or media

    New Auto-Interp
    Negative Logits
    202
    -0.27
     macOS
    -0.25
     ðŁĶ
    -0.24
    https
    -0.24
    ðĿ
    -0.24
    ðŁĴ
    -0.23
     ðŁij
    -0.23
     https
    -0.23
    ðŁ
    -0.23
    ðŁĺ
    -0.23
    POSITIVE LOGITS
     TMZ
    0.30
     Lindsay
    0.26
     VH
    0.25
     cele
    0.25
     pap
    0.23
     Celebrity
    0.23
     MTV
    0.23
     Paris
    0.22
     Perez
    0.22
     Radar
    0.21
    Act Density 0.063%

    No Known Activations