INDEX
    Explanations

    mentions of specific names, likely related to social media accounts or individuals

    proper nouns and names related to individuals or organizations

    New Auto-Interp
    Negative Logits
    zos
    -0.90
     Rez
    -0.89
     Jonas
    -0.86
     Pok
    -0.86
     Zen
    -0.83
    zn
    -0.83
    zo
    -0.82
    ten
    -0.82
     Eisen
    -0.81
    Pie
    -0.75
    POSITIVE LOGITS
    iler
    0.87
    av
    0.85
    taboola
    0.84
    ave
    0.84
    aver
    0.83
    BB
    0.81
    aving
    0.80
    CRIP
    0.80
    cle
    0.78
    iling
    0.77
    Act Density 0.589%

    No Known Activations