INDEX
    Explanations

    LGBTQ crisis support talk

    New Auto-Interp
    Negative Logits
     mitral
    0.43
    जीलैंड
    0.42
     birdseye
    0.41
     malef
    0.41
     dailySales
    0.39
     centrif
    0.39
     بیماری
    0.39
     frauen
    0.38
    护理
    0.38
     analisi
    0.38
    POSITIVE LOGITS
    бята
    0.36
    Talk
    0.33
    ու
    0.33
    एनजी
    0.32
     스스로
    0.32
     Talk
    0.32
     бесе
    0.32
     LGBT
    0.31
    Chat
    0.31
     mixture
    0.31
    Act Density 0.005%

    No Known Activations