INDEX
    Explanations

    mentions of songs or song-related words

    references to music and songs

    New Auto-Interp
    Negative Logits
     Inqu
    -0.69
    ategory
    -0.68
    amily
    -0.66
    aples
    -0.66
    agons
    -0.65
     srfAttach
    -0.64
    achev
    -0.64
    iencies
    -0.61
    alsh
    -0.61
     Dhabi
    -0.60
    POSITIVE LOGITS
    writer
    1.58
    writers
    1.55
    stress
    1.52
    writing
    1.50
     lyrics
    1.46
    bird
    1.45
     lyric
    1.31
    birds
    1.28
     sung
    1.06
     songs
    1.05
    Act Density 0.039%

    No Known Activations