INDEX
    Explanations

    instances of the word "broken" and its variations, indicating a focus on themes of damage or distress

    New Auto-Interp
    Negative Logits
     اÙĦاØŃ
    -0.16
    æ¹¾
    -0.16
    ropa
    -0.16
    नल
    -0.15
    .synthetic
    -0.14
    IFI
    -0.14
    ellen
    -0.14
    azzo
    -0.14
    rex
    -0.14
    shint
    -0.14
    POSITIVE LOGITS
    -hearted
    0.28
     broken
    0.28
    broken
    0.28
    heart
    0.28
    Broken
    0.24
     Broken
    0.23
     promises
    0.21
     pieces
    0.21
    -down
    0.20
     promise
    0.20
    Act Density 0.029%

    No Known Activations