INDEX
    Explanations

    mentions of the term 'Murd*er' with different variations, along with some related terms like 'Daredevil'

    occurrences of the word "Murderer" and references to the show "Daredevil."

    New Auto-Interp
    Negative Logits
     conditioning
    -0.71
     placement
    -0.71
     Gemini
    -0.67
     corrective
    -0.65
     magnification
    -0.64
    zed
    -0.62
    condition
    -0.62
     selective
    -0.62
    eq
    -0.62
    graded
    -0.62
    POSITIVE LOGITS
     Murd
    1.37
    stead
    0.91
    erer
    0.90
    erers
    0.89
    erest
    0.81
    aby
    0.80
    aredevil
    0.78
    anth
    0.77
    luaj
    0.77
    ings
    0.76
    Act Density 0.013%

    No Known Activations