INDEX
    Explanations

    words related to acts of violence or killing

    occurrences of the word "slaughter."

    New Auto-Interp
    Negative Logits
     Ronaldo
    -0.64
     Ric
    -0.62
     Richards
    -0.61
     Crom
    -0.61
     Princ
    -0.60
     Sovereign
    -0.60
    Scot
    -0.60
    âĵĺ
    -0.59
     retention
    -0.59
    aucas
    -0.59
    POSITIVE LOGITS
     Slaughter
    1.17
     slaughter
    1.15
    houses
    1.08
    \\\\\\\\
    1.01
    edIn
    0.93
    ãĥ¼ãĤ¯
    0.92
    house
    0.91
    quished
    0.90
    fest
    0.88
     slaughtered
    0.84
    Act Density 0.012%

    No Known Activations