INDEX
    Explanations

    mentions of the word "Gun" in the text

    New Auto-Interp
    Negative Logits
     Strawberry
    -0.75
    ç«
    -0.71
     Archdemon
    -0.69
     Hort
    -0.67
     Dragons
    -0.67
     Virtue
    -0.64
     coli
    -0.63
     Humph
    -0.60
     Advis
    -0.59
    uthor
    -0.59
    POSITIVE LOGITS
    powder
    1.74
    nery
    1.32
    smith
    1.23
    ner
    1.20
    ners
    1.16
    nar
    1.14
    ning
    1.12
    fight
    1.11
    boats
    1.11
    fighters
    1.11
    Act Density 0.032%

    No Known Activations