INDEX
    Explanations

    instances of the word "gone" and its derivatives

    New Auto-Interp
    Negative Logits
    Ïģιά
    -0.07
    acia
    -0.07
    .scalablytyped
    -0.07
    rieg
    -0.07
    omaly
    -0.07
    roz
    -0.07
    ummer
    -0.07
     киÑĢ
    -0.07
    alis
    -0.07
    ars
    -0.06
    POSITIVE LOGITS
    fish
    0.06
    gone
    0.06
    /un
    0.06
     typo
    0.06
    alion
    0.06
     Gone
    0.06
    .habbo
    0.06
    vez
    0.05
    @g
    0.05
    idges
    0.05
    Act Density 0.003%

    No Known Activations