INDEX
    Explanations

    mentions of specific symbols or characters

    instances of the word "didn't" or its variations

    New Auto-Interp
    Negative Logits
     warp
    -0.67
     pyramid
    -0.67
     chained
    -0.66
     Lancaster
    -0.66
     Rampage
    -0.65
     Hats
    -0.65
     Pony
    -0.63
     draped
    -0.62
     Scarlet
    -0.62
     Yor
    -0.61
    POSITIVE LOGITS
    ï¸ı
    1.08
    vernment
    1.05
    ulty
    1.03
    ¯¯
    1.00
    ufact
    0.99
    conom
    0.98
    £
    0.97
    efe
    0.95
    ember
    0.90
    ¢
    0.88
    Act Density 0.353%

    No Known Activations