INDEX
    Explanations

    references to entertainment media such as movies, webcomics, and TV shows

    New Auto-Interp
    Negative Logits
    abwe
    -0.80
    cius
    -0.78
     independence
    -0.73
     withdraw
    -0.70
    atism
    -0.69
     voluntarily
    -0.69
     withdrawal
    -0.68
     responsibilities
    -0.68
     cross
    -0.65
     inflation
    -0.64
    POSITIVE LOGITS
    PHOTOS
    0.97
    pmwiki
    0.91
    Fans
    0.90
    Advertisement
    0.87
    Scroll
    0.86
    Anyway
    0.84
    Speaking
    0.83
    ³³³³³³³³³³³³³³³³
    0.83
    Bonus
    0.81
     Featuring
    0.81
    Act Density 0.479%

    No Known Activations