INDEX
    Explanations

    words related to a specific TV show or franchise

    mentions of the TV show "Game of Thrones."

    New Auto-Interp
    Negative Logits
    ought
    -0.66
    iffe
    -0.65
    ROR
    -0.62
    ancies
    -0.60
     bapt
    -0.59
    orate
    -0.58
     maxim
    -0.58
    attery
    -0.57
    hips
    -0.57
     Eisen
    -0.57
    POSITIVE LOGITS
    FAQ
    1.13
    Cube
    1.11
    Stop
    1.11
    zeb
    1.06
    Spot
    1.05
    cube
    1.04
    cock
    1.01
    Maker
    0.95
    boy
    0.94
     Freak
    0.90
    Act Density 0.033%

    No Known Activations