INDEX
    Explanations

    words related to humor or comedic elements

    references to humor and comedic elements

    New Auto-Interp
    Negative Logits
    holder
    -0.68
    arnaev
    -0.67
    FT
    -0.62
    Ag
    -0.60
    minster
    -0.59
     Peaks
    -0.58
    ¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯
    -0.57
     opio
    -0.57
     eyed
    -0.57
    REAM
    -0.57
    POSITIVE LOGITS
    ously
    1.29
     humour
    0.95
     humor
    0.94
    ably
    0.91
    ingly
    0.81
    lessly
    0.81
    netflix
    0.80
    isma
    0.76
    atur
    0.76
    osity
    0.75
    Act Density 0.017%

    No Known Activations