INDEX
    Explanations

    the structure and formatting of film reviews, including titles and release years

    New Auto-Interp
    Negative Logits
    ublish
    -0.16
    ListOf
    -0.15
    DataStream
    -0.14
    ugg
    -0.14
    FRING
    -0.14
    ugins
    -0.14
    IRS
    -0.14
    omik
    -0.14
    reate
    -0.14
    CONS
    -0.14
    POSITIVE LOGITS
    Pixels
    0.21
     Venom
    0.20
     Maze
    0.20
     Suicide
    0.19
     Sic
    0.19
     Aqu
    0.18
    Padding
    0.18
     Deadpool
    0.18
     Padding
    0.17
     Kings
    0.17
    Act Density 0.116%

    No Known Activations