INDEX
    Explanations

    mentions of documents and details related to investigations or disclosures

    New Auto-Interp
    Negative Logits
    Videos
    -0.73
     Videos
    -0.71
     videos
    -0.69
     films
    -0.68
     Interviews
    -0.65
    ArgsConstructor
    -0.64
     documentaries
    -0.64
     Photographs
    -0.64
     movies
    -0.64
     Reports
    -0.64
    POSITIVE LOGITS
     piece
    1.25
     article
    1.14
     document
    1.12
     passage
    0.90
     sentence
    0.87
     poem
    0.83
    piece
    0.81
     message
    0.81
     item
    0.80
    document
    0.77
    Act Density 0.615%

    No Known Activations