INDEX
    Explanations

    references to simulated or imitation versions of something

    references to "mockumentaries" or similar formats

    New Auto-Interp
    Negative Logits
     Horizon
    -0.74
     cryst
    -0.72
    ettel
    -0.67
     violet
    -0.66
    pins
    -0.64
    ItemThumbnailImage
    -0.64
    OA
    -0.63
     arrang
    -0.63
    omen
    -0.63
     compan
    -0.62
    POSITIVE LOGITS
     Mock
    1.00
    ument
    0.98
    eries
    0.89
    ito
    0.84
    ingly
    0.83
    atory
    0.82
    ery
    0.81
    eting
    0.79
     mock
    0.77
    tails
    0.75
    Act Density 0.029%

    No Known Activations