INDEX
    Explanations

    phrases related to conspiracies or evil schemes

    references to "plot" in various contexts

    New Auto-Interp
    Negative Logits
    IDA
    -0.71
     salts
    -0.68
    agles
    -0.66
    Scot
    -0.64
    Downloadha
    -0.63
     Splash
    -0.61
     Rio
    -0.61
    Sales
    -0.60
    Occ
    -0.60
    angelo
    -0.60
    POSITIVE LOGITS
    ters
    0.99
     Plot
    0.87
     twists
    0.86
     Twist
    0.82
    line
    0.82
     hatched
    0.81
    ter
    0.81
     plot
    0.79
     plotting
    0.78
    zag
    0.78
    Act Density 0.031%

    No Known Activations