INDEX
    Explanations

    words related to intentional harm or deceitful actions

    instances of the word "screw" and its variations

    New Auto-Interp
    Negative Logits
    åī
    -0.79
    usable
    -0.74
    CTV
    -0.71
    vation
    -0.69
    CVE
    -0.65
    outh
    -0.65
    Interstitial
    -0.64
    obook
    -0.64
     sovere
    -0.63
    icipated
    -0.63
    POSITIVE LOGITS
    driver
    1.31
    drivers
    1.11
    hole
    0.94
     Whedon
    0.94
     screws
    0.91
    holes
    0.88
    ball
    0.85
     screw
    0.83
    balls
    0.82
    nuts
    0.76
    Act Density 0.014%

    No Known Activations