INDEX
    Explanations

    phrases indicating a specific sequence of steps or instructions

    occurrences of the word "The" across various contexts

    New Auto-Interp
    Negative Logits
    SHARE
    -0.77
    20439
    -0.74
    LGBT
    -0.72
    Brexit
    -0.71
    Scotland
    -0.68
    abuse
    -0.67
    Pol
    -0.66
    SPONSORED
    -0.66
    ghazi
    -0.66
    AIDS
    -0.66
    POSITIVE LOGITS
    oret
    1.52
     easiest
    1.47
     downside
    1.36
     simplest
    1.31
     drawback
    1.30
     resultant
    1.22
     resulting
    1.18
     difference
    1.18
     quickest
    1.17
     goal
    1.17
    Act Density 0.346%

    No Known Activations