INDEX
    Explanations

    instances of the word "The" and its variations

    New Auto-Interp
    Negative Logits
    etheless
    -0.82
     beware
    -0.80
     forcefully
    -0.72
     separately
    -0.71
    thood
    -0.69
     froze
    -0.69
    imposed
    -0.68
    vernment
    -0.67
     subjected
    -0.67
     patiently
    -0.66
    POSITIVE LOGITS
    atre
    1.19
    oret
    1.09
     Simpsons
    1.01
     Verge
    0.97
     Amazing
    0.96
     Big
    0.96
     Week
    0.96
     Greatest
    0.95
     Economist
    0.95
     Conversation
    0.94
    Act Density 0.053%

    No Known Activations