INDEX
    Explanations

    instances of the word "the" with varying emphasis levels based on context

    occurrences of the word "the" in various contexts

    New Auto-Interp
    Negative Logits
    leground
    -0.74
    ielding
    -0.73
     thereby
    -0.72
    Whether
    -0.71
    agger
    -0.70
     amid
    -0.68
    / 
    -0.67
    ģĸ
    -0.67
    Develop
    -0.67
     borne
    -0.66
    POSITIVE LOGITS
     coolest
    1.18
     pics
    1.04
     guy
    1.01
     damn
    0.98
     fuck
    0.96
     slightest
    0.95
     whole
    0.93
     shitty
    0.92
     shit
    0.91
     crap
    0.89
    Act Density 0.550%

    No Known Activations