INDEX
    Explanations

    occurrences of the word "the"

    Followed by "alternative"

    New Auto-Interp
    Negative Logits
    Jîn
    -0.56
     ymin
    -0.55
    кість
    -0.55
     Wikimédia
    -0.54
    UnknownFieldSet
    -0.54
     Gonna
    -0.53
     newName
    -0.53
    gonna
    -0.53
    χή
    -0.52
     nông
    -0.51
    POSITIVE LOGITS
     متعلقه
    0.74
    Билгалдахарш
    0.68
    '],
    
    0.66
    ViewFeatures
    0.66
    '])
    
    0.64
    Lähteet
    0.62
    TestTools
    0.61
    '];
    
    0.61
    expandindo
    0.61
    Referanser
    0.59
    Act Density 0.473%

    No Known Activations