INDEX
    Explanations

    references to the word "the."

    New Auto-Interp
    Negative Logits
     decidedly
    -0.52
     ostensibly
    -0.51
     normalerweise
    -0.51
    typically
    -0.50
    Toponymie
    -0.49
    berdayakan
    -0.48
     admittedly
    -0.48
     daarvoor
    -0.48
    Often
    -0.47
     太郎
    -0.47
    POSITIVE LOGITS
     said
    0.66
     concerned
    0.59
     mentioned
    0.58
     stuffs
    0.55
     whole
    0.54
     equipments
    0.54
     evidences
    0.52
     beginners
    0.52
     matters
    0.52
     suitable
    0.50
    Act Density 1.071%

    No Known Activations