INDEX
    Explanations

    instances of the word "the."

    New Auto-Interp
    Negative Logits
    éĤ¦
    -0.15
    stadt
    -0.15
     area
    -0.14
    ahn
    -0.14
    228
    -0.14
    igid
    -0.13
    _area
    -0.13
    å
    -0.13
    hoff
    -0.13
    Extent
    -0.13
    POSITIVE LOGITS
     expense
    0.40
    expense
    0.30
     Expense
    0.30
     mercy
    0.27
     helm
    0.26
     end
    0.25
     urging
    0.24
     beginning
    0.24
     request
    0.24
     discretion
    0.23
    Act Density 0.104%

    No Known Activations