INDEX
    Explanations

    references to historical events and character interactions

    followed by "was" or other auxiliary verbs

    token followed by specific word

    New Auto-Interp
    Negative Logits
    autonomie
    -0.57
    bildeten
    -0.48
    initas
    -0.48
    appartamento
    -0.47
     rightfully
    -0.47
     acrylique
    -0.46
    améli
    -0.46
    ddots
    -0.46
     genauso
    -0.45
    ziehungs
    -0.45
    POSITIVE LOGITS
    WriteTagHelper
    0.67
     estekak
    0.66
     propOrder
    0.66
     eenig
    0.65
    +#+#
    0.64
     InputDecoration
    0.64
    uxxxx
    0.63
    Tikang
    0.63
     ModelExpression
    0.62
    UrlResolution
    0.62
    Act Density 0.311%

    No Known Activations