INDEX
    Explanations

    phrases of the form "a [something] manner/way" or similar

    New Auto-Interp
    Negative Logits
     nahilalakip
    -1.13
    GEBURTSDATUM
    -1.02
     '\\;'
    -0.97
     itſelf
    -0.94
     myſelf
    -0.94
     doubtnut
    -0.93
     ―――――
    -0.93
     Theſe
    -0.91
    -0.91
     pinulongan
    -0.91
    POSITIVE LOGITS
     n
    0.59
     A
    0.57
     to
    0.55
    0.55
     a
    0.54
    -
    0.53
     N
    0.51
     independent
    0.50
    addOn
    0.49
     is
    0.49
    Act Density 0.021%

    No Known Activations