INDEX
    Explanations

    references to addresses and numerical data

    numbers and punctuation

    New Auto-Interp
    Negative Logits
     selfies
    -0.60
     autorytatywna
    -0.59
     '\\;'
    -0.58
     incentiv
    -0.57
     للمعارف
    -0.53
     selfie
    -0.53
     utafitiHapana
    -0.53
    expandindo
    -0.52
    StructEnd
    -0.52
    Cyfeiriadau
    -0.51
    POSITIVE LOGITS
    faßt
    0.62
     Einfluß
    0.52
     aDecoder
    0.49
     Bewußt
    0.48
     muß
    0.43
     dentaire
    0.42
     daß
    0.41
     Saddam
    0.41
    vskip
    0.41
    BrowserModule
    0.39
    Act Density 0.010%

    No Known Activations