INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     CreateTagHelper
    -0.84
     Wikispecies
    -0.77
    mobileqq
    -0.72
    \{\\
    -0.71
    Portale
    -0.69
    -0.66
    GEBURTSDATUM
    -0.66
     يتيمه
    -0.65
     bezeichneter
    -0.65
    بوابة
    -0.62
    POSITIVE LOGITS
    mo
    0.41
    Orig
    0.40
    mim
    0.40
    ocin
    0.40
    less
    0.40
    ly
    0.39
    whelmed
    0.38
    d
    0.38
    men
    0.38
     circulating
    0.37
    Act Density 0.005%

    No Known Activations