INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Administrativna
    -0.81
    MemoryWarning
    -0.81
    windowFixed
    -0.72
    NameInMap
    -0.69
     Préférences
    -0.69
    tagHelperRunner
    -0.68
    EndContext
    -0.68
    RefNanny
    -0.67
     disambiguazione
    -0.66
    ')";
    -0.65
    POSITIVE LOGITS
     delin
    0.50
    aronder
    0.43
     default
    0.42
    default
    0.41
     “
    0.41
     شع
    0.41
    IDATE
    0.39
     "
    0.38
     сом
    0.38
    old
    0.38
    Act Density 0.002%

    No Known Activations