INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    featureID
    -0.66
    DoubleQuotes
    -0.60
     pylint
    -0.59
     consultato
    -0.58
    ########.
    -0.56
    BarStyle
    -0.56
    stdc
    -0.55
    EndGlobalSection
    -0.55
    celle
    -0.55
     relâche
    -0.53
    POSITIVE LOGITS
     disambiguazione
    0.61
    цездатний
    0.58
     Wiktionnaire
    0.54
    SearchParams
    0.48
    braio
    0.48
     متعلقه
    0.46
    oredCriteria
    0.46
     <>",
    0.46
    jgl
    0.46
    adaptiveStyles
    0.44
    Act Density 0.105%

    No Known Activations