INDEX
    Explanations

    instances of the word "de" and its variations

    After token "De" or "de"

    de followed by specific words

    New Auto-Interp
    Negative Logits
    WebVitals
    -0.75
    /**
    -0.59
    StructEnd
    -0.59
    contentLoaded
    -0.59
     forward
    -0.55
    Demografie
    -0.54
     createState
    -0.54
     oprot
    -0.53
    UnsafeEnabled
    -0.53
    دانشنامهٔ
    -0.53
    POSITIVE LOGITS
     facto
    0.43
    商品説明
    0.43
    ValueStyle
    0.43
     للمعارف
    0.41
    udas
    0.41
     bedste
    0.40
    irdre
    0.40
    liber
    0.40
     graded
    0.39
     Beers
    0.39
    Act Density 0.103%

    No Known Activations