INDEX
    Explanations

    frequent occurrences of the word "de" in various contexts

    Followed by a preposition

    New Auto-Interp
    Negative Logits
     faſt
    -0.66
     raiſ
    -0.63
     pleaſure
    -0.60
     slutt
    -0.59
     chrétien
    -0.58
     ſtand
    -0.56
     ainfi
    -0.56
     ſever
    -0.56
     ſta
    -0.54
    ſelf
    -0.54
    POSITIVE LOGITS
     de
    1.34
     De
    0.91
     of
    0.91
     di
    0.88
    De
    0.86
     OF
    0.79
     DE
    0.78
     von
    0.75
    ของ
    0.75
     де
    0.72
    Act Density 0.069%

    No Known Activations