INDEX
    Explanations

    the presence of the letter 'd' in words

    New Auto-Interp
    Negative Logits
    parsedMessage
    -0.95
     tartalomajánló
    -0.82
    verwijspagina
    -0.78
    SourceChecksum
    -0.74
    Искәрмәләр
    -0.72
    *************
    -0.72
     distanciation
    -0.71
    cifix
    -0.70
    lím
    -0.70
    ciosa
    -0.69
    POSITIVE LOGITS
    d
    0.86
    ve
    0.81
     been
    0.69
     had
    0.69
    談社
    0.68
    ll
    0.67
     params
    0.65
    elems
    0.65
    д
    0.64
    ப்
    0.64
    Act Density 0.040%

    No Known Activations