INDEX
    Explanations

    the presence of the word "der" in various contexts

    New Auto-Interp
    Negative Logits
    OrNil
    -0.50
    InjectAttribute
    -0.45
     consum
    -0.45
    guan
    -0.45
     recommendation
    -0.44
    __*/
    -0.44
     Guan
    -0.44
     recom
    -0.43
     developed
    -0.42
    NameValuePair
    -0.42
    POSITIVE LOGITS
     der
    1.59
    der
    0.87
     Der
    0.83
    Der
    0.79
     DER
    0.65
    deri
    0.56
    DER
    0.53
    dert
    0.52
    derma
    0.52
    dering
    0.48
    Act Density 0.060%

    No Known Activations