INDEX
    Explanations

    negations and expressions of inability or non-existence

    New Auto-Interp
    Negative Logits
    adpleegd
    -0.78
    setSource
    -0.76
     Thales
    -0.75
     merce
    -0.75
     Weiss
    -0.75
     Gibbs
    -0.74
     PACE
    -0.73
     _('
    -0.71
    ebe
    -0.71
    Meyer
    -0.71
    POSITIVE LOGITS
     isn
    1.28
    __":
    
    1.26
     wasn
    1.22
     weren
    1.20
     Wasn
    1.19
     didn
    1.16
     Isn
    1.15
     aren
    1.15
    __':
    
    1.14
     mustn
    1.14
    Act Density 0.076%

    No Known Activations