INDEX
    Explanations

    negations and uncertainty

    New Auto-Interp
    Negative Logits
    тип
    -0.09
     Dieser
    -0.08
     четыр
    -0.08
     Trib
    -0.08
    дя
    -0.08
     cuarto
    -0.08
     époque
    -0.08
    牢记
    -0.08
     đô
    -0.08
     चार
    -0.08
    POSITIVE LOGITS
     inherently
    0.09
     lett
    0.08
     compared
    0.07
    Cmp
    0.07
    APP
    0.07
    +"\
    0.07
    POS
    0.07
    িন্ন
    0.07
     inherent
    0.07
    Therefore
    0.07
    Act Density 0.183%

    No Known Activations