INDEX
    Explanations

    the transition words or phrases that indicate contrast or opposition in statements

    New Auto-Interp
    Negative Logits
     autorytatywna
    -1.19
    fjspx
    -0.96
    MLLoader
    -0.94
     gynhyrchwyd
    -0.93
     مرئيه
    -0.91
    protoimpl
    -0.89
    MessageOf
    -0.89
    LookAnd
    -0.87
    الحياه
    -0.86
    principalColumn
    -0.85
    POSITIVE LOGITS
    norr
    0.51
     leiden
    0.50
     Hartmann
    0.49
    在我的
    0.47
    iro
    0.46
     interference
    0.45
     cómics
    0.44
     Neto
    0.43
    fato
    0.42
    が高い
    0.41
    Act Density 0.039%

    No Known Activations