INDEX
    Explanations

    conjunctions and contrasting phrases in the text

    New Auto-Interp
    Negative Logits
    atividad
    -0.52
    irs
    -0.52
     Référence
    -0.47
    ir
    -0.45
    ))).
    -0.44
     Madre
    -0.44
    wards
    -0.43
    yn
    -0.42
    ceus
    -0.41
    स्व
    -0.41
    POSITIVE LOGITS
    sizeCache
    0.81
    UserScript
    0.80
     محفوظة
    0.76
     مرئيه
    0.73
     continúas
    0.71
    BeginContext
    0.70
    djangoproject
    0.69
    ParallelGroup
    0.66
     gouttes
    0.66
    зулта
    0.65
    Act Density 0.485%

    No Known Activations