INDEX
    Explanations

    contradictory statements or contrasts in the text

    New Auto-Interp
    Negative Logits
    ]--;
    -0.60
     Tiberius
    -0.59
     Kraken
    -0.56
     medesimo
    -0.55
     Kolo
    -0.54
     coar
    -0.54
     Baylor
    -0.54
     bacio
    -0.52
     helical
    -0.52
     helico
    -0.52
    POSITIVE LOGITS
     simply
    0.84
    aarrggbb
    0.75
    ValueGeneration
    0.72
     vielmehr
    0.72
     just
    0.71
     lenker
    0.68
    simply
    0.64
     merely
    0.64
    انجليز
    0.61
    Референце
    0.59
    Act Density 0.151%

    No Known Activations