INDEX
    Explanations

    contradictory statements or opposing viewpoints within a text

    New Auto-Interp
    Negative Logits
    onomy
    -0.71
     Balkans
    -0.70
    =-=-=-=-=-=-=-=-
    -0.69
     labs
    -0.67
     ted
    -0.64
     Annotations
    -0.63
    burgh
    -0.62
    anus
    -0.62
     laboratories
    -0.61
     anth
    -0.60
    POSITIVE LOGITS
    nings
    0.85
    SPONSORED
    0.82
    âĶĢâĶĢ
    0.76
    essage
    0.70
    invoke
    0.68
    oths
    0.67
     suppose
    0.66
    è£ħ
    0.64
    epad
    0.64
    olphin
    0.64
    Act Density 10.455%

    No Known Activations