INDEX
    Explanations

    phrases expressing misconceptions and clarifications regarding ownership and historical contexts

    Contradiction or qualification

    New Auto-Interp
    Negative Logits
     EconPapers
    -0.50
     utafitiHapana
    -0.38
    IntoConstraints
    -0.33
    propor
    -0.32
    Diwedd
    -0.32
    brille
    -0.31
     Drag
    -0.31
     Peter
    -0.31
     Lieber
    -0.31
    sorting
    -0.31
    POSITIVE LOGITS
     nonetheless
    0.63
     betyr
    0.63
     betekent
    0.60
     nevertheless
    0.59
     doesn
    0.59
     betyder
    0.58
    並不
    0.58
     notwithstanding
    0.53
     doesnt
    0.53
     does
    0.52
    Act Density 0.287%

    No Known Activations