INDEX
    Explanations

    instances of negation or denial in the text

    New Auto-Interp
    Negative Logits
    unal
    -0.15
     misunderstanding
    -0.15
    eno
    -0.14
    iven
    -0.14
     TestData
    -0.14
    ambi
    -0.14
    ertino
    -0.14
    ernel
    -0.14
    amb
    -0.13
     Marino
    -0.13
    POSITIVE LOGITS
     realize
    0.41
     realization
    0.39
     realise
    0.37
     realizes
    0.36
     realized
    0.36
     realizing
    0.33
     realised
    0.33
     réalis
    0.28
    æĦıè¯Ĩ
    0.25
     realiz
    0.25
    Act Density 0.077%

    No Known Activations