INDEX
    Explanations

    the presence of quotation marks or apostrophes in the text

    New Auto-Interp
    Negative Logits
    ный
    -0.67
    tft
    -0.66
     Harn
    -0.66
    ة
    -0.65
    ment
    -0.65
     Genova
    -0.64
     Martens
    -0.64
    éraux
    -0.64
     Healey
    -0.63
     CDT
    -0.63
    POSITIVE LOGITS
     ’
    1.12
     ''
    1.08
    SpringBootTest
    1.06
    :''
    0.87
    (''
    0.84
    : 
    0.84
    ?''
    0.83
    menistan
    0.81
     isShow
    0.81
     Nicky
    0.80
    Act Density 0.175%

    No Known Activations