INDEX
    Explanations

    phrases indicating negation or denial in various contexts

    New Auto-Interp
    Negative Logits
    "]);
    
    -0.67
    )});
    -0.58
    )];
    
    -0.58
    '));
    
    -0.55
    ')));
    -0.55
     }}$}
    -0.54
    )');
    -0.53
    ')")
    -0.53
    cosh
    -0.53
    Sklici
    -0.53
    POSITIVE LOGITS
    ulitan
    0.63
     necessarily
    0.62
     quelconque
    0.57
     BoxDecoration
    0.57
    mehl
    0.57
    oredCriteria
    0.56
    necessarily
    0.56
     except
    0.55
     any
    0.54
     necessariamente
    0.54
    Act Density 0.810%

    No Known Activations