INDEX
    Explanations

    phrases that involve inconsistencies or contrasts in statements

    New Auto-Interp
    Negative Logits
    )))));
    -0.67
    rungsseite
    -0.64
    ))));
    -0.64
    "]="
    -0.63
    ContentAsync
    -0.62
    ]));
    
    -0.62
    ModelSerializer
    -0.62
    }],
    
    -0.62
     ویکی‌پدی
    -0.61
    ]<<"
    -0.60
    POSITIVE LOGITS
     often
    0.80
     usually
    0.79
     sometimes
    0.79
     souvent
    0.75
     oftentimes
    0.73
     kadang
    0.73
     Often
    0.72
     parfois
    0.72
    often
    0.71
    usually
    0.71
    Act Density 0.264%

    No Known Activations