INDEX
    Explanations

    references to legal consequences or punishments

    New Auto-Interp
    Negative Logits
     surla
    -0.59
    følgelig
    -0.57
    __);
    -0.54
    Datuak
    -0.54
    جانب
    -0.52
     malheur
    -0.50
     preuve
    -0.49
     coté
    -0.48
     vorbei
    -0.47
    ָד
    -0.47
    POSITIVE LOGITS
    tinyos
    0.77
    ValueStyle
    0.69
     kasarigan
    0.67
    Tikang
    0.62
     künftig
    0.58
     PyLong
    0.58
    awaiter
    0.55
    Saiba
    0.54
    queryInterface
    0.54
    artifactId
    0.53
    Act Density 0.411%

    No Known Activations