INDEX
    Explanations

    words that indicate actions or processes related to improvement and evaluation

    New Auto-Interp
    Negative Logits
     suivante
    -0.49
    лыша
    -0.48
     relazioni
    -0.47
    <bos>
    -0.46
    حوالہ
    -0.45
     alimentaires
    -0.45
     alimentaire
    -0.43
     ویکی‌پدی
    -0.43
     Wicidata
    -0.42
     honte
    -0.42
    POSITIVE LOGITS
    ThroughAttribute
    0.85
    LEncoder
    0.81
    yargs
    0.75
    sequelize
    0.73
    Clik
    0.72
    клопе
    0.71
    parsedMessage
    0.69
     فريبيس
    0.69
    MockBean
    0.67
    CloseOperation
    0.67
    Act Density 0.480%

    No Known Activations