INDEX
    Explanations

    emphatic phrases and affirmations

    New Auto-Interp
    Negative Logits
     InputDecoration
    -0.92
     CreateTagHelper
    -0.91
    ])):
    -0.66
    RectangleBorder
    -0.66
    ≦)
    -0.62
    :])
    -0.62
     مشين
    -0.61
    cline
    -0.60
    πάρχ
    -0.60
    +#+#
    -0.60
    POSITIVE LOGITS
    principalTable
    0.55
     Bucure
    0.51
    komen
    0.50
     București
    0.50
     Benav
    0.49
     говорю
    0.48
     secund
    0.48
     Đồng
    0.48
    Dettagli
    0.47
     étranger
    0.47
    Act Density 0.003%

    No Known Activations