INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     mourut
    -0.66
     soutenu
    -0.55
     illustrazione
    -0.52
     juger
    -0.52
     valutazione
    -0.51
     vidare
    -0.50
     montagna
    -0.49
     preghiera
    -0.49
    cemment
    -0.49
     pequenas
    -0.48
    POSITIVE LOGITS
    <bos>
    0.86
    DeleteBehavior
    0.79
     дописавши
    0.70
     الحره
    0.68
    migrationBuilder
    0.68
    ContentAsync
    0.67
    aarrggbb
    0.66
    afficheront
    0.66
     שוליים
    0.65
    homonymie
    0.65
    Act Density 0.000%

    No Known Activations