INDEX
    Explanations

    references to past studies or research

    New Auto-Interp
    Negative Logits
    -0.85
     mourut
    -0.81
     bicara
    -0.78
     françaises
    -0.73
     QtGui
    -0.70
    Milo
    -0.69
     trouvera
    -0.69
     découv
    -0.68
     metallo
    -0.67
     sfida
    -0.66
    POSITIVE LOGITS
     Previous
    1.20
     previous
    1.20
    Previous
    1.14
    previous
    1.07
     PREVIOUS
    1.05
    PREVIOUS
    1.03
     previos
    1.02
     Prev
    0.95
     Previously
    0.94
    previously
    0.92
    Act Density 0.069%

    No Known Activations