INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Luglio
    -0.88
     Settembre
    -0.84
     Ottobre
    -0.84
     Giugno
    -0.78
     LIRE
    -0.77
     DELLE
    -0.77
     chiaramente
    -0.76
     soulign
    -0.71
     préc
    -0.70
     affez
    -0.70
    POSITIVE LOGITS
     McInt
    0.53
     розта
    0.50
    ,
    0.50
    .
    0.48
     Republics
    0.47
     countryman
    0.47
     tarihinde
    0.46
     –
    0.46
     інтер
    0.45
     вік
    0.45
    Act Density 0.183%

    No Known Activations