INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Ottobre
    -0.93
     Luglio
    -0.79
     Giugno
    -0.78
     Settembre
    -0.74
    strona
    -0.66
    Πηγές
    -0.66
    Și
    -0.62
    nasel
    -0.61
     dovr
    -0.60
    -0.59
    POSITIVE LOGITS
    0
    0.56
    4
    0.54
    8
    0.54
    5
    0.54
    6
    0.53
    7
    0.53
     kredi
    0.52
    3
    0.52
     Respectfully
    0.52
    9
    0.50
    Act Density 0.123%

    No Known Activations