INDEX
    Explanations

    names of individuals, likely public figures or experts

    New Auto-Interp
    Negative Logits
     affez
    -1.26
     allarg
    -1.26
     cammin
    -1.26
     parati
    -1.25
     soggior
    -1.24
     rilass
    -1.11
     dirit
    -1.07
     cioc
    -1.07
     tramont
    -1.07
     lele
    -1.07
    POSITIVE LOGITS
    0.77
     himself
    0.76
    '
    0.74
     has
    0.59
     is
    0.58
     was
    0.56
    ׳
    0.56
     had
    0.55
     went
    0.55
     herself
    0.54
    Act Density 0.125%

    No Known Activations