INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     rè
    -0.56
     diagon
    -0.56
     Primaria
    -0.53
     pyram
    -0.53
    sune
    -0.50
     eviden
    -0.50
     Fichier
    -0.50
     simplif
    -0.50
     priva
    -0.49
     cenar
    -0.49
    POSITIVE LOGITS
     folks
    1.18
     Folks
    1.15
    Folks
    1.09
     folk
    1.01
    folk
    0.93
     Folk
    0.86
    Folk
    0.79
    fol
    0.71
     felicity
    0.65
    FOLK
    0.65
    Act Density 0.068%

    No Known Activations