INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     accla
    -0.67
     idolat
    -0.67
     racon
    -0.63
     milf
    -0.62
     nicolas
    -0.59
     fash
    -0.59
     hugo
    -0.59
     disgra
    -0.58
     fann
    -0.57
     pamph
    -0.57
    POSITIVE LOGITS
     software
    1.37
     Software
    1.22
    Software
    1.22
    software
    1.21
     SOFTWARE
    1.11
    SOFTWARE
    0.96
    软件
    0.95
     softwares
    0.82
    oftware
    0.77
     logiciel
    0.72
    Act Density 0.064%

    No Known Activations