INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    rsiniz
    -0.59
    chemy
    -0.52
     gdyby
    -0.49
    出版年
    -0.49
    Spotify
    -0.49
     Chomsky
    -0.47
    Montserrat
    -0.47
    apatalk
    -0.46
    Rahman
    -0.46
     picasso
    -0.46
    POSITIVE LOGITS
     middle
    2.48
    middle
    2.33
    Middle
    2.20
     Middle
    2.08
     MIDDLE
    2.05
    MIDDLE
    1.75
     Middleton
    1.23
     Middles
    1.23
     Middel
    1.16
     middel
    1.11
    Act Density 0.007%

    No Known Activations