INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    Personendaten
    -0.42
     hemel
    -0.42
     ethical
    -0.42
    protoimpl
    -0.41
     péné
    -0.40
     wiers
    -0.39
     abito
    -0.38
    fficio
    -0.38
    fileID
    -0.38
    愛知県
    -0.38
    POSITIVE LOGITS
    Thanks
    0.90
    thanks
    0.89
     Thanks
    0.89
    THANKS
    0.86
     thanks
    0.85
     THANKS
    0.82
     Thx
    0.79
    Thx
    0.73
    Thanx
    0.72
    thx
    0.68
    Act Density 0.086%

    No Known Activations