INDEX
    Explanations

    discussions about personal growth and maintaining daily activities

    New Auto-Interp
    Negative Logits
    élé
    -0.17
    wers
    -0.16
    wyn
    -0.15
    isma
    -0.14
    aket
    -0.14
    ãĥĩãĤ£ãĥ¼ãĤ¹
    -0.14
     Rosenstein
    -0.14
     Kenn
    -0.14
    uz
    -0.14
    adr
    -0.13
    POSITIVE LOGITS
     normal
    0.31
    Normal
    0.24
     Normal
    0.24
     NORMAL
    0.24
    normal
    0.21
    æŃ£å¸¸
    0.21
    -normal
    0.21
    _normal
    0.20
    NORMAL
    0.20
     normals
    0.19
    Act Density 0.187%

    No Known Activations