INDEX
    Explanations

    expressions of personal opinions and reflections on events or choices

    New Auto-Interp
    Negative Logits
    unction
    -0.19
    elsinki
    -0.17
    unden
    -0.15
    opis
    -0.15
    ohana
    -0.15
     hed
    -0.15
    inho
    -0.14
     persons
    -0.14
     seems
    -0.14
     Ñģб
    -0.14
    POSITIVE LOGITS
    æģ
    0.17
    angan
    0.16
    dar
    0.16
    ayi
    0.15
    krom
    0.14
     dư
    0.14
     δεδο
    0.14
    ardin
    0.14
    å¢
    0.13
    Gram
    0.13
    Act Density 0.330%

    No Known Activations