INDEX
    Explanations

    references to personal narratives and historical documents

    New Auto-Interp
    Negative Logits
    ä¹ĥ
    -0.14
    ATUS
    -0.14
     hav
    -0.14
     converged
    -0.13
    jak
    -0.13
    Äħż
    -0.13
     alleged
    -0.13
     Hav
    -0.13
     hed
    -0.13
     Lage
    -0.12
    POSITIVE LOGITS
     entries
    0.17
     Routine
    0.16
    æĿŁ
    0.15
    NX
    0.15
    Entries
    0.14
    entries
    0.14
     richest
    0.14
     zengin
    0.14
    odate
    0.14
    ãĥ¬ãĥ³
    0.14
    Act Density 0.019%

    No Known Activations