INDEX
    Explanations

    terms related to home life and community events

    New Auto-Interp
    Negative Logits
    irs
    -0.18
    avad
    -0.16
    /the
    -0.15
    orate
    -0.15
    رÙĪØ²
    -0.14
    olean
    -0.14
    央
    -0.14
    AMESPACE
    -0.13
     Kostenlose
    -0.13
     mej
    -0.13
    POSITIVE LOGITS
    acro
    0.17
     quot
    0.16
    illion
    0.16
    %E
    0.15
    bsp
    0.15
    idden
    0.14
    sock
    0.14
    ován
    0.14
     Repos
    0.14
     same
    0.14
    Act Density 0.071%

    No Known Activations