INDEX
    Explanations

    Understanding other cultures

    New Auto-Interp
    Negative Logits
     bd
    -0.07
     incapac
    -0.07
     úkol
    -0.06
    IND
    -0.06
     χώ
    -0.06
    ?('
    -0.06
     setters
    -0.06
    dorf
    -0.06
    CHE
    -0.06
    άκ
    -0.06
    POSITIVE LOGITS
     Neo
    0.07
     trom
    0.06
    MessageBox
    0.06
    states
    0.06
    [user
    0.06
     Shirley
    0.06
    0.06
     Occup
    0.06
    INIT
    0.06
    0.06
    Act Density 0.080%

    No Known Activations