INDEX
    Explanations

    phrases related to the establishment or founding of organizations and their history

    New Auto-Interp
    Negative Logits
    ears
    -0.15
     wsz
    -0.14
     ÑĥÑģÑĤановлен
    -0.14
     lẫn
    -0.14
    ipa
    -0.13
     että
    -0.13
    ãģ©
    -0.13
    å¾Ģ
    -0.12
    cluding
    -0.12
    دث
    -0.12
    POSITIVE LOGITS
     out
    0.27
     initially
    0.26
     originally
    0.23
     with
    0.22
     under
    0.22
    aim
    0.22
    initial
    0.21
     Initially
    0.20
    prim
    0.20
     aiming
    0.20
    Act Density 0.160%

    No Known Activations