INDEX
    Explanations

    references to various foundations and their activities

    New Auto-Interp
    Negative Logits
    oline
    -0.18
    reff
    -0.18
    union
    -0.16
    ingen
    -0.16
     Foundation
    -0.16
    275
    -0.15
    sik
    -0.15
    isions
    -0.15
     baz
    -0.15
    asso
    -0.15
    POSITIVE LOGITS
    ally
    0.20
    ality
    0.19
    aries
    0.18
    lation
    0.17
    lay
    0.17
    ary
    0.17
    arity
    0.17
    /Foundation
    0.17
    aire
    0.17
    hei
    0.16
    Act Density 0.021%

    No Known Activations