INDEX
    Explanations

    suffix 'ing' or specific words (simple, linear, context, handle, implement, family)

    New Auto-Interp
    Negative Logits
    1.09
    IIS
    1.02
     leukocytes
    1.00
    INER
    0.96
     ulcerative
    0.94
    IANS
    0.90
    IAC
    0.90
    ClFN
    0.89
     Buddh
    0.89
     Insects
    0.89
    POSITIVE LOGITS
    ap
    1.16
    ain
    1.09
    ти
    1.05
    us
    1.04
    ac
    1.03
     fiel
    0.97
    ine
    0.96
    it
    0.96
    ik
    0.94
     classique
    0.92
    Act Density 0.001%

    No Known Activations