INDEX
    Explanations

    specific instances of various activities, events, and initiatives

    New Auto-Interp
    Negative Logits
     various
    -0.19
     etc
    -0.19
     everywhere
    -0.18
     stuff
    -0.18
     Various
    -0.18
     all
    -0.17
    åIJĦç§į
    -0.17
    æīĢæľī
    -0.16
    Various
    -0.15
     throughout
    -0.15
    POSITIVE LOGITS
    åĪĨåĪ«
    0.28
    ãĢģä¸Ģ
    0.21
    —one
    0.20
     respectively
    0.20
    -two
    0.20
    -one
    0.20
    ãģĿãĤĮ
    0.19
     ê°ģê°ģ
    0.18
    atat
    0.18
    عاÙĨ
    0.18
    Act Density 0.280%

    No Known Activations