INDEX
    Explanations

    instances of negative sentiments or conditions

    New Auto-Interp
    Negative Logits
    s
    -0.16
    S
    -0.16
    M
    -0.15
    (
    -0.15
    |
    -0.14
    A
    -0.14
    I
    -0.14
     following
    -0.14
    -grow
    -0.14
    æĸ
    -0.14
    POSITIVE LOGITS
    styleType
    0.19
     Redistributions
    0.18
    webkit
    0.18
    =-=-=-=-=-=-=-=-
    0.18
    'gc
    0.17
    wahl
    0.16
    ~-~-~-~-
    0.16
     بÙĪØ§Ø¨Ø©
    0.15
    ysz
    0.15
    Ø´ÙĨاسÛĮ
    0.15
    Act Density 0.038%

    No Known Activations