INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    UNT
    -0.07
    100
    -0.07
    _COMMENT
    -0.06
     fog
    -0.06
     nonce
    -0.06
     moments
    -0.06
    _Node
    -0.06
     decades
    -0.06
     rumours
    -0.06
    {\"
    -0.06
    POSITIVE LOGITS
     apply
    0.15
     applied
    0.14
    apply
    0.13
     Applied
    0.13
    Apply
    0.11
     Apply
    0.11
     applying
    0.11
     applies
    0.11
    _apply
    0.10
     Applying
    0.10
    Act Density 0.040%

    No Known Activations