INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Plasma
    -0.07
     Memory
    -0.06
    Nearly
    -0.06
     alma
    -0.06
     zeigen
    -0.06
     진행
    -0.06
    eyim
    -0.06
     παιδ
    -0.06
     neurons
    -0.06
    -0.06
    POSITIVE LOGITS
    opher
    0.07
    _DE
    0.06
     liaison
    0.06
    onder
    0.06
    _MESSAGE
    0.06
     usur
    0.06
    net
    0.06
    _PR
    0.06
    RPC
    0.06
    0.06
    Act Density 0.002%

    No Known Activations