INDEX
    Explanations

    instances of competition and elimination events

    New Auto-Interp
    Negative Logits
    bjerg
    -0.17
    eah
    -0.16
    Įĵ
    -0.16
    ilan
    -0.15
    εÏĦ
    -0.15
    rame
    -0.14
    quee
    -0.14
    serter
    -0.14
    icone
    -0.14
    headline
    -0.14
    POSITIVE LOGITS
     Reality
    0.16
    ray
    0.15
     reality
    0.14
     deb
    0.14
     Musk
    0.14
     Cr
    0.14
     Trab
    0.14
    Scene
    0.14
    imes
    0.14
    ons
    0.14
    Act Density 0.012%

    No Known Activations