INDEX
    Explanations

    references to popular game shows and quiz formats

    New Auto-Interp
    Negative Logits
    ieg
    -0.15
    (OP
    -0.14
     Gill
    -0.14
     mud
    -0.13
    alue
    -0.13
    arena
    -0.13
     Mud
    -0.13
    енка
    -0.13
    iry
    -0.13
    _ex
    -0.13
    POSITIVE LOGITS
    º
    0.20
    ozo
    0.15
    /cpp
    0.15
    wards
    0.15
    nÃŃ
    0.15
    abled
    0.15
    stakes
    0.14
    дÑı
    0.14
    abwe
    0.14
     Imm
    0.14
    Act Density 0.003%

    No Known Activations