INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     exerc
    -0.76
     Tall
    -0.75
     Sed
    -0.72
     Pend
    -0.72
     Reborn
    -0.71
     Gors
    -0.71
     Tid
    -0.69
     Vid
    -0.69
     Redd
    -0.66
     Freed
    -0.66
    POSITIVE LOGITS
    $$
    1.25
    100
    1.24
    250
    1.22
    500
    1.22
    150
    1.21
    300
    1.21
    200
    1.20
    400
    1.18
    1
    1.16
    350
    1.15
    Act Density 0.508%

    No Known Activations