INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Abr
    -0.08
    579
    -0.06
     Recursive
    -0.06
     Entr
    -0.06
    Receipt
    -0.06
     dissertation
    -0.06
    ация
    -0.06
    사는
    -0.06
    _INIT
    -0.06
     Assignment
    -0.06
    POSITIVE LOGITS
     calorie
    0.10
     calories
    0.08
    -value
    0.07
    .coordinates
    0.07
    -fat
    0.07
    оли
    0.06
    Charlie
    0.06
    cool
    0.06
    Ram
    0.06
     Charlie
    0.06
    Act Density 0.003%

    No Known Activations