INDEX
    Explanations

    instances of significant change or transformation

    New Auto-Interp
    Negative Logits
    Whenever
    -0.23
     whenever
    -0.22
     Whenever
    -0.21
    Upon
    -0.20
     wherever
    -0.20
     upon
    -0.20
    upon
    -0.19
     WHILE
    -0.18
    _while
    -0.18
     Upon
    -0.18
    POSITIVE LOGITS
     wh
    0.20
     happens
    0.15
    ,$_
    0.15
     whe
    0.15
     Gul
    0.15
    ents
    0.14
    urs
    0.14
    Âł
    0.14
    uri
    0.14
    imi
    0.14
    Act Density 0.083%

    No Known Activations