INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    grammar
    -0.09
     personality
    -0.08
    worthiness
    -0.08
     temperament
    -0.08
     sequel
    -0.08
    assion
    -0.07
    ([(
    -0.07
    akhi
    -0.07
     faculdade
    -0.07
    Grammar
    -0.07
    POSITIVE LOGITS
     flooded
    0.09
     evacuated
    0.08
     airborne
    0.08
     drowned
    0.08
    _inside
    0.08
     swallowed
    0.08
     immersed
    0.08
     Phys
    0.08
     evacu
    0.07
    -Speed
    0.07
    Act Density 0.030%

    No Known Activations