INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Grims
    -0.42
     springfox
    -0.41
     Aquinas
    -0.40
     in
    -0.38
     Schwim
    -0.38
     Corpo
    -0.36
     civi
    -0.36
     repa
    -0.36
     response
    -0.36
     במה
    -0.35
    POSITIVE LOGITS
     Let
    1.02
    Let
    1.01
     LET
    1.00
     Letting
    0.97
     let
    0.97
    letting
    0.90
    Letting
    0.90
    let
    0.87
     letting
    0.84
     Lets
    0.84
    Act Density 0.139%

    No Known Activations