INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -
    0.31
    .
    0.28
     via
    0.26
    2
    0.26
       
    0.26
    The
    0.26
    --
    0.25
    cation
    0.25
    ,
    0.25
        
    0.25
    POSITIVE LOGITS
     mencegah
    0.44
     menjaga
    0.41
     جلوگیری
    0.41
     ensure
    0.40
     memastikan
    0.39
     asegurar
    0.39
    确保
    0.39
     evitare
    0.38
    ការពារ
    0.38
     gewährleisten
    0.38
    Act Density 0.025%

    No Known Activations