INDEX
    Explanations

    references to significant or extreme situations

    New Auto-Interp
    Negative Logits
    eya
    -0.17
    isel
    -0.15
    eln
    -0.15
    ence
    -0.15
    iz
    -0.15
    iac
    -0.14
    amen
    -0.14
    ORAGE
    -0.13
    irm
    -0.13
    gage
    -0.13
    POSITIVE LOGITS
    era
    0.17
    /cop
    0.14
     bod
    0.14
    amaha
    0.14
    aders
    0.14
     cop
    0.14
    allon
    0.14
     EntityState
    0.14
    á»ģn
    0.14
    ĽĦ
    0.14
    Act Density 0.007%

    No Known Activations