INDEX
    Explanations

    verbs of existence or to be in various forms

    New Auto-Interp
    Negative Logits
    ContentAlignment
    -0.82
    profen
    -0.64
     angelo
    -0.59
    </h5>
    -0.59
    unknownFields
    -0.58
     kháng
    -0.56
    unst
    -0.55
     Forrest
    -0.54
     néz
    -0.54
    UnsafeEnabled
    -0.54
    POSITIVE LOGITS
     é
    2.34
     É
    1.47
    É
    1.36
     È
    1.01
     è
    0.97
    Eacute
    0.95
    é
    0.94
     são
    0.92
    eacute
    0.91
    È
    0.87
    Act Density 0.044%

    No Known Activations