INDEX
    Explanations

    references to specific entities, particularly focusing on nouns related to categories or classifications

    New Auto-Interp
    Negative Logits
    errHandler
    -0.48
    UnsafeEnabled
    -0.36
     oiseaux
    -0.36
     skeleton
    -0.36
    rollup
    -0.36
     shawl
    -0.36
     Bewußt
    -0.35
     écrans
    -0.35
    exemplar
    -0.34
    Publica
    -0.34
    POSITIVE LOGITS
     autoridade
    0.56
    ніципалі
    0.56
    Informações
    0.54
    ActionCreators
    0.50
    ações
    0.49
     consciência
    0.48
     ERSITY
    0.47
     dignité
    0.47
     idéia
    0.47
     dabei
    0.47
    Act Density 0.094%

    No Known Activations