INDEX
    Explanations

    references to social identity and pride in cultural or regional contexts

    New Auto-Interp
    Negative Logits
    componentWill
    -0.55
    et
    -0.50
     (
    -0.47
    -0.44
    Network
    -0.43
     network
    -0.43
    network
    -0.41
    某个
    -0.40
     og
    -0.40
     W
    -0.40
    POSITIVE LOGITS
    Демографія
    0.97
    kháu
    0.90
    ArgsConstructor
    0.90
     Italijanski
    0.88
    ніципа
    0.87
    TypedDataSet
    0.84
    LEncoder
    0.83
    GEBURTSDATUM
    0.82
    AndEndTag
    0.82
     betweenstory
    0.82
    Act Density 0.121%

    No Known Activations