INDEX
    Explanations

    phrases related to cultural identity and heritage

    New Auto-Interp
    Negative Logits
    änder
    -0.15
    feld
    -0.15
    ảm
    -0.15
    _rg
    -0.14
    ÑĢоÑĤ
    -0.14
    portrait
    -0.14
    üst
    -0.14
    alue
    -0.14
    wig
    -0.14
    etik
    -0.14
    POSITIVE LOGITS
     means
    0.24
     direct
    0.20
     sheer
    0.19
     use
    0.18
     various
    0.18
    /by
    0.18
    ought
    0.18
    puts
    0.17
     participation
    0.16
     continued
    0.16
    Act Density 0.096%

    No Known Activations