INDEX
    Explanations

    phrases indicating social or political dynamics, particularly involving recognition and accountability

    New Auto-Interp
    Negative Logits
    avern
    -0.15
    aurus
    -0.15
    isma
    -0.15
    æľĢæĸ°
    -0.15
    agli
    -0.15
    hrad
    -0.14
    enda
    -0.14
    atham
    -0.14
    INE
    -0.14
    68
    -0.14
    POSITIVE LOGITS
     serious
    0.18
     permanent
    0.17
    ÑĢÑĥн
    0.17
     major
    0.17
     sophisticated
    0.16
    sut
    0.16
     entire
    0.16
    뢰
    0.16
    permanent
    0.16
    vip
    0.15
    Act Density 0.020%

    No Known Activations