INDEX
    Explanations

    rights, exploited, basic, governance

    New Auto-Interp
    Negative Logits
     outcrops
    0.37
     drenched
    0.36
     جب
    0.33
     edific
    0.33
     meandering
    0.33
     ridd
    0.32
     ridges
    0.32
     frown
    0.31
     fertilized
    0.31
     YouTuber
    0.31
    POSITIVE LOGITS
    руем
    0.40
     корпора
    0.36
    和服务
    0.36
    和社会
    0.36
    0.36
    现金
    0.35
    лл
    0.34
    性和
    0.33
    Governance
    0.33
    сима
    0.33
    Act Density 0.002%

    No Known Activations