INDEX
    Explanations

    rules and compliance

    New Auto-Interp
    Negative Logits
    .STRING
    -0.08
     Rico
    -0.07
     aici
    -0.07
     Roi
    -0.07
    .pick
    -0.07
    يران
    -0.07
     Mao
    -0.07
     czy
    -0.07
     tutaj
    -0.07
     объявления
    -0.07
    POSITIVE LOGITS
    0.17
     पालन
    0.16
     adherence
    0.16
     соблю
    0.15
     compliance
    0.14
    Compliance
    0.14
     Compliance
    0.14
     соблюдать
    0.13
     conformité
    0.12
     conformity
    0.12
    Act Density 0.110%

    No Known Activations