INDEX
    Explanations

    checking, auditing, analyzing

    New Auto-Interp
    Negative Logits
     проверки
    0.75
     checking
    0.74
     checks
    0.71
    查看
    0.71
    发现
    0.71
     Checks
    0.71
     Checking
    0.69
    Checks
    0.69
     проверка
    0.69
     확인
    0.68
    POSITIVE LOGITS
     vetted
    0.60
     evaluated
    0.49
     analyzed
    0.48
     screened
    0.48
     vet
    0.45
    evaluated
    0.45
     audited
    0.44
     Sc
    0.43
     anal
    0.42
    anal
    0.41
    Act Density 0.085%

    No Known Activations