INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    <Project
    -0.07
     LocalDateTime
    -0.07
     کار
    -0.07
     upwards
    -0.07
    าบาล
    -0.07
     професій
    -0.06
    266
    -0.06
     Squad
    -0.06
     modelBuilder
    -0.06
    -functions
    -0.06
    POSITIVE LOGITS
     deny
    0.17
     denying
    0.16
     denial
    0.14
     denied
    0.14
     denies
    0.13
     Denied
    0.10
    Denied
    0.09
    deny
    0.09
     undeniable
    0.09
    enin
    0.07
    Act Density 0.007%

    No Known Activations