INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    \{\\
    -0.75
     متعلقه
    -0.75
    AndEndTag
    -0.72
    SourceChecksum
    -0.71
     $_"
    -0.70
    tagHelperRunner
    -0.66
    MessageTagHelper
    -0.64
     المعيارى
    -0.63
    batik
    -0.63
    toHaveBeenCalled
    -0.62
    POSITIVE LOGITS
    ids
    0.40
    idov
    0.40
    ida
    0.39
    idis
    0.37
    idor
    0.36
    eces
    0.35
    ache
    0.34
    ēs
    0.34
    idea
    0.34
    ides
    0.34
    Act Density 0.000%

    No Known Activations