INDEX
    Explanations

    repeated phrases and references to specific examples or concepts

    New Auto-Interp
    Negative Logits
    thon
    -0.15
    ABCDEFGHI
    -0.14
     addCriterion
    -0.14
    çģµ
    -0.14
    íĮ
    -0.13
    	damage
    -0.13
    (#)
    -0.13
    RunWith
    -0.13
    öl
    -0.13
    åºŁ
    -0.13
    POSITIVE LOGITS
     kind
    0.38
     kinds
    0.33
     type
    0.33
    kind
    0.27
    -type
    0.26
    type
    0.25
     types
    0.23
     sorts
    0.23
     sort
    0.23
     exact
    0.21
    Act Density 0.189%

    No Known Activations