INDEX
    Explanations

    terms related to understanding and comprehension

    New Auto-Interp
    Negative Logits
    -0.63
    tığı
    -0.62
     pick
    -0.59
    zości
    -0.58
     BIBLIO
    -0.56
     drop
    -0.55
    fabs
    -0.55
    spyOn
    -0.54
     PICK
    -0.54
    ofire
    -0.54
    POSITIVE LOGITS
     understand
    3.48
    understand
    3.23
     understanding
    3.20
     understands
    3.17
     Understand
    3.17
     understood
    3.09
    Understand
    3.07
    understanding
    2.96
     Understanding
    2.76
    understood
    2.76
    Act Density 0.071%

    No Known Activations