INDEX
    Explanations

    references to limitations or restrictions in various contexts

    New Auto-Interp
    Negative Logits
     barg
    -0.17
    alu
    -0.17
    oki
    -0.15
    _criteria
    -0.14
    mans
    -0.14
    yu
    -0.14
    ulu
    -0.14
    alus
    -0.14
    astically
    -0.13
     Tou
    -0.13
    POSITIVE LOGITS
     matter
    0.29
     indeed
    0.25
    matter
    0.23
    aille
    0.20
     Matter
    0.20
    oda
    0.16
     Cham
    0.16
     for
    0.15
     matters
    0.15
    Indeed
    0.15
    Act Density 0.102%

    No Known Activations