INDEX
    Explanations

    instructional/technical content

    New Auto-Interp
    Negative Logits
    ldb
    -0.07
    .school
    -0.07
    pectrum
    -0.06
     Chen
    -0.06
    Poll
    -0.06
    .projects
    -0.06
    Ross
    -0.06
    snow
    -0.06
    _LIB
    -0.06
    .light
    -0.06
    POSITIVE LOGITS
    ewn
    0.07
     Hein
    0.07
    0.06
    NAMESPACE
    0.06
    ?('
    0.06
    [:-
    0.06
     Sommer
    0.06
     jal
    0.06
    させる
    0.06
     şik
    0.06
    Act Density 0.005%

    No Known Activations