INDEX
    Explanations

    putting skills to the test, minds at rest

    New Auto-Interp
    Negative Logits
     centers
    0.36
     deductions
    0.36
     wakes
    0.35
     degradation
    0.35
     deduction
    0.34
     perox
    0.34
     frig
    0.34
     wake
    0.33
     cavities
    0.33
     inferences
    0.33
    POSITIVE LOGITS
    0.37
    ("#{
    0.36
    0.35
    0.35
    0.35
    0.34
    眼镜
    0.34
    0.34
    0.34
     frère
    0.34
    Act Density 0.001%

    No Known Activations