INDEX
    Explanations

    fair trade, fair use, fair enough

    New Auto-Interp
    Negative Logits
    𝟑
    1.11
    1.08
    𝟏
    1.06
    𝟐
    1.03
     Thick
    0.92
     glam
    0.92
    0.90
     lion
    0.89
     piercings
    0.87
     rav
    0.87
    POSITIVE LOGITS
    ytale
    1.97
    yland
    1.44
    skinned
    1.28
    yt
    1.26
    grounds
    1.25
    ground
    1.23
    erweise
    1.20
     skinned
    1.16
     haired
    1.12
    ytail
    1.11
    Act Density 0.010%

    No Known Activations