INDEX
    Explanations

    cute toys and characters

    New Auto-Interp
    Negative Logits
     Sinai
    0.74
    🦅
    0.72
    0.70
     напряжения
    0.69
     rectifier
    0.68
     diagonals
    0.68
     Resolve
    0.67
    💪
    0.67
    0.66
     palestra
    0.66
    POSITIVE LOGITS
     cartoon
    2.09
     adorable
    2.00
     kawaii
    1.91
     teddy
    1.88
     cudd
    1.87
     cute
    1.86
     dolls
    1.81
    Cute
    1.80
     plush
    1.73
     Cute
    1.72
    Act Density 0.583%

    No Known Activations