INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     delicate
    0.45
     Bloom
    0.44
    🌷
    0.44
     Bloss
    0.42
     छात्रा
    0.42
     Blossom
    0.41
     Bouquet
    0.41
     délic
    0.41
    🌸
    0.41
    溶液
    0.41
    POSITIVE LOGITS
     Männer
    0.64
     masculina
    0.60
     mascul
    0.59
     Mascul
    0.58
     erkek
    0.57
     муж
    0.57
     masculine
    0.57
     masculino
    0.55
     homens
    0.54
     ஆண்கள்
    0.54
    Act Density 0.007%

    No Known Activations