INDEX
    Explanations

    expressions of admiration and delight

    New Auto-Interp
    Negative Logits
     containment
    0.43
     contain
    0.38
     chứa
    0.37
     precarious
    0.37
     попыта
    0.35
     содержит
    0.35
     mars
    0.34
     содержа
    0.34
     హె
    0.34
    🌑
    0.33
    POSITIVE LOGITS
     மகிழ்ச்ச
    0.72
     begeistert
    0.71
     মুগ্ধ
    0.71
     praises
    0.68
     delighted
    0.66
     memnun
    0.66
     happily
    0.64
     만족
    0.63
     overjoyed
    0.63
     pleased
    0.62
    Act Density 0.100%

    No Known Activations