INDEX
    Explanations

    adjectives and phrases indicating high quality or excellence

    New Auto-Interp
    Negative Logits
     Züge
    -0.38
     preocupar
    -0.35
     Ankunft
    -0.35
    那就是
    -0.32
     identitas
    -0.32
     alanı
    -0.31
    Huile
    -0.31
     الجميع
    -0.30
    RenderAtEndOf
    -0.30
    риди
    -0.30
    POSITIVE LOGITS
     excellent
    0.96
    excellent
    0.96
    Excellent
    0.91
     good
    0.91
     Excellent
    0.88
     좋은
    0.82
     wonderful
    0.82
    good
    0.81
     quality
    0.81
     excelente
    0.80
    Act Density 0.196%

    No Known Activations