INDEX
    Explanations

    rhyming couplets or text structures

    New Auto-Interp
    Negative Logits
     رغم
    0.49
    esus
    0.44
     Zanu
    0.42
     तत्कालीन
    0.38
     recuerdo
    0.35
    awanda
    0.35
     despite
    0.35
    当時の
    0.35
     liberdade
    0.35
    kende
    0.34
    POSITIVE LOGITS
    ثيل
    0.42
     tòa
    0.38
     behaviors
    0.38
    0.36
    Visual
    0.36
    све
    0.36
    ட்டை
    0.35
    ಗೊಳ
    0.35
     संवै
    0.35
    Templates
    0.35
    Act Density 0.002%

    No Known Activations