INDEX
    Explanations

    technical terms related to alignment in various contexts

    New Auto-Interp
    Negative Logits
     }}"></
    -0.98
    expandindo
    -0.87
     Мексичка
    -0.73
     مشين
    -0.73
    IVEREF
    -0.72
     kaarangay
    -0.70
    InjectAttribute
    -0.70
    ?>">
    -0.69
     $_"
    -0.69
    出版年
    -0.68
    POSITIVE LOGITS
    Alignment
    1.08
    aligned
    0.93
     aligned
    0.77
    alignment
    0.77
    align
    0.75
     align
    0.75
     alignment
    0.75
     Alignment
    0.72
     alignments
    0.63
    forall
    0.63
    Act Density 0.147%

    No Known Activations