INDEX
    Explanations

    phrases related to traditional or standard practices

    references to conventional designs and practices

    New Auto-Interp
    Negative Logits
    oning
    -0.78
    gur
    -0.76
    oran
    -0.72
    haw
    -0.71
    hop
    -0.68
    hoff
    -0.67
    ander
    -0.66
    han
    -0.64
     Dalai
    -0.64
     Wanted
    -0.63
    POSITIVE LOGITS
     wisdom
    1.04
    ventional
    0.96
    etheless
    0.87
     conventional
    0.86
    ually
    0.78
    ization
    0.77
    aneously
    0.76
    nesday
    0.76
    idad
    0.76
    arily
    0.75
    Act Density 0.008%

    No Known Activations