INDEX
    Explanations

    detailed descriptions on procedures or instructions

    New Auto-Interp
    Negative Logits
    oubted
    -0.66
    room
    -0.62
    UM
    -0.61
     ........
    -0.60
    iculture
    -0.59
    odder
    -0.58
    oubt
    -0.57
    piece
    -0.57
    peak
    -0.56
    esides
    -0.56
    POSITIVE LOGITS
    soever
    1.05
    beit
    0.96
    ls
    0.94
     much
    0.94
    itzer
    0.90
    ells
    0.88
    ever
    0.83
    ling
    0.82
    much
    0.80
     MUCH
    0.77
    Act Density 0.289%

    No Known Activations