INDEX
    Explanations

    the word "if" followed by a numerical value

    conditional statements or hypothetical scenarios

    New Auto-Interp
    Negative Logits
     depths
    -0.69
    omi
    -0.65
    GMT
    -0.64
    WAYS
    -0.64
    ================================================================
    -0.63
    holm
    -0.62
    nect
    -0.61
    avor
    -0.61
    grey
    -0.60
    ho
    -0.59
    POSITIVE LOGITS
    yip
    0.93
    fy
    0.82
     Gutenberg
    0.76
     thou
    0.75
     you
    0.74
    ever
    0.69
     Melania
    0.68
    rame
    0.67
    yon
    0.67
    soever
    0.67
    Act Density 0.033%

    No Known Activations