INDEX
    Explanations

    the word "are" at a high activation level

    phrases indicating probability or likelihood

    New Auto-Interp
    Negative Logits
    azel
    -0.79
    chall
    -0.78
     rele
    -0.69
    kind
    -0.68
     streng
    -0.67
    inav
    -0.67
    Els
    -0.67
     coh
    -0.66
    lik
    -0.65
    omics
    -0.65
    POSITIVE LOGITS
     elapsed
    0.82
    aneously
    0.81
     expires
    0.72
     WHEN
    0.68
     hesitation
    0.67
     PST
    0.67
     separating
    0.66
    */(
    0.66
     PDT
    0.66
     CST
    0.66
    Act Density 0.308%

    No Known Activations