INDEX
    Explanations

    questions or prompts starting with "Does"

    questions that begin with "Does."

    New Auto-Interp
    Negative Logits
     Islands
    -0.74
    rets
    -0.73
     Canal
    -0.70
    haul
    -0.68
     Typhoon
    -0.67
     Ascension
    -0.67
     palms
    -0.67
    bush
    -0.66
    boards
    -0.66
     Methods
    -0.64
    POSITIVE LOGITS
    omething
    1.11
    berra
    0.91
    paces
    0.87
    VIDIA
    0.82
    hift
    0.81
    omorphic
    0.81
    ettings
    0.80
    ometimes
    0.80
    onga
    0.80
    pace
    0.79
    Act Density 0.042%

    No Known Activations