INDEX
    Explanations

    instances of the word "as" indicating comparisons or descriptions

    New Auto-Interp
    Negative Logits
    aska
    -0.64
     ASD
    -0.59
    ese
    -0.57
    number
    -0.57
    lot
    -0.57
    psey
    -0.56
    english
    -0.54
     understanding
    -0.54
    gin
    -0.54
    idth
    -0.53
    POSITIVE LOGITS
    icipated
    0.72
    natureconservancy
    0.71
    opped
    0.71
     mathemat
    0.67
    MIT
    0.66
    rowing
    0.65
    med
    0.63
    lighting
    0.62
    paio
    0.61
    idates
    0.61
    Act Density 0.080%

    No Known Activations