INDEX
    Explanations

    sentences instructing to read more on a particular topic

    references to reading-related actions or discussions

    New Auto-Interp
    Negative Logits
    oult
    -0.75
    TEXTURE
    -0.70
    ascal
    -0.68
    cffffcc
    -0.66
     heel
    -0.66
     USSR
    -0.63
    aviour
    -0.62
    pload
    -0.62
    VP
    -0.61
     WWF
    -0.61
    POSITIVE LOGITS
    Write
    0.96
     aloud
    0.90
    Read
    0.86
    just
    0.81
    ahead
    0.81
    gon
    0.79
    iances
    0.79
    dress
    0.78
     Continued
    0.76
     Read
    0.75
    Act Density 0.019%

    No Known Activations