INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    271
    -0.07
    269
    -0.07
     likeness
    -0.07
    /init
    -0.07
    663
    -0.07
    257
    -0.07
     disposal
    -0.07
    Two
    -0.07
     spinner
    -0.07
     Two
    -0.07
    POSITIVE LOGITS
     read
    0.27
     Read
    0.21
    read
    0.18
    Read
    0.17
    -read
    0.17
     READ
    0.16
    .Read
    0.14
     reads
    0.14
    READ
    0.14
     reading
    0.13
    Act Density 0.055%

    No Known Activations