INDEX
    Explanations

    punctuated dialogue and quotation marks in text

    New Auto-Interp
    Negative Logits
    bakan
    -0.09
    ì£
    -0.09
    nga
    -0.09
     disap
    -0.09
    [image
    -0.09
    hend
    -0.08
    ddb
    -0.08
    ãĥ³ãĥIJ
    -0.08
    á»§
    -0.08
    ë¨
    -0.08
    POSITIVE LOGITS
     I
    0.06
     in
    0.06
     about
    0.05
     E
    0.05
     Owens
    0.05
     ,
    0.05
    ucas
    0.05
     between
    0.05
     Cas
    0.05
    ket
    0.05
    Act Density 0.001%

    No Known Activations