INDEX
    Explanations

    instances of dialogue and emotional expressions in the text

    New Auto-Interp
    Negative Logits
    ĶåĽŀ
    -0.14
     reversible
    -0.14
    ench
    -0.14
     '-';↵
    -0.13
    /pub
    -0.13
     doz
    -0.13
    atz
    -0.13
    bs
    -0.13
     ,↵↵
    -0.13
    .sz
    -0.13
    POSITIVE LOGITS
     "
    0.26
     "↵
    0.17
     '"
    0.16
     ""
    0.16
    ัà¸Ļà¸ĺ
    0.15
     `
    0.15
    icker
    0.15
     "$
    0.15
    ares
    0.14
     ".
    0.14
    Act Density 0.092%

    No Known Activations