INDEX
    Explanations

    direct quotations or reported speech

    New Auto-Interp
    Negative Logits
    estern
    -0.75
    anu
    -0.75
    ãĥ´
    -0.67
    acting
    -0.65
    plex
    -0.64
    entimes
    -0.64
    ãĥĩ
    -0.63
     Unch
    -0.62
    mental
    -0.62
    ãĥ¼ãĥĨ
    -0.60
    POSITIVE LOGITS
     "...
    0.97
     "â̦
    0.90
    :"
    0.90
     "[
    0.87
     ""
    0.84
     "'
    0.82
     "#
    0.81
    :
    0.78
     ".
    0.78
     "(
    0.77
    Act Density 0.116%

    No Known Activations