INDEX
    Explanations

    personal opinions and reactions in a conversational context

    expressions of personal feelings and opinions

    New Auto-Interp
    Negative Logits
    Âł
    -0.54
    arist
    -0.52
     Cells
    -0.50
     âī¡
    -0.50
     Material
    -0.48
    akeru
    -0.47
    WAR
    -0.47
    arak
    -0.45
     «
    -0.45
     Âł
    -0.44
    POSITIVE LOGITS
    .'"
    0.81
    .")
    0.76
    !'"
    0.72
    )."
    0.72
     â̦"
    0.71
    '."
    0.64
    ."
    0.64
    }"
    0.60
    ]."
    0.59
    '"
    0.59
    Act Density 0.785%

    No Known Activations