INDEX
    Explanations

    expressions of dialogue and emotional responses

    New Auto-Interp
    Negative Logits
     ok
    -0.18
     OK
    -0.16
    oka
    -0.15
    Ok
    -0.15
    ulty
    -0.14
    ette
    -0.14
    aña
    -0.14
    à¥Ģश
    -0.14
    Anti
    -0.14
     Rosenstein
    -0.14
    POSITIVE LOGITS
     Trilogy
    0.14
    aso
    0.14
    .listBox
    0.14
    anou
    0.14
    kees
    0.14
     nons
    0.14
    åķĬåķĬ
    0.14
    kee
    0.13
     .
    0.13
     Mast
    0.13
    Act Density 0.011%

    No Known Activations