INDEX
    Explanations

    statements and phrases related to dialogue or speech acts

    New Auto-Interp
    Negative Logits
     Millennium
    -0.36
     Paper
    -0.34
    parseColor
    -0.33
     Pod
    -0.31
     Geothermal
    -0.31
     Novo
    -0.31
     çalışan
    -0.30
     cookie
    -0.29
     Innovative
    -0.29
     soru
    -0.29
    POSITIVE LOGITS
    himself
    0.66
     zijne
    0.65
     nahilalakip
    0.65
     himſelf
    0.62
     Himself
    0.61
     himself
    0.61
    ftagPool
    0.61
    rungsseite
    0.59
    iſchen
    0.59
    +#+#
    0.59
    Act Density 0.524%

    No Known Activations