INDEX
    Explanations

    instances of dialogue or quotations

    New Auto-Interp
    Negative Logits
    istrovstvÃŃ
    -0.17
    ymous
    -0.16
     addCriterion
    -0.14
    Ú©Ø´
    -0.14
    ÑĢог
    -0.14
     samo
    -0.14
     åı·
    -0.14
    ipa
    -0.14
    üzel
    -0.14
    zap
    -0.13
    POSITIVE LOGITS
     convers
    0.15
    ween
    0.14
     Kov
    0.13
     chir
    0.13
     reactive
    0.13
    ležit
    0.13
     Orn
    0.13
    geist
    0.13
    077
    0.13
     bey
    0.13
    Act Density 0.170%

    No Known Activations