INDEX
    Explanations

    end of sentence punctuation

    New Auto-Interp
    Negative Logits
     ripping
    0.24
     fondness
    0.24
     rigging
    0.24
    實現
    0.23
     occasionally
    0.23
     impregnation
    0.23
     scalp
    0.23
    ލ
    0.23
     hearth
    0.22
     lasers
    0.22
    POSITIVE LOGITS
    .",
    0.40
    .)
    0.38
    .;
    0.38
    .")
    0.35
    .");
    0.35
    .);
    0.35
    .),
    0.35
    .):
    0.34
    .).
    0.33
     );
    0.32
    Act Density 0.079%

    No Known Activations