INDEX
    Explanations

    phrases related to instructions, disclaimers, and user engagement prompts

    Sentences ending with a punctuation mark

    New Auto-Interp
    Negative Logits
    ViewImports
    -0.85
     tartalomajánló
    -0.73
     виправивши
    -0.72
    Hochspringen
    -0.71
     estekak
    -0.70
     Мексичка
    -0.69
    -0.68
    styleType
    -0.67
    apimachinery
    -0.67
    kloped
    -0.66
    POSITIVE LOGITS
     ...
    0.65
    The
    0.62
     The
    0.56
    A
    0.55
    1
    0.54
     Verw
    0.54
    It
    0.53
     .
    0.52
    Get
    0.52
    .
    0.50
    Act Density 0.262%

    No Known Activations