INDEX
    Explanations

    phrases related to legal documents and formal communication

    Following special characters or formatting

    legal and mathematical structures

    New Auto-Interp
    Negative Logits
    )");
    
    -0.59
     برانيه
    -0.57
    >");
    
    -0.55
    })`
    -0.54
    ")"
    -0.51
    "</
    -0.49
    はじめに
    -0.47
    ztő
    -0.46
    ."</
    -0.46
    _${
    -0.45
    POSITIVE LOGITS
    ↵↵↵
    2.10
    ↵↵↵↵
    1.97
    ↵↵↵↵↵
    1.84
    ↵↵↵↵↵↵
    1.74
    ↵↵↵↵↵↵↵
    1.63
    ↵↵↵↵↵↵↵↵
    1.51
    ↵↵↵↵↵↵↵↵↵
    1.43
    ↵↵↵↵↵↵↵↵↵↵
    1.36
    ↵↵↵↵↵↵↵↵↵↵↵
    1.36
    ↵↵↵↵↵↵↵↵↵↵↵↵
    1.32
    Act Density 0.977%

    No Known Activations