INDEX
    Explanations

    sections of text containing legal or procedural terminology, particularly those indicating judgments or rulings

    New Auto-Interp
    Negative Logits
    ed
    -1.03
     تضيفلها
    -0.87
     Pank
    -0.81
    InputBorder
    -0.78
     Newberry
    -0.76
     Kw
    -0.76
     Tick
    -0.75
     Eddy
    -0.75
     TICK
    -0.74
    Tikang
    -0.71
    POSITIVE LOGITS
    ↵↵↵
    2.05
    ↵↵↵↵
    1.36
    ↵↵↵↵↵↵
    1.19
    ↵↵↵↵↵
    1.17
    ↵↵↵↵↵↵↵
    1.14
    ↵↵↵↵↵↵↵↵↵
    0.99
    ↵↵↵↵↵↵↵↵
    0.98
    ↵↵↵↵↵↵↵↵↵↵↵
    0.98
    );
    
    
    0.94
    ↵↵↵↵↵↵↵↵↵↵↵↵
    0.93
    Act Density 0.167%

    No Known Activations