INDEX
    Explanations

    structural or organizational elements within the text

    New Auto-Interp
    Negative Logits
    ksom
    -0.52
    ThemeOverlay
    -0.49
     Савезне
    -0.49
    anyahu
    -0.47
     setEmail
    -0.47
     }{@
    -0.47
    arap
    -0.46
    irm
    -0.46
     CWE
    -0.45
    entech
    -0.45
    POSITIVE LOGITS
    ↵↵
    2.31
    ↵↵↵
    0.77
    </h4>
    0.68
    </h3>
    0.68
    ↵↵↵↵
    0.67
    '):
    
    0.66
    </h2>
    0.63
    "):
    
    0.61
    ");
    
    0.60
    ":
    
    0.60
    Act Density 0.347%

    No Known Activations