INDEX
    Explanations

    phrases indicating legal arguments or discussions related to court cases

    New Auto-Interp
    Negative Logits
    %");
    -0.99
    %";
    -0.99
     nakalista
    -0.95
    />";
    -0.94
    >");
    
    -0.92
    .";
    
    -0.91
     />';
    -0.90
    tvguidetime
    -0.89
    */;
    -0.89
    "]);
    
    -0.87
    POSITIVE LOGITS
    .
    0.62
    !
    0.56
    :
    0.53
     "
    0.49
    so
    0.47
     (
    0.47
     Because
    0.47
     so
    0.45
     ("
    0.45
    Because
    0.44
    Act Density 0.135%

    No Known Activations