INDEX
    Explanations

    references to legal terms and concepts

    New Auto-Interp
    Negative Logits
     -)
    -0.72
    '),
    
    -0.71
    ''')
    -0.69
    `;
    
    -0.69
    ]})
    -0.67
    ')));
    -0.67
    ?')
    -0.67
     дописавши
    -0.66
    ...]
    -0.66
    }}
    
    -0.65
    POSITIVE LOGITS
     soprav
    0.75
     étions
    0.74
    PhysRev
    0.74
     vocale
    0.74
     carottes
    0.70
     plads
    0.69
     chimie
    0.68
     Jérusalem
    0.68
     feroit
    0.68
    Literals
    0.67
    Act Density 0.053%

    No Known Activations