INDEX
    Explanations

    phrases indicating legal outcomes or emotional responses related to crime and punishment

    New Auto-Interp
    Negative Logits
    4
    -1.65
     four
    -1.49
     Four
    -1.33
    four
    -1.29
     cuatro
    -1.27
     FOUR
    -1.24
     quatro
    -1.22
     quatre
    -1.21
    -1.21
    FOUR
    -1.20
    POSITIVE LOGITS
     Tenth
    0.52
    Sixth
    0.52
    Seventh
    0.51
     Eighth
    0.50
     tenth
    0.49
     bezeichneter
    0.49
     ten
    0.49
     eighth
    0.49
     seventh
    0.48
     Seventh
    0.47
    Act Density 0.757%

    No Known Activations