INDEX
    Explanations

    legal references and case citations

    New Auto-Interp
    Negative Logits
    two
    -0.49
     Two
    -0.48
     two
    -0.47
    Two
    -0.46
     Zwei
    -0.45
     hai
    -0.45
     deux
    -0.44
    2
    -0.43
     två
    -0.42
     zwei
    -0.42
    POSITIVE LOGITS
    3
    1.05
    0.83
    0.73
     thirty
    0.71
     three
    0.70
     thirties
    0.69
    0.67
    三十
    0.66
     Thirty
    0.65
    ۳
    0.64
    Act Density 1.774%

    No Known Activations