INDEX
Explanations
legal references and case citations
New Auto-Interp
Negative Logits
two
-0.49
Two
-0.48
two
-0.47
Two
-0.46
Zwei
-0.45
hai
-0.45
deux
-0.44
2
-0.43
två
-0.42
zwei
-0.42
POSITIVE LOGITS
3
1.05
.³
0.83
3
0.73
thirty
0.71
three
0.70
thirties
0.69
₃
0.67
三十
0.66
Thirty
0.65
۳
0.64
Activations Density 1.774%