INDEX
Explanations
references to the number three or associated terms
New Auto-Interp
Negative Logits
../../../
-0.79
Johnston
-0.78
frow
-0.74
González
-0.72
wwwwwwww
-0.71
UnknownFields
-0.69
Johnston
-0.68
Juana
-0.68
spesies
-0.67
manqué
-0.67
POSITIVE LOGITS
3
1.82
Three
1.29
rd
1.28
THREE
1.25
THREE
1.23
Three
1.19
iii
1.19
three
1.16
三
1.16
three
1.13
Activations Density 0.196%