INDEX
Explanations
references to balls in various contexts
New Auto-Interp
Negative Logits
McCabe
-0.70
]";
-0.69
Myra
-0.68
Prieto
-0.67
]';
-0.67
opsida
-0.66
Fino
-0.66
prä
-0.65
=
-0.64
helves
-0.63
POSITIVE LOGITS
ball
2.48
balls
2.37
BALL
2.33
Ball
2.29
ball
2.22
Ball
2.18
Balls
2.16
Balls
2.06
BALL
2.03
balls
2.03
Activations Density 0.034%