INDEX
Explanations
references to sports figures and organizations, particularly in the context of baseball
New Auto-Interp
Negative Logits
producenta
-0.59
écrire
-0.57
ilustracja
-0.57
Anfitrión
-0.56
réfléchir
-0.54
étoient
-0.54
feroit
-0.53
Autorin
-0.53
ambilan
-0.53
būtų
-0.53
POSITIVE LOGITS
stepper
0.57
baller
0.50
exer
0.50
shir
0.50
footer
0.50
padd
0.49
jo
0.49
camper
0.49
parker
0.49
pol
0.48
Activations Density 0.490%