INDEX
Explanations
references to female pilots and their achievements
New Auto-Interp
Negative Logits
utut
-0.17
Bernstein
-0.16
дÑı
-0.15
alin
-0.14
orman
-0.14
GRADE
-0.14
Pur
-0.14
iken
-0.14
Bernardino
-0.14
omit
-0.14
POSITIVE LOGITS
batch
0.36
Batch
0.32
batches
0.31
batch
0.31
Batch
0.29
BATCH
0.27
_batch
0.27
cohort
0.26
cohorts
0.26
.batch
0.25
Activations Density 0.200%