INDEX
Explanations
instances of gaps or disparities in various contexts
New Auto-Interp
Negative Logits
Shou
-0.90
dioses
-0.77
principes
-0.68
McCarty
-0.66
carros
-0.64
genitori
-0.64
Roscoe
-0.63
torchvision
-0.63
Boas
-0.63
Cartwright
-0.62
POSITIVE LOGITS
gap
2.82
Gap
2.58
gap
2.47
gaps
2.44
Gap
2.36
Gaps
2.35
gaps
2.06
GAP
1.98
GAP
1.66
chasm
1.25
Activations Density 0.103%