INDEX
Explanations
references to gaps or disparities
references to disparities or differences across various contexts
New Auto-Interp
Negative Logits
ocations
-0.71
oran
-0.68
ise
-0.65
MT
-0.62
Interstitial
-0.61
istic
-0.61
abad
-0.59
ocation
-0.59
vez
-0.59
icas
-0.58
POSITIVE LOGITS
gap
1.09
gaps
0.95
wid
0.90
brid
0.82
Gap
0.82
locks
0.82
widened
0.78
byss
0.78
ersed
0.77
junction
0.76
Activations Density 0.009%