INDEX
Explanations
references to gaps or disparities in various contexts, particularly those relating to inequality or balance
New Auto-Interp
Negative Logits
resco
-0.18
tte
-0.16
hq
-0.15
orias
-0.14
enschaft
-0.14
bÃŃ
-0.14
aque
-0.14
er
-0.14
à¹Ģà¸Ħ
-0.14
vale
-0.14
POSITIVE LOGITS
between
0.25
filler
0.24
filled
0.23
wid
0.23
-gap
0.22
closing
0.22
opening
0.22
Closing
0.22
fill
0.22
brid
0.22
Activations Density 0.027%