content: revise ultimate-channel

golang-design · Aug 10, 2021 · 8bfe0cc · 8bfe0cc
1 parent 57369a1
commit 8bfe0cc
Show file tree

Hide file tree

Showing 2 changed files with 130 additions and 49 deletions.
diff --git a/content/assets/ultimate-channel/go.mod b/content/assets/ultimate-channel/go.mod
@@ -1,3 +1,3 @@
-module deadlock-drawcall
+module ultimate-chan
 
 go 1.17
diff --git a/content/posts/ultimate-channel.md b/content/posts/ultimate-channel.md
@@ -13,15 +13,21 @@ Author(s): [Changkun Ou](https://changkun.de)
 
 Permalink: https://golang.design/research/ultimate-channel
 
-Recently, I have been rethinking the programming patterns regarding computer graphics applications, and have been written a 3D graphics package in Go, called [polyred](https://poly.red).
-While I was designing the rendering execution pipeline, a tricky deadlock struggled with me for a long time and led to creating an unbounded channel as a workaround solution eventually.
+Recently, I have been rethinking the programming patterns regarding
+graphics applications, and already wrote a 3D graphics package in Go,
+called [polyred](https://poly.red).
+While I was designing the rendering pipeline APIs, a tricky deadlock
+struggled with me for a while and led to creating an unbounded channel
+as a workaround solution eventually.
 
 <!--more-->
 
 ## The problem
 
-At the beginning of my design, I had to deal with [OpenGL](https://github.com/go-gl/gl) where a chunk of APIs must be executed on the mainthread and issue a draw call is one of those
-infamous. The common pattern in graphics programming is as follows:
+At the beginning of my design, I had to deal with [OpenGL](https://github.com/go-gl/gl)
+where a chunk of APIs must be executed on the main thread and issue
+a draw call is one of those infamous. The common pattern in graphics
+programming is as follows:
 
 ```go
 app := newApp()
@@ -35,14 +41,27 @@ for !app.IsClosed() {
 }
 ```
 
-The entire GUI application is executed in an infinite loop that contains two parts: draw call processing and event processing.
+The entire GUI application is executed in an infinite loop that contains
+two parts: draw call processing and event processing.
 
-Typically, all these codes run on the CPU, and the actual rendering computation executes on a GPU. That means the graphics API provided by a graphic driver (such as OpenGL, Vulkan, Metal, Direct X) is just a communication command send from the CPU to the GPU or even waiting for a response from the GPU.
-For some special reasons, the [polyred](https://poly.red) is limited to software implementation, a pure-CPU implementation. Hence, the execution should utilize the full power of CPU parallelization. It makes much more sense to execute rendering on a separate goroutine so that it won't block the event processing thread.
+Typically, all these codes run on the CPU, and the actual rendering
+computation executes on a GPU. That means, the graphics API provided by
+a graphic driver (such as OpenGL, Vulkan, Metal, Direct X) is just a
+communication command send from the CPU to the GPU or even waiting for
+a response from the GPU.
+For some special reasons, the [polyred](https://poly.red) is limited to
+software implementation, a pure-CPU implementation. Hence, the execution
+should utilize the full power of CPU parallelization. It makes much more
+sense to execute rendering on a separate goroutine so that it won't block
+the event processing thread.
 
-*_In fact, to guarantee an application's responsiveness, it is ideal not to block the event processing since there might also be system invocation._
+*_Aside: To guarantee an application's responsiveness, it is ideal not
+to block the event processing since there might also be system invocation._
 
-Nevertheless, I turned the rendering loop into a separate goroutine and sent the rendering result to the event processing loop to be flushed to the hardware display. The entire application works as the following code snippet:
+Subsequently, I turned the rendering loop into a separate goroutine and
+sent the rendering result to the event processing loop to be flushed to
+the hardware display. The entire application works as the following code
+snippet:
 
 ```go
 // WARNING: This example contains a deadlock.
@@ -58,14 +77,14 @@ type ResizeEvent struct {
 	width, height int
 }
 
-type renderProfile struct {
+type RenderProfile struct {
 	id     int
 	width  int
 	height int
 }
 
 // Draw executes a draw call by the given render profile
-func (p *renderProfile) Draw() interface{} {
+func (p *RenderProfile) Draw() interface{} {
 	return fmt.Sprintf("draw-%d-%dx%d", p.id, p.width, p.height)
 }
 
@@ -83,7 +102,7 @@ func main() {
 	// the event thread notifies the rendering setting change, and here
 	// increases the rendering setting id.
 	go func() {
-		p := &renderProfile{id: 0, width: 800, height: 500}
+		p := &RenderProfile{id: 0, width: 800, height: 500}
 		for {
 			select {
 			case size := <-change:
@@ -119,9 +138,16 @@ func main() {
 }
 ```
 
-As one can observe from the above example, it simulates a resize event of a GUI window at every event processing loop. Whenever the size of the GUI window is changed, the underlying rendering should adapt to that, for instance, reallocating the rendering buffers. To let the rendering thread understand the change, another channel is used to communicate from the event thread to the rendering thread.
+As one can observe from the above example, it simulates a resize event
+of a GUI window at every event processing loop. Whenever the size of
+the GUI window is changed, the underlying rendering should adapt to that,
+for instance, reallocating the rendering buffers. To allow the rendering
+thread understand the change, another channel is used to communicate from
+the event thread to the rendering thread.
 
-It sounds like a perfect design, but a nasty deadlock is hidden in the dark if one executes the program, and the program will freeze until a manual interruption:
+It sounds like a perfect design, but a nasty deadlock is hidden in the
+dark if one executes the program, and the program will freeze until
+a manual interruption:
 
 ```
 draw-0-800x500
@@ -139,13 +165,12 @@ If we take a closer look into the program pattern:
 2. The `E` thread receives communication from the `R` thread
 3. The `R` thread receives communication from the `E` thread
 
-Found the problem? The problem happens in the two-way communication:
+Did you find the problem? The problem happens in the two-way communication:
 If the communication channels are unbuffered channel (wait until the
 receive is complete), the deadlock happens when `E` is waiting for `R` to
 complete the receive, and `R` is also waiting for `E` to complete the receive.
 
-One may argue instantly that the deadlock can be resolved using a
-buffered channel:
+One may argue that the deadlock can be resolved using a buffered channel:
 
 ```diff
 -draw := make(chan interface{})
@@ -154,16 +179,32 @@ buffered channel:
 +change := make(chan ResizeEvent, 100)
 ```
 
-But unfortunately, it remains problematic. Let's do a thought experiment: if `E` is too busy, and quickly exploits the entire buffer of the communication channel `change`, then the communication channel falls back to an unbuffered channel. Then `E` starts to wait to proceed; On the otherwise, `R` is busy working on the draw call, when it is finished, `R` tries to send the draw call to `E`.  However, at this moment. the `E` is already waiting for `R` to receive the `change` signal. Hence, we will fall back to the same case -- deadlock.
-
-Is the problem a producer-consumer scenario? Indeed, the case is quite similar but not entirely identical. The producer-consumer scenario focuses on producing content for the buffer while the consumer consumes the buffer. If the buffer is full, it is easy to send either producer or consumer to sleep. However, the key difference here is: On the two sides of communication, they both play the role of producer and consumer simoutainiously, and they both relying on each other.
-
-What can we do to solve the above deadlock? There are two
-approaches.
+But unfortunately, it remains problematic. Let's do a thought experiment:
+if `E` is too busy, and quickly exploits the entire buffer of the
+communication channel `change`, then the communication channel falls
+back to an unbuffered channel. Then `E` starts to wait to proceed;
+On the otherwise, `R` is busy working on the draw call, when it is
+finished, `R` tries to send the draw call to `E`. 
+However, at this moment. the `E` is already waiting for `R` to receive
+the `change` signal. Hence, we will fall back to the same case -- deadlock.
+
+Is the problem a producer-consumer scenario? Indeed, the case is quite similar
+but not entirely identical. The producer-consumer scenario focuses on
+producing content for the buffer while the consumer consumes the buffer.
+If the buffer is full, it is easy to send either producer or consumer to
+sleep. However, the key difference here is: On the two sides of
+communication, they both play the role of producer and consumer
+simoutainiously, and they both relying on each other.
+
+What can we do to solve the above deadlock? Let's reveal two approaches in this
+article.
 
 ## Solution 1: Send in select's case
 
-The first approach is a simple one. We utilize the power of the select statement, that a send operation to any channel won't block if there is a default statement. Hence, we could simply turn the draw call sends statement into a nested select statement:
+The first approach is a simple one. We utilize the power of the select statement:
+a send operation to any channel won't block, if there is a default statement.
+Hence, we could simply turn the draw call sends statement into a nested select
+statement:
 
 ```diff
 go func() {
@@ -186,18 +227,24 @@ go func() {
 }()
 ```
 
-In this case, if the `draw <- p.Draw()` is blocking, the newly introduced `select` statement will not block on the send and execute the default statement then resolves the deadlock.
+In this case, if the `draw <- p.Draw()` is blocking, the newly introduced
+`select` statement will not block on the send and execute the default
+statement then resolves the deadlock.
 
 However, there are two drawbacks to this approach: 
 
 1. If a draw call is skipped, there will be one frame loss of rendering. Because the next loop will start to calculate a new frame.
 2. The event thread remains blocked until a frame rendering in the rendering thread is complete. Because the new select statement can only be executed after all rendering calculation is complete.
 
-These two drawbacks are there intrinsically, and with this approach, it seems there is no better way to improve it. What else could we do?
+These two drawbacks are there intrinsically, and with this approach, it
+seems there is no better way to improve it. What else could we do?
 
 ## Solution 2: Unbounded Channel
 
-We may first come up with this idea: can we make a channel that contains an infinite-sized buffer, i.e. unbounded channel? Though the language, it is not possible yet. However, such a pattern can be easily constructed:
+We may first come up with this idea: Can we make a channel that contains
+an buffer with infinite capacity, i.e. unbounded channel? Though the
+language, it is not possible yet. However, such a pattern can be easily
+constructed:
 
 ```go
 // MakeChan returns a sender and a receiver of a buffered channel
@@ -236,9 +283,14 @@ func MakeChan() (chan<- interface{}, <-chan interface{}) {
 }
 ```
 
-In the above implementation, we created two unbuffered channels. To not block the communication, a separate goroutine is created from the call. Whenever there is a send operation, it appends to a buffer `q`. To send the value to the receiver, a nested select loop that checks whether send is possible or not. If not, it keeps appending the data to the queue `q`.
+In the above implementation, we created two unbuffered channels. To not
+block the communication, a separate goroutine is created from the call. 
+Whenever there is a send operation, it appends to a buffer `q`. To send
+the value to the receiver, a nested select loop that checks whether send
+is possible or not. If not, it keeps appending the data to the queue `q`.
 
-When the input channel is closed, an additional loop over the queue `q` is used to run out all cached elements, then close the output channel.
+When the input channel is closed, an additional loop over the queue `q`
+is used to run out all cached elements, then close the output channel.
 
 Hence, another fix of the deadlock using unbounded channel would be:
 
@@ -277,18 +329,37 @@ func main() {
 }
 ```
 
-This unbounded channel is very similar to the commonly used standard graphics API pattern: CommandBuffer, a buffer that caches a series of draw calls, and does batch execution of a chunk of draw calls.
-
-## Conclusion: A Generic Channel Abstraction
-
-In this article, we discussed a form of deadlock in the select statement and two possible ways to address it. In the second approach, we discussed a possible way of implementing an unbounded channel construction. The implementation constructs an `interface{}` typed channel. 
-
-We may ask ourselves, does unbounded make sense to have in the Go language with this particular example? Does the Go team ever consider such usage?
-
-The answer to the second question is: Yes. They do, see [golang/go#20352](https://golang.org/issue/20352). The discussion thread shows that unbounded channels indeed serve a certain application, but clear drawbacks may hurt the application.
-The major drawback is that an unbounded channel may run out of memory (OOM). If there is a concurrency bug, the running application will keep eats memory from OS and eventually leads to OOM. Developers argue that an unbounded channel should be added to the language mainly because the `MakeChan` function is returning an `interface{}` typed channel which brings weakly typed flaw into the statically typed Go code. Eventually, Ian Lance Taylor from the Go team [clarifies](https://golang.org/issue/20352#issuecomment-365438524) that an unbounded channel may have a sort of usage but is unworthy to be added to the language. As long as we have generics, a type-safe unbounded channel can be easily implemented in a library, answering the first question.
-
-As of Go 1.18, soon we have type parameters, the above difficulty finally can be resolved. Here I provide a generic channel abstraction that is able to construct a type-safe, arbitrary sized channel:
+This unbounded channel is very similar to the commonly used standard
+graphics API pattern: CommandBuffer, a buffer that caches a series of
+draw calls, and does batch execution of a chunk of draw calls.
+
+## A Generic Channel Abstraction
+
+We have discussed a form of deadlock in the select statement
+and two possible ways to address it. In the second approach, we discussed
+a possible way of implementing an unbounded channel construction. The
+implementation constructs an `interface{}` typed channel.
+
+We may ask ourselves, does unbounded make sense to have in the Go language
+with this particular example? Does the Go team ever consider such usage?
+
+The answer to the second question is: Yes. They do, see [golang/go#20352](https://golang.org/issue/20352).
+The discussion thread shows that unbounded channels indeed serve a
+certain application, but clear drawbacks may hurt the application.
+The major drawback is that an unbounded channel may run out of memory (OOM).
+If there is a concurrency bug, the running application will keep eats memory
+from OS and eventually leads to OOM. Developers argue that an unbounded channel
+should be added to the language mainly because the `MakeChan` function is
+returning an `interface{}` typed channel which brings weakly typed flaw into
+the statically typed Go code. Eventually, Ian Lance Taylor from the Go team
+[clarifies](https://golang.org/issue/20352#issuecomment-365438524) that
+an unbounded channel may have a sort of usage but is unworthy to be added
+to the language. As long as we have generics, a type-safe unbounded channel
+can be easily implemented in a library, answering the first question.
+
+As of Go 1.18, soon we have type parameters, the above difficulty finally
+can be resolved. Here I provide a generic channel abstraction that is able
+to construct a type-safe, arbitrary sized channel:
 
 ```go
 // MakeChan is a generic implementation that returns a sender and a
@@ -297,8 +368,8 @@ As of Go 1.18, soon we have type parameters, the above difficulty finally can be
 // If the given size is positive, the returned channel is a regular
 // fix-sized buffered channel.
 // If the given size is zero, the returned channel is an unbuffered channel.
-// If the given size is -1, the returned an unbounded channel with an
-// infinite-sized buffer.
+// If the given size is -1, the returned an unbounded channel contains an
+// internal buffer with infinite capacity.
 func MakeChan[T any](size int) (chan<- T, <-chan T) {
 	switch {
 	case size == 0:
@@ -349,20 +420,30 @@ func MakeChan[T any](size int) (chan<- T, <-chan T) {
 ```go
 func main() {
 	in, out := MakeChan[int](1)
+	// Or:
 	// in, out := MakeChan[int](0)
 	// in, out := MakeChan[int](-1)
 
-	go func() {
-		in <- 42
-	}()
-
+	go func() { in <- 42 }()
 	println(<-out)
 }
 ```
 
-This code is executable on go2go playground: https://go2goplay.golang.org/p/krLWm7ZInnL.
+*_This code is executable on go2go playground:_ https://go2goplay.golang.org/p/krLWm7ZInnL
+
+## Conclusion
+
+In this article, we talked about a generic implementation of a channel with arbitrary capacity through a real-world deadlock example. We might ask again: Is it perfect?
+
+Well, the answer is non-trivial. As a generalization of channels, the other common operations should also be supported, such as `len()`, `cap()`, and `close()`.
+If we think carefully about the semantics of closing a channel, it is really just about closing the ability of input to that channel. Hence, implementing the `close()` functionality, it is simple and straightforward.
+
+However, the `len()` is not a thread-safe operation for arrays, slices, and maps, but it becomes preety clear that it has to be thread safe for channels, otherwise, there is no way to fetch channel length atomically. Nonetheless, does it really make sense to get the length of a channel? As we know that channel is typically used for synchronization purposes. If there is a `len(ch)` that happens concurrently with a send/receive operation, there is no guarantee what is the return of the `len()`. The length is outdated immediately as `len()` returns.
+This scenario is neither discussed in the [language specification](https://golang.org/ref/spec), or the [Go's memory model](https://golang.org/ref/mem). After all, Do we really need a `len()` operation for the ultimate channel abstraction? The answer speaks for itself.
 
 ## Further Reading Suggestions
 
 - Ian Lance Taylor. Type Parameters. March 19, 2021. https://golang.org/design/43651-type-parameters
 - rgooch. proposal: spec: add support for unlimited capacity channels. 13 May 2017. https://golang.org/issue/20352
+- The Go Authors. The Go Programming Language Specification. Feb 10, 2021. https://golang.org/ref/spec
+- The Go Authors. The Go Memory Model. May 31, 2014. https://golang.org/ref/mem