[Bug] Coordinator miss to change the partition's replicas hold by restarted server to online #334
Open
1 of 2 tasks
Labels
Milestone
Search before asking
Fluss version
main
Minimal reproduce step
Create a partitioneded table with
table.replication.factor
=3
, and stop a tablet server hold one of the replica..The replica will be kicked off from ISR. After restart the server again, the replica should be added to ISR, but I found it didn't..What doesn't meet your expectations?
When a tablet server started, the
CoordinatorEventProcessor
will find all replicas in this server via method replicasOnTabletServer , and change the state of the replicas to online..But this method only consider non-partitioned-table, miss the replica for the partitioned table...
We should add the logic for get replica for partitioned table in this method..
Anything else?
No response
Are you willing to submit a PR?
The text was updated successfully, but these errors were encountered: