Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DAOS-17146 engine: call only pmemobj_close() on stop #16051

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

janekmi
Copy link
Contributor

@janekmi janekmi commented Mar 6, 2025

  • dmg system stop -> SIGINT will trigger a new server stop path which:
    • stops all the execution streams to stop them from using the PMEM pools
    • close all the PMEM pools (pmemobj_close())
    • terminate the daos_engine process
  • dmg system stop --force -> SIGKILL remains as the last resort in case the previous one stucks
  • the graceful teardown process will be unavailable till we fix it (DAOS-XXXX)

Priority: 2

Steps for the author:

  • Commit message follows the guidelines.
  • Appropriate Features or Test-tag pragmas were used.
  • Appropriate Functional Test Stages were run.
  • At least two positive code reviews including at least one code owner from each category referenced in the PR.
  • Testing is complete. If necessary, forced-landing label added and a reason added in a comment.

After all prior steps are complete:

  • Gatekeeper requested (daos-gatekeeper added as a reviewer).

- `dmg system stop` -> SIGINT will trigger a new server stop path which:
  - stops all the execution streams to stop them from using the PMEM
    pools
  - close all the PMEM pools (`pmemobj_close()`)
  - terminate the `daos_engine` process
- `dmg system stop --force` -> SIGKILL remains as the last resort in
  case the previous one stucks
- the graceful teardown process will be unavailable till we fix it
  (DAOS-XXXX)

Priority: 2

Signed-off-by: Jan Michalski <[email protected]>
@janekmi janekmi requested review from a team as code owners March 6, 2025 16:41
Copy link

github-actions bot commented Mar 6, 2025

Ticket title is 'Make sure pmemobj_close() is called despite stop --force'
Status is 'In Progress'
Labels: 'ALCF'
https://daosio.atlassian.net/browse/DAOS-17146

@@ -195,6 +195,8 @@ struct dss_xstream *dss_get_xstream(int stream_id);
int dss_xstream_cnt(void);
void dss_mem_total_alloc_track(void *arg, daos_size_t bytes);
void dss_mem_total_free_track(void *arg, daos_size_t bytes);
void
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please maintain consistency in formatting.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I can't. It is what clang-format does. I don't like it either.

*
* \param[in] is_sysdb close sysdb.
*/
void
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

.

@@ -622,6 +623,9 @@ int d_uhash_link_insert(struct d_hash_table *htable, struct d_uuid *key,
void *cmp_args, struct d_ulink *ulink);
struct d_ulink *d_uhash_link_lookup(struct d_hash_table *htable,
struct d_uuid *key, void *cmp_args);
typedef int (*d_uhash_traverse_cb_t)(struct d_ulink *link, void *arg);
int
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

Successfully merging this pull request may close these issues.

2 participants