Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[META] 8.x PQ improvements #11447

Open
21 of 32 tasks
colinsurprenant opened this issue Dec 17, 2019 · 3 comments
Open
21 of 32 tasks

[META] 8.x PQ improvements #11447

colinsurprenant opened this issue Dec 17, 2019 · 3 comments

Comments

@colinsurprenant
Copy link
Contributor

colinsurprenant commented Dec 17, 2019

These are the PQ issues that should be looked at in 7.x

Phase 1

Permissions & .lock file

Page file size too small exception

Docs

Recovery (pkcheck/pqrepair)

Phase 2

Review, triage and/or fix issues in this list persistent queues

Needs fixing

No fix required:

Phase 3 & Beyond

Needs discussion

Needs watch

Recovery (pkcheck/pqrepair)

  • Add a new pqdump utility which can dump in JSON all/any data in a queue dir.
  • Improve automatic recovery at logstash startup

Timeouts and batching

  • Re-assess the state of queue write timeout handling WRT plugins like the http input, anything else required to move forward?
  • [META] Queue timeouts + Batching [META] Queue timeouts + Batching #9389

Performance

@zez3
Copy link

zez3 commented Feb 18, 2021

I was reading this https://www.elastic.co/blog/using-parallel-logstash-pipelines-to-improve-persistent-queue-performance

I see that there is an issue(“To put this another way, a single pipeline can only drive the disk with a single thread. This is true even if a pipeline were to have multiple inputs, as additional inputs in a single pipeline do not increase disk I/O threads.”)

The proposed "Solution" for improving overall performance I would call it a workaround that unfortunately does not work for all cases. We where planing to send syslog -> LS directly no multiple filebeats instances.

Does the support of additional persistent queue threads running in parallel matches any issues described on this meta?

@zez3
Copy link

zez3 commented Feb 19, 2021

I would also add one more issue.

After filling up the PQ file LS should check if the ES cluster is in read-only state and stop outputting to that cluster.
A recheck every x seconds would help

Similar to:
#10023

@roaksoax roaksoax changed the title [META] 7.x PQ improvements [META] 8.x PQ improvements Jan 19, 2022
@zalseryani
Copy link

Greetings,
any update on this ?

Thank you.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

6 participants
@colinsurprenant @roaksoax @zez3 @zalseryani @kaisecheng and others