-
Notifications
You must be signed in to change notification settings - Fork 900
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug]: TimescaleDB corrupt catalog tables #3874
Comments
…corruption with VACUUM FULL
…corruption with VACUUM FULL
…corruption with VACUUM FULL.
…corruption with VACUUM FULL
…corruption with VACUUM FULL
@markrui3 Many thanks for the bug report. I executed the script against PG 13.5 and This time I got:
It can fail differently, e.g.:
I repeated the test, this time without TS installed. This time I couldn't reproduce the bug. I repeated the entire experiment several times just to exclude the possibility that the problem doesn't always reproduce. I can confirm there is a bug in TS and it reproduces well. I also discovered that it reproduces against PG 14.1. Now I'm going to check the proposed fix #3875 |
Doing catalog queries inside a cache invalidation callback is a very problematic approach. We will need to rethink this entire logic in the |
Closing this as a duplicate of #3924 |
What type of bug is this?
Data corruption
What subsystems and features are affected?
Data node
What happened?
#2680 report the bug of catalog table corruption and 3a8f396bc7d014a2722b8113236b34b0eda73eec solved it. But it only discuss the situation with preload timescaledb.
When I create extension timescaledb and running the following script, the catalog table would be corrupted again.
We believe the corruption happens in the ParallelWorker situation.
The function call is like this:
If we call VACUUM FULL, parallel workers will actually do the things. If the above code reproduce cache, parallel workers will delete catalog because of the incorrect cache.
TimescaleDB version affected
2.5.0
PostgreSQL version used
13.5
What operating system did you use?
RedHat 7u2 3.10.0-327
What installation method did you use?
Source
What platform did you run on?
On prem/Self-hosted
Relevant log output and stack trace
No response
How can we reproduce the bug?
The text was updated successfully, but these errors were encountered: