Postgres-XC / Bugs / #398 Concurrent updates/deletes can cause data inconsistency

Amit Khandekar - 2013-03-04

Here is one way we can fix :
The scan subplan for UPDATE should use FOR UDPATE if there are no coordinator quals.
If there are coordinator quals, there would be two nodes:
The upper node would do :
SELECT * from tab1 where ctid in (.....) FOR UPDATE
The lower node would perhaps be a SubPlan node which would do:
select ctid from tab1
Coordinator filter .....
The lower node would supply the ctids for the IN clause of upper node.

The above will ensure that when it comes to updating the rows, those rows will already be locked because of FOR UPDATE.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Koichi Suzuki - 2013-03-12

priority: 4 --> 8
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Koichi Suzuki - 2013-06-11

milestone: 2663467 --> 1,2 Dev Q
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Koichi Suzuki - 2013-06-12

Would like to determine if we can fix this in 1.2 or later in the F2F meeting, Sept, 2013.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Amit Khandekar - 2013-07-01

There is a precise mention of this scenario in PostgreSQL documentation:

"UPDATE, DELETE, SELECT FOR UPDATE, and SELECT FOR SHARE commands behave the same as SELECT in terms of searching for target rows: they will only find target rows that were committed as of the command start time. However, such a target row might have already been updated (or deleted or locked) by another concurrent transaction by the time it is found. In this case, the would-be updater will wait for the first updating transaction to commit or roll back (if it is still in progress). If the first updater rolls back, then its effects are negated and the second updater can proceed with updating the originally found row. If the first updater commits, the second updater will ignore the row if the first updater deleted it, otherwise it will attempt to apply its operation to the updated version of the row. The search condition of the command (the WHERE clause) is re-evaluated to see if the updated version of the row still matches the search condition. If so, the second updater proceeds with its operation using the updated version of the row. In the case of SELECT FOR UPDATE and SELECT FOR SHARE, this means it is the updated version of the row that is locked and returned to the client."

In Postgres-XC, if the first updater commits, the second updater does not re-apply the WHERE clause , and then does its operation (either delete, update or trigger func) using the OLD values. Not re-applying the WHERE clause is the main reason why this issue occurs.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Koichi Suzuki - 2013-12-02

Mason submitted a patch to use primary key if available. The patch has not been tested yet.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Koichi Suzuki - 2013-12-03

Group: 1.2 Dev Q --> 1.3 Dev Q
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Koichi Suzuki - 2014-06-11

Use of ctid has been removed in usual GUC settings.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Koichi Suzuki - 2014-06-11

status: open --> closed
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Concurrent updates/deletes can cause data inconsistency

Group

Searches

Help

#398 Concurrent updates/deletes can cause data inconsistency

Discussion