Thread: re[2]: [Jdbm-developer] Use of 2PL and MVCC for concurrency

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 454-5900

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0
 Transitional//EN">
<HTML><HEAD>
<STYLE type=text/css> P, UL, OL, DL, DIR,
 MENU, PRE { margin: 0 auto;}</STYLE>

<META content="MSHTML 6.00.2900.2802" name=GENERATOR></HEAD>
<BODY leftMargin=1 topMargin=1 rightMargin=1><FONT
 face=Tahoma>
<DIV><FONT face=Arial size=2>Bryan-</FONT></DIV>
<DIV><FONT face=Arial size=2></FONT>&nbsp;</DIV>
<DIV><FONT face=Arial size=2></FONT></DIV>
<DIV><FONT face=Arial size=2>For thouroughness,
 the answer is "yes" - I believe that 2PL
 could be used for jdbm.&nbsp; Lock releasing
 at the record level would occur during the
 first phase of the commit itself.&nbsp; The
 B-link trees and other special data structures
 would reside outside the CC layer to maximize
 concurrency (they will have to implement
 their own form of CC).</FONT></DIV>
<DIV><FONT face=Arial size=2></FONT>&nbsp;</DIV>
<DIV><FONT face=Arial size=2>I think that
 using 2PL to induce timestamps in MVCC will
 prevent the biggest advantage of MVCC, namely
 removing the potential for r-w and w-r conflicts.&nbsp;
 I believe that these kinds of conflicts will
 outnumber w-w conflicts by a significant
 margin (probably measured in orders of magnitude)
 in typical use cases...</FONT></DIV>
<DIV><FONT face=Arial size=2></FONT>&nbsp;</DIV>
<DIV><FONT face=Arial size=2>I'm challenged
 to think of how MVCC will prevent write conflicts
 - by my understanding, 2PL is what is going
 to be preventing the write conflict implicitly
 because it will not allow two tx to even
 read the same row (and in jdbm at least,
 a read is required prior to performing a
 write on an existing record).</FONT></DIV>
<DIV><FONT face=Arial size=2></FONT>&nbsp;</DIV>
<DIV><FONT face=Arial size=2>Anything I'm
 missing here?</FONT></DIV>
<DIV><FONT face=Arial size=2></FONT>&nbsp;</DIV>
<DIV><FONT face=Arial size=2>- K</FONT></DIV>
<DIV><FONT face=Arial size=2></FONT>&nbsp;</DIV>
<DIV><FONT face=Arial size=2></FONT>&nbsp;</DIV>
<DIV><FONT face=Arial size=2>&nbsp;</FONT>
 
<TABLE>
<TBODY>
<TR>
<TD width=1 bgColor=blue><FONT face=Arial
 size=2></FONT></TD>
<TD><FONT face=Arial size=2><FONT color=red>&gt;
 Kevin,<BR><BR>I have one more issue for this
 list:<BR><BR>3. &nbsp;Do we believe that
 we can use a 2PL protocol for jdbm?<BR><BR>A
 2PL protocol is more than simply locking
 resources. &nbsp;It requires<BR>that locks
 are acquired during one phase and then released
 during<BR>a second phase. &nbsp;Once any
 lock has been released, no more locks may<BR>be
 acquired. &nbsp;The transition between the
 lock acquisition stage and<BR>the lock release
 stage is the "locked point" of the transaction.<BR><BR>Its
 been a while since I read the b-link article,
 but it seems to <BR>me that it had some non-2PL
 locking. &nbsp;If so, then how do we manage<BR>that
 in a context in which 2PL is being used to
 induce timestamps<BR>for MVCC? &nbsp;(Perhaps
 we can discount btree locks if they are only<BR>used
 as index structures for records since the
 record state is<BR>primary? &nbsp;Or perhaps
 a distinct locking mechanism is required
 for<BR>the b-link tree?)<BR><BR>-bryan<BR><BR>-----Original
 Message-----<BR>From: Thompson, Bryan B.<BR>To:
 'Kevin Day '; <A href="mailto:jdb...@li..."><FONT
 color=#0000ff>'jdb...@li...</FONT></A>
 '; 'JDBM<BR>Developer listserv '<BR>Sent:
 2/28/2006 8:19 AM<BR>Subject: RE: [Jdbm-developer]
 Use of 2PL and MVCC for concurrency<BR><BR>Kevin
 wrote:<BR><BR>&gt; &nbsp;1. &nbsp;Do any
 of us believe that pre-declared write sets
 are feasible<BR>&gt; &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;in
 jdbm?<BR><BR>No. &nbsp;Based on reading and
 on my conversation with the postgres people<BR>I
 do not believe that any "real" databases
 use pre-declared write sets.<BR><BR>&gt;
 &nbsp;2. &nbsp;Does anyone see a way of creating
 a progressive (blocking)<BR>&gt; &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;transaction
 set without pre-declared write sets?<BR><BR>No.
 &nbsp;"Progressive" means that transaction
 rollbacks are not used in the<BR>CC strategy.
 &nbsp;This is not something that I can see
 us achieving.<BR><BR>However, there is another
 option. &nbsp;See [1], section 5.3.1. &nbsp;&nbsp;The<BR>option
 is to use locking to induce timestamps. &nbsp;This
 is not a<BR>progressive protocol and transactions
 may result in deadlocks<BR>in which case
 some transaction(s) will be restarted. &nbsp;However<BR>it
 allows the use of share locks, which means
 that you have<BR>concurrent readers, and
 SIX locks, which means that a share<BR>lock
 can be escalated to an exclusive lock once
 the other readers<BR>have released their
 locks. &nbsp;Once the "locked point" of the
 2PL<BR>protocol has been reached for a transaction
 a timestamp is computed<BR>for that transaction.
 &nbsp;Deadlocks occur in the 2PL protocol
 if they<BR>occur. ww synchronization never
 conflicts since it uses MVCC and<BR>aways
 creates a new version.<BR><BR>I also have
 questions about the effective concurrency
 of this CC<BR>strategy and the situations
 under which it is possible to have a<BR>ww
 synchronization resulting in concurrent creation
 of multiple<BR>versions. &nbsp;It may be
 that you get all the concurrency with 2PL
 and<BR>just more overhead from the MVCC aspect.
 &nbsp;I have not seem any papers<BR>on the
 performance of this approach.<BR><BR>It appears
 that the highest concurrency comes from an
 optimistic CC<BR>strategy. &nbsp;However
 this means that you must retain the read
 sets and<BR>write sets of all concurrent
 transactions, so this is pretty much in<BR>direct
 conflict with the use of VLR Tx concurrently
 with short txs.<BR><BR>-bryan<BR><BR>[1]
 Bernstein, P. A. and Goodman, N. 1981. Concurrency
 Control in<BR>&nbsp;&nbsp;&nbsp;Distributed
 Database Systems. ACM Comput. Surv. 13, 2
 (Jun. 1981),<BR>&nbsp;&nbsp;&nbsp;185-221.
 DOI= <A href="http://doi.acm.org/10.1145/356842.356846"><FONT
 color=#0000ff>http://doi.acm.org/10.1145/356842.356846</FONT></A><BR><BR><A
 href="http://www-static.cc.gatech.edu/classes/AY2003/cs8803i_fall/ConcurrencyC"><FONT
 color=#0000ff>http://www-static.cc.gatech.edu/classes/AY2003/cs8803i_fall/ConcurrencyC</FONT></A><BR>ontrol.pdf<BR><BR><BR>-----Original
 Message-----<BR>From: <A href="mailto:jdb...@li..."><FONT
 color=#0000ff>jdb...@li...</FONT></A><BR>To:
 JDBM Developer listserv<BR>Sent: 2/27/2006
 9:09 PM<BR>Subject: [Jdbm-developer] Use
 of 2PL and MVCC for concurrency<BR><BR>Hi
 all- &nbsp;finally back from fun and frolic.
 &nbsp;I'm going tos end a couple<BR>of emails
 with my comments on some of the discussions
 that have happened<BR>over the past week.
 &nbsp;I originally wrote these in a single
 email, but I<BR>think it will be better to
 have them separated...<BR><BR>Here's the
 first:<BR><BR>Use of 2PL and MVCC for concurrency
 - <BR><BR>In the JPEG that Bryan sent out
 outling things, he indicated that RW<BR>synchronizsation
 will be done by locking, and ww sync would
 be done<BR>using MVCC. &nbsp;I'm struggling
 to see why MVCC would be of any advantage<BR>over
 2PL in a ww conflict scenario if it is not
 also applied in a rw<BR>scenario... &nbsp;When
 I consider that a transaction will have to
 read a<BR>record before it can write to it,
 2PL will pretty much prevent any two<BR>tx
 from ever having multiple versions of a single
 record...<BR><BR>The two papers ([1] and
 [2]) that Bryan has pointed us to are quite<BR>descriptive
 on the general strategy of mixing MVCC and
 2PL. &nbsp;As I read<BR>it, the only apparent
 way to actually combine 2PL and MVCC is if
 you<BR>have pre-defined write-sets. &nbsp;Without
 that, 2PL forces an exclusive,<BR>blocking
 lock on each read (it's the only way to ensure
 that the<BR>transaction's are both serializable
 AND progressive - the term<BR>'progressive'
 was missing from my vocabulary last week).<BR><BR>As
 I apply my understanding of 2PL to an MVCC
 implementation (and as<BR>implied in [1],
 and explicitly stated in [2] - see page 212
 in original<BR>paper, sentence: &nbsp;"These
 methods all require predeclaration of<BR>writelocks"),
 without a pre-declared writeset, all bets
 are off in terms<BR>of combining 2PL and
 MVCC.<BR><BR>[1] - <A href="http://www.vldb.org/conf/1983/P074.PDF"><FONT
 color=#0000ff>http://www.vldb.org/conf/1983/P074.PDF</FONT></A><BR><A
 href="http://www.vldb.org/conf/1983/P074.PDF"><FONT
 color=#0000ff>&lt;http://www.vldb.org/conf/1983/P074.PDF&gt;</FONT></A>
 <BR>[2] -<BR><A href="http://www-static.cc.gatech.edu/classes/AY2003/cs8803i_fall/Concurrency"><FONT
 color=#0000ff>&lt;http://www-static.cc.gatech.edu/classes/AY2003/cs8803i_fall/Concurrency</FONT></A><BR>Contr&gt;<BR><A
 href="http://www-static.cc.gatech.edu/classes/AY2003/cs8803i_fall/ConcurrencyC"><FONT
 color=#0000ff>http://www-static.cc.gatech.edu/classes/AY2003/cs8803i_fall/ConcurrencyC</FONT></A><BR>ontrol.pdf<BR><BR>If
 2PL is implemented without pre-declared write
 sets, any advantages of<BR>MVCC will be lost
 (unless I am missing something very important,
 under<BR>2PL, there would never be more than
 one version of any given row). &nbsp;This<BR>pushes
 a hybrid scheme in the direction of either
 allowing the user to<BR>specify which model
 they want during start up, or having the
 system<BR>dynamically flip to 2PL for a period
 of time when it detects a bunch of<BR>tx
 restarts.<BR><BR>Alternatively, intention
 locks could be used on the read side of things<BR>to
 help the CC system determine whether it actually
 needs to abort a<BR>transaction in a write
 conflict (and determine which transaction
 to<BR>abort). &nbsp;But unless I see a very
 compelling mathematical proof<BR>otherwise,
 I do not believe that ensuring 'progressive'
 behavior will be<BR>possible with intention
 locks, in which case, we are back to aborting<BR>transactions
 during conflict - maybe the number of aborts
 will be<BR>reduced because the intention
 locks provide information that will<BR>determine
 whether a transaction should be blocked or
 aborted? &nbsp;I'd be<BR>curious about even
 that. &nbsp;With 2PL (unless we have predeclared
 write<BR>sets), we still have the problem
 of deadlock and lock contention, so you<BR>are
 going to have tx aborts there as well.<BR><BR><BR><BR>Another
 thing to consider: &nbsp;Given that there
 are typically many more<BR>reads in a database
 system than writes, the overhead of intention<BR>locking
 could be quite high. &nbsp;With a pure MVCC
 implementation, only<BR>write locks are required.<BR><BR><BR><BR>I
 think that this is a critical aspect of the
 discussion that really<BR>needs to be hammered
 out, and relatively soon. &nbsp;To that end,
 here are<BR>some specific questions:<BR><BR>1.
 &nbsp;Do any of us believe that pre-declared
 write sets are feasible in<BR>jdbm?<BR>2.
 &nbsp;Does anyone see a way of creating a
 progressive (blocking)<BR>transaction set
 without pre-declared write sets?<BR><BR><BR><BR><BR>Cheers!<BR><BR>-
 K<BR><BR><BR>Kevin Day<BR>Trumpet, Inc.<BR><A
 href="http://www.trumpetinc.com"><FONT color=#0000ff>&lt;http://www.trumpetinc.com&gt;</FONT></A>
 <A href="http://www.trumpetinc.com"><FONT
 color=#0000ff>www.trumpetinc.com</FONT></A><BR><A
 href="mailto:ke...@tr..."><FONT
 color=#0000ff>&lt;mailto:ke...@tr...&gt;</FONT></A>
 <A href="mailto:ke...@tr..."><FONT
 color=#0000ff>ke...@tr...</FONT></A><BR>602-438-7030<BR><BR>-------------------------------------------------------
 This SF.Net<BR>email is sponsored by xPML,
 a groundbreaking scripting language that<BR>extends
 applications into web and mobile media. Attend
 the live webcast<BR>and join the prime developer
 group breaking into this new coding<BR>territory!<BR><A
 href="http://sel.as-us.falkag.net/sel?cmd=lnk&amp;kid=110944&amp;bid=241720&amp;dat=121642"><FONT
 color=#0000ff>http://sel.as-us.falkag.net/sel?cmd=lnk&amp;kid=110944&amp;bid=241720&amp;dat=121642</FONT></A><BR>_______________________________________________
 Jdbm-developer mailing<BR>list <A href="mailto:Jdb...@li..."><FONT
 color=#0000ff>Jdb...@li...</FONT></A><BR><A
 href="https://lists.sourceforge.net/lists/listinfo/jdbm-developer"><FONT
 color=#0000ff>https://lists.sourceforge.net/lists/listinfo/jdbm-developer</FONT></A><BR><BR>&lt;<BR></FONT></FONT></TD></TR></TBODY></TABLE></DIV></FONT></BODY></HTML>