|
From: Michel B. <mi...@bo...> - 2005-06-30 09:45:18
|
Le Jeudi 30 Juin 2005 10:25, Lionel Bouton a =E9crit : > Michel Bouissou wrote the following on 30.06.2005 09:08 : > >Le Mardi 28 Juin 2005 16:05, Lionel Bouton a =E9crit : > >>>BTW, why state that MySQL is a poor choice ? > >> > >>It's often buggy and doesn't follow the standard ? > > > >I didn't use MySQL until a few weeks ago, so I couldn't tell for older > >versions, but MySQL 4.1.12 seems to work perfectly for me. > > > >And it's 15 times faster (!!!) than PostgreSQL for the very DB-intensi= ve > > DSPAM (this nice crowd of bugs ;-) > > DSPAM is a special case. I've had a quick look into its backend drivers= : > they use proprietary features of MySQL to speed things up. Their > PostgreSQL driver is known to be less worked on too. That's true. But in my case, I had to focus on the actual result ;-) It also seems that the "proprietary features of MySQL" that allows to spe= ed=20 things up are missing in PostgreSQL. > For pure *transactional* (MySQL's MyISAM tables aren't in the same > competitive field) SQL, MySQL and PostgreSQL are roughly at the same > performance level although PostgreSQL is often reported to scale better > under concurrent accesses. I'm using "InnoDB" tables in MySQL, which are transactional (although I=20 configured MySQL not in "completely atomic" mode, as I didn't need full=20 transactional features, and to speed things up). In this configuration, a= nd=20 on the same machine, DSPAM still is 15 times faster using MySQL than usin= g=20 PostgreSQL... > On the stability front, of all the database admins I've had personnal > contacts with, only the MySQL ones witnessed crashes (and chronic ones > at that) and data loss not related to hardware failures. InnoDB tables are reported to be very safe and robust (and that's why I=20 choosed to use them). > But for 6 years I've personnaly worked with PostgreSQL in various kinds > of loads and queries, I've *never* had a crash. I don't have any doubt that PostgreSQL is very good and stable. I've used= it=20 for years as well, and have never lost data, nor have I seen any PostgreS= QL=20 crash. Only this speed issue with DSPAM made me shift from Pg to MySQL. > In the quirkiness department, in SQLgrey, setting aside the now() > function declaration for SQLite, although I originally developped > SQLgrey with MySQL, all the special cases are for it, from the top of m= y > head: > - the "I update the timestamp because I think it's better for you" bugf= ix, > - the need to tell the client to autoreconnect because the server drops > the connection without being told to, > - the heavy SQL involved with timestamp differences. > > The first two introduced bugs in SQLgrey, the last one made me spent > quite some time looking for a working SQL syntax... Well, I don't know for "the server drops the connection without being tol= d=20 to", but the fact that 2 different SQL servers behave differently on some= =20 details isn't surprising, and wouldn't make me say that one is better tha= n=20 the other, as long as this is documented -- which is the case for timesta= mp=20 fields for example. > <OT> Yep. In my case, the lack of performance is one reason I consider > evaluating other statistical filters. Even with MySQL, DSPAM doesn't > seem to scale well enough for thousands of users. Too bad, DSPAM comes > with a nice web interface out of the box. I'll have to do some php/perl > web hacking and I lack time for this :-(</OT> The things that mostly pisses me off with DSPAM is the number of bugs it=20 contains, the fact that long reported bugs aren't fixed (as the developpe= r=20 prefers adding new features rather than fixing known bugs, and quite ofte= n=20 seems not to want to hear that his software may have bugs...), plus the f= act=20 that half of the times the fixing of a bug introduces another new bug. I have more than serious doubts about the quality of this software and it= s=20 development. OTOH, it "mostly works" (surprisingly enough ;-) and when it works, it us= ually=20 works very good. But for example I've noticed than on my system, since I've upgraded to th= e=20 latest (supposedly stable) DSPAM 3.4.8, retraining missed spams and false= =20 positives doesn't work anymore :-(( DSPAM says it has done it and doesn't complain, but the tokens in the DB=20 aren't actually reversed :-(( </OT> --=20 Michel Bouissou <mi...@bo...> OpenPGP ID 0xDDE8AC6E |