It can be done, the query form just appears to prevent it. I have a tool I wrote for my own research that I'd be happy to share. For now, try running the query formatted as follows through the web form:
 
count(distinct group_id)  as group_count 
 
(Select g.group_id, count(distinct ug.user_id)  as user_count From sf0507.groups g,sf0507.user_group ug Where g.is_public = 1 and g.status = 'A' and ug.group_id = g.group_id Group by g.group_id) a
 
user_count between 6 and 10
 
The three lines above are the input for the three text boxes on the query form. (Sorry if my email program inserted any line breaks.)
 
If anyone is interested in the graphical client I've developed for querying the Research Archive, let me know. I'll be happy to share the binaries and the source code. It is written in Delphi 5, however, so I don't know how many people will want the source.
 
On 5/22/07, Kevin Crowston <crowston@syr.edu> wrote:
That's a reasonable explanation, but the query you provide is not one
that can be done using the public query form, AFAIK.

On 22 May 2007, at 2:38 AM, Dan Delorey wrote:

> One thought that comes to mind is that the developers column in the
> stats table may not mean what you think it means. Those tables are
> calculated tables created at various intervals by the
> SourceForge.net back-end system and stored to the database which is
> then dumped monthly to Notre Dame. There are no firm answers in the
> SourceForge documentation regarding the database schema. However,
> my impression from what I read here ( http://sourceforge.net/docman/
> display_doc.php?docid=14040&group_id=1#project_statistics) is that
> development statistics are based on CVS and SVN commits. It could
> be that the developers column in that table tracks the number of
> developers who committed to the source repository over the
> corresponding time period. It would make sense for this number to
> always be less than or equal too the number of developers indicated
> by the FLOSSMole dataset (and usually for this number to be
> strictly less than the FLOSSMole number based on existing research).
>
> You may try this query which pulls the data from the UND database
> that corresponds to the data in the FLOSSMole tables you mention:
>
> select count(distinct group_id) group_count
> from (select g.group_id,
>              count(distinct ug.user_id) as user_count
>       from groups g,
>            user_group ug
>       where g.is_public = 1
>         and g.status = 'A'
>         and ug.group_id = g.group_id
>       group by g.group_id) a
> where user_count between 6 and 10
>
> The subquery counts the number of users per project for all
> projects that are public and active. The main query counts the
> number of projects from the subquery that have between 6 and 10
> users. The numbers do not match exactly. From this query I get 4335
> projects compared to your 4322, but that's a difference I'm
> comfortable chalking up to different data collection cycles.
>
> Hope that helps.
>
> On 5/21/07, Megan Conklin <mconklin@elon.edu> wrote: Kevin Crowston
> wrote:
> > The only thing that comes to mind is a different definition of
> > developers in the two data sets--it seems that flossmole's
> dev_count is
> > greater than Notre Dame's:
>
> Just to be clear, dev_count in the projects table is not a calculated
> field. Rather, it is simply the number of developers as reported on
> the main project page.
>
> When our spider "clicks" into the devs page for a project and gathers
> the specific devs, those go into the developers and dev_projects
> tables, the theory being that if you add up the number of devs for
> each project, it would probably match the dev_count value.
>
> To test this, I looked at a few of the projects you sent, such as ac3:
>
> FLOSSMole data:
>
> ac3     10
>
> Notre Dame's data:
>
> 8,ac3,
>
> the number '10' in our data set is of course the dev_count value which
> comes from their project page.
>
> I then ran this query to see exactly who the devs are:
>
> SELECT *
> FROM developer_projects
> WHERE datasource_id =57
> AND proj_unixname = 'ac3'
>
> And indeed there are 10 devs listed for this project:
>
> ge2kme
> hib (is_admin)
> jansb000 (is_admin)
> jvlau
> mafattah
> majanson
> saunup
> snrichards
> structor
> s_armondi
>
> 2 are marked as admins as you see there. But then Kevin says:
>
> > I just looked at the acmemail page and it does say 8 developers,
> not 6
> > as in the ND data. Odd...  I was hypothesizing that the ND data
> doesn't
> > include admins, but acmemail has 3 admins, not 2. So it's a
> mystery...
>
> That is very strange. Here are the devs listed for acmemail:
>
> acme (is_admin)
> muttley
> peterw (is_admin)
> tfraser
> twoshortplanks
> unbekannt
> waaa
> wim (is_admin)
>
> So yeah, 8 devs with 3 admins. These are the ones listed on the web
> site (still), so I'm not sure what to tell you about the ND data.
>
> Let us know when you find out!
>
> -megan
>
>
> ----------------------------------------------------------------------
> ---
> This SF.net email is sponsored by DB2 Express
> Download DB2 Express C - the FREE version of DB2 express and take
> control of your XML. No limits. Just data. Click to get it now.
> http://sourceforge.net/powerbar/db2/
> _______________________________________________
> Ossmole-discuss mailing list
> Ossmole-discuss@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/ossmole-discuss
>
> ----------------------------------------------------------------------
> ---
> This SF.net email is sponsored by DB2 Express
> Download DB2 Express C - the FREE version of DB2 express and take
> control of your XML. No limits. Just data. Click to get it now.
> http://sourceforge.net/powerbar/db2/
> _______________________________________________
> Ossmole-discuss mailing list
> Ossmole-discuss@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/ossmole-discuss


Kevin Crowston
Syracuse University                            Phone:  +1 (315) 443-1676
School of Information Studies                    Fax:    +1 (866)
265-7407
348 Hinds Hall                                Web:    http://
crowston.syr.edu/
Syracuse, NY   13244-4100   USA

*PS: The attachment named "PGP.sig" of type "application/pgp-
signature" is an electronic signature that may be used to verify that
this email came from me if you have PGP or GPG. Otherwise, you may
safely ignore the attachment.



-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Ossmole-discuss mailing list
Ossmole-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ossmole-discuss