pytables-users Mailing List for PyTables - Hierarchical datasets (Page 4)

SourceForge Headquarters 1320 Columbia Street Suite 310 San Diego, CA 92101 +1 (858) 422-6466

Hi Sebastian,

Il 25/06/2013 09:36, Wagner Sebastian ha scritto:
> Hi Anthony and Antonio,
> 
> Thanks for your fast responses. It's great to hear all features are now free to use, though I needed one and a half week to get this.
> 
> The first reference I read to learn the usage of PyTables was Hints for SQL Users [1], where is stated several times, for example in the section ' Creating an index':
>> Indexing is supported in the commercial version of PyTables (PyTablesPro).
> I would suggest that these texts should be updated.
> Being convinced it's only available in Pro-Version after I read it so often, I also overread the warning in the PyTables Pro page[2] (As I were only interested in the features not available in the free version I just scrolled down immediately, diagonal reading...). So the next suggestion is to give a color to the warning text there :)
> 
> [1]
> http://www.pytables.org/moin/HintsForSQLUsers#Creatinganindex
> http://www.pytables.org/moin/HintsForSQLUsers#Selectingdata
> [2]
> http://www.pytables.org/moin/PyTablesPro
> 
> regards,
> Sebastian
> 

thank you for reporting the issue, I will fix it ASAP.
The same problem also affect the corresponding cookbook page [1].

Anyway, please, feel free to update the wiki if you find outdated material.

[1] http://pytables.github.io/cookbook/hints_for_sql_users.html

> On Mon, Jun 24, 2013 at 4:25 AM, Wagner Sebastian < Seb...@ai...> wrote:
> 
>>  Dear PyTables-Users,****
>>
>> ** **
>>
>> For testing purposes I use a PyTables DB with 4 columns (1x Uint8 and
>> 3xFloat) with 750k rows, the total file size about 90MB. As the free 
>> version does no support indexing I thought that a search (full-table) 
>> on this database would last a least one or two seconds, because the 
>> file has to be loaded first (throttleneck I/O), and then the search 
>> over ~20k rows can begin. But PyTables took only 0.05 seconds for a 
>> full table search (in-kernel, so near C-speed, but nevertheless full 
>> table), while my bisecting algorithm with a precomputed sorted list 
>> wrapped around PyTables (but saved in there), took about 0.5 
>> seconds.****
>>
>> ** **
>>
>> So the thing I don?t understand: How can PyTables be so fast without 
>> any Indexing?
>>
> 
> Hi Sebastian,
> 
> First, there is no longer a non-free version of PyTables and v3.0 *does* have indexing capabilities.  However, you have to enable them so you probably weren't using them.
> 
> PyTables is fast because HDF5 is a binary format, it using pthreads under the covers to parallelize some tasks, and it uses numexpr (which is also
> parallel) to evaluate many expressions.  All of these things help make PyTables great!
> 
> Be Well
> Anthony
> 
> 
> Il 24/06/2013 11:25, Wagner Sebastian ha scritto:
>> Dear PyTables-Users,
>>
>> For testing purposes I use a PyTables DB with 4 columns (1x Uint8 and 3xFloat) with 750k rows, the total file size about 90MB. As the free version does no support indexing I thought that a search (full-table) on this database would last a least one or two seconds, because the file has to be loaded first (throttleneck I/O), and then the search over ~20k rows can begin. But PyTables took only 0.05 seconds for a full table search (in-kernel, so near C-speed, but nevertheless full table), while my bisecting algorithm with a precomputed sorted list wrapped around PyTables (but saved in there), took about 0.5 seconds.
>>
>> So the thing I don't understand: How can PyTables be so fast without any Indexing?
>>
>> I'm using 3.0.0rc2 coming with WinPython
>>
>> Regards,
>> Sebastian
> 
> The indexing features of PyTables Pro are now available in the open source version of PyTables since version 2.3 (please see [1]).
> 
> 
> 
> [1]
> http://pytables.github.io/release-notes/RELEASE_NOTES_v2.3.x.html#changes-from-2-2-1-to-2-3
> 
> ciao
> 
> --
> Antonio Valentino
> 

-- 
Antonio Valentino

2002	Jan	Feb	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov (5)	Dec
2003	Jan	Feb (2)	Mar	Apr (5)	May (11)	Jun (7)	Jul (18)	Aug (5)	Sep (15)	Oct (4)	Nov (1)	Dec (4)
2004	Jan (5)	Feb (2)	Mar (5)	Apr (8)	May (8)	Jun (10)	Jul (4)	Aug (4)	Sep (20)	Oct (11)	Nov (31)	Dec (41)
2005	Jan (79)	Feb (22)	Mar (14)	Apr (17)	May (35)	Jun (24)	Jul (26)	Aug (9)	Sep (57)	Oct (64)	Nov (25)	Dec (37)
2006	Jan (76)	Feb (24)	Mar (79)	Apr (44)	May (33)	Jun (12)	Jul (15)	Aug (40)	Sep (17)	Oct (21)	Nov (46)	Dec (23)
2007	Jan (18)	Feb (25)	Mar (41)	Apr (66)	May (18)	Jun (29)	Jul (40)	Aug (32)	Sep (34)	Oct (17)	Nov (46)	Dec (17)
2008	Jan (17)	Feb (42)	Mar (23)	Apr (11)	May (65)	Jun (28)	Jul (28)	Aug (16)	Sep (24)	Oct (33)	Nov (16)	Dec (5)
2009	Jan (19)	Feb (25)	Mar (11)	Apr (32)	May (62)	Jun (28)	Jul (61)	Aug (20)	Sep (61)	Oct (11)	Nov (14)	Dec (53)
2010	Jan (17)	Feb (31)	Mar (39)	Apr (43)	May (49)	Jun (47)	Jul (35)	Aug (58)	Sep (55)	Oct (91)	Nov (77)	Dec (63)
2011	Jan (50)	Feb (30)	Mar (67)	Apr (31)	May (17)	Jun (83)	Jul (17)	Aug (33)	Sep (35)	Oct (19)	Nov (29)	Dec (26)
2012	Jan (53)	Feb (22)	Mar (118)	Apr (45)	May (28)	Jun (71)	Jul (87)	Aug (55)	Sep (30)	Oct (73)	Nov (41)	Dec (28)
2013	Jan (19)	Feb (30)	Mar (14)	Apr (63)	May (20)	Jun (59)	Jul (40)	Aug (33)	Sep (1)	Oct	Nov	Dec

pytables-users Mailing List for PyTables - Hierarchical datasets (Page 4)

pytables-users — PyTables users discussion list