pytables-users Mailing List for PyTables - Hierarchical datasets (Page 3)

SourceForge Headquarters 1320 Columbia Street Suite 310 San Diego, CA 92101 +1 (858) 422-6466

Hi Pushkar,

I agree with Antonio.  You should load your data with NumPy functions and
then write back out to PyTables.  This is the fastest way to do things.

Be Well
Anthony

On Wed, Jul 17, 2013 at 2:12 PM, Antonio Valentino <
ant...@ti...> wrote:

> Hi Pushkar,
>
> Il 17/07/2013 19:28, Pushkar Raj Pande ha scritto:
> > Hi all,
> >
> > I am trying to figure out the best way to bulk load data into pytables.
> > This question may have been already answered but I couldn't find what I
> was
> > looking for.
> >
> > The source data is in form of csv which may require parsing, type
> checking
> > and setting default values if it doesn't conform to the type of the
> column.
> > There are over 100 columns in a record. Doing this in a loop in python
> for
> > each row of the record is very slow compared to just fetching the rows
> from
> > one pytable file and writing it to another. Difference is almost a factor
> > of ~50.
> >
> > I believe if I load the data using a C procedure that does the parsing
> and
> > builds the records to write in pytables I can get close to the speed of
> > just copying and writing the rows from 1 pytable to another. But may be
> > there is something simple and better that already exists. Can someone
> > please advise? But if it is a C procedure that I should write can someone
> > point me to some examples or snippets that I can refer to put this
> together.
> >
> > Thanks,
> > Pushkar
> >
>
> numpy has some tools for loading data from csv files like loadtxt [1],
> genfromtxt [2] and other variants.
>
> Non of them is OK for you?
>
> [1]
>
> http://docs.scipy.org/doc/numpy/reference/generated/numpy.loadtxt.html#numpy.loadtxt
> [2]
>
> http://docs.scipy.org/doc/numpy/reference/generated/numpy.genfromtxt.html#numpy.genfromtxt
>
>
> cheers
>
> --
> Antonio Valentino
>
>
> ------------------------------------------------------------------------------
> See everything from the browser to the database with AppDynamics
> Get end-to-end visibility with application monitoring from AppDynamics
> Isolate bottlenecks and diagnose root cause in seconds.
> Start your free trial of AppDynamics Pro today!
> http://pubads.g.doubleclick.net/gampad/clk?id=48808831&iu=/4140/ostg.clktrk
> _______________________________________________
> Pytables-users mailing list
> Pyt...@li...
> https://lists.sourceforge.net/lists/listinfo/pytables-users
>

2002	Jan	Feb	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov (5)	Dec
2003	Jan	Feb (2)	Mar	Apr (5)	May (11)	Jun (7)	Jul (18)	Aug (5)	Sep (15)	Oct (4)	Nov (1)	Dec (4)
2004	Jan (5)	Feb (2)	Mar (5)	Apr (8)	May (8)	Jun (10)	Jul (4)	Aug (4)	Sep (20)	Oct (11)	Nov (31)	Dec (41)
2005	Jan (79)	Feb (22)	Mar (14)	Apr (17)	May (35)	Jun (24)	Jul (26)	Aug (9)	Sep (57)	Oct (64)	Nov (25)	Dec (37)
2006	Jan (76)	Feb (24)	Mar (79)	Apr (44)	May (33)	Jun (12)	Jul (15)	Aug (40)	Sep (17)	Oct (21)	Nov (46)	Dec (23)
2007	Jan (18)	Feb (25)	Mar (41)	Apr (66)	May (18)	Jun (29)	Jul (40)	Aug (32)	Sep (34)	Oct (17)	Nov (46)	Dec (17)
2008	Jan (17)	Feb (42)	Mar (23)	Apr (11)	May (65)	Jun (28)	Jul (28)	Aug (16)	Sep (24)	Oct (33)	Nov (16)	Dec (5)
2009	Jan (19)	Feb (25)	Mar (11)	Apr (32)	May (62)	Jun (28)	Jul (61)	Aug (20)	Sep (61)	Oct (11)	Nov (14)	Dec (53)
2010	Jan (17)	Feb (31)	Mar (39)	Apr (43)	May (49)	Jun (47)	Jul (35)	Aug (58)	Sep (55)	Oct (91)	Nov (77)	Dec (63)
2011	Jan (50)	Feb (30)	Mar (67)	Apr (31)	May (17)	Jun (83)	Jul (17)	Aug (33)	Sep (35)	Oct (19)	Nov (29)	Dec (26)
2012	Jan (53)	Feb (22)	Mar (118)	Apr (45)	May (28)	Jun (71)	Jul (87)	Aug (55)	Sep (30)	Oct (73)	Nov (41)	Dec (28)
2013	Jan (19)	Feb (30)	Mar (14)	Apr (63)	May (20)	Jun (59)	Jul (40)	Aug (33)	Sep (1)	Oct	Nov	Dec

pytables-users Mailing List for PyTables - Hierarchical datasets (Page 3)

pytables-users — PyTables users discussion list