From: Fabian M. <fa...@my...> - 2009-07-29 01:27:37
|
Patrick, the tools loads the data and map into memory. with a 32bit jvm you can only allocate about 1.3-1.5GB of memory independent of what is in your machine. with a 64bit jvm more is possible. the code could be changed to not load the data into memory to deal with larger files. the standard SOM algorithms we implemented to not accept a distance matrix as an input but repeatedly use the data to train the map. I know there are distance matrix algorithms but they are not in our tool. not that distance matrix methods have a complexity of O(n^2) while the standard SOM algorithms have O(n). Of course for particular datasets either one could be faster, in particular for extremely high dimensionality as you have described. yes you can save a png of the maps. hope this helps fabian Patrick De Maziere wrote: > Hello, > > I want to use the ESOM tools starting from a distance matrix since I question > whether the tools can handle > 7000 vectors with a dimensionality > 800000. I > used the Euclidean distance metric to construct that distance matrix but I > now wonder what the next steps are to create a U-matrix map and if I can run > everything from batch (another program is dealing with these huge datasets > and with the visualisation of any result). So, I like also to know whether I > can "export" the U-map to a PNG or other graphic format. > > (I tried with a smaller data set (100 vectors, dimension +/- 50000, but even > then he sometimes fails due to memory problems: out of memory, and setting > the memory parameter in the script to more than 3GB, e.g., Xmx4096m resulted > in an error (of the program?) that he could not handle such sizes ....) > > Many thanks in advance and best regards, > Patrick > > ------------------------------------------------------------------------------ > Register Now for Creativity and Technology (CaT), June 3rd, NYC. CaT > is a gathering of tech-side developers & brand creativity professionals. Meet > the minds behind Google Creative Lab, Visual Complexity, Processing, & > iPhoneDevCamp as they present alongside digital heavyweights like Barbarian > Group, R/GA, & Big Spaceship. http://p.sf.net/sfu/creativitycat-com > _______________________________________________ > Databionic-ESOM-User mailing list > Dat...@li... > https://lists.sourceforge.net/lists/listinfo/databionic-esom-user > > > |