iometer-devel Mailing List for Iometer (Page 4)

SourceForge Headquarters 1320 Columbia Street Suite 310 San Diego, CA 92101 +1 (858) 422-6466

Hello Vedran,
     Thank you for your time to reply. However it seems filesystem cache is
not the cause for this problem. I tried 2 times run after boot, and each is
around 28K (although a little less than yesterday). Regarding SSD, I was
trying to ask each read I/O to access different LBA to avoid to hit SSD
cache.

      Any further suggestion? Thanks!

Nai Yan.

2012/2/13 Vedran Degoricija <ve...@ya...>

> Nai Yan,
>
> Your program does not specify anything about the file caching attributes,
> so the data is most likely coming out of the filesystem cache. And if you
> have a bunch of threads scanning through the same set of LBAs, the SSD
> cache might be helping as well.
>
> Try running 1 thread with 1 iteration right after boot and see what
> numbers you get.
>
> Regards,
> Ved
>
>    *From:* Nai yan zhao <zha...@gm...>
> *To:* jo...@ei...
> *Cc:* Iom...@li...
> *Sent:* Sunday, February 12, 2012 6:59 PM
>
> *Subject:* Re: [Iometer-devel] Please advise - Why IOPS by IOMeter is
> much slower than windows multi-threading data fetch IOPS?
>
> Hello Joe,
>      Again, thank you for your reply!  I will take your suggestion and try
> again. But I am very looking forward to your further investigation on
> Windows system for my program.
>
>      I trust IOMeter, but I can't explain why and where's the problem with
> my program. And further speaking,  would you give me some comments?
>      1) What's the difference between IOmeter I/O calculation and my
> program (although it's much much simpler)? From the behavior of IOMeter, it
> also seems to create a file on target disk and MAYBE fetch data from that
> file by pre-defined I/O size and policy. If I am wrong?
>           If I am not wrong, then why there's so much difference.  Joe, by
> your experience, if my program has any big defect?
>
>      2) My major purpose is to have a program in our production env.
> ,which will frequently fetch data from SSD, and there are also some
> additional operations/work after data fetched - this is also why you see I
> put some additional work after each I/O (such as memory allocation and
> de-allocation in I/O calculation).
>          What I expect to see, its benchmark SHOULD be less than I/OMeter
> benchmark.
>
>      Would you advise more? Is there any big defect in my program for
> either doing file I/O or I/O calculation?
>
>      Thanks in advance!!
>
> Nai Yan.
>
>
>
> 2012/2/13 <jo...@ei...>
>
> Manufacturer's quoted sequential MB/s won't be with 512byte reads. In
> Iometer, try 256KB sequential reads with about 8 outstanding I/Os. That
> should come closer to the maximum throughput(I doubt you'll be able to get
> your laptop to actually get close to 520MB/s though).
>
> I'll see if I can find a windows system to try to compile/run your
> program, but I can't make any promises.
>
>
> Joe
>
>
> Quoting Nai yan zhao <zha...@gm...>:
>
>  Hello Joe,
>     Thank you again for your time!
>     It's wired that from IOMeter, the throughput for sequential IOPS
> (512B, queue depth is 64) is ONLY 42MB/s with around 82K IOPS.  However,
> from that SSD official website, this SSD sequential throughput should be
> around 510MB/s (
> http://www.plextoramericas.com/index.php/ssd/px-m3-series?start=1, my SSD
> is 128G). If there's any parameter I didn't set correctly in IOMeter?
>
>     As you suggested, I try to create a 12GB sample file (my test bed
> memory is 6GB and without RAID) and use 1 thread to do IO. The result
> is 33666; However, with I/O meter, it's 11572 (throughput this time is ONLY
> 5.93MB/s); IOPS still 3 times!!
>
>     I attach my IOMeter settings, if there's anything wrong? Also, I
> attach my modified code.  Joe, could you help again to see where's the
> problem?
>
>     Thank you so much!!
>
> Nai Yan.
>
> 2012/2/13 <jo...@ei...>
>
>  82K sounds reasonable for iops on an SSD. You should check the specs of
> your drive to see what you should expect.
>
> You need to remember that you are doing file i/o so you have several
> layers of cache involved. think of it was file cache -> block cache ->
> controller cache -> drive cache (you aren't testing a HW RAID, so you
> probably don't have cache in you controller) My personal run of thumb for
> random I/O is to have my file size be about 3x my combined cache size. For
> example, 4G ram in system, 512MB RAID cache, (8 drives*32MB) = 4.75GB I'd
> do a 16GB file.
>
> If in iometer you are accessing a PHYSICALDISK, then you are avoiding
> window's file cache.
>
> I just pulled up the code and (keep in mind I'm not much of a windows guy)
> something looks odd in your GetSecs routine. The cast to double is going to
> lose resolution, I think I would store the start/end times as
> LARGE_INTEGER. And you probably only have to call the frequency routine
> once
>
> Also windows used to have issues in the HAL where if a thread got moved to
> a different processor you'd get odd results. There is a Windows API call
> for setting affinity, similar to the linux sched_set_affinity.
>
> This doesn't really matter for what we are talking about, it is just a pet
> peeve of mine, your "delete c;" should be "delete [] c;" (are you intending
> tp be timing your allocator calls as well? you may be if you are simulating
> system performance, but typically for disk performance you'd try to
> preallocate as much as possible so your only timing the transfers)
>
>
> If it were me I would start with something simplier, (say single threaded
> sequential read) and see if your program gets the correct values then.  You
> could also fire up windows performance monitor and try to correlate to its
> counts as well (PHYSICALDISK transfers/sec).
>
> Good Luck,
>
> Joe
>
>
>
> Quoting Nai yan zhao <zha...@gm...>:
>
>  Hello Fabian and Joe,
>
>     Thank you so much for your reply.
>
>    Actually, what I am trying to do, is to split a file into 32 parts,
> and each part will be assigned to a thread to read. Each thread each time
> to open file, read 512B, and close file.  I was trying to avoid 2 read
> I/Os
> hit 1 block(512B) - i.e. to avoid cache in SSD (it's 128MB), although most
> read I/Os are ordered but not
> contiguous<http://en.**wikiped**ia.org/wiki/Contiguity#****
> Computer_science <http://wikipedia.org/wiki/Contiguity#**Computer_science>
> <http://en.**wikipedia.org/wiki/Contiguity#**Computer_science<http://en.wikipedia.org/wiki/Contiguity#Computer_science>>
>
>
> >
> .
>
>
>    By your suggestion, I tried 512B sequential I/O with settings below,
>
>    Max disk size - 8388608
>    # of Outstanding I/O - 32 (for 64, it's also around 82K)
>    Transfer request size - 512B,
>    100% sequential
>    Reply size - no reply
>    Align I/Os on - Sector boundaries
>
>     The result is around 82K, still much slower than my program.
>
>     If my program has any defect in calculating IOPS? Or if I have any
> misunderstanding of caching of SSD or file system, which causes my program
> fetches data most from RAM of SSD? Or what parameters I should set in I/O
> meter to simulate my program I/O?
>
>     Thank you again in advance for your time to help investigate it!!
>
> Nai Yan.
>
> 2012/2/11 Fabian Tillier <fa...@ti...>
>
>  If I read the test correctly, all threads start at offset 0, and then
>
>  perform 512b reads with a 1024b stride between reads.  As Joe said,
> this is pretty much sequential reading, and all threads are reading
> the same data, so most are likely to be satisifed from cache, either
> in the OS or on the SSD itself.  They'll do 320000/16=20000 IO
> operations total each, so end up reading 20MB of the file.  It's quite
> likely that the whole 20MB that you are reading will sit happilly in
> the file cache.
>
> Create an access pattern that mimics your app (512b sequential with
> 1024b stride), create 32 workers, and see if the results are similar.
> Best would be if you created a test file of 20MB, too.  You can then
> see how things compare if you go with async I/O and a single thread.
>
> Cheers,
> -Fab
>
> On Fri, Feb 10, 2012 at 5:40 AM,  <jo...@ei...> wrote:
> > Forgive me if I missed it, but I don't see any randomization in your
> > file reads.
> >
> > It looks like you just skip ahead so thread 0 reads the first
> > 512bytes, thread 1 the next 512b.  So any storage will be prefetching
> > very effectively.
> >
> > Tell Iometer to do sequential instead of random and see how much
> > closer the numbers are.  Or better yet, make your program randomize
> > its reads over the entire disk.
> >
> > Joe
> >
> >
> > Quoting Nai yan zhao <zha...@gm...>:
> >
> >> Greetings,
> >>      Could anybody help me a little out of my difficulty?
> >>
> >>      I have a SSD and I am trying to use it to simulate my program I/O
> >> performance, however, IOPS calculated from my program is much much
> faster
> >> than IOMeter.
> >>
> >>      My SSD is PLEXTOR PX-128M3S, by IOMeter, its max 512B random read
> >> IOPS is around 94k (queue depth is 32).
> >>      However my program (32 windows threads) can reach around 500k
> 512B
> >> IOPS, around 5 times of IOMeter!!! I did data validation but didn't
> find
> >> any error in data fetching. It's because my data fetching in order?
> >>
> >>      I paste my code belwo (it mainly fetch 512B from file and release
> it;
> >> I did use 4bytes (an int) to validate program logic and didn't find
> >> problem), can anybody help me figure out where I am wrong?
> >>
> >>      Thanks so much in advance!!
> >>
> >> Nai Yan.
> >>
> >> #include <stdio.h>
> >> #include <Windows.h>
> >> /*
> >> **  Purpose: Verify file random read IOPS in comparison with IOMeter
> >> **  Author:  Nai Yan
> >> **  Date:    Feb. 9th, 2012
> >> **/
> >> //Global variables
> >> long completeIOs = 0;
> >> long completeBytes = 0;
> >> int  threadCount = 32;
> >> unsigned long long length = 1073741824;                  //4G test
> file
> >> int interval = 1024;
> >> int resultArrayLen = 320000;
> >> int *result = new int[resultArrayLen];
> >> //Method declarison
> >> double GetSecs(void);            //Calculate out duration
> >> int InitPool(long long,char*,int);         //Initialize test data for
> >> testing, if successful, return 1; otherwise, return a non 1 value.
> >> int * FileRead(char * path);
> >> unsigned int DataVerification(int*, int sampleItem);
> >> //Verify data fetched from pool
> >> int main()
> >> {
> >> int sampleItem = 0x1;
> >> char * fPath = "G:\\workspace\\4G.bin";
> >> unsigned int invalidIO = 0;
> >> if (InitPool(length,fPath,****sampleItem)!= 1)
>
> >>   printf("File write err... \n");
> >> //start do random I/Os from initialized file
> >> double start = GetSecs();
> >> int * fetchResult = FileRead(fPath);
> >>  double end = GetSecs();
> >> printf("File read IOPS is %.4f per second.. \n",completeIOs/(end -
> start));
> >> //start data validation, for 4 bytes fetch only
> >> // invalidIO = DataVerification(fetchResult,****sampleItem);
>
> >> // if (invalidIO !=0)
> >> // {
> >> // printf("Total invalid data fetch IOs are %d", invalidIO);
> >> // }
> >> return 0;
> >> }
> >>
> >>
> >> int InitPool(long long length, char* path, int sample)
> >> {
> >> printf("Start initializing test data ... \n");
> >> FILE * fp = fopen(path,"wb");
> >> if (fp == NULL)
> >> {
> >> printf("file open err... \n");
> >> exit (-1);
> >> }
> >> else //initialize file for testing
> >> {
> >> fseek(fp,0L,SEEK_SET);
> >> for (int i=0; i<length; i++)
> >> {
> >> fwrite(&sample,sizeof(int),1,****fp);
>
> >> }
> >> fclose(fp);
> >> fp = NULL;
> >> printf("Data initialization is complete...\n");
> >> return 1;
> >> }
> >> }
> >> double GetSecs(void)
> >> {
> >>    LARGE_INTEGER frequency;
> >>    LARGE_INTEGER start;
> >>    if(! QueryPerformanceFrequency(&****frequency))
> >>        printf("****QueryPerformanceFrequency Failed\n");
> >>    if(! QueryPerformanceCounter(&****start))
> >>        printf("****QueryPerformanceCounter Failed\n");
> >>  return ((double)start.QuadPart/(****double)frequency.QuadPart);
>
> >> }
> >> class input
> >> {
> >> public:
> >> char *path;
> >> int starting;
> >> input (int st, char * filePath):starting(st),path(****filePath){}
>
> >> };
> >> //Workers
> >> DWORD WINAPI FileReadThreadEntry(LPVOID lpThreadParameter)
> >> {
> >> input * in = (input*) lpThreadParameter;
> >> char* path = in->path;
> >> FILE * fp = fopen(path,"rb");
> >> int sPos = in->starting;
> >> // int * result = in->r;
> >> if(fp != NULL)
> >> {
> >> fpos_t pos;
> >> for (int i=0; i<resultArrayLen/threadCount;****i++)
>
> >> {
> >> pos = i * interval;
> >> fsetpos(fp,&pos);
> >> //For 512 bytes fetch each time
> >> unsigned char *c =new unsigned char [512];
> >> if (fread(c,512,1,fp) ==1)
> >> {
> >> InterlockedIncrement(&****completeIOs);
>
> >> delete c;
> >> }
> >> //For 4 bytes fetch each time
> >> /*if (fread(&result[sPos + i],sizeof(int),1,fp) ==1)
> >> {
> >> InterlockedIncrement(&****completeIOs);
>
> >> }*/
> >> else
> >> {
> >> printf("file read err...\n");
> >> exit(-1);
> >> }
> >> }
> >> fclose(fp);
> >> fp = NULL;
> >> }
> >> else
> >> {
> >> printf("File open err... \n");
> >> exit(-1);
> >> }
> >> }
> >> int * FileRead(char * p)
> >> {
> >> printf("Starting reading file ... \n");
> >>  HANDLE mWorkThread[256];                      //max 256 threads
> >> completeIOs = 0;
> >>  int slice = int (resultArrayLen/threadCount);
> >> for(int i = 0; i < threadCount; i++)
> >> {
> >> mWorkThread[i] = CreateThread(
> >> NULL,
> >> 0,
> >> FileReadThreadEntry,
> >> (LPVOID)(new input(i*slice,p)),
> >> 0,
> >> NULL);
> >> }
> >>   WaitForMultipleObjects(****threadCount, mWorkThread, TRUE,
> INFINITE);
>
> >>   printf("File read complete... \n");
> >>   return result;
> >> }
> >> unsigned int DataVerification(int* result, int sampleItem)
> >> {
> >> unsigned int invalid = 0;
> >> for (int i=0; i< resultArrayLen/interval;i++)
> >> {
> >> if (result[i]!=sampleItem)
> >> {
> >> invalid ++;
> >> continue;
> >> }
> >> }
> >> return invalid;
> >> }
> >>
> >
> >
> >
> >
> >
> ------------------------------****----------------------------**--**
>
> ------------------
> > Virtualization & Cloud Management Using Capacity Planning
> > Cloud computing makes use of virtualization - but cloud computing
> > also focuses on allowing computing to be delivered as a service.
> > http://www.accelacomm.com/jaw/****sfnl/114/51521223/<http://www.accelacomm.com/jaw/**sfnl/114/51521223/>
> <http://**www.accelacomm.com/jaw/sfnl/**114/51521223/<http://www.accelacomm.com/jaw/sfnl/114/51521223/>
> >
> > ______________________________****_________________
> > Iometer-devel mailing list
> > Iometer-devel@lists.**sourcefo**rge.net <http://sourceforge.net/><
> Iometer-devel@lists.**sourceforge.net<Iom...@li...>
> >
> > https://lists.sourceforge.net/****lists/listinfo/iometer-devel<https://lists.sourceforge.net/**lists/listinfo/iometer-devel>
> **<https://lists.sourceforge.**net/lists/listinfo/iometer-**devel<https://lists.sourceforge.net/lists/listinfo/iometer-devel>
> >
>
>
>
>
>
>
>
>
>
>
>
>
>
> ------------------------------------------------------------------------------
> Try before you buy = See our experts in action!
> The most comprehensive online learning library for Microsoft developers
> is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3,
> Metro Style Apps, more. Free future releases when you subscribe now!
> http://p.sf.net/sfu/learndevnow-dev2
>
> _______________________________________________
> Iometer-devel mailing list
> Iom...@li...
> https://lists.sourceforge.net/lists/listinfo/iometer-devel
>
>
>

2003	Jan	Feb (17)	Mar (26)	Apr (3)	May (20)	Jun (3)	Jul (22)	Aug (15)	Sep (3)	Oct (12)	Nov (1)	Dec (6)
2004	Jan (4)	Feb (11)	Mar (14)	Apr (48)	May (14)	Jun (24)	Jul (38)	Aug (17)	Sep (29)	Oct (13)	Nov (19)	Dec (21)
2005	Jan (16)	Feb (14)	Mar (23)	Apr (36)	May (15)	Jun (13)	Jul (39)	Aug (29)	Sep (5)	Oct (2)	Nov (13)	Dec (8)
2006	Jan (6)	Feb (12)	Mar (8)	Apr (34)	May (8)	Jun (36)	Jul (8)	Aug (22)	Sep (16)	Oct (54)	Nov (33)	Dec (16)
2007	Jan (8)	Feb (18)	Mar (6)	Apr	May (1)	Jun (12)	Jul (2)	Aug (10)	Sep (31)	Oct (1)	Nov (5)	Dec (3)
2008	Jan (3)	Feb (6)	Mar (33)	Apr (21)	May	Jun (8)	Jul (1)	Aug (1)	Sep	Oct (13)	Nov (2)	Dec (4)
2009	Jan (3)	Feb (2)	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec
2010	Jan	Feb	Mar (1)	Apr (1)	May (2)	Jun (1)	Jul	Aug (12)	Sep (4)	Oct	Nov (4)	Dec (2)
2011	Jan (11)	Feb	Mar	Apr (1)	May	Jun	Jul (5)	Aug (3)	Sep (9)	Oct	Nov	Dec
2012	Jan	Feb (19)	Mar (7)	Apr (2)	May (7)	Jun	Jul (3)	Aug	Sep (2)	Oct (1)	Nov	Dec
2013	Jan	Feb	Mar	Apr	May (3)	Jun	Jul	Aug (6)	Sep	Oct	Nov	Dec (1)
2014	Jan (21)	Feb	Mar	Apr	May	Jun (1)	Jul (2)	Aug (2)	Sep	Oct (1)	Nov	Dec
2015	Jan	Feb (1)	Mar (3)	Apr (1)	May	Jun	Jul	Aug	Sep (2)	Oct (3)	Nov	Dec (1)
2025	Jan	Feb	Mar (1)	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec

iometer-devel Mailing List for Iometer (Page 4)

iometer-devel — Dedicated for people who are contributing to the project / are interested in.