sfscode-updates Mailing List for sfs_code
Brought to you by:
luricchio,
ryan_hernandez
You can subscribe to this list here.
2008 |
Jan
|
Feb
|
Mar
|
Apr
|
May
|
Jun
(5) |
Jul
(1) |
Aug
(1) |
Sep
|
Oct
|
Nov
|
Dec
|
---|---|---|---|---|---|---|---|---|---|---|---|---|
2009 |
Jan
|
Feb
(1) |
Mar
|
Apr
|
May
|
Jun
|
Jul
|
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
2010 |
Jan
(1) |
Feb
|
Mar
|
Apr
|
May
|
Jun
|
Jul
|
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
2014 |
Jan
|
Feb
|
Mar
|
Apr
|
May
|
Jun
|
Jul
(1) |
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
2015 |
Jan
|
Feb
|
Mar
|
Apr
|
May
|
Jun
|
Jul
|
Aug
(1) |
Sep
|
Oct
|
Nov
|
Dec
|
From: Ryan H. <rh...@gm...> - 2015-08-20 11:55:50
|
Dear SFS_CODE'ers, I've posted an update to SFS_CODE to the project page. There are several updates, including bug fixes and new features. You can read about them in the Release Notes pasted below. Let me know if you have any questions. Happy simulating! Ryan **** CURRENT RELEASE: **** DATE: 2015-08-15 NEW FEATURES: Generally much faster. I made a few improvements to the way data structures are handled. optimizeLL.pl When simulating long loci, SFS_CODE can be slow. This is because it stores only unique haplotypes, and there can be many unique haplotypes with a long locus. By dividing the locus into chunks, you gain the ability to store far few haplotypes. Optimizing this used to be a fairly arbitrary process. I've created a heuristic solution to this problem called optimizeLL.pl. It will run a very simple version of your desired constraints (population size, locus length, etc), and find the optimal partitioning of the loci to minimize runtime. --VCF now output directly to VCF files. --selDistType (-W) ... [H <h>] ... This option now allows dominance! Note that h=1.0 is codominance for backwards compatibility. Arbitrary models of dominance are allowed, using the same distributions as the selection coefficients (1: constant, 2: gamma distribution, 3: normal distribution). --mutation ... [H <h>] ... [F <file>] This option now allows dominance [H] as above, and the ability to simulate a conditional trajectory. The idea is that you can generate a trajectory using another software (or outputing the trajectory from an SFS_CODE simulation), and input it here to fix the path of a given mutation and simulate linked loci under the resulting structured population. BUG FIXES: --indel and --longIndel no longer delete more sequence than is possible. Previously, it was possible to have a 10bp deletion that occured at the last position of the simulated sequence, effectively deleting DNA from the end of the chromosome. Thanks to William Nicholson for pointing this out. The way --neutPop worked did not maintain neutrality in the indicated populations in some situations. This has been fixed. Many fixes in convertSFS_CODE, which had caused may people problems... My apologies for taking so long! |
From: Ryan H. <rh...@gm...> - 2014-07-19 21:01:19
|
Hi Folks, It has been a while, but I've made some significant updates to SFS_CODE! You can find a description in the release notes below. This release includes some new features, and importantly, some significant optimizations to the code. It is now tremendously faster than it was, and returns SFS_CODE to being one of the fastest forward simulation programs out there! Best, Ryan DATE: 2014-07-19 This release includes two primary bug fixes, a couple of additional features, and a major efficiency optimization. BUGS: Bug 1: Time to fixation conditional on fixation. This should affect only simulations where a large number of bases are simulated (e.g., 1Mb or larger), and you are looking at the time to fixation conditional on fixation for mutations in a small simulated chunk (e.g., 1kb). This could arise when you are interested in a locus surrounded by deleterious background, or if you are simulating an exome. The previous version reported fixation times that were too large. This did not impact segregating variants. Bug 2: Recombination. Previously, when the number of recombination events were low (e.g., when the simulated number of recombination events entering a population in a given generation was < 0.1*N for the population), the recombination events were only being distributed amongst females (though both maternal and paternal chromosomes were recombining, only females inherited recombinant chromosomes). This has been resolved. NEW FEATURES: Output Samples: You can now output samples throughout the simulation. This is implemented as an extension of the -n flag to a -Tn timed event. Note that -n still works as before, but if you use -Tn, you can now indicate specific times to sample the population, or specify a regular interval to sample the population. The -Tn option changes the output slightly, adding a flag like "//gen=XX" prior to the sequence data at that time point. The final sample is still taken and reported as usual. Here is the updated description: -n* --sampSize [P <pop>] [R <tau_d>] [S <tau_1>]] <SS1> [<SS2>...<SSNpops>] set the number of individuals sampled from a population. When used as -Tn, a time must be specified for sampling, and opens the ability for [R]ecurrent sampling starting at the pre-specified time, and repeat every tau_d*PN_A generations until the end of the simulation, or a specified [S]top time tau_1. Note that the sample size must be specified at the end (either a single number to be applied to all populations or include a number for every population). You can use -Tn multiple times. Soft Sweeps: You can now do a form of selection on standing variation in SFS_CODE. This is implemented through the -TW argument. You need to specify the time that selection starts, the allele frequency, and the type of selection you want to act on it. SFS_CODE will find a random allele that ideally is at the specified frequency F +/-0.05F. If a SNP cannot be found in this range, the SNP nearest in frequency will be used. Here is the updated description: -W* --selDistType [P <pop>] [L <locus>] [F <allele_freq> [w] [T [R <min_freq> <max_freq> [S]] [A [G <gens>]] [M <max_reps>] [F [a] <file>]]] <type> [args] set distribution of selective effects. See documentation for proper usage of <type> [args]. Set the distribution for a [P]opulation or [L]ocus. In -TW mode, change selection coefficient of an existing polymorphism with a particular [F]requency. If a frequency is specified, you can [T]rack the allele to ensure that it achieves a particular frequency [R]ange at time of sampling ([S]topping the first time in range), [A]utomatically restart if the allele is lost (up to a [M]aximal number of tries), and output the trajectory to a [F]ile (or [a]ppend to existing). Note that tracking a mutation here prevents you from tracking a locus using --trackTrajectory. Unfortunately, tracking only occurs after selection coefficient changes. OPTIMIZATIONS: For simulations with long sequences and/or large numbers of individuals, SFS_CODE was rather slow. This is partly because there were a few redundant calculations being performed on each mutation every generation. As generations went on and the number of variants (both fixed and polymorphic) grew, this led to a roughly exponential growth in computation time per generation. Eliminating these redundant calculations has resulted in a DRAMATIC increase in efficiency for long sequences and/or large numbers of individuals. |
From: Ryan H. <rhe...@uc...> - 2010-01-20 23:25:48
|
Hello SFS_CODEers, Several new features (some long awaited) to announce!! 1) --admixture (-TJ) You can now directly simulate admixture! The options are extremely flexible, allowing for an arbitrary number of ancestral populations, differing representation of males and females from ancestral populations, etc. 2) --mutation Introduce a new mutation at any time, with a user-specified selection coefficient! 3) --trackAncestry This allows you to know the true genetic ancestry of individuals at every site when you are simulating with migration or admixture. Very handy to know the exact widths and locations of each admixture block. 4) --trackTrajectory Follow the frequency trajectory of a mutation generation by generation. This option can also be used to specify the final frequency of a derived allele at the time of sampling, and automatically restart the simulation if all the mutations you are tracking are fixed or lost. Updates: I have fixed the way that convertSFS_CODE creates ms-style output so that the output can be used directly with sample_stats (supplied with the ms download). Download the new version, and try out the new features: http://sfscode.sourceforge.net/ |
From: Ryan H. <rh...@gm...> - 2009-02-20 04:25:23
|
A new version of sfs_code has been released! The major new addition is gene conversion, which can also allow for biased gene conversion. I've also added a few other features such as an additive model of selection, which are all described in the users manual. As always, comments are very welcome. Happy simulating! -Ryan ---------- Forwarded message ---------- From: SourceForge.net <no...@so...> Date: Thu, Feb 19, 2009 at 6:19 PM Subject: [SourceForge.net Release] sfscode : sfscode To: no...@so... Project: sfs_code (sfscode) Package: sfscode Date : 2009-02-19 19:19 Project "sfs_code" ('sfscode') has released the new version of package 'sfscode'. You can download it from SourceForge.net by following this link: < https://sourceforge.net/project/showfiles.php?group_id=211239&release_id=662521 > or browse Release Notes and ChangeLog by visiting this link: <https://sourceforge.net/project/shownotes.php?release_id=662521> You receive this email because you requested to be notified when new versions of this package were released. If you don't wish to be notified in the future, please login to SourceForge.net and click this link: < https://sourceforge.net/project/filemodule_unmonitor.php?filemodule_id=253795 > If you lost your SourceForge.net login name or password, refer to this document: <http://p.sf.net/sitedocs/accountproblems> Note that you may receive this message indirectly via one of your mailing list subscriptions. Please review message headers before reporting unsolicited mailings. -- *************************************** Ryan D. Hernandez Postdoctoral Researcher Human Genetics University of Chicago 418 CLSC Chicago, IL 60637 email: rhe...@uc... phone: (773) 834-8932 |
From: Ryan H. <rhe...@uc...> - 2008-08-11 18:02:12
|
Through ongoing testing of sfs_code, I discovered a bug when exactly 2 loci were simulated with partial linkage (no recombination events were thrown down between the loci). I've corrected the error, and posted a new version on the website below. I will be submitting a manuscript on sfs_code in the next day or two. Happy Simulating! -Ryan ---------- Forwarded message ---------- From: SourceForge.net <no...@so...> Date: Mon, Aug 11, 2008 at 12:14 PM Subject: [SourceForge.net Release] sfscode : sfscode To: no...@so... Project: sfs_code (sfscode) Package: sfscode Date : 2008-08-11 12:14 Project "sfs_code" ('sfscode') has released the new version of package 'sfscode'. You can download it from SourceForge.net by following this link: < https://sourceforge.net/project/showfiles.php?group_id=211239&release_id=619186 > or browse Release Notes and ChangeLog by visiting this link: <https://sourceforge.net/project/shownotes.php?release_id=619186> You receive this email because you requested to be notified when new versions of this package were released. If you don't wish to be notified in the future, please login to SourceForge.net and click this link: < https://sourceforge.net/project/filemodule_unmonitor.php?filemodule_id=253795 > If you lost your SourceForge.net login name or password, refer to this document: <https://sourceforge.net/docman/display_doc.php?docid=760&group_id=1> Note that you may receive this message indirectly via one of your mailing list subscriptions. Please review message headers before reporting unsolicited mailings. -- *************************************** Ryan D. Hernandez Postdoctoral Researcher Human Genetics University of Chicago 418 CLSC Chicago, IL 60637 email: rhe...@uc... phone: (773) 834-8932 |
From: Ryan H. <rh...@gm...> - 2008-07-29 23:09:19
|
---------- Forwarded message ---------- From: SourceForge.net <no...@so...> Date: Tue, Jul 29, 2008 at 5:59 PM Subject: [SourceForge.net Release] sfscode : sfscode To: no...@so... Project: sfs_code (sfscode) Package: sfscode Date : 2008-07-29 17:59 Project "sfs_code" ('sfscode') has released the new version of package 'sfscode'. You can download it from SourceForge.net by following this link: < https://sourceforge.net/project/showfiles.php?group_id=211239&release_id=616488 > or browse Release Notes and ChangeLog by visiting this link: <https://sourceforge.net/project/shownotes.php?release_id=616488> You receive this email because you requested to be notified when new versions of this package were released. If you don't wish to be notified in the future, please login to SourceForge.net and click this link: < https://sourceforge.net/project/filemodule_unmonitor.php?filemodule_id=253795 > If you lost your SourceForge.net login name or password, refer to this document: <https://sourceforge.net/docman/display_doc.php?docid=760&group_id=1> Note that you may receive this message indirectly via one of your mailing list subscriptions. Please review message headers before reporting unsolicited mailings. -- *************************************** Ryan D. Hernandez Postdoctoral Researcher Human Genetics University of Chicago 418 CLSC Chicago, IL 60637 email: rhe...@uc... phone: (773) 834-8932 |
From: Ryan H. <rh...@gm...> - 2008-06-30 17:23:35
|
The previous post did not go through, so I'm trying my gmail account. Let me know if you actually receive this email. Thanks! Ryan On Sun, Jun 29, 2008 at 4:03 PM, Ryan Hernandez <rhe...@uc...> wrote: > SFS_CODE had a memory leak when performing multiple simulations with > multiple populations across multiple loci. This should not have affected > any previous results, just makes the program a bit more sound. Thanks > valgrind! > > A new release has been posted on sourceforge: sfscode.sourceforge.net. > > happy simulating! > > Ryan > -- *************************************** Ryan D. Hernandez Postdoctoral Researcher Human Genetics University of Chicago 418 CLSC Chicago, IL 60637 email: rhe...@uc... phone: (773) 834-8932 |
From: Ryan H. <rhe...@uc...> - 2008-06-29 21:03:23
|
SFS_CODE had a memory leak when performing multiple simulations with multiple populations across multiple loci. This should not have affected any previous results, just makes the program a bit more sound. Thanks valgrind! A new release has been posted on sourceforge: sfscode.sourceforge.net. happy simulating! Ryan |
From: Ryan H. <rh...@gm...> - 2008-06-18 22:15:03
|
Hello, As it turns out, I'm still working out the kinks on putting files together for upload. I've you've already downloaded SFS_CODE, please download again (one of the files was not right, and the documentation left out the information on simulating indels). If not, then no worries. Cheers! Ryan http://sfscode.sourceforge.net |
From: Ryan H. <rh...@gm...> - 2008-06-18 14:49:33
|
The SFS_CODE source code has finally been released! It is not quite "pretty" code, but it has been tested against coalescent simulations in a wide range of cases and shown to be quite accurate. I've also compared site-frequency spectra under various selective effects with and without demography to their Poisson Random Field expectations. In each case, the program performs very well. The details of these simulation tests will be posted on the website at http://sfscode.sourceforge.net (or on the sourceforge wiki if I get adventurous) in the near future. I'll also include performance statistics so you can get a sense for how long a simulation will take, and how much memory will be required for various population sizes and sequence lengths. Happy Simulating!! Ryan |