Name | Modified | Size | Downloads / Week |
---|---|---|---|
Parent folder | |||
FT5.txt | 2018-07-30 | 524.7 kB | |
FT4.txt | 2018-07-30 | 1.8 MB | |
FT3.txt | 2018-07-30 | 1.6 MB | |
FT2.txt | 2018-07-30 | 116.7 kB | |
FT1.txt | 2018-07-30 | 647.5 kB | |
CG5.fasta | 2018-07-30 | 2.5 MB | |
CG4.fasta | 2018-07-30 | 4.3 MB | |
CG3.fasta | 2018-07-30 | 4.1 MB | |
CG2.fasta | 2018-07-30 | 649.9 kB | |
CG1.fasta | 2018-07-30 | 2.6 MB | |
CDS5.txt | 2018-07-30 | 2.6 MB | |
CDS4.txt | 2018-07-30 | 4.4 MB | |
CDS3.txt | 2018-07-30 | 4.0 MB | |
CDS2.txt | 2018-07-30 | 647.5 kB | |
CDS1.txt | 2018-07-30 | 2.9 MB | |
Totals: 15 Items | 33.3 MB | 0 |
You can find here all script files and supplementary materials of the paper entitled "Análise de composição de conjunto de treinamento para avaliação de aprendizagem de máquina aplicada à predição de genes", available at ... Files were organized into 2 folders, as follows: | docs: Folder which contains supplementary docs with additional information ---| Results_by_Species.pdf: File which contains all results by species ---| Genomes.pdf: File which contains all genome names used | scripts: Folder which contains the scripts used to predict genes in metagenomic sequences ---| input: Folder with annotated genomes ---|train: Folder with annotated genomes used to train ---|test: Folder with annotated genomes used to test ---| output: Folder with generated models ---| dataPreProcessing.R: Script to preprocess data ---| extractSequences.R: Script to extract sequences ---| functions.R: Script which contains all functions used ---| getTableFeatures.R: Script to get a table with features from data ---| main.R: Script to run train and test scripts ---| modelGenerating.R: Scrit to generate models ---| PaperPrediction.Rproj: Gene prediction project ---| prediction.R: Script to predict genes ---| readFragGeneScanResults.R: Script to read FragGeneScan results ---| retirar_ids.sh: Script to remove ids