init_folder_structure¶
Creates a basic Arx folders structure.
Overview¶
The init_folder_structure command creates the foundational directory structure required for Arx genome data organization. This is typically the first step in setting up a new Arx project.
Usage¶
export FOLDER_STRUCTURE=/path/to/folder_structure
init_folder_structure
Or specify the directory directly:
init_folder_structure --folder_structure_dir=/path/to/folder_structure
Created Structure¶
After running the command, you'll have the following folder structure:
folder_structure
├── organisms
├── annotations.json
├── annotation-descriptions
│ ├── SL.tsv
│ ├── KO.tsv
│ ├── KR.tsv
│ ├── EC.tsv
│ └── GO.tsv
├── orthologs
└── pathway-maps
├── type_dictionary.json
└── svg
Directory Descriptions¶
organisms/: Contains individual organism directories with their genome dataannotations.json: Central annotation configuration fileannotation-descriptions/: Contains description files for different annotation typesSL.tsv: Subcellular localization annotationsKO.tsv: KEGG Orthology annotationsKR.tsv: KEGG Reaction annotationsEC.tsv: Enzyme Commission annotationsGO.tsv: Gene Ontology annotationsorthologs/: Stores orthology analysis resultspathway-maps/: Contains pathway visualization filestype_dictionary.json: Pathway type definitionssvg/: SVG pathway map files
Next Steps¶
Once the folder structure has been initiated:
- Use
import_genometo add genomes to the folder structure - Use
download_ncbi_genomeandimport_genometo download and add genomes from NCBI - When all genomes have been added, use
init_orthofinderandimport_orthofinderto calculate orthologs (optional)
Help¶
init_folder_structure --help