Quick Start¶
This guide walks you through your first download with EGAfetch.
Prerequisites¶
- An EGA account with access to at least one dataset
- EGAfetch binary installed (see Installation)
Step 1: Log In¶
Verify your session:
Step 2: Explore a Dataset¶
List the files in a dataset:
Output:
File ID Size Check File Name
---------------------------------------------------------------------------
EGAF00001104661 500.0 MB MD5 SLX-9630.A006.bwa.bam
EGAF00001104662 320.0 MB MD5 SLX-9630.A007.bwa.bam
...
60 files, 25.3 GB total
Step 3: Download¶
Download the entire dataset:
Or download specific files:
Or download from a text file with identifiers (one per line, # comments allowed):
You will see live progress for each file:
Downloading 60 file(s) to ./my-data
SLX-9630.A006.bwa.bam [========> ] 45% 225.0 MB / 500.0 MB
SLX-9630.A007.bwa.bam [====> ] 22% 70.4 MB / 320.0 MB
SLX-9631.A001.bwa.bam [waiting...]
Step 4: Resume (If Interrupted)¶
If the download is interrupted (Ctrl+C, network failure, etc.), simply re-run the same command:
Completed files are skipped, partial files resume from the last byte.
Step 5: Verify¶
Re-verify checksums of all completed files:
Step 6: Clean Up¶
Remove temporary chunk files (keeps your completed downloads):
Using a Config File Everywhere¶
Pass --cf to any command to skip interactive login:
egafetch download EGAD00001001938 -o ./data --cf credentials.json
egafetch list EGAD00001001938 --cf credentials.json
egafetch metadata EGAD00001001938 --cf credentials.json
This is especially useful for scripts and batch jobs on HPC clusters.
Next Steps¶
- Download command reference -- all flags and tuning options
- Metadata export -- export dataset metadata as TSV/CSV/JSON
- Configuration -- credentials and config file details