Pull Genes

Command

ncbi:pull-genes

Source

This command uses the following file as the source for a list of Gene instances:

ftp://ftp.ncbi.nlm.nih.gov/gene/DATA/GENE_INFO/Mammalia/Homo_sapiens.gene_info.gz

Threading & Synchronization

This command assumes that Gene instances may already exist. Therefore, it is written to check for the existence of a Gene by symbol before persisting a new instance. And since the command checks for the existence of previously persisted Gene instances, multithreading is not used....avoiding race conditions.

Filtering

If the chromosome is '-' or 'Un', the gene will be ignored.