blastdb_api contains a set of C routines for reading BLAST databases. Version 1.0.3 has just been put on my FTP site.
Changes since 1.02.
A. Fixed the seek by gi function.
B. Added the ability to do gi list based bulk retrieval, equivalent
to the blastall "-l" option.
C. Added the ability to do taxon restricted bulk retrieval
in conjunction with the gi_dbname.bin and gi_dbname.idx files
produced by the gi_subset and prep_taxon_gi files from my
parallelblast distribution. That software may be retrieved
the gi_subset and prep_taxon_gi programs may be used separately
from the parallel parts of the code. See the README in that distribution for more information.
A note of caution: NCBI databases often have multiple gi
values associated with a single sequence entry. So in some
cases retrieving by gilist or taxon id (which generates a gilist)
may include the same sequence entry more than once.
Please report any bugs.
mathog at caltech.edu
Manager, Sequence Analysis Facility, Biology Division, Caltech