You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
spacegraphcats uses code from
BBHash, a C++ library for building
minimal perfect hash functions (Guillaume Rizk, Antoine Limasset,
Rayan Chikhi; see
Limasset et al., 2017, arXiv, as
wrapped by pybbhash.
spacegraphcats also uses functionality from
khmer and
sourmash.
The rdomset code for efficently calculating a dominating set of a graph
at a given radius R is in spacegraphcats/catlas/rdomset.py.
The graph denoising code for removing low-abundance pendants from
BCALM cDBGs is in function contract_degree_two in
cdbg/bcalm_to_gxt.py.
Part of the indexPieces code for indexing cDBG nodes by dominating
nodes is
cdbg/index_cdbg_by_kmer.py. The
remainder is implemented in search, below.
The search code for extracting query neighborhoods is in
search/query_by_sequence.py;
see especially the call to kmer_idx.count_cdbg_matches(...).
Interesting library functionality
Code for indexing large FASTQ/FASTA read files by cDBG unitig, and
extracting the reads corresponding to individual unitigs from BGZF
files, is available in
cdbg/label_cdbg.py
and
search/search_utils.py,
get_reads_by_cdbg, respectively.
About
Indexing & querying large assembly graphs -- in space, no one can hear you miao!