This track displays $organism/$o_organism genomic alignment chains. These alignments were generated using blastz and blat alignments of $o_organism genomic sequence from the 13 Nov. 2003 Arachne $o_organism draft assembly.
Alignments were performed using a gap scoring system that allows longer gaps than traditional affine gap scoring systems. It can also tolerate gaps in both $o_organism and $organism simultaneously. These "double-sided" gaps can be caused by local inversions and overlapping deletions in both species.The graphical display shows boxes joined together by single or double lines. The boxes represent aligning regions. Single lines indicate gaps that are largely due to a deletion in the genome of the non-$organism species or an insertion in the $organism assembly. Double lines represent more complex gaps that involve substantial sequence in both species. This may result from inversions, overlapping deletions, an abundance of local mutation, or an unsequenced gap in one species. In cases where there are multiple chains over a particular portion of the $organism genome, chains with single-lined gaps are often due to processed pseudogenes, while chains with double-lined gaps are more often due to paralogs and unprocessed pseudogenes. In the fuller display modes, the individual feature names indicate the scaffold name, strand, and location (in thousands) of the match for each matching alignment.
The alignments were generated by blastz on repeatmasked sequence using the following $organism/$o_organism scoring matrix:
A C G T A 100 -300 -150 -300 C -300 100 -300 -150 G -150 -300 100 -300 T -300 -150 -300 100 K = 4500, L = 3000, Y = 3400, H = 2000The resulting alignments were processed by the axtChain program. AxtChain organizes all the alignments between a single $o_organism scaffold and a single $organism chromosome into a group and makes a kd-tree out of all the gapless subsections (blocks) of the alignments. The maximally-scoring chains of these blocks were found by running a dynamic program over the kd-tree. Chains scoring below a certain threshold were discarded.
To place additional $o_organism scaffolds that were not initially aligned by blastz, a DNA blat of the unmasked sequence was performed. The resulting blat alignments were also chained, and then merged with the blastz-based chains produced in the previous step to produce the chains displayed in this track.
Blastz was developed at Pennsylvania State University by Scott Schwartz, Zheng Zhang, and Webb Miller with advice from Ross Hardison.
The chaining and netting programs were developed at the University of California at Santa Cruz by Jim Kent.
The browser display and database storage of the chains were made by Robert Baertsch and Jim Kent.
Chiaromonte F, Yap VB, Miller W (2002). Scoring pairwise genomic sequence alignments. Pac Symp Biocomput 2002;:115-26.
Kent WJ, Baertsch R, Hinrichs A, Miller W, and Haussler D (2003). Evolution's cauldron: Duplication, deletion, and rearrangement in the mouse and human genomes. Proc Natl Acad Sci USA 100(20):11484-11489 Sep 30 2003.
Schwartz S, Kent WJ, Smit A, Zhang Z, Baertsch R, Hardison R, Haussler D, and Miller W (2003). Human-Mouse Alignments with BLASTZ. Genome Res. 13(1):103-7.