International Journal of Biological Sciences
Impact Factor

ISSN 1449-2288

News feeds of IJBS published articles

Manuscript Status/Login

International Journal of Medical Sciences

Journal of Cancer


Journal of Genomics

PubMed Central

Journal of Genomics now in PubMed/PubMed Central. Submit manuscript...

Indexed in Journal Impact Factor

Int J Biol Sci 2010; 6(4):333-349. doi:10.7150/ijbs.6.333


Water Buffalo Genome Science Comes of Age

Vanessa N. Michelizzi1, Michael V. Dodson1, Zengxiang Pan1, M Elisabete J Amaral2, Jennifer J. Michal1, Derek J. McLean1, James E. Womack3, Zhihua Jiang1, Corresponding address

1. Department of Animal Sciences, Washington State University, Pullman, WA 99164-6351, USA
2. Department of Biologia, UNESP - São Paulo State University, IBILCE, São Jose Rio Preto, SP, Brazil
3. Department of Veterinary Pathobiology, College of Veterinary Medicine, Texas A&M University, College Station, TX 77843-4467, USA

How to cite this article:
Michelizzi VN, Dodson MV, Pan Z, Amaral MEJ, Michal JJ, McLean DJ, Womack JE, Jiang Z. Water Buffalo Genome Science Comes of Age. Int J Biol Sci 2010; 6(4):333-349. doi:10.7150/ijbs.6.333. Available from


The water buffalo is vital to the lives of small farmers and to the economy of many countries worldwide. Not only are they draught animals, but they are also a source of meat, horns, skin and particularly the rich and precious milk that may be converted to creams, butter, yogurt and many cheeses. Genome analysis of water buffalo has advanced significantly in recent years. This review focuses on currently available genome resources in water buffalo in terms of cytogenetic characterization, whole genome mapping and next generation sequencing. No doubt, these resources indicate that genome science comes of age in the species and will provide knowledge and technologies to help optimize production potential, reproduction efficiency, product quality, nutritional value and resistance to diseases. As water buffalo and domestic cattle, both members of the Bovidae family, are closely related, the vast amount of cattle genetic/genomic resources might serve as shortcuts for the buffalo community to further advance genome science and biotechnologies in the species.

Keywords: Water buffalo, Genome resources, Cytogenetics, Whole genome mapping, next generation sequencing, Genome biotechnology.

1. Introduction

Since water buffalo were domesticated 3000-6000 years ago, they have had economic importance as dairy, meat, and draught animals, in many highly populated countries [1-3]. The animals are typically found in tropical and subtropical forests, wet grasslands, marshes and swamps. Although they are terrestrial animals, they spend a good portion of time wallowing in mud holes or rivers in order to keep cool. Their habitats typically contain rivers, streams, mud holes, tall grasses, and trees which provide sufficient drinking and wallowing water, food, and coverage [2]. During the Pleistocene epoch, Bubalus was distributed from southern Asia to Europe. As the climate became increasingly dry, the area of distribution shrunk to India, Indonesia, and parts of Southeast Asia. Buffaloes are thought to have been introduced into Italy from central Europe in the sixth century or by the Bay of Tunis in the seventh century at the same time as the Arab conquests. Introduction of water buffalo populations into Australia, Africa, and the Americas has taken place only recently [1, 3].

There are more than 168 million water buffalo in the world with about 161 million in Asia, 3.7 million in Africa, 3.3 million in South America, and the rest are distributed in Europe and Australia [4]. In South America there is a large population of swamp buffalo/river buffalo hybrids, because many of the buffalo were imported from India where river buffalo are predominant, and Australia, which has a high population of swamp buffalo [5]. These 168 million water buffalo comprise only 11.1% of the world's bovid population, but more people depend on the water buffalo than on any other domesticated species in the world [1]. As such, unlike other domesticated bovids, the water buffalo population has increased about 2% per year world-wide in the last 20 years.

Water buffalo provide more than 5% of the world's milk supply, which contains less water and more fat, lactose, protein, and minerals than cow milk [2, 4]. Water buffalo milk is used to make butter, butter oil, high quality cheeses, and other high quality dairy products. They have leaner meat that contains less fat and cholesterol than beef, while having a comparable taste [2]. Their hide can be used to make good quality leather products and they make good beasts of burden, providing 20% to 30% of all farm power, and are superior draught animals in waterlogged conditions such as rice paddies. They can also be used for transportation and can haul heavier loads than cattle [2, 4]. Water buffalo dung is collected, used for heat or fertilizer, and successfully enriches the soil, which reduces or eliminates the need for chemical fertilizers. Water buffalo utilize less digestible feeds than cattle making them easier to maintain using locally available roughages. In addition, water buffalo are used as cash--to be sold when the need arises; thus securing the economic status of many families.

Water buffalo are fairly healthy animals, even though they typically live in hot/humid regions that are favorable to the development of disease. While they are susceptible to most diseases and parasites, that similarly affect cattle, including trypanosomiasis, tuberculosis, brucellosis, rinderpest and piroplasmosis, the effects of disease are often less deleterious [6]. Due to their wallowing behavior, water buffalo are less susceptible to ticks and other ecto-parasites [4]. The resistance to ticks means that tick-borne diseases such as theileriasis, babesiosis, and anaplasmosis have little effect on water buffalo. If inoculated with East Coast fever (a form of theileriasis), water buffalo and cattle are equally susceptible [6]. Wallowing also makes water buffalo resistant to the screwworm fly, which is a major pest of livestock in Central and South America. In areas where cattle are heavily infected with the screwworm fly larvae, water buffalo are not affected. The mud that is caked on water buffalo after wallowing is thought to suffocate the larvae [6]. The adult water buffalo also has a strong resistance to strongyloid nematodes. Water buffalo do not experience the same nutritional deficiencies and the resulting susceptibility to these worms that cattle do because of their ability to utilize low quality roughages [6]. However, despite the resistance to some parasites that wallowing affords, this behavior increases the susceptibility of water buffalo to liver fluke. The buffalo are easily infected with the waterborne stage of liver fluke, although clinical signs of the disease are not usually manifested. Milking water buffalo are less affected than dairy cattle by mastitis although this is likely to change as buffalo milk yield increases [6].

Although there are many advantages to raising water buffalo as described above, these animals remain underutilized. In particular, water buffalo breeders and farmers have been facing many challenges and problems, such as poor reproductive efficiency, sub-optimal production potential, higher than normal incidence of infertility, and lower rates of calf survival.

Genome research has created a broad basis for promoting and utilizing gene technologies in many fields of livestock production. For example, genome biotechnology will provide a major opportunity to advance sustainable animal production systems of higher productivity through manipulating the variation within and between breeds to realize more rapid and better-targeted gains in breeding value. This type of research will also make it possible to distinguish molecular phenotypes and thus improve the use of genetic resources in domestic animals. Therefore, the present review focuses on the currently available genome resources in water buffalo, thus providing knowledge and technologies that can help optimize production potentials, reproduction efficiency, product quality, nutritional value and resistance to diseases in the species.

2. Genome Anatomies of Water Buffalo

Buffalo belong to the Bovidae family and there are two main species of buffalo: the Asiatic buffalo (Bubalus bubalus) and the African Buffalo (Syncerus caffer) [5]. The Asiatic buffalo originated in India where domestication likely took place in the third millennium BC [7], and China where domestication occurred in the fifth millennium BC [8]. The Asiatic water buffalo can be divided into two subspecies: the river buffalo and the swamp buffalo. The exact phylogenetic relationship between swamp and river buffalo is still in question. Divergence of these two subspecies occurred approximately 10,000 to 1.7 million years ago, long before domestication [1, 9]. Therefore, it is likely that that there were separate domestication events for river buffalo in India and for swamp buffalo in China [10]. As for the African buffalo, there are two subspecies: the cape buffalo (Syncerus caffer caffer) and the forest buffalo (Syncerus caffer nanus) [11].

Cytogenetics of Asiatic Buffalo

Cytogenetic studies show that river buffalo have 25 chromosome pairs while swamp buffalo have 24 pairs. These subspecies differ by one chromosome; a fusion between river buffalo (BBU) chromosome 4 and 9 is comparable to swamp buffalo chromosome 1 [5], and all chromosomes and chromosome arms are preserved between these two subspecies. Crosses between the two subspecies are fertile but hybrids possess 49 chromosomes, which is thought to lead to lower reproductive values in subsequent matings. River buffalo have 5 biarmed chromosome pairs and all others, including the sex chromosomes are acrocentric. Several studies have shown that river buffalo and domestic cattle, both members of the Bovidae family, are closely related. Indeed, both share chromosome banding and gene order homology, and have been cytogenetically characterized [12]. At the cytogenetic level, cattle and river buffalo chromosomes can be matched arm for arm (see examples in Figure 1). For reference, the cattle genome consists of 29 acrocentric chromosome pairs and a pair of XY sex chromosomes, while the river buffalo genome has 5 biarmed and 19 acrocentric chromosome pairs plus the XY sex chromosomes. The 5 biarmed chromosome pairs correspond to the fusion of two cattle acrocentrics, such as BBU 1 for BTA1 and BTA27, BBU2 for BTA2 and BTA23, BBU 3 for BTA8 and BTA19, BBU4 for BTA5 and BTA28, and BBU5 for BTA16 and BTA29, respectively. Each of the acrocentric river buffalo chromosomes corresponds to one of the remaining cattle chromosomes [12].

 Figure 1 

At the cytogenetic level, water buffalo chromosomes can be matched to bovine chromosomes arm for arm. Each biarmed water buffalo chromosome is derived from the fusion of two bovine acrocentrics. (a) This shows the similar banding patterns for bovine chromosomes 29 and 16 to water buffalo chromosome 5 [22], (b) This shows similar banding patterns for bovine chromosome 12 and water buffalo chromosome 13 [22].

Int J Biol Sci Image (Click on the image to enlarge.)

Cytogenetics of African Buffalo

Cytogenetic studies show that S.c. caffer possess 26 chromosome pairs and S.c. nanus has 27 chromosome pairs. These two species can interbreed although their progeny have 53 chromosomes and have reduced fertility due to unbalanced gametes, which give rise to unbalanced zygotes [5]. The main difference between the two subspecies of African buffalo is the presence of four biarmed chromosomes in S.c. caffer and three biarmed chromosomes in S.c. nanus. The rest of the chromosomes, including the sex chromosomes, are acrocentric in both species. The biarmed pairs in S.c. caffer correspond to the fusion of cattle chromosomes 1, 13; 2, 3; 5, 20; and 11, 29 [13]. In addition, Syncerus and Bubalus share no bi-armed chromosomes pairs, which suggest that there can be no crosses between these two genera because the resulting hybrid would have an unbalanced chromosome set. Therefore, chromosome morphology supports the designation of two separate genera [5].

Sex Chromosomes

Several studies have revealed high degrees of homology among autosomal chromosomes of bovids with similar banding patterns and gene order among the chromosome arms of cattle, river buffalo, sheep, and goats [14, 15]. Bovid sex chromosomes, unlike the highly similar autosomal chromosomes, share a slightly more complex rearrangement of sequences [5]. Chromosome banding comparisons show that while large portions of these chromosomes are conserved, BBU-X has large blocks of constitutive heterochromatin that BTA-X lacks. Cytogenetic studies representing loci order on these sex chromosomes show complex rearrangements that may have occurred during the karyotype evolution of river buffalo and cattle. BBU-X and BTA-X share the same gene order but a different centromere position, indicating a centromere translocation event with the loss of constitutive heterochromatin in BTA-X, which differentiates it from BBU-X [5]. Comparative FISH mapping shows the existence of a similar situation in river buffalo and cattle Y-chromosomes. BTA-Y and BBU-Y differ in an inversion including the centromere and breakage points in both arms (pericentric inversion) where BBU-Y is larger than BTA-Y and gains heterochromatin [5].

Nucleolar Organizing Regions (NORs)

Studies have shown that nucleolar organizing regions (NORs) are present in the telomeres of five cattle, sheep, and goat chromosomes and six water buffalo chromosomes [5]. The presence of these NORs and the highly conserved nature of bovid chromosomes indicated that the same nucleolus organizer chromosomes (NOCs) would also be present in bovids. Research by Gallagher et al. [16] found that some NORs, and not NOCs, were conserved to homologous chromosomes or chromosome arms. Moreover, goats, sheep, cattle, and river buffalo share one NOC; and two of the NOCs are common to river buffalo and cattle showing a close evolutionary proximity to each other [5].

Phylogenetics of Water Buffalo

Barker et al. [17] studied 21 microsatellite loci in 8 swamp and 3 river buffalo populations, and found that there were considerable intra- and interpopulation differences in allelic variability between and within each buffalo type. Ritz et al. [18] studied the relationship between bovid species, including river buffalo. This study compared the analysis of 20 bovine microsatellites and showed that B. bubalis and S. caffer were the most divergent species in the Bos clade. Finally, Kumar et al. [19] used 27 microsatellite loci to show genetic variation among 8 Indian breeds. More recently, amplified fragment length polymorphism (AFLP) fingerprinting analysis was used in the phylogenetic analysis of bovid species that clustered African and water buffalo. This study showed several tree constructions including African buffalo with water buffalo [5]. Analysis of the mitochondrial D-loop DNA sequence of 19 swamp buffaloes and 61 river buffaloes of different breeds was done by Kierstein et al. [1]. The results of this study suggested that there was only one domestication event of water buffalo, which occurred on the Indian subcontinent 5,000 years ago. The authors hypothesized that these water buffalo interbred with wild buffalo or domestic buffalo from China resulting in the buffalo found on the South-East Asian mainland. Alternatively, Kumar et al. [10] suggested an independent domestication for river and swamp buffalo which is supported by the comparison of the mitochondrial D-loop regions of seven Chinese swamp buffalo and other swamp and river buffalo from Australia, India, Brazil, Italy, and Southeast Asia done by Lei et al [20]. These results showed that river buffalo and swamp buffalo diverged before domestication indicating that the domestication of swamp buffalo in China was independent from the domestication of river buffalo on the Indian subcontinent.

Analysis of water buffalo interleukin-12 (IL12) sequences and expression revealed significant sequence identity to bovine IL12 and functional cross-reactivity with bovine immune cells [5]. Furthermore, Indian water buffalo interleukin-18 (IL18) cDNA showed similar amino acid sequence (99%) to cattle.

3. Whole Genome Mapping of Water Buffalo

Several methods have been developed and used in mapping of genomes, such as linkage, radiation hybrid (RH), and in situ hybridization mapping. The linkage map was developed soon after the re-discovery of Mendel's work at the beginning of the 20th century. Linkage maps are generated by counting the number of offspring that receive either parental or recombinant allele combinations from a parent that carries two different alleles at two or more loci. Analyses of this type of data determine whether loci are "linked" to each other and, their relative order and the distance that separate them. Therefore, linkage mapping requires polymorphic markers and reference populations. The starting materials for RH mapping are cell lines that are constructed by fusing irradiated donor cells from the species of interest with a rodent cell line (usually hamster). The irradiation shatters each chromosome into multiple fragments at random locations. The resolution of this technique is as high as linkage analysis but it does not depend upon breeding and polymorphic markers. In situ hybridization uses a labeled probe to detect and localize specific RNA or DNA sequences on a chromosome. The modified in situ protocol that utilizes fluorescent tags is referred to as FISH (for fluorescent in situ hybridization). In general, for the method to be effective, the probe should be at least 500 bp in length. However, the estimated resolution of in situ hybridization is limited to ~10,000 kb. Current research only reports the use of the latter two methods in mapping of the water buffalo genome.

Fluorescent In Situ Hybridization (FISH)

The first cytogenetic map for water buffalo with only 68 loci, mostly assigned using FISH, was reported by Di Meo in 2008 [21]. This map contained at least one bovine molecular marker assigned to each river buffalo chromosome or chromosome arm. Subsequent maps were reported that contain 171 known genes and 122 microsatellites. Of these, 293 were assigned genes and 247 were assigned using FISH [5]. The total number of mapped loci for river buffalo is now 309, which cover all chromosomes and chromosome regions of the river buffalo genome (reviewed in Iannuzzi and Di Meo 2009 [5]).

Radiation Hybrid Mapping

Radiation hybrid mapping is used to generate medium to high resolution maps, and are available for several mammalian species including the cow. The RH panel for river buffalo is a recent development that has been used to construct preliminary RH maps for several of the water buffalo chromosomes [22-26]. The preliminary RH maps for BBU1, BBU3, BBU6, BBU7, BBU10, and BBUX were based on markers derived from cattle, and showed that the bovine genome is a useful source of markers for buffalo genome mapping. This one element is significant and demonstrates that rapid and efficient transfer of genetic information may occur from cattle to water buffalo. Indeed Amaral et al. [12] used the BBURH5000 panel to construct a first generation genome RH map of the river buffalo with 2621 cattle-derived loci covering all chromosomes. This map demonstrates improved coverage with considerable increases in the number of mapped markers, when compared to preliminary maps (previously constructed). After completion of the first generation whole genome RH map for river buffalo, the marker order was compared to the current bovine genome sequence assembly Btau_4.0. This comparison showed that the marker order within the linkage groups for the buffalo chromosomes was consistent with the bovine genome assembly [12]. As such, Stafuzza et al. [27] generated RH maps for BBUY which contain a total of 28 markers distributed within one linkage group. Figure 2 illustrates the bovine genome regions that have been linked to the water buffalo genomes based on the current maps of the species.

4. Whole Genome Sequencing of Water Buffalo

Genome sequencing in farm and other animals has advanced significantly in recent years. For example, sequence data are available in the public domain for many livestock species. As of March 1, 2010, there are 2,509,850 cattle, 3,237,358 pigs, 2,195,532 chicken, 6,259,791 sheep, 470,489 horse, 2,886,083 cat and 2,599,789 dog sequences available in the GenBank nucleotide databases ( Whole genome sequencing has been completed in cattle, horse, chicken and dog and sequencing of the porcine genome is almost completed. A total of 66,935 nucleotide sequences for the water buffalo have been deposited in the GenBank database and are mainly 64,212 whole genome shotgun sequences, while the rest includes 974 mitochondrial genomic sequences and 1,748 nuclear gene/genomic DNA sequences. The latter may be further classified into nuclear gene-related sequences (981), satellite-related sequences (689) and others (78) (Figure 3). The 689 satellite sequences involve satellite (17), microsatellite (311) and minisatellite (361) sequences, respectively, and the 78 other sequences mainly consist of cis-acting regulator, CpG island, repeat sequence, gene-like sequence or some other genomic sequences.

 Figure 2 

Genome regions that have markers linked between river buffalo and cattle (Data source: [12]).

Int J Biol Sci Image (Click on the image to enlarge.)
 Figure 3 

Categories of 66935 water buffalo sequences deposited in the GenBank Database.

Int J Biol Sci Image (Click on the image to enlarge.)

Mitochondrial Genome Sequencing of Water Buffalo

The mitochondrial genome is present as a circular DNA molecule. Unlike nuclear chromosomes that are paired in mammals (except X and Y sex chromosomes), there are many copies of the mitochondrial molecule in every cell. However, the copy number can be extremely variable in different cells. For example, an egg contains 100,000 to 1,000,000 mitochondrial DNA (mtDNA) molecules, while a sperm contains only 100 to 1000. The complete water buffalo mitochondrial genome was sequenced by three groups of scientists at Hainan Medical College, China (see Genbank accession number: AY702168), Centre for Cellular and Molecular Biology, Hyderabad, India (see GenBank accession number: AF547270) and Istituto Spallanzani, Italy (see GenBank accession number: AY488491). Similar to other mammals, each mtDNA molecule in water buffalo harbors genetic material that encodes 37 genes: 13 for proteins (polypeptides), 22 for transfer RNA (tRNA) and one each for the small and large subunits of ribosomal RNA (rRNA) (Figure 4).

 Figure 4 

Water buffalo mitochondrial genome structure and annotation based on AY488491.

Int J Biol Sci Image (Click on the image to enlarge.)

Among 13 coding mitochondrial genes in water buffalo, the length of the coding sequences of four genes - COX1, ND4L, ND4 and ND6 are identical to those in Bison bison (American bison, NC_012346), Bos grunniens (domestic yak, NC_006380), Bos indicus (zebu, AF492350), Bos taurus (domestic cattle, NC_006853), Capra hircus (goat, NC_005044), Equus asinus (donkey, NC_001788), Equus caballus (horse, NC_001640), Lama glama (llama, NC_012102), Oryctolagus cuniculus (rabbit, NC_001913), Ovis aries (sheep, NC_001941) and Sus scrofa (swine, NC_000845). However, the D-loop region (926 bp) is relatively short in water buffalo, whereas it ranges from 888 bp in B. bison to 1800 bp in O. cuniculus. Interestingly, sixteen of these 37 genes overlap in the water buffalo mitochondrial genome, including two three-gene-overlaps (ND1/tRNA-Ile/tRNA-Gln and ATP8/ATP6/COX3), and five two-gene-overlaps (ND2/tRNA-Trp, tRNA-Tyr/COX1, ND4L/ND4, ND5/ND6 and tRNA-Thr/tRNA-Pro) (see GeneBank accession number: AY488491). The overlapping size varies from 1 bp to 40 bp. Certainly some genes are also distanced from each other, but the distance gap ranges only from 1 bp to 4 bp in length. In addition to these complete mitochondrial genome sequences of water buffalo, the D-loop and CYTB regions have also been investigated. Currently, there are 784 entries for the former region and 162 entries for the latter region in the GenBank databases.

Nuclear Gene/Genomic Sequencing of Water Buffalo

As shown in Table 1, 971 known gene sequences have been contributed to the GenBank database for the species. Our annotation (Table 1) revealed that the sequences represent 277 functional genes based on the BLAST searches against orthologous genes in mammals. In particular, we observed that 29 genes/clusters have been heavily investigated, and they may account for 54% (527/971) of the known gene entries. Among them, 243 entries are sequences for genes related to growth and milk production, such as oxidized low density lipoprotein (lectin-like) receptor 1 (OLR1, 53 entries), leptin (LEP, 33 entries), growth hormone receptor (GHR, 29 entries), lactalbumin, alpha (LALBA, 20 entries), casein beta (CSN2, 18 entries), growth hormone 1 (GH1, 15 entries), insulin-like growth factor 1 (IGF1, 14 entries), casein kappa (CSN3, 13 entries), casein alpha s1 (CSN1S1, 13 entries), stearoyl-CoA desaturase (SCD, 10 entries), myostatin (MSTN, 9 entries), diacylglycerol O-acyltransferase homolog 1 (DGAT1, 9 entries) and butyrophilin, subfamily 1, member A1 (BTN1A1, 7 entries). Major histocompatibility complex (BULA@, 86 entries), solute carrier family 11, member 1 (SLC11A1, 24 entries), lactoferrin (LTF, 23 entries), integrin, beta 2 (ITGB2, 11 entries), CD14 molecule (CD14, 11 entries), toll-like receptor 4 (TLR4, 10 entries), lysozyme (LYZ, 10 entries), cathelicidin (CATHL@, 8 entries) and interleukin 2 (IL2, 7 entries) were the targets to study disease resistance in water buffalo. Variables of reproduction is another focus of water buffalo genome research, including studies on follicle stimulating hormone receptor (FSHR, 17 entries), sex determining region Y (SRY, 15 entries), progestagen-associated endometrial protein (PAEP, 14 entries), luteinizing hormone beta polypeptide (LHB, 10 entries), cytochrome P450, family 19, subfamily A, polypeptide 1 (CYP19A1, 9 entries), follicle stimulating hormone, beta polypeptide (FSHB, 8 entries) and estrogen receptor 1 (ESR1, 6 entries).

Genomic sequencing in water buffalo also involved satellite, minisatellite and microsatellite sequencing. Satellite sequences were mainly contributed by two groups of researchers, one at the Nagoya University, Japan [28] and one at National Institute of Immunology, India [29]. With digestion of two restriction endonucleases, BamHI and StuI, Tanaka and colleagues [28] identified two types of satellites in water buffalo by sequence analysis: one with ~ 1,400 bp tandem repeat unit and another with ~700 bp tandem repeat unit. The former shows 79% similarity to the bovine satellite I DNA, while the latter is 81% identical to the bovine satellite II DNA. The authors found that both satellite DNAs are localized to the centromeric regions of all chromosomes in either river or swamp type of buffaloes. Furthermore, the hybridization signals with the satellite I DNA on the acrocentric autosomes and X chromosome were much stronger than those on the biarmed autosomes and Y chromosome. However, the hybridization signals with buffalo satellite II DNA was almost the same over all the chromosomes, including the Y chromosome. Pathak and coworkers [29] further confirmed these two types of satellite sequences in the buffalo genome: the 1378- and 673-bp repeat fragments. Using real-time PCR analysis, the authors uncovered 1234 and 3420 copies of 1378- and 673-bp fragments per haploid genome, corresponding to 30 and 68 copies per chromosome, respectively. In addition, both 1378- and 673-bp repeat fragments are abundantly expressed in the spleen and liver.

There are a total of 361 minisatellite sequences deposited in the GenBank database for buffalo, which were mainly contributed by the National Institute of Immunology, India. Srivastava and colleagues [30] performed the minisatellite-associated sequence amplification with an oligo (5' CACCTCTCCACCTGCC 3'), that was designed based on consensus of 33.15 repeat loci using cDNA from the testis, ovary, spleen, kidney, heart, liver, and lung of water buffalo. The authors defined six different sizes of minisatellites with 1,263, 846/847, 602, 576, 487, and 324 bp, respectively. BLAST searches revealed that the 846/847-bp fragment has homology with the adenylate kinase gene, while the 1,263, 324, and 487-bp fragments show homology with the secreted modular calcium binding protein (SMOC-1), leucine-rich repeat neuronal 6A (LRRN6A) mRNA, and human TTTY5 mRNA, respectively. As for the microsatellites, there are a total of 311 submissions in the current GenBank database. Most of them were submitted by the Centre for Cellular and Molecular Biology, India (see AY775830 - AY775944, AY779565 - AY779623, AY787147 - AY787166, AY805331 - AY805389 and AY912133 - AY912182). These microsatellites have yet to be further mapped.

Next Generation Sequencing of Water Buffalo Genome

In the past, two general strategies have been widely used for whole genome sequencing: BAC by BAC sequencing and shotgun sequencing. Both strategies employ the Sanger method, which is relatively costly, time consuming, and labor intensive [31]. Therefore, the high demand for low-cost sequencing has led to the development of high-throughput sequencing technologies, called next-generation sequencing. As recently reviewed by Jiang et al. [32], three such next-generation sequencing technologies have been commercialized, such as Roche/454 life science (, Illumina/Solexa ( and Applied Biosystem/SOLiD ( These new generation sequencing methods no longer use the Sanger method for sequencing. Instead, the 454 technology is based on pyrosequencing and emulsion PCR; the Solexa technology utilizes a sequencing-by-synthesis approach for sequencing single DNA molecules attached to microspheres and the SOLiD (supported oligonucleotide ligation and detection) technology is a short-read sequencing method based on ligation. Nevertheless, these next-generation sequencing methods can produce a large amount of sequences in a relatively short time: for example, 500 Mb within 10 hours for Roche 454 GS FLX system, 1.5 Gb within 2.5 days for Illumina Genome Analyzer and 4 Gb within 6 days for Applied Biosystems SOLiD system [33].

Using Roche 454 GS-FLX Titanium technology, a group of researchers at the Anand Agricultural University, India completed <1x genome sequencing of water buffalo in 2009 with a total of 64,212 sequences submitted to the GenBank database (ACZF01000001-ACZF01064212). The submission information from NCBI shows that a mature buffalo bull of Jaffrabadi breed was used to provide DNA materials for the project. The bull was tested free for bovine tuberculosis, brucellosis and Johne's disease. The submitted sequences range from 92 bp to 4,726 bp in size, but entries with 201 bp - 500 bp account for 83.3% (Figure 5). Overall, the 454 GS-FLX sequencing technologyhas contributed 21,675,247 bp in total to the species.

Also in 2009, Jayaraman [34] reported India's ambitious goal to sequence the complete water buffalo genome (, because the species represents the mainstay of the country's dairy industry, producing up to 55 percent of milk in addition to meat, hides and draught power. At that announcement, the Indian Council of Agricultural Research awarded the National Bureau of Animal Genetic Resources in Karnal, Haryana a substantial grant for their research. The Buffalo Genome Resources website at the National Center for Biotechnology Information recently confirmed that collaborations were established among the National Bureau of Animal Genetic Resources in Karnal, the Central Institute for Research on Buffaloes in Hisar, and the Animal Science Division of the Indian Council of Agricultural Research to sequence the buffalo genome. It is anticipated that >95% coverage of the buffalo genome will be sequenced by the end of 2010. In collaboration with the University of Florida, Washington State University will use the Illumina GAIIx technology to produce approximately 40 Gb of sequences for water buffalo during the summer of 2010.

 Figure 5 

Size distributions of 64,212 water buffalo whole genome shotgun sequences deposited in the GenBank Database.

Int J Biol Sci Image (Click on the image to enlarge.)
 Table 1 

Annotation of known gene sequences in water buffalo.

Gene symbolDescriptionNACCESSION
ABCG2ATP-binding cassette, sub-family G, member 22DQ272524 DQ205444
ACTA2actin, alpha 2, smooth muscle, aorta1DQ846868
ACTBactin, beta4FJ468013 AF156978 DQ087263 DQ661647*
ACTC1actin, alpha, cardiac muscle 11DQ904371
AKR1C4aldo-keto reductase family 1, member C41DQ884879
AMELXamelogenin (amelogenesis imperfecta 1, X-linked)2DQ469598 EF062996
AMELYamelogenin, Y-linked2DQ469599 EF062997
ANGPT1angiopoietin 11AM237586
ANGPT2angiopoietin 21AM237589
ART4ADP-ribosyltransferase 4 (Dombrock blood group)1EF118387
BMP15bone morphogenetic protein 153FJ613788 FJ613792 EF375880*
BOP1block of proliferation 11DQ082844
BPIbactericidal/permeability-increasing protein1AB234871
BTN1A1butyrophilin, subfamily 1, member A17EU199797-8 AY491471-4 EU194868
BULA-A**MHC class I antigen-like20AY894407-10 AY188812-13 AY785758-61 AY925136 AY785759 DQ157373-81
BULA-DIB**MHC class II DI-beta antigen, Bubu-DIB*0101 allele1AY155224
BULA-DQA**MHC class II antigen, BuLA-DQA*01 allele7DQ822570-3 DQ868979-81
BULA-DQA1**MHC class II antigen3DQ116959 AY954685 DQ440647*
BULA-DQA2**MHC class II antigen precursor1DQ440648*
BULA-DQB**MHC class II antigen24DQ908903-4 EU025857-66 AY699876-87
BULA-DRA**MHC class II antigen, Bula-DRA*01 allele3AF385488-9 DQ016629*
BULA-DRB**MHC class II antigen, Bula-DRB*01 allele8AF385473-80
BULA-DRB1**MHC class II antigen precursor1DQ146947*
BULA-DRB3**MHC class II antigen28AF261955-6 AF270653-74 DQ057985* DQ187336-7 AY496063
BULA-DYA**MHC class II antigen3DQ188097* AY155211-2
BYS1**Y chromosome repeat sequence1X93551
CAPN1calpain 1, (mu/I) large subunit1AB362998
CAPN2calpain 2, (m/II) large subunit1AB362999
CATHL@**cathelicidin-like cluster8DQ661043* AJ812216 AY762972 DQ832665* EF050453* DQ832666 GQ396636 GQ354416
CATSPER1cation channel, sperm associated 11GU372967
CCL3chemokine (C-C motif) ligand 31EF562448*
CCNB2cyclin B21EF367720
CCT8chaperonin containing TCP1, subunit 8 (theta)2EF121558 EF113594
CD14CD14 molecule11DQ444324 EU370398-404 DQ457089* EU871785 GU368103
CD2CD2 molecule1EF118474
CD247CD247 molecule1DQ057984*
CGAglycoprotein hormones, alpha polypeptide1AJ302670
CHI3L1chitinase 3-like 1 (cartilage glycoprotein-39)1AY295929
CLEC7AC-type lectin domain family 7, member A1EF636900*
CPceruloplasmin (ferroxidase)1FN649762
CScitrate synthase1FJ468012
CSF2colony stimulating factor 2 (granulocyte-macrophage)1AY553190*
CSN1S1casein alpha s113AF529305 FJ392261* EF025978-83 AY948385* DQ111783* AY514427 AJ005430 EF133464
CSN1S2**casein alpha-S215AJ005431 AY514428 DQ133467* DQ173244* EF025984-9 EF066480 FM865618-21
CSN2casein beta18AY352050 DQ191170-2 Y17836-9 FM946182 FN424088 EF115306 AY599833 DQ317447* DQ631829* EF066481 AJ005165 AJ005432 FM986648
CSN3casein kappa13AF164024 AM900443 EF133463 DQ191173-4 AY750857* DQ645429* EF066482 FJ770200 U96662 AJ011387 AJ628346 D14370^
CSRP3cysteine and glycine-rich protein 31GQ231523
CTLA4cytotoxic T-lymphocyte-associated protein 41FJ827143
CXCR2chemokine (C-X-C motif) receptor 21AY864732
CYM**chymosin2EU265816* AF177290*
CYP11A1cytochrome P450, family 11, subfamily A, polypeptide 11AM237596
CYP19A1cytochrome P450, family 19, subfamily A, polypeptide 113EF126034-6 EF178281 DQ407274 EU621845 FJ572657-9 AH015824 DQ489713 DQ489714 EU308111
CYP1A1cytochrome P450, family 1, subfamily A, polypeptide 12FJ984598-9
CYP21**cytochrome P450, subfamily XXI1AF163768
DAZ1deleted in azoospermia 11EU106872
DDX3YDEAD (Asp-Glu-Ala-Asp) box polypeptide 3, Y-linked5AY928814 AY928821 AY936546 AY936552 GQ259328
DEFB@**beta defensin cluster10GQ231526-9 EF418029* EF418030* EF418031 AY392452* AY301005* EF489402
DGAT1diacylglycerol O-acyltransferase homolog 1 (mouse)9AY999090^ DQ886485^ DQ182702 FJ014704-6 DQ120929* DQ173243 AJ971291
DHX36DEAH (Asp-Glu-Ala-His) box polypeptide 361EF088288
EIF1ADeukaryotic translation initiation factor 1A domain containing1FJ415608
ERO1LERO1-like (S. cerevisiae)1EF088290
ESR1estrogen receptor 16AM237582 EU662287-91
ESR2estrogen receptor 2 (ER beta)1AM237583
F11coagulation factor XI2DQ232885 DQ233653
FABP3fatty acid binding protein 3, muscle and heart2GQ402490 AY758205*
FAM13A1**family with sequence similarity 13, member A12DQ246517 DQ297675
FBXL6F-box and leucine-rich repeat protein 62DQ097508 DQ082845
FCGRTFc fragment of IgG, receptor, transporter, alpha1EU262263
FGF1fibroblast growth factor 1 (acidic)1AM237590
FGF2fibroblast growth factor 2 (basic)1AM237591
FN1fibronectin 11AY862395
FOXD3forkhead box D31DQ487024
FSHBfollicle stimulating hormone, beta polypeptide11EU340285* AY449463^ DQ666186* DQ862125-7 EF650039-42 EF710660*
FSHRfollicle stimulating hormone receptor17EU148059 DQ785802* EU016216* EF650043-50 AM237585 DQ845245* EU662283-6
G6PDglucose-6-phosphate dehydrogenase3AJ56462-8 GU324292
GAPDHglyceraldehyde-3-phosphate dehydrogenase2GU324291 AY974798
GATA4GATA binding protein 41DQ487031
GCNT3glucosaminyl (N-acetyl) transferase 3, mucin type3AY847312-3^ AY847321^
GDF9growth differentiation factor 93FJ529501* FJ529502 EF202171*
GGT1gamma-glutamyltransferase 11AM260200
GH1growth hormone 116AJ000549 AF177288 EF417860 EF417870 EU344998-9 FJ754322 GU223914 AJ005116 AJ011533 AY940159 X72947 AJ011513-4 D16297 DQ307367
GHRgrowth hormone receptor29AY608917* AY053546-69 EF207441^ EU178741* AY739705 AY775296
GHRHgrowth hormone releasing hormone1DQ064594
GHRHRgrowth hormone releasing hormone receptor2EF600712^ EF600713*
GHRLghrelin/obestatin prepropeptide5GU071074-5 DQ118139 EF583468^ EU604028
GJA1gap junction protein, alpha 1, 43kDa1GQ896281
GNL3Lguanine nucleotide binding protein-like 3 like1GQ853551
GNRH1gonadotropin-releasing hormone 11EU395713*
GNRH2gonadotropin-releasing hormone 21EU887534#
GNRHRgonadotropin-releasing hormone receptor4AJ422122 AJ621497 DQ821403* EU621854*
GNRHR2gonadotropin-releasing hormone (type 2) receptor 21EU541504
GOT1glutamic-oxaloacetic transaminase 1, soluble1AB437282*
GPAA1glycosylphosphatidylinositol anchor attachment protein 1 homolog (yeast)1DQ082846
GPR33G protein-coupled receptor 33 (gene/pseudogene)2AY490667-8
GYG1glycogenin 11DQ885595
H2BH2B histone1EF526308*
HAMPhepcidin antimicrobial peptide1EU399814*
HAS2hyaluronan synthase 21DQ989300
HBA1hemoglobin, alpha 14AJ242731-4
HBBhemoglobin, beta5AM886147-51 (AM886149#)
HERC3hect domain and RLD 33AH015380 DQ272525-6
HEXAhexosaminidase A (alpha polypeptide)1EF057418
HK1hexokinase 11GU324294
HPS3Hermansky-Pudlak syndrome 31DQ885601
HSF1heat shock transcription factor 11DQ097507
HSPA1Aheat shock 70kDa protein 1A2AJ812563 EU099315*
IFN@**interferon cluster7AY323971-2 AY681350-1 EF503726 AY535404 AY665673
IFNGinterferon, gamma5AB246350* AF484688* AY466119 EF424252* EF567075*
IGF1insulin-like growth factor 1 (somatomedin C)14Y10691 Y16248 Y18832 EU159114-6 FJ032308 AM237594 AY803777 GQ301206* GQ385227 AY743324-5 DQ085612
IGF1Rinsulin-like growth factor 1 receptor3FJ032309 Y12700 AM237595
IGF2insulin-like growth factor 2 (somatomedin A)2FJ032306 DQ085613
IGF2Rinsulin-like growth factor 2 receptor1FJ032307
IGFBP2insulin-like growth factor binding protein 2, 36kDa3AJ223171 Y16352 DQ079594
IGFBP3insulin-like growth factor binding protein 35AJ223172 Y16351 AY338972 AY816186 AY304829
IGFBP4insulin-like growth factor binding protein 42AJ223170 DQ079592
IGFBP5insulin-like growth factor binding protein 52DQ079593 AF045566
IGHG3immunoglobulin heavy constant gamma 31GQ140262
IL10interleukin 103AB246351* AY325267* EF362779*
IL10RAinterleukin 10 receptor, alpha1AY864730
IL12Ainterleukin 12A4AB246352* EF424253* AY232819* DQ885596
IL12Binterleukin 12B3AB246353 EF424254 AY198121
IL13interleukin 132DQ083523* EF201845
IL15interleukin 153AJ891036 DQ083522* EF201846
IL18interleukin 185AJ891035 AY394479* EF424256* AY436506* DQ083521
IL1Ainterleukin 1, alpha2AB246786* AY514120*
IL1Binterleukin 1, beta2AB246787* AY514903*
IL2interleukin 27EF118562^ AB246354* EF397241 EF407852* DQ010150 AF255665 AF363786*
IL2RGinterleukin 2 receptor, gamma2AY864727 AY864729
IL3interleukin 31AY553189*
IL4interleukin 45AJ891034 DQ083520 AY293620* AB246355* EF407850*
IL4Rinterleukin 4 receptor1AY864728
IL5interleukin 51EF407851*
IL6interleukin 63AB246788* AY347710* AY601639
IL6Rinterleukin 6 receptor1AY864731
IL8interleukin 83FJ595833^ AY952930* AY862415
INHAinhibin, alpha1EU884446*
INSRinsulin receptor1Y14017
ISG15ISG15 ubiquitin-like modifier3DQ118136-8^
ITGB1integrin, beta 13AH015379 DQ251725-6
ITGB2integrin, beta 211AY629299-302 AY737720-1 AY742803 AY821799 AY842449* EU853307 AY252121
KCNMA1potassium large conductance calcium-activated channel, subfamily M, alpha member 11EU816373
KDRkinase insert domain receptor1AM237593
KITv-kit Hardy-Zuckerman 4 feline sarcoma viral oncogene homolog3FJ232938* AY829238 DQ314491*
KRT14keratin 141DQ846867
LALBAlactalbumin, alpha-20EF408609-10 AH014129-32 AY726609-17 EF408824 AF194373^ DQ785796* EF419240-1
LAPLaryngeal adductor paralysis4DQ458768* EF418028* EF630358* DQ886701
LBPlipopolysaccharide binding protein1FN645646
LDHAlactate dehydrogenase A1GU324295
LEPleptin33AF387813-4 AH013754^ AY177609 AY338973 AY427959 AY495586-7^ DQ490986 DQ497641 DQ676888-90 DQ831142-3 DQ837736-8 EU030441 EU078405 EU194869 EU199796 EU541447* EU662273-7 EU825672-4 EU888289 GQ385228
LEPRleptin receptor2AY177610-1
LHBluteinizing hormone beta polypeptide10AY765376* FJ357421* DQ489560^ EF113097-103
LHCGRluteinizing hormone/choriogonadotropin receptor5DQ858168-72
LIFleukemia inhibitory factor2EU926738 FJ907967
LIPElipase, hormone-sensitive1AY900493
LTFlactotransferrin25Y12775 EU192148 GQ231524 AF281089 DQ522302 DQ838746-51 EF443159 EF488002 EF650854 EU258554 EU263788 EU518482 EU532491 EU581859 EU598149 EU669579 EU683031 EU706286 EU719067 AJ005203
LYZlysozyme10DQ480754-5* GQ497214-6 GQ995483-5 EF535848* AJ225012
MAF1MAF1 homolog (S. cerevisiae)1DQ082847
MAP2K1mitogen-activated protein kinase kinase 11EF057415
MEGF11multiple EGF-like-domains 111EF057416
MMEmembrane metallo-endopeptidase1DQ885597
MOGAT2monoacylglycerol O-acyltransferase 23EF208205 EU239373-4
MPLmyeloproliferative leukemia virus oncogene1GQ411204
MSTNmyostatin13FJ752686-7^ AH014575^ AH013313^ AY254098^ AY363177-8 AY725214 AY854495-7 DQ091762^ DQ159987*
MX1myxovirus (influenza virus) resistance 11AB462524*
MX2myxovirus (influenza virus) resistance 21EF052266*
MYCv-myc myelocytomatosis viral oncogene homolog (avian)1GU296437
MYF5myogenic factor 52EF197850^ EF128444*
MYF6myogenic factor 6 (herculin)1FJ039849*
MYH1myosin, heavy chain 1, skeletal muscle, adult1AB126014
MYOD1myogenic differentiation 11FJ194946^
MYOGmyogenin (myogenic factor 4)2EF636460^ FJ211420
NANOGNanog homeobox1DQ487022
NCAM1neural cell adhesion molecule 11AF025988
NLRP5NLR family, pyrin domain containing 52AM748273 AM941717
NOS2nitric oxide synthase 2, inducible2EF562453 EF655617
NRAMP1natural resistance-associated macrophage protein 124DQ390205 DQ441408 DQ095780-1 DQ376109-10 DQ645386-9 AH015634 DQ658151* AH014155 AY702720 AY707989 AH014681-2 AY860618 AY860620-4 DQ390206
OLR1oxidized low density lipoprotein receptor 153GQ415420-51 GQ478023-42 GQ385226
ORM1orosomucoid 11FN645647
OVGP1oviductal glycoprotein 1, 120kDa2EU382735* DQ482669*
OXToxytocin, prepropeptide2AM234538-9
PAEPprogestagen-associated endometrial protein14AY775796-802 FN377869 DQ785797* AJ005429 DQ340204-5 AJ492506 AM238696
PAG1phosphoprotein associated with glycosphingolipid microdomains 11EU815059*
PAG2**pregnancy-associated glycoprotein 21GU433184
PAPOLApoly(A) polymerase alpha2FJ386493 GQ896282
PAPPApregnancy-associated plasma protein A, pappalysin 11DQ079591
PDCD1programmed cell death 12FJ827145* FJ827147*
PDHBpyruvate dehydrogenase (lipoamide) beta1GU324296
PFKMphosphofructokinase, muscle1GU324297
PGRprogesterone receptor6EU662278-82 AM237584
PKD2polycystic kidney disease 21DQ230325
PKM2pyruvate kinase, muscle1EF057417
PLOD2procollagen-lysine, 2-oxoglutarate 5-dioxygenase 21DQ418774
PLS1plastin 11DQ418775
PLZFPromyelocytic leukemia zinc finger1EU195081
POU1F1POU class 1 homeobox 12AY971145 GQ385224
POU5F1POU class 5 homeobox 12DQ487023 EU926737
PPARGperoxisome proliferator-activated receptor gamma2GU066311 EU887290^
PRDX1peroxiredoxin 11DQ082848
PRKAG3protein kinase, AMP-activated, gamma 3 non-catalytic subunit2DQ646431 DQ525861
PRLprolactin4EF054878* EU352822 DQ287249 EU340420
PRLRprolactin receptor1GQ339914
PRNPprion protein6AY720689-90 AY768533-4 AY320365 AY769894
PROS1protein S (alpha)1DQ885599
PRSS7protease, serine, 7 (enterokinase)1DQ518426
PTGESprostaglandin E synthase1DQ167808
PTGFRprostaglandin F receptor (FP)2DQ100351* AY346134*
PTGS2prostaglandin-endoperoxide synthase 21EF028069
PXMP4peroxisomal membrane protein 4, 24kDa1EU714054
REXO1REX1, RNA exonuclease 1 homolog (S. cerevisiae)1FJ269033
RN18S1RNA, 18S ribosomal 12GU324290 Y12516
RN28S1RNA, 28S ribosomal 11AJ621496
RNASE1ribonuclease, RNase A family, 1 (pancreatic)3AF259554 AJ011842-3
RNF13ring finger protein 131DQ885600
RPSAribosomal protein SA1GQ452383^#
SAA@serum amyloid A1 cluster1FN599527
SCDstearoyl-CoA desaturase (delta-9-desaturase)10FN395259 FM876222 AM600640 AH015836 DQ088625 DQ646700-1 GQ336861-2 EF122150
SDHAsuccinate dehydrogenase complex, subunit A, flavoprotein (Fp)1GU324298
SELTselenoprotein T1DQ899756
SERPINA1serpin peptidase inhibitor, clade A, member 11GQ385225
SERPINA14**serpin peptidase inhibitor, clade A, member 141DQ661648*
SFTPDsurfactant protein D1GQ231525
SLAMF1signaling lymphocytic activation molecule family member 11DQ228868*
SLC11A1solute carrier family 11, member 15FJ827149* U27105* AY702721 DQ855416-7
SLC11A2solute carrier family 11, member 21FJ827151*
SLC24A1solute carrier family 24, member 11EF121559
SLC26A2solute carrier family 26, member 21AY350740^
SLC2A1solute carrier family 2, member 13GU324293 FJ468009 AJ812564
SLC5A4solute carrier family 5, member 41AF254423
SLC5A5solute carrier family 5, member 51FJ468010
SLC5A8solute carrier family 5, member 81FJ468011
SLPIsecretory leukocyte peptidase inhibitor1EU700055*
SMOC1SPARC related modular calcium binding 14AY947405 EU370565* DQ159955* EF446167*
SOX2SRY (sex determining region Y)-box 24GQ451841* GQ853881^ DQ487021 EU661361
SPP1secreted phosphoprotein 13DQ295062* DQ415660* DQ899755^
SRYsex determining region Y15GQ259332^ AY323859 DQ119747^ EF465091 EF465093-4 AY341337^ DQ417872^ EF465092 EF465095-6 EU016229^ DQ336535^ FJ546413-4^
STARsteroidogenic acute regulatory protein1DQ062682
STAT3signal transducer and activator of transcription 31DQ487026
STAT5Asignal transducer and activator of transcription 5A2EU887291 EU030440
TAP**tracheal antimicrobial peptide2AB299981 AB299970
TEKTEK tyrosine kinase, endothelial1AM237587
TFDP2transcription factor Dp-21DQ418776
TIE1tyrosine kinase with immunoglobulin-like and EGF-like domains 11AM237588
TLR1toll-like receptor 12EU005235 GU451251^
TLR10toll-like receptor 101EU005244
TLR2toll-like receptor 25DQ131599 DQ288130* EU005236 EU178742^ GU441859
TLR3toll-like receptor 33DQ508811* EU005237 FJ606788^
TLR4toll-like receptor 410GQ851932 EF609240 EF562451-2 EU005238 GU002362 GU176310 DQ393127 EU386358^ DQ857349*
TLR5toll-like receptor 52EU005239 GQ866978^
TLR6toll-like receptor 62EU005240 GQ866885
TLR7toll-like receptor 73EU005241 GQ925364^ GU214059^
TLR8toll-like receptor 82EU005242 GQ922103^
TLR9toll-like receptor 93EU005243 EU747827 FJ606787^
TNFtumor necrosis factor5EF562449* EF562450* AB246789* EF424255* AY221123*
TNP1transition protein 11EU979632
TNP2transition protein 23EU979633 EU999142-3
TSHBthyroid stimulating hormone, beta1FJ357420*
TSPY**testis-specific protein4EU386184 EU386187-8 EU032586
TUBB3tubulin, beta 31DQ487030
TYROBPTYRO protein tyrosine kinase binding protein1EF118738^
U2AF1U2 small nuclear RNA auxiliary factor 11FJ415609
UACAuveal autoantigen with coiled-coil domains and ankyrin repeats1EF088289
UBA52ubiquitin A-52 residue ribosomal protein fusion product 11AY842446*
USP25ubiquitin specific peptidase 251DQ418777
UTYubiquitously transcribed tetratricopeptide repeat gene, Y-linked1AY936541
VEGFAvascular endothelial growth factor A1AM237592
YWHAZtyrosine 3-monooxygenase/tryptophan 5-monooxygenase activation protein, zeta polypeptide1GU324299
ZFXzinc finger protein, X-linked2Y08942 X99827#
ZFYzinc finger protein, Y-linked5DQ336545 DQ336555 AY928827 X99826 GQ259330


* Entry is mRNA with complete coding sequence for the gene;

^ Entry is genomic DNA with complete coding sequence for the gene;

# Entry is a pseudogene;

**Specific gene symbols are used in water buffalo;

@Entry belongs a gene cluster.

5. Conclusions

  1. As described above, the water buffalo community has been working hard to generate genome resources in terms of cytogenetic characterization, whole genome mapping and whole genome sequencing. Certainly, genome science comes of age in this important animal species.
  2. The vast amount of genetic/genomic resources for cattle research has served as shortcuts for the water buffalo community to initiate genome science in the species. As bovine genome resources have advanced significantly during the last thirty years, so has that of water buffalo.
  3. Completion of the studies reported herein will help develop essential genome technologies for breeders and farmers to improve reproductive performance and production potential in water buffalo.


This work was supported by USDA/FAS grant BIO12-001-009 to Z.J.

Conflict of Interest

The authors have declared that no conflict of interest exists.


1. Kierstein G. et al. Analysis of mitochondrial D-loop region casts new light on domestic water buffalo (Bubalus bubalis) phylogeny. Mol Phylogenet Evol. 2004;30(2):308-24

2. Bubalus bubalis 2004. Roth J and Myers P.

3. Yindee M. et al. Y-chromosomal variation confirms independent domestications of swamp and river buffalo. Anim Genet. 2010 [Epub ahead of print]

4. FAO. Water Buffalo: an asset undervalued, F.R.O.f.A.a. Bangkok, Thailand: Pacific Editor. 2000:1-6

5. Iannuzzi L, Di Meo G. Water Buffalo. In: (ed.) Cockett NE, Kole C. Genome Mapping and Genomics in Domestic Animals. Berlin Herdelberg, Germany: Springer-Verlag. 2009

6. Council N.R. The water buffalo: new prospects for an underutilized animal. Washington DC: National Academy Press. 1981

7. Cockrill W.R. The water buffalo: a review. Br Vet J. 1981;137(1):8-16

8. Chen Y.C, Li X.H. New evidence of the origin and domestication of the Chinese swamp buffalo (Bubalus bubalis). Buffalo Journal. 1989;1:51-55

9. Kumar S. et al. Phylogeography and domestication of Indian river buffalo. BMC Evol Biol. 2007;7:186

10. Kumar S. et al. Mitochondrial DNA analyses of Indian water buffalo support a distinct genetic origin of river and swamp buffalo. Anim Genet. 2007;38(3):227-32

11. Buchholz C. Cattle. In: (ed.) Parker S.P. Grzimek's Encyclopedia of Mammals. New York: McGraw-Hill Publishing Company. 1990:360-417

12. Amaral M.E. et al. A first generation whole genome RH map of the river buffalo with comparison to domestic cattle. Genomics BMC. 2008;9:631

13. Gallagher DSJr, Womack JE. Chromosome conservation in the Bovidae. Hered J. 1992;83(4):287-98

14. Iannuzzi L. et al. Comparative FISH mapping in river buffalo and sheep chromosomes: assignment of forty autosomal type I loci from sixteen human chromosomes. Cytogenet Cell Genet. 2001;94(1-2):43-8

15. Iannuzzi L, King W.A, Di Berardino D. Chromosome evolution in domestic bovids as revealed by chromosome banding and FISH-mapping techniques. Cytogenet Genome Res. 2009;126(1-2):49-62

16. Gallagher DSJr. et al. Applications of chromosomal fish in the Bovidae with emphases on physical mapping in domestic cattle and comparative cytogenetic analyses of the tribe Bovini. Anim Biotechnol. 1999;10(3):105-8

17. Barker J.S. et al. Genetic diversity of Asian water buffalo (Bubalus bubalis): microsatellite variation and a comparison with protein-coding loci. Anim Genet. 1997;28(2):103-15

18. Ritz L.R. et al. Phylogenetic analysis of the tribe Bovini using microsatellites. Anim Genet. 2000;31(3):178-85

19. Kumar S. et al. Genetic variation and relationships among eight Indian riverine buffalo breeds. Mol Ecol. 2006;15(3):593-600

20. Lei C.Z. et al. Independent maternal origin of Chinese swamp buffalo (Bubalus bubalis). Anim Genet. 2007;38(2):97-102

21. Di Meo G.P. et al. An extended river buffalo (Bubalus bubalis, 2n = 50) cytogenetic map: assignment of 68 autosomal loci by FISH-mapping and R-banding and comparison with human chromosomes. Chromosome Res. 2008;16(6):827-37

22. Amaral M.E. et al. Construction of a river buffalo (Bubalus bubalis) whole-genome radiation hybrid panel and preliminary RH mapping of chromosomes 3 and 10. Anim Genet. 2007;38(3):311-4

23. Goldammer T. et al. A radiation hybrid map of river buffalo (Bubalus bubalis) chromosome 7 and comparative mapping to the cattle and human genomes. Cytogenet Genome Res. 2007;119(3-4):235-41

24. Ianella P. et al. First radiation hybrid map of the river buffalo X chromosome (BBUX) and comparison with BTAX. Anim Genet. 2008;39(2):196-200

25. Miziara M.N. et al. A radiation hybrid map of river buffalo (Bubalus bubalis) chromosome 1 (BBU1). Cytogenet Genome Res. 2007;119(1-2):100-4

26. Stafuzza N.B. et al. Preliminary radiation hybrid map for river buffalo chromosome 6 and comparison to bovine chromosome 3. Anim Genet. 2007;38(4):406-9

27. Stafuzza N.B. et al. Comparative RH maps of the river buffalo and bovine Y chromosomes. Cytogenet Genome Res. 2009;126(1-2):132-8

28. Tanaka K. et al. Characterization and chromosomal distribution of satellite DNA sequences of the water buffalo (Bubalus bubalis). Hered J. 1999;90(3):418-22

29. Pathak D. et al. Chromosomal localization, copy number assessment, and transcriptional status of BamHI repeat fractions in water buffalo Bubalus bubalis. DNA Cell Biol. 2006;25(4):206-14

30. Srivastava J. et al. Transcriptional status of known and novel genes tagged with consensus of 33.15 repeat loci employing minisatellite-associated sequence amplification (MASA) and real-time PCR in water buffalo, Bubalus bubalis. DNA Cell Biol. 2006;25(1):31-48

31. Metzker M.L. Emerging technologies in DNA sequencing. Genome Res. 2005;15(12):1767-76

32. Jiang Z, Rokhsar D.S, Harland R.M. Old can be new again: HAPPY whole genome sequencing, mapping and assembly. Int J Biol Sci. 2009;5(4):298-303

33. Voelkerding K.V, Dames S.A, Durtschi J.D. Next-generation sequencing: from basic research to diagnostics. Clin Chem. 2009;55(4):641-58

34. Jayaraman K.S. Indian Buffalo at Crossroads of Science. 2009

Author contact

Corresponding address Corresponding author: Dr. Zhihua Jiang, Department of Animal Sciences, Washington State University, Pullman, WA 99164 - 6351. Tel: +509 335 8761; Fax: +509 335 4246; E-mail: jiangzedu

Received 2010-3-29
Accepted 2010-6-14
Published 2010-6-17