The Peanut Genome Project was launched in 2012 and a genomics symposium was held at the 2012 annual meeting of the American Peanut Research and Education Society. Seven speakers presented a spectrum of topics covering peanut molecular tools and materials to which they have been applied, along with the challenges and benefits of a genome sequence to prebreeding and breeding of cultivated peanut. Highlights of the symposium are presented and are accompanied by three in-depth reviews of population development, utilization of wild species, and genetic mapping in Arachis.

Genomics, the science of analyzing structure and function of the complete set of DNA in an organism, has been applied to plants since sequencing of the Arabidopsis (Arabidopsis thaliana (L.) Heynh.) genome was initiated (Kaul et al., 2000). Although a weed, Arabidopsis was a logical first candidate for DNA sequencing given its small genome size (125 Mb) and use as a genetic model due to small plant stature, rapid life cycle, and rich genetic resources (Ecker, 1998). It also is a member of a plant family, Brassicaceae, which contains important crop plants, namely rapeseed (Brassica napus L.) and cole crops. For Arabidopsis, research long ago entered the post-genomics era of transcriptomics, proteomics and metabolomics. The first crop plant genome to be sequenced was rice (Oryza sativa L.) because of its global importance and relatively small genome size (420 Mb) (Goff et al., 2002; Yu et al., 2002). With advances in sequencing technology and decreases in the cost per base pair of sequence (Zhao and Grant, 2011), genomics for crop species with larger genomes has become accessible. Most notably, the cereal crops maize (Zea mays L.) and barley (Hordeum vulgare L.), with genome sizes of 2.3 and 5.1 Gb, respectively, and the legume crop soybean (Glycine max (L.) Merr.) (1.1 Gb) have now been sequenced (Mayer et al., 2012; Schmutz et al., 2010; Schnable et al., 2009), as have several model and minor legumes (Cannon et al., 2009; Varshney et al., 2012; Varshney et al., 2013). At last, peanut, Arachis hypogaea L., is on the radar scope for whole genome sequencing, through an international effort coordinated by the Peanut Foundation and substantially funded by the U.S. peanut industry (Peanutbioscience, 2012). The history of this effort can be found in Guo et al. (2013).

The cultivated peanut genome will be challenging to assemble because of its size (2.8 Gb) and complexity (polyploidy). Peanut is an allotetraploid derived from the hybridization of two progenitor diploid species, most likely A. duranensis Krapov. & W.C.Greg. and A. ipaensis Krapov. & W.C.Greg. (Kochert et al., 1996). The tetraploidization event imposed a domestication bottleneck thereby constraining genetic diversity within cultivated peanut. Furthermore, the two progenitor genomes are diverged from one another by only 3–3.5 million years (Moretzsohn et al., 2013; Nielen et al., 2012); therefore, their DNA sequences have high levels of similarity, adding to the difficulty for assembly of a tetraploid sequence that is essentially a merger of the two diploid genomes. In anticipation of the assembly difficulty, the two diploid progenitor genomes are also being sequenced.

With the 2012 launch of genome sequencing of the reference genotype ‘Tifrunner’ (Holbrook and Culbreath, 2007) and the two progenitor diploids, APRES became a preferred venue to describe to the larger peanut community the broader constellation of research and advancements expected in the near future through a symposium entitled, “The orphan legume genome whose time has come”.

The assembly challenges for the peanut genome were illustrated by Scott Jackson (Jackson, 2012), University of Georgia, through an analogy to variants of Vincent Van Gogh's “Sunflowers”. While advances in sequencing technologies have made DNA sequencing more affordable, they also have been based on output of shorter sequences which are more difficult to computationally piece back together in proper order. Imperfect reconstruction of the tetraploid peanut genome is expected when the two subgenomes are so similar to one another that many sequence regions from each will collapse into one (Jackson et al., 2011). While many other crops also are polyploid, their subgenomes are more evolutionarily distant than those of peanut, e.g., cotton (7–8 million years ago) and soybean (13 million years ago) (Jackson and Chen, 2010; Schmutz et al., 2010), and are thus more easily parsed during genome assembly (Schlueter et al., 2007). The prediction is that assemblies of the diploid Arachis progenitor genome sequences will guide assembly of the tetraploid peanut genome sequence. To this end, Lutz Froenicke, UC Davis, provided an update on sequencing of the gene space of A. duranensis and A. ipaensis (Froenicke, 2012).

What is the anticipated impact of a peanut genome sequence on the peanut industry? The peanut industry encompasses growers, shellers, and manufacturers who provide products for consumers. The U.S. peanut industry can claim the highest productivity, highest quality peanut product in the world (FAOSTAT, 2013; American Peanut Council, 2011). In order to maintain that claim while remaining economically competitive, the industry advocates increased yields with concurrent decreased production costs that can be attained in part through genetic improvement and management of pests and diseases with host plant resistance (Valentine, 2012). Nutritional and processing quality must likewise be maintained as productivity increases. Peanut cultivar improvement through breeding began in the early part of the 20th century, and as a result, yield gains due to genetic gain have been impressive, yet gains have not kept pace with corn, cotton, and soybean. Through collective improvements in genetics and cultural practices, average U.S. production increased from 964 kg/ha in 1940 to 4695 kg/ha in 2012 (Holbrook et al., 2013a). A peanut genome sequence will enrich the set of molecular markers that can be applied toward more rapid genetic selection for multiple target traits including yield, biotic and abiotic stress tolerance, and seed quality. For example, the soybean genome sequence led to the discovery of more than 50,000 genetic markers whereas marker numbers had remained in the hundreds prior to sequencing. While U.S. breeders are focused primarily on traits of importance to U.S. markets that affect the economics of production (the estimated annual loss in revenue due to pests and diseases in Georgia alone is more than $100 million (Williams-Woodward 2013)), enriching molecular tools for peanut breeding will have global impact on peanut improvement, leading to a healthier and even more nutritious, protein- and calorie-rich product to help alleviate malnutrition in developing countries.

In order to progress from genome sequence to translational genomics, sequence variation must be associated with specific traits of interest. With advances in sequencing technologies, it is now quicker to generate a genome sequence than to develop and analyze populations segregating for those traits of interest. Corley Holbrook summarizes the current status of U.S. efforts to generate mapping populations segregating for multiple disease resistances as well as productivity and quality traits (Holbrook et al., 2012, 2013b). Reliable phenotyping of populations is essential for association of markers with traits, and phenotyping requires replicated testing over multiple years and ultimately in multiple environments. Therefore, the time frame and labor intensiveness of phenotyping emerges as the bottleneck for translational genomics. The Peanut Genome Project has incorporated a significant component for phenotyping (Table 1, component 5), recognizing that the value of genome sequence will be realized by associating sequences with phenotypes. Some of this value already has been extracted through smaller-scale genetic mapping projects. Baozhu Guo (Guo et al., 2012, 2013) presents an update on genetic mapping of recombinant inbred line (RIL) populations of peanut which illustrates the advances made in marker development, from restriction fragment length polymorphisms (RFLPs) and random amplified polymorphic DNAs (RAPDs) in the 1990s to expressed sequence tag-simple sequence repeats (EST-SSRs) in recent years. Even with the now thousands of molecular markers available for cultivated peanut, the polymorphism information content for most is very low and often only 5–10% of markers screened are polymorphic between any one parental pair. Adopting single nucleotide polymorphisms (SNPs) as markers and expanding marker discovery by whole genome sequencing and resequencing will catapult peanut translational genomics tools to parity with major crops.

Table 1.

Components of the Peanut Genome Project Action Plan (www.peanutbioscience.com).

Components of the Peanut Genome Project Action Plan (www.peanutbioscience.com).
Components of the Peanut Genome Project Action Plan (www.peanutbioscience.com).

One essential component to facilitate molecular breeding is a toolbox enabling access to genomic and phenotypic information along with marker associations that can be easily implemented for marker-assisted breeding. Steven Cannon, USDA-ARS, presented a vision for peanut modelled on existing informatics platforms for other legumes such as SoyBase (soybase.org) and the Legume Information System (LIS; www.comparative-legumes.org) (Cannon 2012). The latter was built with the knowledge, initially from genetic maps, that a high degree of synteny or colinearity (gene position and order) often exists between orthologous chromosome regions of closely related species. Thus, comparative genomics can be exploited to search for candidate genes underlying a specific phenotype that may occur across species or to predict gene function (Young and Bharti, 2012). The web portal, PeanutBase.org, now is online with minimal content but is expected to grow rapidly as the peanut genome project matures.

Finally, the breadth of the peanut genome project to encompass wild diploid as well as cultivated tetraploid species, paralleled by renewed interest in utilizing diploid genetic resources to mine for allelic diversity (Bertioli et al., 2011; Stalker et al. 2012, 2013), will promote the use of wild genetic resources in breeding through introgression pathways (Favero et al., 2006; Simpson, 2001). Tom Stalker, North Carolina State University, reviewed the status of wild-species germplasm collections, resistance attributes, taxonomic relationships, crossability, and molecular variation (Stalker et al. 2012, 2013). Molecular variation between cultivated and wild species is much greater than between accessions within the cultivated species; therefore, genome sequence will obtain highly effective application for monitoring introgression of wild chromosome segments in breeding (Chu et al., 2011; Nagy et al., 2010).

Detailed papers originating from three speakers in the Symposium are published in this issue and recap the utilization of wild Arachis species in breeding (Stalker et al. 2013), population development in cultivated peanut (Holbrook et al. 2013b) and the history of the Peanut Genome Project along with the status of genetic mapping in peanut (Guo et al. 2013). Peanut now has “graduated” from a position of orphan crop to one that soon will be replete with genetic and genomic resources (Varshney et al. 2013).

American Peanut Council
.
2011
.
U.S. quality control and research
.
http:// www.peanutsusa.com (verified August 28, 2013)
.
Bertioli
,
D.J
.,
G
.
Seijo
,
F.O
.
Freitas
,
J.F.M
.
Valls
,
S.C.M
.
Leal-Bertioli
,
and
M.C
.
Moretzsohn
.
2011
.
An overview of peanut and its wild relatives
.
Plant Genetic Resources-Characterization and Utilization
9
:
134
149
.
Cannon
,
S.B
.,
G.D
.
May
,
and
S.A
.
Jackson
.
2009
.
Three sequenced legume genomes and many crop species: Rich opportunities for translational genomics
.
Plant Physiol.
151
:
970
977
.
Cannon
,
S.B
.
2012
.
Bioinformatics resources for crop improvement in peanut and across the legumes
.
Proc. Amer. Peanut Res. Educ. Soc.
44
:
57
.
Chu
,
Y
.,
C.L
.
Wu
,
C.C
.
Holbrook
,
B.L
.
Tillman
,
G
.
Person
,
and
P
.
Ozias-Akins
.
2011
.
Marker-assisted selection to pyramid nematode resistance and the high oleic trait in peanut
.
The Plant Genome
4
:
110
117
.
Ecker
,
J.R
.
1998
.
Genome sequencing - genes blossom from a weed
.
Nature
391
:
438
439
.
FAOSTAT
.
2013
.
Food and Agriculture Organization of the United Nations Crop Production Database
.
http://www.faostat.fao.org (verified August 28, 2013)
.
Favero
,
A.P
.,
C.E
.
Simpson
,
J.F.M
.
Valls
,
and
N.A
.
Vello
.
2006
.
Study of the evolution of cultivated peanut through crossability studies among Arachis ipaensis, A. duranensis, and A. hypogaea
.
Crop Sci.
46
:
1546
1552
.
Froenicke
,
L
.,
C
.
Beitel
,
D
.
Scaglione
,
R.W
.
Michelmore
,
D
.
Bertioli
,
M.C
.
Moretzsohn
,
P
.
Guimaraes
,
S.C.M
.
Leal-Bertioli
,
M
.
Pandey
,
H.D
.
Upadhyaya
,
and
R.K
.
Varshney
.
2012
.
Generation of ultra-dense genetic maps for the A and B genomes of peanut
.
Proc. Amer. Peanut Res. Educ. Soc.
44
:
58
59
.
Goff
,
S.A
.,
D
.
Ricke
,
T.H
.
Lan
,
G
.
Presting
,
R.L
.
Wang
,
et al
.
2002
.
A draft sequence of the rice genome (Oryza sativa L. ssp. japonica)
.
Science
296
:
92
100
.
Guo
,
B.Z
.,
M.K
.
Pandey
,
A.K
.
Culbreath
,
and
R.K
.
Varshney
.
2012
.
Recent advances in molecular genetic linkage maps of cultivated peanut (Arachis hypogaea L.)
.
Proc. Amer. Peanut Res. Educ. Soc.
44
:
58
.
Guo
,
B
.,
M
.
Pandey
,
G
.
He
,
X
.
Zhang
,
B
.
Liao
,
A
.
Culbreath
,
R
.
Varshney
,
V
.
Nwosu
,
R
.
Wilson
,
and
T
.
Stalker
. .
Recent advances in molecular genetic linkage maps of cultivated peanut
.
Peanut Sci.
40
:
In press
.
Holbrook
,
C.C
.
and
A.K
.
Culbreath
.
2007
.
Registration of ‘Tifrunner’ peanut
.
J. Plant Registr.
1
:
124
.
Holbrook
,
C.C
.,
T.G
.
Isleib
,
P
.
Ozias-Akins
,
Y
.
Chu
,
S.J
.
Knapp
,
B
.
Tillman
,
C.L
.
Wu
,
B
.
Guo
,
R
.
Gill
,
and
M.D
.
Burow
.
2012
.
Development and phenotyping of recombinant inbred line (RIL) populations
.
Proc. Amer. Peanut Res. Educ. Soc.
44
:
56
57
.
Holbrook
,
C.C
.,
T.B
.
Brenneman
,
H.T
.
Stalker
,
Johnson
,
W.C
. III,
P
.
Ozias-Akins
,
Y
.
Chu
,
G
.
Vellidis
,
and
D
.
McClusky
.
2013a
.
Yield advances in peanut
.
In:
J
.
Specht
,
B
.
Diers
,
B
.
Carver
,
and
S
.
Smith
(
eds.
).
Yield Gains in Major U.S. Field Crops
.
CSSA
,
Madison, WI
, (
In press
).
Holbrook
,
C
.,
T
.
Isleib
,
P
.
Ozias-Akins
,
Y
.
Chu
,
S
.
Knapp
,
B
.
Tillman
,
B
.
Guo
,
and
R
.
Gill
.
2013b
.
Development and phenotyping of recombinant inbred line (RIL) populations for peanut
.
Peanut Sci.
40
In press
.
Jackson
,
S
.
and
Z.J
.
Chen
.
2010
.
Genomic and expression plasticity of polyploidy
.
Curr. Opin. Plant Biol.
13
:
153
159
.
Jackson
,
S.A
.,
A
.
Iwata
,
S.H
.
Lee
,
J
.
Schmutz
,
and
R
.
Shoemaker
.
2011
.
Sequencing crop genomes: Approaches and applications
.
New Phytol.
191
:
915
925
.
Jackson
,
S.A
.
2012
.
Genome sequences in Arachis
.
Proc. Amer. Peanut Res. Educ. Soc.
44
:
56
.
Kaul
,
S
.,
H.L
.
Koo
,
J
.
Jenkins
,
M
.
Rizzo
,
T
.
Rooney
,
et al
.
2000
.
Analysis of the genome sequence of the flowering plant Arabidopsis thaliana
.
Nature
408
:
796
815
.
Kochert
,
G
.,
H.T
.
Stalker
,
M
.
Gimenes
,
L
.
Galgaro
,
C.R
.
Lopes
,
and
K
.
Moore
.
1996
.
RFLP and cytogenetic evidence on the origin and evolution of allotetraploid domesticated peanut, Arachis hypogaea (Leguminosae)
.
Amer. J. Bot.
83
:
1282
1291
.
Mayer
,
K.F.X
.,
R
.
Waugh
,
P
.
Langridge
,
T.J
.
Close
,
R.P
.
Wise
,
and
Intl. Barley Genome Sequencing Consortium
.
2012
.
A physical, genetic and functional sequence assembly of the barley genome
.
Nature
491
:
711
716
.
Moretzsohn
,
M.C
.,
E.G
.
Gouvea
,
P.W
.
Inglis
,
S.C.M
.
Leal-Bertioli
,
J.F.M
.
Valls
,
and
D.J
.
Bertioli
.
2013
.
A study of the relationships of cultivated peanut (Arachis hypogaea) and its most closely related wild species using intron sequences and microsatellite markers
.
Ann. Bot.
111
:
113
126
.
Nagy
,
E.D
.,
Y
.
Chu
,
Y.F
.
Guo
,
S
.
Khanal
,
S.X
.
Tang
,
Y
.
Li
,
W.B.B
.
Dong
,
P
.
Timper
,
C
.
Taylor
,
P
.
Ozias-Akins
,
C.C
.
Holbrook
,
V
.
Beilinson
,
N.C
.
Nielsen
,
H.T
.
Stalker
,
and
S.J
.
Knapp
.
2010
.
Recombination is suppressed in an alien introgression in peanut harboring Rma, a dominant root-knot nematode resistance gene
.
Mol. Breed.
26
:
357
370
.
Nielen
,
S
.,
B.S
.
Vidigal
,
S.C.M
.
Leal-Bertioli
,
M
.
Ratnaparkhe
,
A.H
.
Paterson
,
O
.
Garsmeur
,
A
.
D'hont
,
P.M
.
Guimaraes
,
and
D.J
.
Bertioli
.
2012
.
Matita, a new retroelement from peanut: Characterization and evolutionary context in the light of the Arachis A-B genome divergence
.
Mol. Genet. Genom.
287
:
21
38
.
Peanutbioscience
.
2013
.
The peanut genome project
. .
Schlueter
,
J.A
.,
J.Y
.
Lin
,
S.D
.
Schlueter
,
I.F
.
Vasylenko-Sanders
,
S
.
Deshpande
,
J
.
Yi
,
M
.
O'bleness
,
B.A
.
Roe
,
R.T
.
Nelson
,
B.E
.
Scheffler
,
S.A
.
Jackson
,
and
R.C
.
Shoemaker
.
2007
.
Gene duplication and paleopolyploidy in soybean and the implications for whole genome sequencing
.
BMC Genomics
8
:
330
.
Schmutz
,
J
.,
S.B
.
Cannon
,
J
.
Schlueter
,
J.X
.
Ma
,
T
.
Mitros
,
et al
.
2010
.
Genome sequence of the palaeopolyploid soybean
.
Nature
465
:
178
183
.
Schnable
,
P.S
.,
D
.
Ware
,
R.S
.
Fulton
,
J.C
.
Stein
,
F
.
Wei
,
et al
.
2009
.
The B73 maize genome: Complexity, diversity, and dynamics
.
Science
326
:
1112
1115
.
Simpson
,
C.E
.
2001
.
Use of wild Arachis species/introgression of genes into A. hypogaea L
.
Peanut Sci.
28
:
114
116
.
Stalker
,
H.T
.,
S.P
.
Tallury
,
P
.
Ozias-Akins
,
D.A
.
Bertioli
,
and
S.C.M
.
Leal-Bertioli
.
2012
.
The value of diploid peanut relatives for breeding and genomics
.
Proc. Amer. Peanut Res. Educ. Soc.
44
:
57
58
.
Stalker
,
H.T
.,
S.P
.
Tallury
,
P
.
Ozias-Akins
,
D
.
Bertioli
,
and
S.C
.
Leal Bertioli
.
2013
.
The value of diploid peanut relatives for breeding and genomics
.
Peanut Sci.
40
:
In press
.
Valentine
,
H
.
2012
.
Potential economic impact of the peanut genomic project (PGP)
.
Proc. Amer. Peanut Res. Educ. Soc.
44
:
56
.
Varshney
,
R.K
.,
W
.
Chen
,
Y
.
Li
,
A.K
.
Bharti
,
R.K
.
Saxena
,
et al
.
2012
.
Draft genome sequence of pigeonpea (Cajanus cajan), an orphan legume crop of resource-poor farmers
.
Nature Biotech.
30
:
83
89
.
Varshney
,
R.K
.,
C
.
Song
,
R.K
.
Saxena
,
S
.
Azam
,
S
.
Yu
,
et al
.
2013
.
Draft genome sequence of chickpea (Cicer arietinum) provides a resource for trait improvement
.
Nature Biotech.
31
:
240
246
.
Varshney
,
R.K
.,
S.M
.
Mohan
,
P.M
.
Gaur
,
N.V.P.R
.
Gangarao
,
M.K
.
Pandey
,
et al
.
2013
.
Achievements and prospects of genomics-assisted breeding in three legume crops of the semi-arid tropics
. .
Williams-Woodward
,
J
.
2013
.
Georgia plant disease loss estimates 2011
.
University of Georgia Coop. Exten. Annu. Publ.
102-4
.
Young
,
N.D
.
and
A.K
.
Bharti
.
2012
.
Genome-enabled insights into legume biology
.
Annu. Rev. Plant Biol.
63
:
283
305
.
Yu
,
J
.,
S.N
.
Hu
,
J
.
Wang
,
G.K.S
.
Wong
,
S.G
.
Li
,
et al
.
2002
.
A draft sequence of the rice genome (Oryza sativa L. ssp. indica)
.
Science
296
:
79
92
.
Zhao
,
J
.
and
S.F.A
.
Grant
.
2011
.
Advances in whole genome sequencing technology
.
Curr. Pharm. Biotech.
12
:
293
305
.