2015
Accelerating soybean yield and composition improvement through genomic selection
Category:
Sustainable Production
Keywords:
GeneticsGenomics
Parent Project:
This is the first year of this project.
Lead Principal Investigator:
Brian Diers, University of Illinois at Urbana-Champaign
Co-Principal Investigators:
William Beavis, Iowa State University
Asheesh Singh, Iowa State University
Katy M Rainey, Purdue University
Patrick Brown, University of Illinois at Urbana-Champaign
Matthew Hudson, University of Illinois at Urbana-Champaign
Randall Nelson, University of Illinois at Urbana-Champaign
George Graef, University of Nebraska
Aaron Lorenz, University of Nebraska
James Specht, University of Nebraska
+8 More
Project Code:
Contributing Organization (Checkoff):
Institution Funded:
Brief Project Summary:

Both growers and breeders are frustrated that soybean yields are not increasing as rapidly as yields in maize. The goal of this project is to increase the rate of genetic improvement for yield and protein by developing a new breeding strategy for soybean called Genomic Selection (GS). GS allows breeders to predict high yield-potential crosses and predict the performance of potential varieties by using thousands of genetic markers weighted based on associations with yield or other economically valuable traits. The aim of this project is to use GS to both increase the rate of yield improvement and maintain compositional quality.

GS methods in soybean will be developed through leveraging...

Unique Keywords:
#breeding & genetics, #soybean genetic mapping, #soybean germplasm screening
Information And Results
Project Deliverables

1. Response surface analyses were used to determine the optimal conditions for distinguishing among genomic selection methods.
2. R code has been developed for generating predicted distributions of recombinant inbred lines from any arbitrary pairs of potential parental crosses
3. Genetic marker analysis was completed for 5630 experimental lines from 26 populations using genotyping by sequencing (GBS), which is a new low cost, high throughput marker system. The GBS generated 480 billion base pairs of sequence data resulting in an average of 6,652 markers per line.
4. The marker data were combined with genomic selection models to predict the highest yielding lines and those that combine yield and good protein composition. In addition, lines were selected based on yield in non-replicated trials in 2014. Random lines were also obtained to provide a base set of non-selected lines.
5. The selected and random lines assigned to field trials and planted in spring 2015. The trials consisted of 2,090 plots grown in each of four locations (Illinois, Indiana, Iowa, and Nebraska).
6. The plots were evaluated for maturity, plant height, plant lodging, and harvested to measure grain yield. The data from these tests will be compiled and analyzed to contrast different selection methods.

Final Project Results

1. Response surface analyses showed that the largest differences among genomic selection methods are revealed when all phenotypic variability is due to genetic architecture consisting of epistasis. Since there were no differences among genomic selection methods applied to the SoyNAM population it is most likely that the genetic architecture is additive and any genomic selection method can be used for building a predictive model. The most accessible method is RRBLUP implemented in R.
2. Evaluation of accuracy between predicted and observed yield values revealed that it was preferable to combine all data across the SoyNAM families into a single prediction model. Compared to single-family models, this approach increases predictive ability for yield by 44%. The within-family prediction accuracies ranged from 0.19 to 0.62. When testing the ability to predict between and within families simultaneously, predictive abilities for yield were about 0.75.
3. Information from the genomic selection evaluations were then applied to select lines from active breeding populations. These breeding populations came from the University of Illinois, University of Nebraska, and Purdue University programs. During the summer of 2014, the project collaborators tested 7,500 experimental lines from 26 populations in non-replicated trials. These tests were grown in Illinois, Iowa, Indiana and Nebraska. The lines were rated for maturity, plant height and plant lodging and they were harvested to measure seed yield and to provide seed for 2015 tests.
4. A total of 5630 experimental lines from the 26 populations were genotyped using the GBS technology. The sequence data consisted of 4.8 billion good sequencing reads resulting in the production of about 480 billion base pairs of data. Individual lines were scored with 4,693 to 9,007 segregating markers with an average of 6,652 markers. The genetic marker and yield data were used to select lines in the following categories:
• Greatest yield based on marker predictions.
• Greatest yield combined with acceptable protein levels based on marker predictions.
• Greatest yield based only on yield in the 2014 tests.
• Greatest yield based on a combination of marker predictions and 2014 test results.
• Random lines in each population.
5. 1998 lines were selected according the categories above and organized into field tests. These lines, together with parents and check varieties were planted in non-replicated yield trials consisting of a total of 2090 plots at each of four locations. These plots were evaluated for plant maturity, height, lodging seed yield. The results will be compared with the predicted values during the winter of 2016.

The United Soybean Research Retention policy will display final reports with the project once completed but working files will be purged after three years. And financial information after seven years. All pertinent information is in the final report or if you want more information, please contact the project lead at your state soybean organization or principal investigator listed on the project.