My research interest is bioinformatics, a unique interdisciplinary area, which is the merger of biotechnology and information technology with the goal of revealing new insights and principles in biology. My current grant, entitled “Facile Production and Efficient Indexing of Transposon-tagged Lines Using Next-generation Sequencing Technology for Maize” funded by NSF, uses the next generation sequencing technology to analyze the maize genome. I have been carried out bioinformatics analysis for sequence assembly, annotation, and mapping of millions of short reads to the maize reference genome. My another project is The Genomics Education Partnership (GEP). The Partnership is funded by the Howard Hughes Medical Institute along with fifty other institutions across the country, with the goal of providing research opportunities for undergraduate students. My research results have been published in very prestigious scientific journals, such as Science, Proceedings of the National Academy of Sciences of the United States of America, Genome Research, BMC Biology, BMC Evolutionary Biology, BMC Genomics, and Molecular Plant-Microbe Interaction.
Sokol Faculty/Student Research Grant
Studied on bioinformatics analysis of cDNA microarray data and experimental design. This project
proves a wonderful opportunity for science informatics students to apply multiple disciplines into
one scientific problem. These students will gain a strong background in biology, biochemistry,
computer science, mathematics, and statistics. There are two students diligently setup the pipeline
for microarray data analysis.
Genome Express System
As more and more plant and animal genome sequences are generated and available to public, many
display and management tools for the tremendous genomic data have been arisen during the past few
years. The Genome Express System (GES) is a genome browser, as well as a database server. GES has
two main components. First is the integrated database and graphic server handling data retrieving,
image drawing, and message passing. The second is a client CGI program accepting web user inputs,
connecting database server, and sending the search results to web users. GES is designed to rapidly
search and display large quantitative data, such as mRNAs, cDNAs, ESTs, BACs, genome annotations,
mutations, genetic maps and microarray signals. The main objectives of GES are to serve as a generic
genome browser for major crop genome data and allow public to carry out complex searches.