A cluster of 8 quad-core 64 bit servers and an Agilent G6460 triple-quadrupole mass spectrometer with Agilent MassHunter software were used. First, a MySQL database using text mining techniques for the microarray experiment data was designed. Second, the inaccurate data was removed from the database by developing two novel algorithms: KNN-Delta and Semantic Outlier Factor. Third, thermal gradient focusing technology in the mass spectrometer for ion confinement was implemented. Fourth, conditions inside the mass spectrometer were optimized for efficient ion transmission. Fifth, the ion scattering within the electrospray ionization unit of the mass spectrometer was analyzed to detect areas of high densities. Finally, an inlet capillary to capture ions in the high density areas was designed.
The microarray database was successfully created and was able to automatically annotate over 400,000 experiments using common semantics. The KNN-Delta and Semantic Outlier Factor algorithms were able to increase the accuracy of the database, removing 40,000 inaccurately annotated experiments. The mass spectrometry efficiencies resulted in a signal-to-charge ratio increase from 1000:1 to 6800:1, detecting samples as low as 150 femtomoles.
Based on the successful implementation of the microarray database and the sensitivity improvements of the mass spectrometer, it is possible to detect diseases at small concentrations, before they become lethal.
By developing novel algorithms for a microarray database and improving the sensitivities of a mass spectrometers, we are able to facilitate the early detection of diseases, before they become lethal.
Science Fair Project done By Tony Ho; Ritik Malhotra