The optical network integrated computing environment has been thought of as a promising technology to support large-scale data-intensive distributed computing applications. For such an environment involving so many heterogeneous resources, such as high-performance processors and optical links, faults seem to be inevitable. The faults will lead to the failure of the applications or highly delay the applications' finish times. Therefore, it is necessary to analyze resources' fault probability and then to better schedule the tasks of the application onto the appropriate resources so as to minimize the fault probability of the application. We address the task-scheduling problem based on the fault probability analysis for distributed computing applications over an optical network. We quantitatively analyze the fault probability of the processors and optical links in a given interval and propose a minimal fault probability (MFP) task-scheduling algorithm to minimize the fault probability of the application. We develop a simulator to evaluate the performance of the MFP algorithm. The simulation results prove the efficiency of the MFP algorithm.
© 2008 Optical Society of America
Original Manuscript: April 29, 2008
Revised Manuscript: August 5, 2008
Manuscript Accepted: September 26, 2008
Published: October 31, 2008
Wei Guo, Zheng Liang, Zhenyu Sun, Shilin Xiao, Yaohui Jin, Weiqiang Sun, and Weisheng Hu, "Task scheduling considering fault probability for distributed computing applications over an optical network," J. Opt. Netw. 7, 947-957 (2008)