Estimated vital signs might include a variety of measurements that can be used in detecting any abnormal conditions by analyzing facial images from continuous monitoring with a thermal video camera. To overcome the limitless human visual perceptions, thermal infrared has proven to be the most effective technique for visualizing facial colour changes that could have been reflected by changes in oxygenation levels and blood volume in facial arteries. This study investigated the possibility of vital signs estimation using physiological function images converted from thermal infrared images in the same ways that visible images are used, with a need for an efficient extractor method as correction procedures that have used datasets that include images with and without wearing glasses or protective face masks. This paper, summarize thermal images using advanced machine learning and deep learning methods with satisfactory performance. Also, we presented the evaluation matrices that were included in the assessment based on statistical analysis, accuracy measures and error measures. Finally, to discuss future gaps and directions for further evaluations.