recognition problem as a multi-task learning problem, where the non-rigid facial muscle
motion (mainly the first 17 AUs) and the rigid head motion (the last 6 AUs) are modeled
separately. The co-occurrence of the expression features and the head pose features are
explored. We observe that different AUs converge at various speed. By choosing the optimal
checkpoint for each AU, the recognition results are improved. We are able to obtain a final …