TY - JOUR
T1 - Type-2 Fuzzy Topic Models for Human Action Recognition
AU - Cao, Xiao-Qin
AU - Liu, Zhi-Qiang
PY - 2015/10
Y1 - 2015/10
N2 - Following the "bag-of-words" representation for video sequences, we propose novel type-2 fuzzy topic models (T2 FTM) to recognize human actions. In traditional topicmodels (TM) for visual recognition, each video sequence is modeled as a "document" composed of spatial-temporal interest points called visual words. Topic models automatically assign a "topic" label to explain the action category of each word so that each video sequence becomes a mixture of action topics for recognition. Our T2 FTM differs from previous TM in that it uses type-2 fuzzy sets to encode the higher order uncertainty of each topic. We can use the primary membership function (MF) to measure the degree of uncertainty that a document or a visual word belongs to a specific action topic, and use the secondary MF to evaluate the fuzziness of the primary MF itself. In this paper, we implement two T2 FTM: 1) interval T2 FTM with all secondary grades equal one, and 2) vertical-slice T2 FTM with unequal secondary grades based on our prior knowledge. To estimate parameters in T2 FTM, we derive the efficient message-passing algorithms. Experiments on KTH, Weizmann, UCF, and Hollywood2 human action datasets demonstrate that T2 FTM performs better than other state-of-the-art topic models for human action recognition.
AB - Following the "bag-of-words" representation for video sequences, we propose novel type-2 fuzzy topic models (T2 FTM) to recognize human actions. In traditional topicmodels (TM) for visual recognition, each video sequence is modeled as a "document" composed of spatial-temporal interest points called visual words. Topic models automatically assign a "topic" label to explain the action category of each word so that each video sequence becomes a mixture of action topics for recognition. Our T2 FTM differs from previous TM in that it uses type-2 fuzzy sets to encode the higher order uncertainty of each topic. We can use the primary membership function (MF) to measure the degree of uncertainty that a document or a visual word belongs to a specific action topic, and use the secondary MF to evaluate the fuzziness of the primary MF itself. In this paper, we implement two T2 FTM: 1) interval T2 FTM with all secondary grades equal one, and 2) vertical-slice T2 FTM with unequal secondary grades based on our prior knowledge. To estimate parameters in T2 FTM, we derive the efficient message-passing algorithms. Experiments on KTH, Weizmann, UCF, and Hollywood2 human action datasets demonstrate that T2 FTM performs better than other state-of-the-art topic models for human action recognition.
KW - Human action recognition
KW - latent Dirichlet allocation (LDA)
KW - message passing
KW - type-2 fuzzy sets (T2 FS)
KW - HIDDEN MARKOV-MODELS
KW - CHARACTER-RECOGNITION
KW - RANDOM-FIELDS
KW - LOGIC
KW - SETS
KW - CLASSIFICATION
KW - CATEGORIES
UR - http://www.scopus.com/inward/record.url?scp=84975318510&partnerID=8YFLogxK
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-84975318510&origin=recordpage
U2 - 10.1109/TFUZZ.2014.2370678
DO - 10.1109/TFUZZ.2014.2370678
M3 - RGC 21 - Publication in refereed journal
SN - 1063-6706
VL - 23
SP - 1581
EP - 1593
JO - IEEE Transactions on Fuzzy Systems
JF - IEEE Transactions on Fuzzy Systems
IS - 5
ER -