UNTAG: Learning Generic Features for Unsupervised Type-Agnostic Deepfake DetectionMejri, Nesryne ; Ghorbel, Enjie ; Aouada, Djamila ![]() in IEEE International Conference on Acoustics, Speech and Signal Processing. Proceedings (2023) This paper introduces a novel framework for unsupervised type-agnostic deepfake detection called UNTAG. Existing methods are generally trained in a supervised manner at the classification level, focusing ... [more ▼] This paper introduces a novel framework for unsupervised type-agnostic deepfake detection called UNTAG. Existing methods are generally trained in a supervised manner at the classification level, focusing on detecting at most two types of forgeries; thus, limiting their generalization capability across different deepfake types. To handle that, we reformulate the deepfake detection problem as a one-class classification supported by a self-supervision mechanism. Our intuition is that by estimating the distribution of real data in a discriminative feature space, deepfakes can be detected as outliers regardless of their type. UNTAG involves two sequential steps. First, deep representations are learned based on a self-supervised pretext task focusing on manipulated regions. Second, a one-class classifier fitted on authentic image embeddings is used to detect deepfakes. The results reported on several datasets show the effectiveness of UNTAG and the relevance of the proposed new paradigm. The code is publicly available. [less ▲] Detailed reference viewed: 218 (29 UL) You Can Dance! Generating Music-Conditioned Dances on Real 3D Scans.Dupont, Elona ; Singh, Inder Pal ; et alScientific Conference (2023) Detailed reference viewed: 155 (3 UL) IML-GCN: Improved Multi-Label Graph Convolutional Network for Efficient yet Precise Image ClassificationSingh, Inder Pal ; Oyedotun, Oyebade ; Ghorbel, Enjie et alin AAAI-22 Workshop Program-Deep Learning on Graphs: Methods and Applications (2022, February) In this paper, we propose the Improved Multi-Label Graph Convolutional Network (IML-GCN) as a precise and efficient framework for multi-label image classification. Although previous approaches have shown ... [more ▼] In this paper, we propose the Improved Multi-Label Graph Convolutional Network (IML-GCN) as a precise and efficient framework for multi-label image classification. Although previous approaches have shown great performance, they usually make use of very large architectures. To handle this, we propose to combine the small version of a newly introduced network called TResNet with an extended version of Multi-label Graph Convolution Networks (ML-GCN); therefore ensuring the learning of label correlation while reducing the size of the overall network. The proposed approach considers a novel image feature embedding instead of using word embeddings. In fact, the latter are learned from words and not images making them inadequate for the task of multi-label image classification. Experimental results show that our framework competes with the state-of-the-art on two multi-label image benchmarks in terms of both precision and memory requirements. [less ▲] Detailed reference viewed: 472 (24 UL) MULTI LABEL IMAGE CLASSIFICATION USING ADAPTIVE GRAPH CONVOLUTIONAL NETWORKS (ML-AGCN)Singh, Inder Pal ; Ghorbel, Enjie ; Oyedotun, Oyebade et alin IEEE International Conference on Image Processing (2022) In this paper, a novel graph-based approach for multi-label image classification called Multi-Label Adaptive Graph Convolutional Network (ML-AGCN) is introduced. Graph-based methods have shown great ... [more ▼] In this paper, a novel graph-based approach for multi-label image classification called Multi-Label Adaptive Graph Convolutional Network (ML-AGCN) is introduced. Graph-based methods have shown great potential in the field of multi-label classification. However, these approaches heuristically fix the graph topology for modeling label dependencies, which might be not optimal. To handle that, we propose to learn the topology in an end-to-end manner. Specifically, we incorporate an attention-based mechanism for estimating the pairwise importance between graph nodes and a similarity-based mechanism for conserving the feature similarity between different nodes. This offers a more flexible way for adaptively modeling the graph. Experimental results are reported on two well-known datasets, namely, MS-COCO and VG-500. Results show that ML-AGCN outperforms state-of-the-art methods while reducing the number of model parameters. [less ▲] Detailed reference viewed: 197 (10 UL) LSPnet: A 2D Localization-oriented Spacecraft Pose Estimation Neural NetworkGarcia Sanchez, Albert ; Mohamed Ali, Mohamed Adel ; Gaudilliere, Vincent et alin Proceedings of Conference on Computer Vision and Pattern Recognition Workshops (2021, June) Being capable of estimating the pose of uncooperative objects in space has been proposed as a key asset for enabling safe close-proximity operations such as space rendezvous, in-orbit servicing and active ... [more ▼] Being capable of estimating the pose of uncooperative objects in space has been proposed as a key asset for enabling safe close-proximity operations such as space rendezvous, in-orbit servicing and active debris removal. Usual approaches for pose estimation involve classical computer vision-based solutions or the application of Deep Learning (DL) techniques. This work explores a novel DL-based methodology, using Convolutional Neural Networks (CNNs), for estimating the pose of uncooperative spacecrafts. Contrary to other approaches, the proposed CNN directly regresses poses without needing any prior 3D information. Moreover, bounding boxes of the spacecraft in the image are predicted in a simple, yet efficient manner. The performed experiments show how this work competes with the state-of-the-art in uncooperative spacecraft pose estimation, including works which require 3D information as well as works which predict bounding boxes through sophisticated CNNs. [less ▲] Detailed reference viewed: 385 (35 UL) Detection & Identification of On-Orbit Objects Using Machine Learning; Mohamed Ali, Mohamed Adel ; Garcia Sanchez, Albert et alin European Conference on Space Debris (2021), 8(1), Detailed reference viewed: 192 (14 UL) Vertex Feature Encoding and Hierarchical Temporal Modeling in a Spatio-Temporal Graph Convolutional Network for Action RecognitionPapadopoulos, Konstantinos ; Ghorbel, Enjie ; Aouada, Djamila et alin International Conference on Pattern Recognition, Milan 10-15 January 2021 (2021) Detailed reference viewed: 401 (29 UL) SPACECRAFT RECOGNITION LEVERAGING KNOWLEDGE OF SPACE ENVIRONMENT: SIMULATOR, DATASET, COMPETITION DESIGN, AND ANALYSISMohamed Ali, Mohamed Adel ; Gaudilliere, Vincent ; Ghorbel, Enjie et alin 2021 IEEE International Conference on Image Processing (ICIP) (2021) Detailed reference viewed: 290 (20 UL) Home-based rehabilitation system for strokesurvivors: a clinical evaluationGhorbel, Enjie ; Baptista, Renato ; Shabayek, Abd El Rahman et alin Journal of medical systems (2020) Detailed reference viewed: 189 (13 UL) DeepVI: A Novel Framework for Learning Deep View-Invariant Human Action Representations using a Single RGB CameraPapadopoulos, Konstantinos ; Ghorbel, Enjie ; Oyedotun, Oyebade et alin IEEE International Conference on Automatic Face and Gesture Recognition, Buenos Aires 18-22 May 2020 (2020) Detailed reference viewed: 201 (19 UL) Fast Adaptive Reparametrization (FAR) with Application to Human Action RecognitionGhorbel, Enjie ; Demisse, Girum ; Aouada, Djamila et alin IEEE Signal Processing Letters (2020) In this paper, a fast approach for curve reparametrization, called Fast Adaptive Reparamterization (FAR), is introduced. Instead of computing an optimal matching between two curves such as Dynamic Time ... [more ▼] In this paper, a fast approach for curve reparametrization, called Fast Adaptive Reparamterization (FAR), is introduced. Instead of computing an optimal matching between two curves such as Dynamic Time Warping (DTW) and elastic distance-based approaches, our method is applied to each curve independently, leading to linear computational complexity. It is based on a simple replacement of the curve parameter by a variable invariant under specific variations of reparametrization. The choice of this variable is heuristically made according to the application of interest. In addition to being fast, the proposed reparametrization can be applied not only to curves observed in Euclidean spaces but also to feature curves living in Riemannian spaces. To validate our approach, we apply it to the scenario of human action recognition using curves living in the Riemannian product Special Euclidean space SE(3) n. The obtained results on three benchmarks for human action recognition (MSRAction3D, Florence3D, and UTKinect) show that our approach competes with state-of-the-art methods in terms of accuracy and computational cost. [less ▲] Detailed reference viewed: 364 (9 UL) VIEW-INVARIANT ACTION RECOGNITION FROM RGB DATA VIA 3D POSE ESTIMATIONBaptista, Renato ; Ghorbel, Enjie ; Papadopoulos, Konstantinos et alin IEEE International Conference on Acoustics, Speech and Signal Processing, Brighton, UK, 12–17 May 2019 (2019, May) In this paper, we propose a novel view-invariant action recognition method using a single monocular RGB camera. View-invariance remains a very challenging topic in 2D action recognition due to the lack of ... [more ▼] In this paper, we propose a novel view-invariant action recognition method using a single monocular RGB camera. View-invariance remains a very challenging topic in 2D action recognition due to the lack of 3D information in RGB images. Most successful approaches make use of the concept of knowledge transfer by projecting 3D synthetic data to multiple viewpoints. Instead of relying on knowledge transfer, we propose to augment the RGB data by a third dimension by means of 3D skeleton estimation from 2D images using a CNN-based pose estimator. In order to ensure view-invariance, a pre-processing for alignment is applied followed by data expansion as a way for denoising. Finally, a Long-Short Term Memory (LSTM) architecture is used to model the temporal dependency between skeletons. The proposed network is trained to directly recognize actions from aligned 3D skeletons. The experiments performed on the challenging Northwestern-UCLA dataset show the superiority of our approach as compared to state-of-the-art ones. [less ▲] Detailed reference viewed: 353 (32 UL) A View-invariant Framework for Fast Skeleton-based Action Recognition Using a Single RGB CameraGhorbel, Enjie ; Papadopoulos, Konstantinos ; Baptista, Renato et alin 14th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, Prague, 25-27 February 2018 (2019, February) View-invariant action recognition using a single RGB camera represents a very challenging topic due to the lack of 3D information in RGB images. Lately, the recent advances in deep learning made it ... [more ▼] View-invariant action recognition using a single RGB camera represents a very challenging topic due to the lack of 3D information in RGB images. Lately, the recent advances in deep learning made it possible to extract a 3D skeleton from a single RGB image. Taking advantage of this impressive progress, we propose a simple framework for fast and view-invariant action recognition using a single RGB camera. The proposed pipeline can be seen as the association of two key steps. The first step is the estimation of a 3D skeleton from a single RGB image using a CNN-based pose estimator such as VNect. The second one aims at computing view-invariant skeleton-based features based on the estimated 3D skeletons. Experiments are conducted on two well-known benchmarks, namely, IXMAS and Northwestern-UCLA datasets. The obtained results prove the validity of our concept, which suggests a new way to address the challenge of RGB-based view-invariant action recognition. [less ▲] Detailed reference viewed: 538 (24 UL) Two-stage RGB-based Action Detection using Augmented 3D PosesPapadopoulos, Konstantinos ; Ghorbel, Enjie ; Baptista, Renato et alin 18th International Conference on Computer Analysis of Images and Patterns SALERNO, 3-5 SEPTEMBER, 2019 (2019) In this paper, a novel approach for action detection from RGB sequences is proposed. This concept takes advantage of the recent development of CNNs to estimate 3D human poses from a monocular camera. To ... [more ▼] In this paper, a novel approach for action detection from RGB sequences is proposed. This concept takes advantage of the recent development of CNNs to estimate 3D human poses from a monocular camera. To show the validity of our method, we propose a 3D skeleton-based two-stage action detection approach. For localizing actions in unsegmented sequences, Relative Joint Position (RJP) and Histogram Of Displacements (HOD) are used as inputs to a k-nearest neighbor binary classifier in order to define action segments. Afterwards, to recognize the localized action proposals, a compact Long Short-Term Memory (LSTM) network with a de-noising expansion unit is employed. Compared to previous RGB-based methods, our approach offers robustness to radial motion, view-invariance and low computational complexity. Results on the Online Action Detection dataset show that our method outperforms earlier RGB-based approaches. [less ▲] Detailed reference viewed: 278 (11 UL) Localized Trajectories for 2D and 3D Action RecognitionPapadopoulos, Konstantinos ; Demisse, Girum ; Ghorbel, Enjie et alin Sensors (2019) The Dense Trajectories concept is one of the most successful approaches in action recognition, suitable for scenarios involving a significant amount of motion. However, due to noise and background motion ... [more ▼] The Dense Trajectories concept is one of the most successful approaches in action recognition, suitable for scenarios involving a significant amount of motion. However, due to noise and background motion, many generated trajectories are irrelevant to the actual human activity and can potentially lead to performance degradation. In this paper, we propose Localized Trajectories as an improved version of Dense Trajectories where motion trajectories are clustered around human body joints provided by RGB-D cameras and then encoded by local Bag-of-Words. As a result, the Localized Trajectories concept provides an advanced discriminative representation of actions. Moreover, we generalize Localized Trajectories to 3D by using the depth modality. One of the main advantages of 3D Localized Trajectories is that they describe radial displacements that are perpendicular to the image plane. Extensive experiments and analysis were carried out on five different datasets. [less ▲] Detailed reference viewed: 440 (17 UL) Home Self-Training: Visual Feedback for Assisting Physical Activity for Stroke SurvivorsBaptista, Renato ; Ghorbel, Enjie ; Shabayek, Abd El Rahman et alin Computer Methods and Programs in Biomedicine (2019) Background and Objective: With the increase in the number of stroke survivors, there is an urgent need for designing appropriate home-based rehabilitation tools to reduce health-care costs. The objective ... [more ▼] Background and Objective: With the increase in the number of stroke survivors, there is an urgent need for designing appropriate home-based rehabilitation tools to reduce health-care costs. The objective is to empower the rehabilitation of post-stroke patients at the comfort of their homes by supporting them while exercising without the physical presence of the therapist. Methods: A novel low-cost home-based training system is introduced. This system is designed as a composition of two linked applications: one for the therapist and another one for the patient. The therapist prescribes personalized exercises remotely, monitors the home-based training and re-adapts the exercises if required. On the other side, the patient loads the prescribed exercises, trains the prescribed exercise while being guided by color-based visual feedback and gets updates about the exercise performance. To achieve that, our system provides three main functionalities, namely: 1) Feedback proposals guiding a personalized exercise session, 2) Posture monitoring optimizing the effectiveness of the session, 3) Assessment of the quality of the motion. Results: The proposed system is evaluated on 10 healthy participants without any previous contact with the system. To analyze the impact of the feedback proposals, we carried out two different experimental sessions: without and with feedback proposals. The obtained results give preliminary assessments about the interest of using such feedback. Conclusions: Obtained results on 10 healthy participants are promising. This encourages to test the system in a realistic clinical context for the rehabilitation of stroke survivors. [less ▲] Detailed reference viewed: 214 (16 UL) Key-Skeleton Based Feedback Tool for Assisting Physical ActivityBaptista, Renato ; Ghorbel, Enjie ; Shabayek, Abd El Rahman et alin 2018 Zooming Innovation in Consumer Electronics International Conference (ZINC), 30-31 May 2018 (2018, May 31) This paper presents an intuitive feedback tool able to implicitly guide motion with respect to a reference movement. Such a tool is important in multiple applications requiring assisting physical ... [more ▼] This paper presents an intuitive feedback tool able to implicitly guide motion with respect to a reference movement. Such a tool is important in multiple applications requiring assisting physical activities as in sports or rehabilitation. Our proposed approach is based on detecting key skeleton frames from a reference sequence of skeletons. The feedback is based on the 3D geometry analysis of the skeletons by taking into account the key-skeletons. Finally, the feedback is illustrated by a color-coded tool, which reflects the motion accuracy. [less ▲] Detailed reference viewed: 244 (7 UL) Features and Classification Schemes for View-Invariant and Real-Time Human Action Recognition; ; Ghorbel, Enjie et alin IEEE Transactions on Cognitive and Developmental Systems (2018), 10(4), 894--902 Detailed reference viewed: 173 (3 UL) Kinematic Spline Curves: A temporal invariant descriptor for fast action recognitionGhorbel, Enjie ; ; et alin Image and Vision Computing (2018), 77 Detailed reference viewed: 188 (5 UL) An extension of kernel learning methods using a modified Log-Euclidean distance for fast and accurate skeleton-based Human Action RecognitionGhorbel, Enjie ; ; et alin Computer Vision and Image Understanding (2018), 175 Detailed reference viewed: 303 (4 UL) |
||