Categories
Nevin Manimala Statistics

Towards High-Quality and Disentangled Face Editing in a 3D GAN

IEEE Trans Pattern Anal Mach Intell. 2025 Jan 6;PP. doi: 10.1109/TPAMI.2024.3523422. Online ahead of print.

ABSTRACT

Recent methods for synthesizing 3D-aware face images have achieved rapid development thanks to neural radiance fields, allowing for high quality and fast inference speed. However, existing solutions for editing facial geometry and appearance independently usually require retraining and are not optimized for the recent work of generation, thus tending to lag behind the generation process. To address these issues, we introduce NeRFFaceEditing, which enables editing and decoupling geometry and appearance in the pretrained tri-plane-based neural radiance field while retaining its high quality and fast inference speed. Our key idea for disentanglement is to use the statistics of the tri-plane to represent the high-level appearance of its corresponding facial volume. Moreover, we leverage a generated 3D-continuous semantic mask as an intermediary for geometry editing. We devise a geometry decoder (whose output is unchanged when the appearance changes) and an appearance decoder. The geometry decoder aligns the original facial volume with the semantic mask volume. We also enhance the disentanglement by explicitly regularizing rendered images with the same appearance but different geometry to be similar in terms of color distribution for each facial component separately. Our method allows users to edit via semantic masks with decoupled control of geometry and appearance. Both qualitative and quantitative evaluations show the superior geometry and appearance control abilities of our method compared to existing and alternative solutions.

PMID:40030881 | DOI:10.1109/TPAMI.2024.3523422

Categories
Nevin Manimala Statistics

Exploring Contrastive Pre-training for Domain Connections in Medical Image Segmentation

IEEE Trans Med Imaging. 2025 Jan 3;PP. doi: 10.1109/TMI.2024.3525095. Online ahead of print.

ABSTRACT

Unsupervised domain adaptation (UDA) in medical image segmentation aims to improve the generalization of deep models by alleviating domain gaps caused by inconsistency across equipment, imaging protocols, and patient conditions. However, existing UDA works remain insufficiently explored and present great limitations: (i) Exhibit cumbersome designs that prioritize aligning statistical metrics and distributions, which limits the model’s flexibility and generalization while also overlooking the potential knowledge embedded in unlabeled data; (ii) More applicable in a certain domain, lack the generalization capability to handle diverse shifts encountered in clinical scenarios. To overcome these limitations, we introduce MedCon, a unified framework that leverages general unsupervised contrastive pre-training to establish domain connections, effectively handling diverse domain shifts without tailored adjustments. Specifically, it initially explores a general contrastive pre-training to establish domain connections by leveraging the rich prior knowledge from unlabeled images. Thereafter, the pre-trained backbone is fine-tuned using source-based images to ultimately identify per-pixel semantic categories. To capture both intra- and inter-domain connections of anatomical structures, we construct positive-negative pairs from a hybrid aspect of both local and global scales. In this regard, a shared-weight encoder-decoder is employed to generate pixel-level representations, which are then mapped into hyper-spherical space using a non-learnable projection head to facilitate positive pair matching. Comprehensive experiments on diverse medical image datasets confirm that MedCon outperforms previous methods by effectively managing a wide range of domain shifts and showcasing superior generalization capabilities.

PMID:40030864 | DOI:10.1109/TMI.2024.3525095

Categories
Nevin Manimala Statistics

Intuitive Directional Sense Presentation to the Torso Using McKibben-Based Surface Haptic Sensation in Immersive Space

IEEE Trans Haptics. 2024 Dec 26;PP. doi: 10.1109/TOH.2024.3522897. Online ahead of print.

ABSTRACT

In recent years, systems that utilize immersive space have been developed in various fields. Immersive spaces often contain considerable amounts of visual information; therefore, users often fail to obtain their desired information. Therefore, various methods have been developed to guide users toward haptic sensations. However, many of these methods have limitations in terms of the intuitive perception of haptic sensation and require practice for familiarization with haptic sensation. Fabric actuators are wearable haptic devices that combine fabric and McKibben artificial muscles to provide wearers with surface haptic sensation. These sensations can be provided to a wide area of the body with intuitive perception, instead of only to a part of the body. This paper presents a novel air pressure adjustment method for whole-body motion guidance using surface haptic sensations provided by a wearable fabric actuator. The proposed system can provide users with a directional sense without visual information in an immersive space. The effectiveness of the proposed system was evaluated through subject experiments and statistical data analysis. Finally, a directional sense presentation was conducted for users performing micromanipulations in a mixed-reality space to demonstrate the applicability of the proposed system for teleoperation.

PMID:40030816 | DOI:10.1109/TOH.2024.3522897

Categories
Nevin Manimala Statistics

UniAda: Domain Unifying and Adapting Network for Generalizable Medical Image Segmentation

IEEE Trans Med Imaging. 2024 Dec 26;PP. doi: 10.1109/TMI.2024.3523319. Online ahead of print.

ABSTRACT

Learning a generalizable medical image segmentation model is an important but challenging task since the unseen (testing) domains may have significant discrepancies from seen (training) domains due to different vendors and scanning protocols. Existing segmentation methods, typically built upon domain generalization (DG), aim to learn multi-source domain-invariant features through data or feature augmentation techniques, but the resulting models either fail to characterize global domains during training or cannot sense unseen domain information during testing. To tackle these challenges, we propose a domain Unifying and Adapting network (UniAda) for generalizable medical image segmentation, a novel “unifying while training, adapting while testing” paradigm that can learn a domain-aware base model during training and dynamically adapt it to unseen target domains during testing. First, we propose to unify the multi-source domains into a global inter-source domain via a novel feature statistics update mechanism, which can sample new features for the unseen domains, facilitating the training of a domain base model. Second, we leverage the uncertainty map to guide the adaptation of the trained model for each testing sample, considering the specific target domain may be outside the global inter-source domain. Extensive experimental results on two public cross-domain medical datasets and one inhouse cross-domain dataset demonstrate the strong generalization capacity of the proposed UniAda over state-of-the-art DG methods. The source code of our UniAda is available at https://github.com/ZhouZhang233/UniAda.

PMID:40030769 | DOI:10.1109/TMI.2024.3523319

Categories
Nevin Manimala Statistics

Privacy-Aware Data Acquisition Under Data Similarity in Regression Markets

IEEE Trans Neural Netw Learn Syst. 2025 Jan 1;PP. doi: 10.1109/TNNLS.2024.3521056. Online ahead of print.

ABSTRACT

Data markets facilitate decentralized data exchange for applications such as prediction, learning, or inference. The design of these markets is challenged by varying privacy preferences and data similarity among data owners. Related works have often overlooked how data similarity impacts pricing and data value through statistical information leakage. We demonstrate that data similarity and privacy preferences are integral to market design and propose a query-response protocol using local differential privacy (LDP) for a two-party data acquisition mechanism. In our regression data market model, we analyze strategic interactions between privacy-aware owners and the learner as a Stackelberg game over the asked price and privacy factor. Finally, we numerically evaluate how data similarity affects market participation and traded data value.

PMID:40030760 | DOI:10.1109/TNNLS.2024.3521056

Categories
Nevin Manimala Statistics

Reproducibility of Microwave Breast Imaging: Analysis of Regular Scans of a Group of Volunteers

IEEE Trans Biomed Eng. 2024 Dec 26;PP. doi: 10.1109/TBME.2024.3512572. Online ahead of print.

ABSTRACT

Microwave imaging has been proposed for breast cancer detection and treatment monitoring. The introduction of new approaches or next-generation prototype systems requires characterization of expected variability when scanning participants over clinically relevant timeframes.

OBJECTIVE: The objective of this study is to quantify the reliability and variability in scans of a group of 35 volunteers collected at multiple time points with a next-generation microwave imaging system.

METHODS: Multiple scans of the same volunteer are collected both during one visit and over multiple visits. Test-retest reliability and variability in measurements are investigated for the average permittivity of breast tissues.

RESULTS: The average permittivity for each volunteer exhibited similarity during and between sessions, with some variation noted for smaller breast sizes. The average properties of right and left breasts were also similar. Reliability was demonstrated with the intra-class correlation coefficient (ICC) values statistically greater than 0.9 both within and between sessions. Variability of the measurement was typically less than one unit and coefficient of variation less than 5% (within sessions) or 6% (between sessions).

CONCLUSION: The microwave imaging system exhibits excellent reliability when scanning volunteers multiple times during one session and between sessions. This study represents the largest group of participants scanned at multiple time points reported to date.

SIGNIFICANCE: The excellent reliability demonstrated in this study suggests that microwave breast imaging has strong potential for capturing changes over time, such as treatment or therapy related effects, along with detecting changes in breast tissues.

PMID:40030750 | DOI:10.1109/TBME.2024.3512572

Categories
Nevin Manimala Statistics

Trajectory of Fifths Based on Chroma Subbands Extraction – A New Approach to Music Representation, Analysis, and Classification

IEEE Trans Pattern Anal Mach Intell. 2024 Dec 17;PP. doi: 10.1109/TPAMI.2024.3519420. Online ahead of print.

ABSTRACT

In this article, we propose a new method of representing and analyzing music audio records. The method is based on the concept of the trajectory of fifths, which was initially developed for the analysis of music represented in MIDI format. To adapt this concept to the needs of audio signal processing, we implement a short-term spectral analysis of a musical piece, followed by a mapping of its subsequent spectral timeframes onto signatures of fifths reflecting relative intensities of sounds associated with each of the 12 pitch classes. Subsequently, the calculation of the characteristic points of the consecutive signatures of fifths enables the creation of the trajectory of fifths. The results of the experiments and statistical analysis conducted in a set of 8996 audio music pieces belonging to 10 genres indicate that this kind of trajectory, just as its MIDI-compliant precursor, is a source of valuable information (i.e., feature coefficients) concerning the harmonic structure of music, which may find use in audio music classification processes.

PMID:40030732 | DOI:10.1109/TPAMI.2024.3519420

Categories
Nevin Manimala Statistics

Development and Validation of a Class Imbalance-Resilient Cardiac Arrest Prediction Framework Incorporating Multiscale Aggregation, ICA and Explainability

IEEE Trans Biomed Eng. 2024 Dec 18;PP. doi: 10.1109/TBME.2024.3517635. Online ahead of print.

ABSTRACT

OBJECTIVE: Despite advancements in artificial intelligence (AI) for predicting cardiac arrest (CA) with multivariate time-series vital signs data, existing models continue to face significant problems, particularly concerning balance, efficiency, accuracy, and explainability. While neural networks have been proposed to extract multiscale features from raw data in various applications, to our knowledge, no work has utilized multiscale feature extraction, specifically for diagnostic CA prediction. This paper presents a new framework that tackles these difficulties by utilizing multiscale feature aggregation via Independent Component Analysis (ICA).

METHODS: We present the Pareto optimal StrataChron Pyramid Fusion Framework (SPFF) that improves temporal vital signs statistics by capturing long-term dependencies using multi-scale temporal feature aggregation. SPFF integrates with ICA to eliminate information redundancy and enhance model efficiency. We have developed and validated the approach using the public MIMIC IV dataset.

RESULTS: The proposed model demonstrates resilience to data imbalance and enhances explainability through SHapley Additive exPlanations (SHAP), Partial Dependence Plots (PDP), and Individual Conditional Expectation (ICE) across varying time windows. The multilayer perceptron (MLP) using SPFF and ICA achieves an accuracy of 0.982, precision of 0.969, recall of 0.989, F1-score of 0.979, and AUROC of 0.998.

CONCLUSION: The proposed method effectively predicts CA across varying time windows, offering a robust solution to the challenges of efficiency, accuracy, and explainability in current models.

SIGNIFICANCE: This method is significant to biomedical research as it provides superior performance in CA prediction while capturing both short- and long-term dependencies in patient data, potentially improving patient outcomes and guiding clinical decision-making.

PMID:40030724 | DOI:10.1109/TBME.2024.3517635

Categories
Nevin Manimala Statistics

Spectral Information Dynamics of Cortical Signals Uncover the Hierarchical Organization of the Human Brain’s Motor Network

IEEE Trans Biomed Eng. 2024 Dec 13;PP. doi: 10.1109/TBME.2024.3516943. Online ahead of print.

ABSTRACT

OBJECTIVE: Understanding brain dynamics during motor tasks is a significant challenge in neuroscience, often limited to studying pairwise interactions. This study provides a comprehensive hierarchical characterization of node-specific, pairwise and higher-order interactions within the human brain’s motor network during handgrip task execution.

METHODS: The brain source activity was reconstructed from scalp EEG signals of ten healthy subjects performing a motor task, identifying five brain regions within the contralateral and ipsilateral motor networks. Using the spectral entropy rate as the basis for the decomposition of dynamic information in the alpha and beta frequency bands, we assessed the predictability of the individual rhythms within each brain region, the information shared between the activity of pairs of regions, and the higher-order interactions among groups of signals from more than two regions.

RESULTS: An overall decrease in hierarchical interactions at different orders within the motor network was observed during motor task execution. In addition to an increase in the predictability of single-source dynamics and a decrease in the strength of pairwise interactions, a statistically significant reduction in redundancy between brain sources was found. These changes primarily affected the dynamics of the alpha frequency band, driven by the well-known sensorimotor mu rhythm.

CONCLUSIONS AND SIGNIFICANCE: This work emphasizes the importance of examining hierarchically-organized brain source interactions in the frequency domain using a unified framework which fully captures the complex dynamics of the motor network.

PMID:40030679 | DOI:10.1109/TBME.2024.3516943

Categories
Nevin Manimala Statistics

Extraction of three mechanistically different variability and noise sources in the trial-to-trial variability of brain stimulation

IEEE Trans Neural Syst Rehabil Eng. 2024 Dec 25;PP. doi: 10.1109/TNSRE.2024.3522681. Online ahead of print.

ABSTRACT

Motor-evoked potentials (MEPs) are among the few directly observable responses to external brain stimulation and serve a variety of applications, often in the form of input-output (IO) curves. Previous statistical models with two variability sources inherently consider the small MEPs at the low-side plateau as part of the neural recruitment properties. However, recent studies demonstrated that small MEP responses under resting conditions are contaminated and over-shadowed by background noise of mostly technical quality, e.g., caused by the amplifier, and suggested that the neural recruitment curve should continue below this noise level. This work intends to separate physiological variability from background noise and improve the description of recruitment behaviour. We developed a triple-variability-source model around a logarithmic logistic function without a lower plateau and incorporated an additional source for background noise. Compared to models with two or fewer variability sources, our approach better described IO characteristics, evidenced by lower Bayesian Information Criterion scores across all subjects and pulse shapes. The model independently extracted hidden variability information across the stimulated neural system and isolated it from background noise, which led to an accurate estimation of the IO curve parameters. This new model offers a robust tool to analyse brain stimulation IO curves in clinical and experimental neuroscience and reduces the risk of spurious results from inappropriate statistical methods. The presented model together with the corresponding calibration method provides a more accurate representation of MEP responses and variability sources, advances our understanding of cortical excitability, and may improve the assessment of neuromodulation effects.

PMID:40030620 | DOI:10.1109/TNSRE.2024.3522681