Radial Basis Function-based Quantum Hybrid Classical Generative Adversarial Networks for Enhanced Image Quality and Training Stability

doi:10.21203/rs.3.rs-4195599/v1

Download PDF

Research Article

Radial Basis Function-based Quantum Hybrid Classical Generative Adversarial Networks for Enhanced Image Quality and Training Stability

https://doi.org/10.21203/rs.3.rs-4195599/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Quantum Generative Adversarial Networks (QGANs), as the quantum version to classical Generative Adversarial Networks, exhibit exponential advantages in certain aspects, garnering considerable attention. However, within this nascent field, challenges persist in the synthesis of image quality and the stability of training in QGANs. In this work, we introduce a Hybrid Quantum Classical Generative Adversarial Network (HQCGAN), incorporating a classical discriminator constructed using Radial Basis Function Neural Networks (RBFNN). Harnessing the superior non-linear data processing capabilities and inherent resilience to image noise of RBFNNs, our HQCGAN significantly enhances its proficiency in generating high-fidelity grayscale images characterized by discrete value distributions. Through a series of meticulous experiments that evaluated the training cross-validation scores and the robustness of the loss functions, we have demonstrated the exceptional performance of our HQCGAN model, especially in the presence of noisy input data. These findings contribute meaningfully to the burgeoning field of quantum generative models, underscoring the vital role played by classical machine learning components in augmenting the overall efficacy of quantum approaches. The incorporation of RBFNNs within a quantum framework in our study offers novel perspectives to address prevailing challenges related to image quality and training stability, marking a substantial progression in the evolution of quantum generative adversarial networks.

Quantum generative adversarial networks

Hybrid Quantum-Classical Models

Variational quantum circuits

Radial Basis Function Neural Networks

Machine learning [1, 2], a subfield of artificial intelligence, is dedicated to developing algorithms and models that enable computers to identify patterns and make predictions from data. It covers a range of techniques including supervised, unsupervised, and reinforcement learning. A significant recent advancement in machine learning has been the development of Generative Adversarial Networks [3](GANs), which are composed of two neural networks—a generator and a discriminator—that compete against each other. This competition facilitates the production of highly realistic data, rendering GANs valuable for tasks such as image generation, style transfer, and data augmentation.

However, the substantial computational demands of GANs have brought them to the cusp of Moore’s Law limitations [4]. With the rise of quantum computing [5, 6], researchers have begun to explore the integration of machine learning algorithms with quantum capabilities, yielding impressive outcomes [7]. Quantum computing operates by manipulating quantum information units under quantum mechanics principles and offers revolutionary potential for processing exponentially large volumes of data [8]. Hence, quantum machine learning [9–11], which merges the fields of machine learning and quantum computing, is poised to become a critical application of quantum technology.

One area of intense interest is Quantum Generative Adversarial Networks [12–14] (QGANs), which have demonstrated theoretical potential for exponential advantages [15] over their classical counterparts. As the quantum analogue of GANs, QGANs leverage quantum computing to enhance generative model performance. Seth et al. [13] initially proposed the concept of QGANs. Subsequently, Situ et al. [16] developed a hybrid quantum-classical QGAN that addresses discrete data generation limitations inherent in traditional GANs. Hu et al. [17] successfully operated a single-qubit QGAN with high information fidelity on a superconducting quantum device. More recently, Romero et al. [18] introduced a hybrid quantum-classical approach to model continuous probability distributions using variational quantum circuits, and Huang et al. [19] designed and tested quantum patch and batch GANs on handwritten digit image generation tasks. Yet, as we continue in the noisy intermediate-scale quantum (NISQ) era [20], challenges persist, prompting the development of hybrid quantum-classical algorithms suitable for NISQ devices [21, 22]. Existing QGANs occasionally produce images of inferior quality compared to classical GANs, due in part to the absence of an appropriately matched discriminator for the quantum-classical context. Moreover, the training of QGANs tends to be unstable as discussed in Ref. [23].

On the other hand, the radial basis function neural network has been proven to serve as an optimal discriminator approximation for Generative Adversarial Networks (GANs), particularly in handling nonlinear sparse data derived from a continuous distribution, as reported in prior research [24]. Building upon this insight, we introduce an approach with the novel Hybrid Quantum-Classical GAN, incorporating an RBFNN-based discriminator. This augmentation aims to bolster the robustness of feature detection and improve generative capabilities throughout the training process. Significantly, this methodology holds the potential to stabilize training further, thereby fostering the continued development of QGANs.

2.1 RBFNN

The Radial Basis Function Neural Network, serving as a regression model, is a three-layer feedforward neural network comprising an input layer, a hidden layer, and an output layer, as shown in Fig. 1. Note that, in the specific context of amalgamating RBFNNs and GANs within this study, the number of output neurons is specified as 1.

In contrast to Fully Connected Networks (FCNs), which utilize sigmoid or ReLU activation functions [25], an RBFNN adopts a Radial Basis Function (RBF). Notably, the mapping of data to an infinite-dimensional space using the Gaussian kernel function can be considered a feature extraction method for input data. The Gaussian kernel function stands out as the most widely utilized RBF, which is expressed as:

$$H\left(x,c\right)= {\text{e}}^{\left(-{\beta } \bullet {‖x - c‖}^{2}\right) } \left(1\right)$$

where $\left|\right|·\left|\right|$ is a norm on the function space, ${x}_{i }$denotes the $\text{i}$th sample in the training set, x is the input data, c is the center of the RBF layer, and $\beta$ is defined as the expansion coefficient. By multiplying the distance with a scalar coefficient $\beta$, we can control how fast the function will decay. So higher $\beta$ means a sharper decline. Figure 2 shows the 1-D curves of the Gaussian kernel function with$\text{c} = 0$ and ${\beta } = 0.1, 0.5, 1.0, 2.0.$

The output of a RBFNN can be written as:

$${ y}_{i}= {\omega }_{0}+ \sum _{j=1}^{q}{w}_{j} \bullet {g}_{1}({x}_{i},{v}_{j},{\sigma }_{j}\left) \right(2)$$

where $q$ denotes the number of hidden neurons, ${w}_{j}$ denotes the weight between the j-th hidden neuron and the output neuron, and ${\omega }_{0}$ denotes the bias of the output neuron.

Theoretically, RBFNN offers several advantages over FC neural networks. Firstly, the Nonlinear Mapping Capability of RBFNN stems from its use of radial basis functions, which exhibit robust nonlinear mapping capabilities. Particularly in domains characterized by intricate patterns and nonlinear relationships, RBFNN may outperform fully connected networks in capturing and modeling complex data relationships. Secondly, Local Representation and Learning are intrinsic to RBFNN, as each radial basis function responds more strongly to specific local regions of the input data [26]. This inherent property makes RBFNN potentially more adept at handling local features and non-uniformly distributed data, without the need to consider the entire dataset, unlike fully connected networks. Thirdly, Robustness and Generalization Ability are notable strengths of RBFNN due to the sensitivity of radial basis functions to local input data regions. This sensitivity contributes to enhanced robustness against noise and outliers, ultimately improving the model's generalization ability, especially in handling unseen data. Finally, Fewer Parameters and Faster Training characterize RBFNN, as it typically requires fewer parameters, given that each radial basis function has relatively fewer weights. This feature can result in expedited training, particularly beneficial when dealing with relatively small datasets [27].

2.2 Hybrid Quantum–Classical GAN Algorithm

Diverging from the classical generator, the variational quantum generator eliminates the need to map a prior noise distribution into a high-dimensional space; instead, it derives varied outcomes through the inherent randomness of quantum states. The core component of the variational quantum generator is a Variational Quantum Circuit (VQC) [28], responsible for transforming the input quantum state, typically represented as ${|0⟩}^{\otimes \text{n}}$ into an entangled state [29]. Classical data can be extracted from the resulting state by measuring in the computational basis. The discriminator, crafted using a classical neural network, evaluates the classical data, determining its authenticity in relation to the real dataset.

2.2.1 Variational Quantum Circuits

Quantum computing, a rapidly evolving research domain, utilizes the principles of quantum mechanics for computational tasks. Central to quantum computing are quantum bits, commonly referred to as "qubits." Unlike classical bits limited to 0 or 1, qubits manifest a superposition of states, denoted as $|\varphi ⟩=\alpha |0⟩+\beta |1⟩$, where 𝛼 and 𝛽 are complex numbers, subject to the condition ${\left|\alpha \right|}^{2} + {\left|\beta \right|}^{2} = 1$. This unique property allows quantum computers to process information simultaneously, providing exponential computational advantages over classical counterparts.

Recently, VQCs have become pivotal tools in quantum computing, tackling challenges arising from the inherent complexity and decoherence phenomena in quantum systems. Typically, VQCs are implemented using quantum gates to execute machine learning tasks.

The frequently used single-qubit gates include the Pauli gates (X, Y, and Z), the Hadamard gate ($H$) and rotation gates, i.e., ${R}_{x}$(𝜃), 𝑅_𝑦 (𝜃), and 𝑅_𝑧 (𝜃). The matrix representations of $H$ gate and $R$_ygate are:

$$H= \frac{1}{\sqrt{2 }}\left[\begin{array}{cc}1& 1\\ 1& -1\end{array}\right] \left(3\right)$$

$${R}_{y}\left(\theta \right)= \left[\begin{array}{cc}{cos}\frac{\theta }{2}& -{sin}\frac{\theta }{2}\\ {sin}\frac{\theta }{2}& {cos}\frac{\theta }{2}\end{array}\right] \left(4\right)$$

The prototype of the two-qubit gate is the Controlled-NOT (CNOT) gate, which has two input qubits, the control qubit, and the target qubit. Another commonly used two-qubit gate is the Controlled-Z(CZ) gate. CZ gate are represented by the Hermitian unitary matrix, which can be expressed by:

$$CZ= \left[\begin{array}{cccc}1& 0& 0& 0\\ 0& 1& 0& 0\\ 0& 0& 1& 0\\ 0& 0& 0& -1\end{array}\right] \left(5\right)$$

VQCs have shown substantial promise in diverse applications, encompassing quantum chemistry, optimization problems, and machine learning. Prior studies have illustrated that VQCs can function as versatile approximators, classifiers, and even simulate quantum-many-body physics—tasks beyond the computational capacity of classical computers [18, 30]. The capability of VQCs to address diverse applications underscores their potential to advance quantum computing and solve complex problems surpassing the capabilities of classical computing paradigms.

2.2.2 Hybrid Quantum–Classical GAN Structure

In the era of NISQ computing, the fusion of a quantum generator and classical discriminator has proven to be a robust architecture for machine learning tasks. The generator in a hybrid quantum-classical GAN [31, 32] frequently integrates quantum circuits for specific computations, utilizing quantum gates and operations to generate states representing synthetic data. The discriminator, typically implemented as a classical neural network, receives both real and quantum-generated data as input, producing a probability score indicative of the likelihood that the input is real.

The classical part of the network may preprocess quantum-generated data or incorporate quantum measurements into the classical training process. Quantum measurements are conducted on the generated quantum states to obtain classical information, which is subsequently fed into the classical discriminator for further processing. The training process involves a combination of classical and quantum optimization, with classical optimization algorithms such as gradient descent commonly employed to update the parameters of the classical discriminator. Quantum optimization techniques [33, 34], like variational quantum eigensolvers (VQEs) [35, 36], may be utilized to update parameters in the quantum generator.

The objective function of the hybrid quantum-classical GAN is designed to minimize the distinguishability between real and quantum-generated data, requiring the adjustment of parameters in both the quantum generator and classical discriminator to achieve a Nash equilibrium [13]. The generator and discriminator undergo iterative training in a feedback loop, with the generator striving to produce quantum states indistinguishable from real data, while the discriminator evolves to more effectively differentiate between real and generated data.

2.2.3 Classical discriminator

In the prelude to the age of universal quantum computers, collaboration between NISQ devices and classical processors emerges as a viable strategy to optimize the advantages offered by quantum devices. Consequently, delving into the realm of hybrid quantum-classical algorithms become imperative. Given that the discriminator's role involves a straightforward binary classification task within generative adversarial learning, a classical neural network proves adept at fulfilling the requirements of a QGAN. Moreover, the classical discriminator possesses the capability to directly process discrete data sampled from either real datasets or quantum generators, circumventing potential input–output bottlenecks. The input layer's dimension in the discriminator aligns with the number of qubits in the quantum generator, while the hidden layer exhibits flexibility in adjusting the number of layers and dimensions based on the intricacy of the tasks at hand. The output layer, characterized by a single dimension and a sigmoid activation function, constrains the output value within the range of (0, 1). This restricted range signifies the probability that the input data originates from a real dataset.

2.2.4 Adversarial learning strategy

To train the HQCGAN, we adopt a binary cross-entropy loss function. Specifically, our attention is directed towards two pivotal components within the architecture: the discriminator, represented by ${L}_{D}$, and the generator, denoted as ${L}_{G}$. The formulation for ${L}_{D}$ is articulated as:

$${L}_{D} = - \frac{1}{2m} \sum _{j=1}^{m}\left[{ln}D\left( {x}_{j} \right)+{ln}\left( 1 – D \left( {G}_{j}\right)\right)\right] \left(6\right)$$

This expression encapsulates the discrimination process, accounting for both the real dataset represented by 𝑥_𝑗 and the generated data by the generator, 𝐺_j. The adversarial interplay is manifest in the dual terms, with 𝐿_D striving to minimize the discriminator's ability to distinguish between real and generated instances. Simultaneously, the 𝐿_𝐺 formulation, embodying the generator's perspective, is articulated as:${ L}_{G} = - \frac{1}{\text{m}} \sum _{\text{j}=1}^{\text{m}}\text{ln}\text{D} \left( {\text{G}}_{\text{j}}\right) \left(7\right)$

This expression encapsulates the generator's objective to generate data that is indistinguishable from real data, as measured by the discriminator. The minimization of 𝐿_G is paramount for the generator to enhance its ability to produce authentic data that aligns with the distribution of the real dataset. In these expressions, 𝑚 signifies the number of input data points processed in a single batch. The variable 𝑥_𝑗 is sourced from the authentic dataset, while 𝐺_j represents the measured values of the output quantum states. This adversarial learning strategy, underpinned by binary cross-entropy, lies at the core of HQCGAN's training paradigm, forging a dynamic interplay between the discriminator and generator for the generation of quantum-informed synthetic data.

In the realm of generative adversarial networks, we introduce a novel HQCGAN algorithm, specifically incorporating an RBF discriminator—termed as RBF-QGAN. This distinctive algorithmic framework encompasses a dual structure, comprising a quantum generator and a classical discriminator.

The overarching framework is visually elucidated in Fig. 3, encapsulating the proposed architecture. It is noteworthy that our innovative approach adheres to the established structure of traditional HQCGANs, seamlessly integrating a quantum generator and a discriminator. The workflow initiates with the acquisition and pre-processing of sample data, followed by the conversion of pre-processed data into discrete samples—a crucial preparatory step before its integration into the model.

The subsequent phase involves iterative adversarial training, a process intricately orchestrated by a generator composed of quantum circuits and a classical discriminator leveraging Radial Basis Functions. Through this concerted training, the model dynamically refines its capacity to generate image sample data, a testament to the robustness and efficacy of our RBF-QGAN algorithm in the realm of quantum-classical hybrid generative networks.

3.1 Quantum circuit

In the framework of the RBF-QGAN algorithm, the generator is constructed using the VQCs. These VQCs are composed of essential circuit components, including a rotation layer and an entanglement layer. The rotation layer integrates single-qubit gates such as ${R}_{x}$, ${R}_{y}$, and ${R}_{z}$ gates, facilitating the rotation of qubits to manipulate their states. The entanglement layer utilizes two-qubit controlled rotation gates, notably the CZ gate, to establish entanglement between qubits. Specifically, in the CZ gate, when the control qubit is set to 0, the operation on the target qubit remains unaffected. However, when the control qubit is set to 1, it triggers the corresponding operation on the target qubit, thereby inducing entanglement between the qubits involved in the gate operation. This structured approach to circuit design enables the RBF-QGAN algorithm to leverage quantum principles effectively, enhancing its capacity for quantum data generation and processing.

In the design of a quantum circuit for the RBF-QGAN, we commence by initializing the circuit parameters to zero. Subsequently, we apply the H gate to each qubit in the input state, thereby inducing a superposition of $|0⟩$ and $|1⟩$ states for each qubit. When processing classical data inputs within the HQCGAN algorithm, encoding the inputs into a quantum state becomes imperative. This process essentially represents a nonlinear mapping from a low-dimensional Hilbert space to a high-dimensional one. To preserve the quantum advantages inherent in the HQCGAN algorithm [37], we opted for a simple angle encoding with only a single layer of depth to alleviate the input bottleneck. Specifically, the single qubit 𝑅_y gate is used in the rotation layer as shown in Fig. 4. Furthermore, the introduction of quantum entanglement characteristics enhances the correlation between information, resulting in patterns beyond the reach of classical systems. Following the methodology outlined in [29], we employ the $\text{C}\text{Z}$ gate, a two-qubit gate, in the entanglement layer, interconnected in a circular topology, to achieve low-cost and stronger expressiveness and entanglement capabilities.

3.2 RBF-QGAN

In the RBF-QGAN algorithm, our emphasis lies in optimizing the classical discriminator. The key approach involves employing the RBF network as the cornerstone of the discriminator. Leveraging its nonlinear processing capabilities, robust learning and classification prowess, this choice significantly bolsters the discriminator's stability within hybrid quantum-classical generative adversarial algorithms, rendering it more resilient to noise input.

Ref. [24] introduces best approximation theory, elucidating the nature of neural networks in approximating continuous functions. It establishes that the RBFNN serves as the optimal approximation for a GAN discriminator when confronted with nonlinear sparse data. Consequently, RBFNNs prove advantageous as discriminators in GANs dealing with nonlinear datasets. To enhance HQCGAN performance, we here replace the classical Fully Connected (FC) neural network with RBFNN.

In this enhancement, preprocessed sample image data, already completed, serves as input into the discriminator (denoted as 𝐷) in the form of a binary array. The discriminator harnesses the exceptional nonlinear data processing capabilities of the RBFNN to effectively classify and discriminate model-generated data originating from the quantum generator, which undergoes measurement data sampling. In an ideal scenario, 𝐷 becomes unable to distinguish whether the input stems from the true distribution or the generative model distribution.

The final state of the generator (denoted as 𝐺) is then measured to align with handwritten digital images of real samples. This alignment is accomplished through training with a binary cross-entropy loss function. Throughout the training process, both the generator and discriminator employ the Adam optimizer, each with a learning rate set at ${\text{l}\text{r}}_{\text{g}}$=0.001 and ${\text{l}\text{r}}_{\text{d}}$=0.001. The training regimen spans 200 epochs.

4.1 Experiment setting

In this section, we meticulously outline the parameters and configurations employed to rigorously evaluate the RBF-QGAN algorithm. Our validation encompasses the MNIST handwritten digit dataset and the Fashion MNIST dataset, chosen for their distinct characteristics. All computational models are meticulously crafted in Python 3, utilizing the powerful capabilities of TensorFlow [38] and PennyLane [39]. TensorFlow stands as a robust, high-performance machine learning library, while PennyLane serves as a Quantum Machine Learning (QML) library, seamlessly interfacing with TensorFlow. The experimentation unfolds on a computational platform boasting a 13th Gen Intel® Core™ i5-13400 processor running at a clock speed of 2.50 GHz, bolstered by a substantial 32 GB of RAM for optimal performance and resource allocation. This choice of hardware ensures the execution of our experiments with efficiency and precision.

4.2 Datasets and Noise Input

To assess the efficacy of the proposed algorithm, we utilize the training datasets from MNIST [40] and Fashion MNIST datasets [41]. MNIST is widely employed for evaluating machine learning algorithms, consisting of grayscale images depicting handwritten digits ranging from 0 to 9, with dimensions of 28 × 28 pixels. Conversely, Fashion MNIST serves as an alternative dataset, featuring images of clothing and accessories to replace handwritten digits. Both datasets comprise 60,000 training images and 10,000 testing images.

To ensure the stability and robustness of the RBF-QGAN, we subject it to various types of noise, including Gaussian, uniform, and salt and pepper noise, each with distinct noise levels, as depicted in Fig. 5. The training images are drawn from MNIST and Fashion MNIST datasets.

We set the iterative optimization period (epoch) to 200 and utilize the Adam optimizer to optimize both the generator and the RBF discriminator, initializing the learning rate at 0.01. Our experimental setup employs a quantum circuit with 8 qubits and a desired layer count of 3. Notably, as depicted in Fig. 4, the MNIST datasets containing noisy digit images are downsampled to 16 × 16 pixels to align with the requirements of the quantum circuit.

We undertake a comparative analysis between QGAN models employing discriminators comprised of fully connected layers and those utilizing convolutional layers. Initially, we focus on assessing the fundamental capability of image generation, selecting the digit "0" and "Ankle boot" as our test subjects. Employing our RBF-QGAN architecture, we embark on generating grayscale images corresponding to these chosen categories.

The ensuing figures showcase the generated results alongside the evolution of generator and discriminator losses throughout the training phase. As depicted in Fig. 6 and Fig. 7, our algorithm exhibits swift convergence towards the Nash equilibrium point. Notably, the cross-entropy loss values for both the generator and discriminator stabilize around 0.7, indicative of convergence attainment. Furthermore, a visual inspection of the generated images reveals a striking resemblance to the chosen sample pictures. These findings underscore the proficiency of our proposed RBF-QGAN model in executing image generation tasks for MNIST and Fashion MNIST datasets.

Indeed, our model demonstrates the capacity to achieve a remarkably low mean cross-entropy loss between generated and original images within a mere 70 epochs. This underscores the effectiveness and efficiency of the RBF-QGAN approach in generating high-quality images while maintaining fidelity to the original dataset samples.

5.1 Stability of the RBF-QGANs

To assess the stability of our RBF-QGANs, we adopt the coefficient of variation (CV) [42] as a pivotal measure for evaluating the consistency of the discriminator’s performance. The CV is a standardized metric that compares the dispersion of data by calculating the ratio of the standard deviation to the mean of the original dataset. This calculation offers a unitless scale, facilitating the comparison of variability across different datasets or models. By leveraging this metric, we can closely monitor the fluctuation of loss values throughout the model’s training phase, thus providing a detailed insight into the overall training stability of the model’s components. The formula used to compute the CV is as follows:

$$CV= \frac{{S}_{D}}{{M}_{\text{D}}} \left(8\right)$$

where 𝑆_𝐷 represents the standard deviation, and 𝑀_D denotes the mean of the discriminator’s loss value. A smaller CV value indicates greater system stability as demonstrated in Ref. [42].

To delve deeper into the effectiveness and stability of the RBF-enhanced discriminator within our training framework, we have devised a specific experimental setup. Throughout the training regimen of each model variant, we meticulously document the loss values associated with the discriminator at every iteration. These collected data points enable us to compute the CV for the loss values corresponding to each training approach, providing a quantifiable measure of stability.

Figure 6 presents a detailed visualization illustrating the shift in the coefficient of CV in response to increasing noise levels within our innovative RBF-QGAN framework. This analysis directly compares our approach with an QGAN setup incorporating fully connected (FC) and 1-dimensional convolutional (CONV) layers in the discriminator. We conducted this comparison using both the MNIST and Fashion MNIST datasets, each subjected to three distinct noise profiles: Gaussian, Uniform, and Salt Pepper noise.

This comparative study allows for a comprehensive evaluation of our model’s robustness and adaptability under various conditions of noise interference. Our results unequivocally demonstrate the effectiveness of the proposed RBF-QGAN, as indicated by consistently lower CV values across all types of noise, in contrast to classical FC-based and CONV-based QGAN architectures, as depicted in Fig. 6. These significantly smaller CV values underscore the robustness and stability of our model in identifying and processing noisy image inputs. Enhanced stability is crucial in real-world scenarios where noise is an inherent aspect of data acquisition and processing pipelines. Therefore, our approach not only showcases superior performance in noisy environments but also signifies its potential for deployment in practical settings requiring resilient and reliable image processing capabilities.

5.2. Robustness in RBF-QGANs

Robustness stands as a cornerstone attribute within the realm of Generative Adversarial Networks [43], encapsulating a model's resilience in maintaining performance and effectiveness despite variations in input data, noise, or other perturbations. In the intricate landscape of GANs, a robust model manifests its prowess by consistently generating high-quality and realistic outputs, even when subjected to uncertainties or adversarial inputs. This critical characteristic ensures the reliability and generalizability of the model across a spectrum of datasets and conditions, effectively guarding against potential pitfalls such as mode collapse or vulnerability to minute changes in input data.

Fig .7 The comparison of the loss values of QGAN models composed of RBF discriminator and those composed of FC and CONV discriminators under noisy data input.

To comprehensively evaluate the robustness of our algorithm, we deliberately selected the digit "0" from the MNIST dataset as the focal point for model training. We systematically introduced various levels of noise to test the model's resilience. The ensuing comparison of robustness, depicted in Fig. 7, offers a thorough assessment of our model's performance under diverse noise conditions.

Our evaluation incorporates both qualitative observations and quantitative analysis, with a particular emphasis on cross-entropy loss values as a metric for performance assessment. Specifically, we scrutinized the performance of HQCGAN, which includes three distinct discriminators, across varying degrees of noise. Figure 7 reveals that under Gaussian noise with a strength of 0.4, Uniform noise with a strength of 0.1, and Salt Pepper noise with a strength of 0.015, the RBF-QGAN demonstrates robustness, with loss values exhibiting minimal increase despite the presence of noise. These results underscore the model's commendable consistency in achieving similar effects despite fluctuations in noise levels, thereby reaffirming its robustness in adverse conditions.

Such robustness is pivotal for ensuring the algorithm's efficacy in real-world scenarios. Moreover, the study acknowledges previous research by Cheng at el., which addressed robustness concerns in image synthesis with QGANs on NISQ devices [44]. However, in contrast to Chu's focus on developing a new quantum generator tailored for pure quantum circuits, this study emphasizes enhancing the robustness of image synthesis in hybrid models featuring classical data input. This is achieved through modifications to the classical discriminator component.

5.3 Comparison of Similar Works in Image Generation

In our comprehensive comparative analysis, we meticulously evaluate our architecture (showed as Fig. 8(d)) alongside pertinent prior works, shedding light on the nuanced approaches within the domain of QGANs. Notably, Huang et al. [19] pioneered a pioneering paradigm with their introduction of a novel patch quantum generative adversarial network. Their innovative strategy involved the reduction of MNIST image dimensions to a compact 8 × 8 format, followed by the subdivision of the image into four distinct sections. Each section underwent individual processing by dedicated 5-qubit quantum generators, contributing to a multifaceted synthesis process. The culmination of these efforts is the seamless synthesis of the final image, achieved through the synergistic amalgamation of outputs from the four generators, as vividly illustrated in Fig. 8(a).

Moreover, in a landmark contribution, the Ref. [45] presented a versatile quantum GAN framework engineered to tackle the complex challenge of generating images endowed with intricate high-dimensional features. This groundbreaking approach leverages the inherent power of quantum superposition to concurrently train multiple examples, representing a significant stride forward in the realm of quantum image generation. Their noteworthy achievement lies in the successful learning and generation of real-world handwritten digit images, a feat made possible through the utilization of a state-of-the-art superconducting quantum processor, as showcased in Fig. 8(b).

Fig .8 Comparison of our generated MNIST dataset images with different works.

Additionally, we have compared our image generation with QuGAN [46], as displayed in Fig. 8(c), a pure quantum GAN architecture that operates discriminator and generator based on quantum state fidelity and calculates quantum-based loss function values via swap tests of quantum bits. In contrast, our employed hybrid framework is more adapted to the current development stage of NISQ devices, and it is better positioned to leverage the advantages of quantum.

Overall, the images generated by RBF-QGAN exhibited distinct handwritten digits with well-defined edges and minimal distortion. This comparative examination serves to underscore the rich diversity of methodologies employed in the burgeoning field of QGANs, offering a glimpse into the continuous evolution and innovative strides driving advancements in quantum image synthesis and generation.

In conclusion, our study introduces the RBF-based QGANs, representing an innovative approach aimed at addressing several challenges inherent in QGANs. Leveraging classical machine learning components, specifically RBFNNs, our work significantly enhances its ability to generate high-fidelity grayscale images with discrete value distributions. Through meticulous experimentation, evaluating training cross-validation scores and loss function robustness, we have demonstrated the exceptional performance of our QGAN model, particularly in the presence of noisy input data. Our findings underscore the significance of integrating classical machine learning techniques within quantum frameworks to augment the efficacy of quantum generative models. By incorporating RBFNNs into our quantum-inspired algorithm, we offer novel perspectives to tackle prevailing challenges related to image quality and training stability. This marks a substantial progression in the evolution of quantum generative adversarial networks, positioning them as viable solutions for a wide array of practical applications.

In essence, our work charts a new course in the advancement of quantum generative models, offering meaningful contributions to this burgeoning field. Looking forward, the fusion of classical and quantum methodologies holds immense promise for propelling machine learning forward, especially within the realm of quantum computing. Continued research and development in this trajectory is essential to unlock the full potential of quantum-inspired algorithms and realize their transformative impact across various domains.

Acknowledgements

This work was partly supported by the National Natural Science Foundation of China (Grant Nos. 61874001, 62004001, 62201005, 62004001, 62304001), the Anhui Provincial Natural Science Foundation under Grant No. 2308085QF213, 2308085QF195, and the Natural Science Research Project of Anhui Educational Committee under Grant No. 2022AH050106, 2023AH050072.

Carleo, G., Cirac, I., Cranmer, K., Daudet, L., Schuld, M., Tishby, N., Vogt-Maranto, L., Zdeborová, L. J. R. o. M. P.: Machine learning and the physical sciences. Rev. Mod. Phys. 91(4), 045002(2019)
Jordan, M. I., Mitchell, T. M.: Machine learning: Trends, perspectives, and prospects. Science. 349(6245), 255-60(2015)
Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., Bengio, Y. J. A. i. n. i. p. s. Generative adversarial nets. Proc. Int. Conf. Neural Inf. Process. Syst., 2014, pp. 2672–2680(2014)
Thompson, N. C., Greenewald, K., Lee, K., Manso, G. F. J. a. p. a.: The computational limits of deep learning. arXiv:2007.05558
Boixo, S., Isakov, S. V., Smelyanskiy, V. N., Babbush, R., Ding, N., Jiang, Z., Bremner, M. J., Martinis, J. M., Neven, H. J. N. P.: Characterizing quantum supremacy in near-term devices. Nat. Phys. 14(6), 595-600(2018)
Nielsen, M., Chuang, I.: Quantum Computation and Quantum Information. Cambridge University Press, Cambridge (2010)
Bravyi, S., Gosset, D., Koenig, R., Tomamichel, M.: Quantum advantage with noisy shallow circuits. Nat. Phys. 16(10), 1040-+(2020)
Nimbe, P., Weyori, B. A., Adekoya, A. F.: Models in quantum computing: a systematic review. Quantum Inf. Process. 20(2), (2021)
Biamonte, J., Wittek, P., Pancotti, N., Rebentrost, P., Wiebe, N., Lloyd, S. J. N.: Quantum machine learning. Nature. 549(7671), 195-202(2017)
Cerezo, M., Verdon, G., Huang, H.-Y., Cincio, L., Coles, P. J.: Challenges and opportunities in quantum machine learning. Nature computational science. 2(9), 567-76(2022)
Houssein, E. H., Abohashima, Z., Elhoseny, M., Mohamed, W. M.: Machine learning in the quantum realm: The state-of-the-art, challenges, and future vision. Expert Syst. Appl. 194, (2022)
Dallaire-Demers, P.-L., Killoran, N. J. P. R. A.: Quantum generative adversarial networks. Phys. Rev. A. 98(1), 012324(2018)
Lloyd, S., Weedbrook, C. J. P. r. l.: Quantum generative adversarial learning. Phys. Rev. Lett. 121(4), 040502(2018)
Tian, J., Sun, X., Du, Y., Zhao, S., Liu, Q., Zhang, K., Yi, W., Huang, W., Wang, C., Wu, X., Hsieh, M.-H., Liu, T., Yang, W., Tao, D.: Recent Advances for Quantum Neural Networks in Generative Learning. Ieee T. Pattern Anal. 45(10), 12321-40(2023)
Gao, X., Zhang, Z. Y., Duan, L. M.: A quantum machine learning algorithm based on generative models. Sci. Adv. 4(12), (2018)
Situ, H., He, Z., Wang, Y., Li, L., Zheng, S. J. I. S.: Quantum generative adversarial network for generating discrete distribution. Inform. Sciences. 538, 193-208(2020)
Hu, L., Wu, S.-H., Cai, W., Ma, Y., Mu, X., Xu, Y., Wang, H., Song, Y., Deng, D.-L., Zou, C.-L. J. S. a.: Quantum generative adversarial learning in a superconducting quantum circuit. Sci. Adv. 5(1), eaav2761(2019)
Romero, J., Aspuru‐Guzik, A. J. A. Q. T.: Variational quantum generators: Generative adversarial quantum machine learning for continuous distributions. Adv. Quantum Technol. 4(1), 2000003(2021)
Huang, H.-L., Du, Y., Gong, M., Zhao, Y., Wu, Y., Wang, C., Li, S., Liang, F., Lin, J., Xu, Y. J. P. R. A.: Experimental quantum generative adversarial networks for image generation. Phys. Rev. Appl. 16(2), 024051(2021)
Xu, Z. P.: Characterizing arbitrary quantum networks in the noisy intermediate-scale quantum era. Phys. Rev. A. 108(4), (2023)
Preskill, J. J. Q.: Quantum computing in the NISQ era and beyond. Quantum. 2, 79(2018)
Bharti, K., Cervera-Lierta, A., Kyaw, T. H., Haug, T., Alperin-Lea, S., Anand, A., Degroote, M., Heimonen, H., Kottmann, J. S., Menke, T., Mok, W.-K., Sim, S., Kwek, L.-C., Aspuru-Guzik, A.: Noisy intermediate-scale quantum algorithms. Rev. Mod. Phys. 94(1), (2022)
Borras, K., Chang, S. Y., Funcke, L., Grossi, M., Hartung, T., Jansen, K., Kruecker, D., Kühn, S., Rehm, F., Tüysüz, C. Impact of quantum noise on the training of quantum generative adversarial networks. Journal of Physics: Conference Series: IOP Publishing; (2023). p. 012093.
Hu, L., Wang, W., Xiang, Y., Zhang, J. J. I. T. o. A., Systems, E.: Flow field reconstructions with gans based on radial basis functions. Ieee T. Aero. Elec. Sys. 58(4), 3460-76(2022)
Nair, V., Hinton, G. E. Rectified linear units improve restricted boltzmann machines. Proceedings of the 27th International Conference on International Conference on Machine Learning. Haifa, Israel: Omnipress; (2010). p. 807–14.
Montazer, G. A., Giveki, D., Karami, M., Rastegar, H. J. C. R. J.: Radial basis function neural networks: A review. Computer Reviews Journal. 1(1), 52-74(2018)
Kanirajan, P., Kumar, V. S. J. A. S. C.: Power quality disturbance detection and classification using wavelet and RBFNN. Appl. Soft Comput. 35, 470-81(2015)
Huembeli, P., Dauphin, A.: Characterizing the loss landscape of variational quantum circuits. Quantum Sci. Technol. 6(2), (2021)
Sim, S., Johnson, P. D., Aspuru‐Guzik, A. J. A. Q. T.: Expressibility and entangling capability of parameterized quantum circuits for hybrid quantum‐classical algorithms. Adv. Quantum Technol. 2(12), 1900070(2019)
Cerezo, M., Arrasmith, A., Babbush, R., Benjamin, S. C., Endo, S., Fujii, K., McClean, J. R., Mitarai, K., Yuan, X., Cincio, L. J. N. R. P.: Variational quantum algorithms. Nat. Rev. Phys. 3(9), 625-44(2021)
Liu, W. J., Zhao, J. J., Wu, Q. S.: A hybrid quantum-classical generative adversarial networks algorithm based on inherited layerwise learning with circle-connectivity circuit. Quantum Inf. Process. 21(11), (2022)
Zhou, N.-R., Zhang, T.-F., Xie, X.-W., Wu, J.-Y.: Hybrid quantum-classical generative adversarial networks for image generation via learning discrete distribution. Signal Process-Image. 110, (2023)
Ajagekar, A., Al Hamoud, K., You, F. J. I. T. o. Q. E.: Hybrid classical-quantum optimization techniques for solving mixed-integer programming problems in production scheduling. IEEE Transactions on Quantum Engineering. 3, 1-16(2022)
Gilyén, A., Arunachalam, S., Wiebe, N. Optimizing quantum optimization algorithms via faster quantum gradient computation. Proceedings of the Thirtieth Annual ACM-SIAM Symposium on Discrete Algorithms: SIAM; (2019). p. 1425-44.
Tilly, J., Chen, H., Cao, S., Picozzi, D., Setia, K., Li, Y., Grant, E., Wossnig, L., Rungger, I., Booth, G. H. J. P. R.: The variational quantum eigensolver: a review of methods and best practices. Phys. Rep. 986, 1-128(2022)
Peruzzo, A., McClean, J., Shadbolt, P., Yung, M.-H., Zhou, X.-Q., Love, P. J., Aspuru-Guzik, A., O'Brien, J. L.: A variational eigenvalue solver on a photonic quantum processor. Nat. Commun. 5, (2014)
Liu, J.-G., Wang, L. J. P. R. A.: Differentiable learning of quantum circuit born machines. Phys. Rev. A. 98(6), 062324(2018)
Abadi, M., Agarwal, A., Barham, P., Brevdo, E., Chen, Z., Citro, C., Corrado, G. S., Davis, A., Dean, J., Devin, M. J. a. p. a.: Tensorflow: Large-scale machine learning on heterogeneous distributed systems. arXiv:1603.04467
Bergholm, V., Izaac, J., Schuld, M., Gogolin, C., Ahmed, S., Ajith, V., Alam, M. S., Alonso-Linaje, G., AkashNarayanan, B., Asadi, A., Arrazola, J. M., Azad, U., Banning, S., Blank, C., Bromley, T., Cordier, B. A., Ceroni, J., Delgado, A., Di Matteo, O., Dusko, A., Garg, T., Guala, D., Hayes, A., Hill, R., Ijaz, A., Isacsson, T., Ittah, D., Jahangiri, S., Jain, P., Jiang, E., Khandelwal, A., Kottmann, K., Lang, R. A., Lee, C. A., Loke, T., Lowe, A., McKiernan, K., Meyer, J., Montanez-Barrera, J. A., Moyard, R., Niu, Z., James O'Riordan, L., Oud, S., Panigrahi, A., Park, C.-Y., Polatajko, D., Quesada, N., Roberts, C., Sa, N., Schoch, I., Shi, B., Shu, S., Sim, S., Singh, A., Strandberg, I., Soni, J., Szava, A., Thabet, S., Vargas-Hernandez, R. A., Vincent, T., Vitucci, N., Weber, M., Wierichs, D., Wiersema, R., Willmann, M., Wong, V., Zhang, S., Killoran, N.: PennyLane: Automatic differentiation of hybrid quantum-classical computations. arXiv:1811.04968
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P. J. P. o. t. I.: Gradient-based learning applied to document recognition. P. Ieee. 86(11), 2278-324(1998)
Xiao, H., Rasul, K., Vollgraf, R. J. a. p. a.: Fashion-mnist: a novel image dataset for benchmarking machine learning algorithms. arXiv:1708.07747
Gan, Y., Xiang, T., Liu, H., Ye, M. J. I. F.: Learning-aware feature denoising discriminator. Inform. Fusion. 89, 143-54(2023)
Poursaeed, O., Jiang, T., Yang, H., Belongie, S., Lim, S.-N. Robustness and generalization via generative adversarial training. Proceedings of the IEEE/CVF International Conference on Computer Vision(2021). p. 15711-20.
Chu, C., Skipper, G., Swany, M., Chen, F. Iqgan: Robust quantum generative adversarial network for image synthesis on nisq devices. ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP): IEEE; (2023). p. 1-5.
Tsang, S. L., West, M. T., Erfani, S. M., Usman, M. J. I. T. o. Q. E.: Hybrid quantum-classical generative adversarial network for high resolution image generation. IEEE Transactions on Quantum Engineering. (2023)
Stein, S. A., Baheri, B., Chen, D., Mao, Y., Guan, Q., Li, A., Fang, B., Xu, S. Qugan: A quantum state fidelity based generative adversarial network. 2021 IEEE International Conference on Quantum Computing and Engineering (QCE): IEEE; (2021). p. 71-81.

No competing interests reported.

Download PDF

Version 1

posted

You are reading this latest preprint version

Radial Basis Function-based Quantum Hybrid Classical Generative Adversarial Networks for Enhanced Image Quality and Training Stability

Status:

Version 1

Abstract

Figures

1. Introduction

2. PRELIMINARIES

2.1 RBFNN

2.2 Hybrid Quantum–Classical GAN Algorithm

2.2.1 Variational Quantum Circuits

2.2.2 Hybrid Quantum–Classical GAN Structure

2.2.3 Classical discriminator

2.2.4 Adversarial learning strategy

3. RBF-based Hybrid Quantum-Classical GAN

3.1 Quantum circuit

3.2 RBF-QGAN

4. Experiment

4.1 Experiment setting

4.2 Datasets and Noise Input

5. Results

5.1 Stability of the RBF-QGANs

5.2. Robustness in RBF-QGANs

5.3 Comparison of Similar Works in Image Generation

6. Conclusion

Declarations

References

Additional Declarations

Status:

Version 1