In this proposed research work, the architectural design of Wave pipeline Polar codes is implemented using Xilinx 9.2i ISE and Model Sim 6.3. The program was written in VHDL coding and the encoded output is shown Fig 5. The WP – Polar architecture is also synthesized in both Xilinx and Synopsys tools. The above implementation provides us a detailed analysis on area and power consumptions. WP – Polar reduces the area overheads and reduced power dissipation due to the absence of intermediate registers.
In the conventional 4 bit encoding method (Fig 6) the system uses 10 intermediate registers. Obviously 10 clock cycle were required to move the inputs to the output. In the wave pipelining method the number of tasks (xor operation) were executed in parallel (computed concurrently). Thus, minimum number of clock cycles were expected here for a wave to propagate through the wave pipeline before being latched by the output register(fig 7).
Due to absence of the intermediate register and pipeline stages the reduce in the static and dynamic power hence SoC power gets reduced. Here latency is calculated by the time taken by any encoder design to generate its output after the clock signal is applied. Table 1 provides the device utilization summary of two, three stage and Wave pipelining is given. As the stages of pipe-line increases, the incremental gate count and number of flip flops also increases due to the intermediate registers. Gate count and flip flops have increased from 794 in 2-stage pipeline to 1702 in 3-stage pipeline. where as it has been observed that there is a significant reduction in number of flip flops and gate count used for wave pipelining. As the table showcase, efficient utilization of equivalent gate count and the number of slices is achieved by wave pipelined architecture with a count of 302 and 12 respectively. The gate count increases with the increase in the number of pipeline stages because of the insertion of additional registers.
Table 1: Device Utilization summary(area)
The bar chart (Fig. 8) is plotted considering device utilization without pipeline and with different pipeline stages.
The summary of timing analysis is given in Table 2. Here the timing analysis is based on the maximum time required by the output calculated in nanoseconds (ns). The time required by the two stages and three stage encoders is 4.89 ns and 4.04 ns respectively. The time required by the wave pipelined architecture is 3.939 ns. In normal pipelining stages, intermediate data are stored in registers, whereas in wave pipelining the data are handled by delays.
Table 2: Summary of Timing Analysis
The architecture of wave pipelining was synthesized using Synopsys where utilization of area and power was obtained. Fig 9 indicates the cell area utilization of pipe-lined and wave pipelined architecture.
Due to the absence of intermediate registers in the pipelined stages, Cell area utilized by wave pipelined Polar encoder is much lesser compared to other pipelined architectures. Fig 10 shows the cell area utilization of (8,4) Polar Encoder, where the area is measured in μm2 . The total cell area occupied by normal design, two stage, three stage and wave pipelining are 43,660.715 μm2 The below chart showcases the power Utilization of different pipelined architecture of Polar encoder. Analysis is made using Synopsys tool which is measured in milliwatt (mW) (Fig. 10).
The power utilized by normal design, two stage and three stage pipelines are 4.125 mW, 5.05 mW and 5.9 mW respectively, which is effectively reduced by using wave pipelined structure and the power utilized is 3.8 mW. Field-Programmable Gate Array (FPGAs) is explicitly intended to address the issues of high volume, cost-delicate purchaser electronics applications. The five-part family offers densities going from 100,000 to 1.6 million framework gates. The Spartan-3E family. expands on the accomplishment of the prior spartan 3E family by expanding the measure of rationale per I/O, essentially reducing the expense per logic cell. New highlights improve framework execution and diminish the expense of arrangement. These Spartan-3E FPGA improvements, joined with cutting edge 90 nm measure innovation, convey more usefulness and transfer speed per dollar than already conceivable, setting new guidelines in the programmable logic industry. The Spartan 3E family is a better option than mask modified ASICs. This result is obtained from the synthesis report from Synopsys.