“…Both the Gold and Bially [5] and Bi and Jones [7] architectures take this form, albeit with considerable differences at the detailed logic level. The architecture proposed by Hui et al [8,9] also has similar properties, although, here both data, and output values are generated in reverse order, requiring circuitry to perform the data re-ordering. Each of these architectures displays various attributes in terms of performance and hardware cost.…”