亚洲知识产权资讯网为知识产权业界提供一个一站式网上交易平台,协助业界发掘知识产权贸易商机,并与环球知识产权业界建立联系。无论你是知识产权拥有者正在出售您的知识产权,或是制造商需要购买技术以提高操作效能,又或是知识产权配套服务供应商,你将会从本网站发掘到有用的知识产权贸易资讯。

Low Latency, Parallel Computing Scheme

详细技术说明
Parallel Computing Harnessing Unary EncodingA new frontier in computing, opened in the last decade, uses unary encoding: the data is not packed, but complex computations such as Gamma correction and edge detection in an image can be performed using very simple logic. This new computing method provides a deterministic parallel bit shuffling network that uses a simple, deterministic thermometer encoding of data to achieve near optimal sub-sampling. The approach results in zero random fluctuation and high accuracy, yet keeps the output bit stream length constant, hence overcoming the impractical large code size in a zero-fluctuation stochastic code. It uses core “stochastic” logic circuits that do not employ constant coefficients, making them significantly smaller than traditional stochastic logic that potentially spend a significant amount of resources to generate such constant coefficients. The low-latency, parallel computation scheme in the context of unary stochastic and unary deterministic computing that enables computing a parallel unary stream (i.e., all bits are generated in a single digital clock cycle, using parallel copies of a stochastic logic, generating the output stream in one clock cycle). It uses a hard-wired, deterministic shuffling network that de-correlates inputs and allows for low errors in general (even zero errors in some cases). Rather than relying on randomness as an essential requirement for stochastic logic input, deterministic shuffling and sub-sampling techniques are described for generating inputs to the core stochastic logic.Smaller Area than Stochastic MethodsConventional binary has been the dominant encoding of data in digital systems due to its compact representation. However, it requires the data to be “unpacked” before computation (e.g., multiplication has to be broken into partial product and accumulation operations). Stochastic computing suffers from random fluctuations and unpredictability of the output. Previous solutions to this problem involve significantly increasing the code size, which is not practical. Stochastic computing and deterministic computing on unary streams use simple “stochastic” logic to perform complex computation. However, one of the major limitations with this method is its long latency and circuit depth. This technology offers a parallel implementation that uses both a thermometer code and a hard-wired deterministic shuffling method, which significantly decreases latency with a moderate increase in area. When compared to previous stochastic computing methods, results on feed-forward and feedback circuits show, on average, an (area × delay) value 7x smaller than of conventional binary and 8x smaller than previous stochastic work at a10-bit binary resolution.Comparison vs. Conventional BinaryNo. of Bits10111213Area-Delay Product8x smaller4x smaller2x smaller1xBENEFITS AND FEATURES:Simple, deterministic thermometer encoding of dataAchieves very good to optimal sub-samplingZero random fluctuation and high accuracyKeeps output bit stream length constantCore “stochastic” logic circuitsLow-latency, parallel computation schemeEnables computing a parallel unary streamHard-wired, deterministic shuffling method de-correlates inputs and allows for low errorsSuperior to binary computing for medium resolutions (8-12 bits)APPLICATIONS:FPGA architectures: FPGAs provide two useful features this method uses: abundant buffered routing resources and flipflops, and the ability to handle large fanoutsASICFull-custom digital chips operating in domains (e.g., image processing and signal processing)Low-power signal/image processing, e.g., in Internet of ThingsMachine learning/deep learning systemsApproximate computing applications and applications that tolerate some degree of uncertainty (e.g., video processing, image tagging)Phase of Development - Conceptual
*Abstract
None
国家/地区
美国

欲了解更多信息,请点击 这里
移动设备