Growing on-chip wire delays, coupled with complexity and power limitations,
have placed severe constraints on the issue-width scaling of centralized
superscalar architectures. As a result, recent microprocessor designs
have backed away from powerful uniprocessors, instead favoring multiple
simpler cores on a single die. Partitioning the chip into a collection of processors communicating via a common memory system mitigates some of the
technology scaling challenges, but increases the burden on software to providemultiple threads to execute concurrently across the cores.
An alternative is to pursue more powerful uniprocessors, but design them
so that they are scalable and tolerant of technology and complexity scaling.
Ideally, such wide-issue processors would be tiled [30], meaning composed
of multiple replicated, communicating design blocks. Because of multicycle
communication delays across these large processors, control must be distributed
across the tiles. We advocate the use of microarchitectural networks
(or micronets) for routing control and data among the tiles. Micronets provide
high-bandwidth, flow-controlled transport for control or data in a wiredominated
processor by connecting the multiple tiles, each of which is a client on one or more micronets. Higher-level microarchitectural protocols direct global control across themicronets and tiles in a manner invisible to software.
In this chapter,we describe the architecture and implementation of the Teraop,
Reliable, Intelligently-adaptive Processing System (TRIPS) processor—a
distributed, tiled microarchitecture. In particular, we discuss TRIPS tile partitioning,micronet connectivity, anddistributedprotocols thatprovide global
services in the TRIPS processor, including distributed fetch, execution, flush,
and commit. Although some of our prior publications have described the
TRIPS approach to exploiting parallelism as well as high-level performance
results [20,3], this chapter examines in detail the intertile connectivity and
protocols that have resulted from reducing the high-level design to silicon.
The key concepts that differentiate TRIPS from other tiled architectures such
as RAW[30] are the dynamic scheduling and execution which require distributed
dynamic hardware protocols to provide the means to extract both
irregular and regular concurrency.
声明:本文内容及配图由入驻作者撰写或者入驻合作网站授权转载。文章观点仅代表作者本人,不代表电子发烧友网立场。文章及其配图仅供工程师学习之用,如有内容侵权或者其他违规问题,请联系本站处理。 举报投诉
全部0条评论
快来发表一下你的评论吧 !