<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Tech Design Forum Techniques</title>
	<atom:link href="http://www.techdesignforums.com/practice/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.techdesignforums.com/practice</link>
	<description>Tech Design Forum Techniques Section</description>
	<lastBuildDate>Thu, 23 May 2013 14:36:59 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	
		<item>
		<title>Facing the verification management challenge</title>
		<link>http://www.techdesignforums.com/practice/technique/verification-management-soc/</link>
		<comments>http://www.techdesignforums.com/practice/technique/verification-management-soc/#comments</comments>
		<pubDate>Thu, 23 May 2013 14:26:05 +0000</pubDate>
		<dc:creator>Luke Collins</dc:creator>
				<category><![CDATA[Design Management]]></category>
		<category><![CDATA[Verification]]></category>
		<category><![CDATA[co-verification]]></category>
		<category><![CDATA[multicore]]></category>
		<category><![CDATA[software debugging]]></category>
		<category><![CDATA[verification IP]]></category>
		<category><![CDATA[verification reuse]]></category>
		<category><![CDATA[virtual prototyping]]></category>

		<guid isPermaLink="false">http://www.techdesignforums.com/practice/?p=5206</guid>
		<description><![CDATA[The growing verification challenge, and how to address it by coordinating multiple debug strategies.]]></description>
				<content:encoded><![CDATA[<p>The integration of multicore CPUs, graphics coprocessors, modems, multimedia and networking facilities in the SoCs that power today’s sophisticated smartphones, tablets, computing and networking devices is creating a new verification challenge. The divide-and-conquer approach that worked well when such systems were built of interconnected chips is less effective now that their functions are integrated on one die, with complex specifications, rapidly increasing software content and relentless pressure to get the design to market.</p>
<p>Today’s verification challenge is immense. One SoC may be defined by more than 30 million lines of design RTL and testbench code. It may embed 10 or more sophisticated communications protocols (think USB, and mobile connectivity on a handset, for example), and have as many as 200 power domains to optimize energy consumption. Verifying such a chip may involve using thousands of servers, many with 150Gbytes or more of memory, writing and testing hundreds of thousands of assertions, and producing and interpreting terabytes of coverage data.</p>
<p>It’s little wonder, then, that the cost of verifying a leading-edge chip has risen 30% a year since 2001, so that it is now three times what it was back then, and that verification engineers outnumber design engineers by two to one.</p>
<div class="article_figure"><a class="figure" title="Figure 1 : The growing cost of verification (Source: Synopsys)" href="http://www.techdesignforums.com/practice/files/2013/05/tdf-snps-verimgmnt-fig2-lrg.jpg"><img src="http://www.techdesignforums.com/practice/files/2013/05/tdf-snps-verimgmnt-fig2-med.jpg" alt="The growing cost of verification (Source: Synopsys)" /></a></div><div class="article_figure"><p class="figure_wrapper"><span class="figure_title">Figure 1 </span>The growing cost of verification (Source: Synopsys)</p></div>
<p>To tackle the verification challenge, design and verification managers must find ways to achieve a series of order-of-magnitude productivity improvements in their verification strategies. This may mean greater reuse of testbenches, updated methodologies, and the authoring and application of verification IP. They will also have to leverage next-generation technologies that enable significantly more throughput from their server farms, and introduce debug automation and debug strategies that can detect and prevent more bugs, more quickly.</p>
<h2><b>Time to market is key</b></h2>
<p>The relentless competition in consumer and other markets is creating tremendous pressure to tape out designs. This pressure is intensified by the increasing complexity of the core SoCs, which integrate ever more digital functions, are adding analog functions, implement sophisticated power-management strategies, and embed a rapidly growing amount of software. For design and verification teams, this means that the challenge is evolving from taping out a functional hardware design to delivering that hardware with correctly functioning software onboard as well. This added complexity now increasingly drives decisions about verification strategies.</p>
<h2><b></b><b>Performance </b><b>matters</b></h2>
<p>Due to the complexity of today’s SoCs, leading-edge chip design teams must apply a variety of verification techniques to gain the confidence to tape out a design.</p>
<p>Exhaustive simulation demands large server farms, and some companies are spending millions, if not billions, of dollars to build and run the IT infrastructure necessary to do high-performance verification. The total cost of ownership for these server farms includes the cost of acquiring the hardware and regularly updating it, powering and cooling it, and administering the whole set-up. No company wants to incur these costs, but they know that every improvement they make in verification performance will have a material effect on their bottom line. They also know that the cost of missing a market opportunity because of a faulty chip is so punitive that teams must run as many simulation cycles, and get as close to full chip coverage, as they can.</p>
<p>To achieve improved verification performance, companies are turning to techniques such as constrained random verification. This involves writing complex testbenches that generate random, but meaningful, vectors for the design. These testbenches themselves must be verified before use, a process that can be accelerated using fast constraint solvers. The better the testbench, the more effective the test, and hence the greater the return on the IT infrastructure that runs it. So performance, whether it is measured in pure CPU cycles or the effectiveness of a particular testbench, is critical to meeting the ‘time to tape-out’ and ‘time to software’ challenges.</p>
<h2><b>Debug challenges </b><b>multiply</b></h2>
<p>Surveys and studies suggest that 70 percent of the effort involved in taping out a complex SoC is spent on verification. Half of that, or 35 per cent of the total, is spent on debugging the chip, the interaction between the chip and its software, and the testbenches.</p>
<p>The scope of debug is also growing, as issues such as power management, hardware/software co-design, and the introduction of analog or mixed-signal blocks, create new challenges.</p>
<p>For example, turning a chip’s functional blocks on and off to reduce power consumption introduces thousands of unknown (X) states during simulation, which then have to be sorted into real bugs and simulation artifacts and correlated with the designer’s power intent. X-propagation has always been difficult to debug, but with low-power designs, both the verification and debug processes require the X states to be modeled more accurately and to be propagated more correctly during simulation.</p>
<p>In hardware/software debug, designers need to be able to see the relationship between their C or assembler code and the RTL with which it was interacting when a fault occurred. Debugging the interaction of analog or mixed-signal blocks and digital blocks introduces another set of challenges.</p>
<p>These are just some examples of the way in which the scope of the debug task is broadening. What makes the issue even more challenging is that these debug domains interact &#8211; a faulty analog circuit may create a digital logic bug that causes the chip’s software to misbehave.</p>
<h2><b>Multiple strategies</b></h2>
<p>In the face of these challenges, design and verification managers must develop a set of strategies that, working together, provide them with a level of design coverage and debug insight that gives them the confidence to tape out the chip.</p>
<p>This means making trade-offs between various forms of verification, in terms of their coverage, speed and costs, and various forms of debug, in terms of their efficacy. It also means taking into account the costs of moving from one approach to the next and the possibility of introducing inconsistencies as they progress from approach to approach.</p>
<p>There is no standard verification strategy. For example, in one market sector I have seen two companies making competing chips that use verification strategies that are almost polar opposites.  One company builds FPGA-based prototype boards in order to run as many cycles as possible. The other relies on simulation and coverage management and rejects hardware prototyping.  The right approach is to use both. Essentially, one should use all possible strategies at one’s disposal. There is no one-size-fits-all approach.</p>
<p>The challenge for design and verification managers, and the tools vendors that support them, is to raise confidence in each methodology, and to make it possible to move between approaches very easily. For example, standard server-based simulation offers flexibility, so long as compile times remain within practical limits. Emulation offers greater speed, once the design has been set up on the emulator. An FPGA prototype might be even faster, but it will lack the access to internal signals of a simulator or emulator.</p>
<p>The difficulty for managers is to understand the tradeoffs involved in each of these approaches and how to move between them for the best overall result. For example, different design teams will hand off from simulation to emulation at different points in their process, depending on how confident they are with the revisions they are likely to need to make. In a simulation, the typical change-and-compile cycle is 3 to 4 minutes. In emulation, making the same change takes a bit longer so verification teams need to reach a version of their design that is a bit more stable before moving to emulation.</p>
<p>Design managers also need to marshal the skills that support each strategy, from server-farm management for the simulation phase to design-partitioning and logical-to-physical mapping in the FPGA prototyping phase. The skill here is to know when to make the transition between each verification approach for best overall effect, and then to have the people and resources on hand to make that transition as smooth as possible.</p>
<h2><b>Debug </b><b>roadmap</b></h2>
<p>The broadening frontier of the debug task makes the verification issue more complex. Along with knowing when to move between different verification strategies, and how much verification is enough, design managers also need to develop a consistent debug strategy for the various forms of verification. Currently, there is no systematic unified debugger that works across all these different ways of exercising a design.</p>
<p>As a step towards a more systemic approach to debug, Synopsys has acquired SpringSoft so that we can offer the <i>de facto</i> standard Verdi as an open platform upon which users, third parties and ourselves, can innovate effectively by offering plug-ins and extensions to tackle emerging debug issues.</p>
<div class="article_figure"><a class="figure" title="Figure 2 : Verdi's hardware/software debug facility synchronizes RTL and programmer views (Source: Synopsys)" href="http://www.techdesignforums.com/practice/files/2013/05/tdf-snps-verimgmnt-fig1-lrg.jpg"><img src="http://www.techdesignforums.com/practice/files/2013/05/tdf-snps-verimgmnt-fig1-med.jpg" alt="Verdi's hardware/software debug facility synchronizes RTL and programmer views (Source: Synopsys)" /></a></div><div class="article_figure"><p class="figure_wrapper"><span class="figure_title">Figure 2 </span>Verdi's hardware/software debug facility synchronizes RTL and programmer views (Source: Synopsys)</p></div>
<p>Verdi has powerful underlying abilities, such as an understanding of how a design is synthesised from RTL into gates that is expressed in a single database, which others will be able to build upon. We’re already adding ways to show verification coverage in Verdi’s graphical user interface, so that users can relate coverage and debug. We will soon be adding new capabilities, such as hardware/software debug and support for analog and mixed signal debug, over time.</p>
<h2><b>Verification management</b></h2>
<p>Design and verification managers will need technologies with which they can see how the many forms of verification and debug interrelate, so that they can develop better insights into the tradeoffs they are making as a design progresses to completion. Such technologies should be flexible enough to handle many forms of verification. They should also be able to present many types of metrics, and do a good job of turning those metrics into actionable information upon which design managers can make timely decisions that lead to the most efficient and effective overall verification strategy for their designs. And they should enable design and verification managers to explore a variety of verification strategies against key parameters, such as coverage and cost.</p>
<p>There’s one truth that the managers of large SoC projects must live by: time to market is king. Effective verification strategies, be they better ways to write testbenches, greater reuse, or a well thought-out approach to debug, are vital to getting big designs out of the door on time and with a reasonable level of confidence.</p>
<p><em>Michael Sanie is senior director, verification marketing, Synopsys</em></p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.techdesignforums.com/practice/technique/verification-management-soc/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Building an RTL sign-off flow</title>
		<link>http://www.techdesignforums.com/practice/technique/rtl-sign-off/</link>
		<comments>http://www.techdesignforums.com/practice/technique/rtl-sign-off/#comments</comments>
		<pubDate>Tue, 14 May 2013 18:59:17 +0000</pubDate>
		<dc:creator>Luke Collins</dc:creator>
				<category><![CDATA[Assembly & Integration]]></category>
		<category><![CDATA[DFT]]></category>
		<category><![CDATA[Verification]]></category>
		<category><![CDATA[clock domain crossing]]></category>
		<category><![CDATA[formal verification]]></category>
		<category><![CDATA[power gating]]></category>
		<category><![CDATA[resetability]]></category>
		<category><![CDATA[X propagation]]></category>

		<guid isPermaLink="false">http://www.techdesignforums.com/practice/?p=5182</guid>
		<description><![CDATA[RTL sign-off strategies ease SoC design and IP integration by enabling early analysis and optimization of CDC, power, X propagation, timing, and resetability issues.]]></description>
				<content:encoded><![CDATA[<p>As SoCs become more complex, and the cost of errors grows, it becomes increasingly important that engineers ensure their work is as correct as possible as soon as possible in the design process. They cannot afford to carry errors forward from one stage to the next, where their impact is likely to grow while their causes become obscured.</p>
<p>This requirement is driving a shift in design exploration and hand-off to the register transfer level. Using RTL sign-off eases the integration of heterogeneous IP and makes it easier to check the way that blocks are interfacing with the host design, how clocks will cross these interfaces, power requirements, and testability. It also cuts the simulation load, especially when designs are begin exercised in a system context, which vastly increases the number of states necessary to check functionality.</p>
<p>Initial timing constraints and clocking schemes have to be defined to enable earlier analysis and verification. Power estimation and optimization methods are necessary to provide previews of gate-level performance. The impact of inserting test structures to ease testability has to be considered. There is some good news &#8211; working with the design at this level means that each issue can be constrained and addressed by a focused tool, rather than being taken forward to the gate level where they would interact more strongly and hence be more difficult to solve.</p>
<p>What tools are available to improve the quality of RTL code before it reaches the simulation stage? Linting tools have evolved to the point where they can handle full-chip designs of 500 million gates or more, and yet still offer concise reporting. Timing constraints management and checking ensures correct timing for the block and full-chip level, so long as any changes in the RTL are reflected in the SDC files for the design. (The SDC itself needs to be verified for correctness and consistency, and is essential for sign-off grade analyses such as clock design crossing.)</p>
<p>Reset analysis ensures that the design will come in a known good state, and in later iterations of the design may be used to save chip area and routing resources through a more intelligent application of reset signals.</p>
<p>Automatic formal verification techniques can be used to find obscure functional bugs in the RTL, especially in finite state machines, and root out issues, such as bus contention or dead code, which violate the implicit intent of the RTL.</p>
<p>Clock domain crossing analysis, so important in these days of design reuse, IP, and complex power management schemes, can be carried out using a combination of formal and structural methods, which helps trap the corner case combinations of timing and functionality that lead to errors.</p>
<p>Power analysis and optimization techniques address issues such as reset checking, retention flop and isolation-cell analysis and optimization, clock/power gating, and sequential/combinational optimizations. These interventions can be so extensive that it makes sense to go back to the linting stage to recheck the design, and to clear the way for DFT analysis and optimization.</p>
<p>Working at the RTL sign-off level means that even those without DFT expertise can develop DFT strategies and analyze them for the testability that they bring to the design.</p>
<p>As a last step, it is important to manage the way that the simulation and synthesis processes handle the unknown (X) states thrown up by power management strategies that turn blocks on and off, and clocks crossing domains. A proper analysis of this issue can reveal functional bugs that have been hidden at the RTL level by too much optimism about the impact of X states, and reduce the impact of excessive pessimism about the impact of X states after synthesis.</p>
<p>Combining these static verification steps can clean up the RTL and so reduce the simulation burden of testing its function, and the synthesis burden of trying to implement conflicted code. And it means that the design will be as correct as possible as soon as possible, helping meet time to market goals.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.techdesignforums.com/practice/technique/rtl-sign-off/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Eight requirements for 3D-IC design</title>
		<link>http://www.techdesignforums.com/practice/technique/eight-requirements-3d-ic-design/</link>
		<comments>http://www.techdesignforums.com/practice/technique/eight-requirements-3d-ic-design/#comments</comments>
		<pubDate>Wed, 08 May 2013 10:14:05 +0000</pubDate>
		<dc:creator>Chris Edwards</dc:creator>
				<category><![CDATA[DFM]]></category>
		<category><![CDATA[IC Implementation]]></category>
		<category><![CDATA[2.5D-IC]]></category>
		<category><![CDATA[3D IC]]></category>
		<category><![CDATA[TSV]]></category>

		<guid isPermaLink="false">http://www.techdesignforums.com/practice/?p=4975</guid>
		<description><![CDATA[Many design teams are looking at ways in which they can make use of 3D integration. Here are eight requirements for an effective 3D-IC design flow.]]></description>
				<content:encoded><![CDATA[<p>3D-ICs promise “more than Moore” integration by packing a great deal of functionality into small form factors, while improving performance and reducing costs. It&#8217;s no surprise then that many design teams are looking at ways in which they can make use of 3D integration.</p>
<p>3D-IC packages may accommodate multiple heterogeneous die, such as logic, memory, analog, RF, and micro-electrical mechanical systems (MEMS). Each of these die can use different process nodes, such as 28nm for high-speed logic and 130nm for analog. This provides an alternative to system-on-chip (SoC) integration, potentially postponing an expensive move to a new process node for all of the functionality developers want to place in a single package.</p>
<p>While there is great interest in this emerging technology, development is still at an early stage. There are clear requirements that an effective 3D-IC flow must satisfy to bring the cost, integration and performance benefits that the technology promises.</p>
<p>Use of the TSV, for example, introduces design, integration and test challenges. TSVs are copper vias with diameters that may range from 1 to 30um, and they make it possible to integrate multiple stacked dies in a single package.</p>
<p>With 3D-IC, a die containing TSVs can be attached to the package substrate using conventional flip-chip technology. A second die is attached to the first using the TSVs to route connections &#8212; not just to the adjacent IC but to the underlying PCB through the package&#8217;s redistribution layer.</p>
<p>Compared to a wire-bonded system-in-package (SiP), 3D-ICs constructed using TSVs offer reduced RLC parasitics, better performance, more power savings, and a denser implementation. Compared to a 2.5D silicon interposer approach, a vertical 3D die stack offers a higher level of integration, smaller form factor, and faster design cycle. But a 3D stack raises some additional challenges, including thermal, timing, and power management concerns.</p>
<div class="article_figure"><a class="figure" title="Figure 1 : 3D integration can support complex stacking arrangements" href="http://www.techdesignforums.com/practice/files/2013/05/tdf_cdn_3d_stack.jpg"><img src="http://www.techdesignforums.com/practice/files/2013/05/tdf_cdn_3d_stack_med.jpg" alt="3D integration can support complex stacking arrangements" /></a></div><div class="article_figure"><p class="figure_wrapper"><span class="figure_title">Figure 1 </span>3D integration can support complex stacking arrangements</p></div>
<p>Although 3D-ICs with TSVs do not require a revolutionary new 3D design system, they do require some new capabilities that need to be added to existing toolsets for digital design, analog/custom design, and IC/package co-design. The end goal is to optimize system cost with the shortest possible turnaround time. If 3D-ICs cannot be both cost and time effective, they will not enjoy widespread adoption. To that end, we have identified and are working toward supporting eight requirements for design tool flows that support 3D-IC:</p>
<h2>1. System-level exploration</h2>
<p>The cost and benefits of 3D-IC can be subtle as there are many tradeoffs that engineering teams can use during design to bring some functions onto a monolithic die that will make economic sense under certain circumstances. In other cases, the desire for flexibility or to reduce design risk can favor the migration of functions to other die within a single package.</p>
<p>Sometimes called ‘pathfinding’, 3D-IC system-level exploration will help users partition designs into separate chips, select the appropriate silicon technology for each chip, determine where functionality goes, choose the best die order in the stack, and optimize connectivity between chips.</p>
<p>Existing system-level exploration tools can provide early power, area, and cost estimates, and allow what-if explorations across architectures, silicon IP choices, and foundry processes. These tools are being to serve stacked die implementations and package considerations and to carry this intent through the design process.</p>
<h2>2. 3D floorplanning</h2>
<p>Once the technology selection and partitioning choices have been, floorplanning is vital. Although the pin-to-pin distances within a package are much shorter than those of PCB traces, the decisions over type of integration and how functions are distributed will have direct impacts on performance.</p>
<p>For example, TSVs are very large compared to logic gates and other circuit features. Thus, the number and location of TSVs is crucial. With too many TSVs, the wire length goes up. TSVs cause coupling, which can be reduced by adding space to their keep-out zones – but this adds to the area. TSVs also cause mechanical stress, which can impact the performance of nearby devices.</p>
<p>Given these considerations, a TSV-aware 3D floorplanning capability is quite challenging. It must provide an abstraction level to capture all the die, and provide a unified representation of intent for placement and routing tools.</p>
<p>A 3D floorplanner should work in the X, Y, and Z directions, and should have visibility into the top and bottom of each die. This helps optimize the placement of blocks, TSVs, and micro-bumps, and shortens interconnect distances, thus improving performance and power. For continuous design convergence, micro-bump and TSV assignments should take into account the floorplans on adjacent die.</p>
<p>Ideally, a 3D floorplan will be aware of thermal issues and will help avoid hotspots. It will also help users determine the optimal placement of die into stacks. The order of the stack is important. Die in the middle are most susceptible to thermal problems.</p>
<h2>3. Implementation</h2>
<p>The techniques used by tools for synthesis, placement, and routing change in the 3D-IC environment. For example, there are new layout rules that may be driven by features on adjacent die. The back-side redistribution layer (RDL) is a new layout layer. And given their size, TSVs themselves are a significant new layout feature.</p>
<p>A digital implementation system that supports 3D-ICs must be “double-sided aware,” taking into account both the top and bottom of each die. This may call for a new modeling and database infrastructure, TSV-specific tools, and support for a variety of stacking styles.</p>
<p>Power planning for a single-die IC is hard enough. With a 3D stack, it gets more complex. Designers need to provide enough power to drive all of the die, including the top-most die. Designers must manage vertical voltage drop and reliably simulate system power consumption. Tools need to support power distribution for TSVs and micro-bumps. A unified representation of power intent should be carried across the entire 3D-IC design.</p>
<p>Place and route tools should include thermal constraints to avoid hot spots. Routing tools need to handle TSVs and micro-bumps properly, route signals across multiple die, and verify the bump alignment between adjacent die.</p>
<p>Managing clocks across multiple die while avoiding skew is another challenge. If there are different clocks for different die, the designer must figure out how to synchronize them.</p>
<p>Analog implementation environments also need to add support for 3D-ICs. Examples of useful capabilities include multi-chip visualization with background views; support for bump, TSV, and reverse-side routing; and connectivity extraction maintained through TSV connections.</p>
<p>Throughout the design convergence process, design intent must be maintained and checked, and the necessary abstraction techniques must be applied for proper implementation and analysis.</p>
<h2>4. Extraction and analysis</h2>
<p>Extraction and analysis tools are crucial for design convergence in any IC-design environment. However, existing extraction and analysis tools need to be extended for 3D-ICs. For example, the tools must consider RLC parasitics for TSVs, micro-bumps, and interposer routing. Further, analysis tools must be 3D-aware. Timing, signal integrity, power, and thermal gradients must be analyzed across multiple die. Multi-die static timing must be validated, with an understanding of interactions between multiple die and with the package.</p>
<p>Because the metal stack creates heat gradients, thermal analysis and signoff is critical, especially for die located in the middle of the stack. Further, the substrate thinning required for 3D stacks results in relatively poor heat dissipation. After placement and routing, thermal signoff is needed to ensure hot spots are below specified limits, and that thermal effects do not have a negative impact on performance or leakage.</p>
<p>Signoff raises new questions with 3D-IC stacks. For example, can design rule checking (DRC) and layout-versus-schematics (LVS) run on the entire stack? Can timing be verified for the entire stack? Is there any crosstalk between die?</p>
<p>Electromagnetic interference (EMI) is a possible concern for 3D-ICs, raising a potential need for analysis tools. A multi-die package offers less shielding than a single-die package, and thus offers more likelihood that emissions could escape.</p>
<p>Finally, to facilitate TSV connections, the wafer is thinned to implement a 3D-IC. This causes stress and adds susceptibility to thermal changes, which need to be accounted for in the design.</p>
<h2>5. Design for test (DFT)</h2>
<p>Test raises many challenges for 3D-ICs, including access to die inside a stack and proper handling of thinned wafers. Both new standards and tool support are required to help validate that design intent is maintained once 3D-IC silicon is realized, and to diagnose issues properly if the system doesn’t behave as intended.</p>
<p>Like conventional single-die IC test, 3D-IC test must be considered at two levels – wafer test (for the silicon die), and package test (after die assembly into the package). The difference is that in the 3D-IC fabrication, there are many more intermediate steps, such as die stacking and TSV bonding. This provides many more opportunities for wafer test before final assembly and packaging.</p>
<div class="article_figure"><a class="figure" title="Figure 2 : Test access for a 3D stack requires pre-planning" href="http://www.techdesignforums.com/practice/files/2013/05/tdf_cdn_3d_test.jpg"><img src="http://www.techdesignforums.com/practice/files/2013/05/tdf_cdn_3d_test_med.jpg" alt="Test access for a 3D stack requires pre-planning" /></a></div><div class="article_figure"><p class="figure_wrapper"><span class="figure_title">Figure 2 </span>Test access for a 3D stack requires pre-planning</p></div>
<p>Wafer test is needed for cost optimization. If a die is bad, it can be thrown away before it is placed in a package. If a package-level test fails, the entire package would have to be thrown away. Thus, wafer test is highly desirable, especially early in the product lifecycle while defects may still be relatively high.</p>
<p>But wafer test for 3D-ICs is challenging for three reasons. First, today’s probe technology is unable to handle the finer pitch and dimensions of TSV tips, and is generally limited to handling several hundred probes, whereas the TSVs may have several thousand probes. Second, probe technology leaves scrub marks that can potentially cause problems with the downstream bonding step. Finally, wafer test requires the creation of a known-good die (KGD) stack. To stack known-good die, the wafer must be thinned by about 75 per cent so the tips of the TSVs can be exposed. However, as the thinned wafer is contacted by a wafer probe, there’s a danger of damaging the wafer.</p>
<p>3D-ICs also introduce new intra-die defects. These may be introduced by new manufacturing steps such as wafer thinning, or by bonding the top of a TSV to another wafer. Thermal effects are another potential sources of defects, because excessive heat may be generated from the densely packed stack of dies. Thermo-mechanical stress is caused by different thermal coefficients of the various materials in the stack. Despite the differences in the manufacturing steps, the resulting faults (shorts, opens, delay defects) appear to be similar to what we see in conventional ICs. It is possible that new fault models may be required as we get more empirical data.</p>
<p>Modeling defects through TSV-based interconnects is a new area. These defects may be introduced in the fabrication or the bonding of TSVs. Fortunately, defects introduced through TSVs can be mapped to existing fault models, such as opens, shorts, static, delay, and bridging faults. However, a methodology is needed to map TSV defects to known fault types.</p>
<p>A sound methodology for 3D-IC test includes a DFT architecture that provides efficient ways to control and observe individual die from the chip I/Os, while providing different test access modes (such as a mode for a known good die test or a known good stack test). Conventional DFT architectural approaches and techniques such as on-chip compression, boundary scan, memory built-in self-test (MBIST), reduced pin count testing, and on-chip clocking for at-speed test are broadly applicable, and need to be configured and optimized to meet 3D controllability and observability goals. The trick is one of making an intelligent allocation of DFT resources across the multiple die to minimize the area overhead, while meeting constraints for test cost and shipped product quality.</p>
<h2>6. IC/package co-design</h2>
<p>Developers of 3D-ICs need to remember that any electronic product includes three different fabrics – chips, packages, and boards. Designing the chip first and throwing it “over the wall” to package and board designers will not result in design convergence on an optimal, cost-effective solution.</p>
<p>If the chip, package and board are not designed co-operatively, the interconnect will not be optimized, and extra vias will be needed to handle signals that cross from one point to another. As a result, performance will be reduced, additional board layers may be needed, and board and package costs may rise. Further, without co-design, timing, power, and signal integrity will not be optimized.</p>
<p>IC/package co-design is important for 3D-ICs because there are a large number of I/Os, and because the cost of packaging goes higher with multiple die in one package. Without co-optimization, the package could end up costing more than the silicon die. Important capabilities include I/O feasibility planning, connectivity management, 3D visualization, SiP layout, and support for multi-fabric analog and RF circuitry. To ensure complete design convergence, the packaging tool must understand the IC and package design intent, and should effectively abstract the IC design database to provide constraint-driven layout of the package substrate.</p>
<p>The board must be considered as well. 3D die stacks result in additional interconnect that will have to find its way down to the board. As more connectivity is handled inside the package, there’s less complexity on the board. The board designer needs to know what’s going to be positioned near the 3D package. By positioning and rotating components properly, the designer can reduce the number of layers required for the board.</p>
<p>Some companies drive co-design from the board up. They know where components will go on the board, and those locations are fixed. They then design the package that contains the stacked die in order to optimize connectivity, and to allow the minimum number of layers on the PCB. But it’s not so important where co-design starts – what’s important is that it is done to assure convergence for the 3D-IC silicon-realization process.</p>
<h2>7. A flexible ecosystem</h2>
<p>To be successful, 3D-ICs need to be designed and produced in a cost-effective way, with sufficient turnaround time to meet market windows. This will be possible only with a robust and well-defined supply chain ecosystem, including semiconductor design companies, EDA vendors, IP suppliers, foundries, and outsourced semiconductor assembly and test (OSAT) providers.</p>
<p>Boundaries between the different players may start to blur. For example, when are TSVs created and who is responsible? Possible implementation steps include:</p>
<ul>
<li>Via first – wafer processing starts with TSVs and is done by the foundry.</li>
<li>Via middle – TSVs are created after transistors, but before back end of line (BEOL), by the foundry.</li>
<li>Via last – TSVs are created after BEOL, probably by the OSAT.</li>
</ul>
<p>Since one size does not fit all with 3D-ICs, the supply chain needs to be adaptable to customer needs.</p>
<p>Many customers will want to line up a second-source 3D packaging service provider before production is started. Additionally, key strategic alliances between memory suppliers, logic IDMs, foundries, and packaging subcontractors will need to be forged.</p>
<div class="article_figure"><a class="figure" title="Figure 3 : Effective 3D-IC design requires collaboration" href="http://www.techdesignforums.com/practice/files/2013/05/tdf_cdn_3d_collab.jpg"><img src="http://www.techdesignforums.com/practice/files/2013/05/tdf_cdn_3d_collab_med.jpg" alt="Effective 3D-IC design requires collaboration" /></a></div><div class="article_figure"><p class="figure_wrapper"><span class="figure_title">Figure 3 </span>Effective 3D-IC design requires collaboration</p></div>
<p>Foundries are establishing design rules, create models and libraries, and provide process design kits (PDKs) and reference flows. One example of a design rule is to avoid placing TSVs too close to active devices, because they cause mechanical stress that may change the performance of the device. Tools must be aware of recommended TSV diameters and pitches. They need to understand distances between TSVs as well as the width of metal routing to TSVs.</p>
<p>OSATs will play a role in assembling early 3D stacks and interposer configurations, combining die from different foundries, and developing tests for 3D stacks – but in the long term they will have to compete with foundries who are pulling OSAT tasks in-house.</p>
<h2>8. 3D-IC standards</h2>
<p>Standards will become an important part of the 3D-IC ecosystem. An initial standards effort may focus on defining a taxonomy of terms. Down the road, I/O standardization between interfaces such as memory, logic, and interposer layers will be helpful.</p>
<p>Meanwhile, the 3D-IC Alliance is focusing on the manufacturing side, and has released the Intimate Memory Interconnect Standard (IMIS) to standardize vertical interconnect requirements. Another area calling out for standardization is 3D-IC test. Two emerging standards &#8211; IEEE 1149.7 compact JTAG and IEEE P1687 internal JTAG (iJTAG)—can be deployed together to embed test structures in 3D-ICs.</p>
<p>The IEEE 1500 standard for embedded core test makes the pins of an IP core controllable and observable.</p>
<p>The same principle could potentially be used to access individual die in a 3D stack. The IEEE 1500 “core test wrapper” concept places a DFT wrapper around a core. In a 3D-IC, this concept could place an entire die in a wrapper and make it accessible through a product-level I/O interface. The same test patterns could be reused at the package test level.</p>
<h2>Conclusion</h2>
<p>From a design standpoint, the good news is that extensive retooling is not needed for 3D-ICs. Although there are clear requirements for 3D-IC development, the fundamental underpinning for tools is in place. There is no need to acquire a new ‘3D’ design system. There are also no apparent showstoppers in process technology. However, new capabilities are needed in the areas discussed above, such as architectural analysis, floorplanning, place and route, thermal analysis, timing, signal integrity, IC/package co-design, and test.</p>
<p>Above all, a comprehensive solution is needed. Many 3D stacks will combine digital and analog/RF circuitry, requiring a strong analog/mixed-signal capability. Because of the unique packaging requirements of stacked die, an IC/package co-design capability is a must. Additionally, fitting 3D-ICs on a board is challenging, requiring a capable PCB layout system with appropriate analysis tools. Thus, anyone who presents a complete “solution” must provide expertise in digital, analog, IC, package, and PCB design.</p>
<h2>About the author</h2>
<p><em>Steve Carlson is marketing group director for silicon realization at Cadence Design Systems.</em></p>
<h2>Contact</h2>
<p>Cadence Design Systems<br />
2655 Seely Avenue<br />
San Jose, CA 95134</p>
<p>T: +1 (408) 943 1234<br />
W: <a href="http://www.cadence.com" target="_blank">www.cadence.com</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.techdesignforums.com/practice/technique/eight-requirements-3d-ic-design/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Keeping high-speed designs clean with ERC</title>
		<link>http://www.techdesignforums.com/practice/technique/high-speed-designs-erc/</link>
		<comments>http://www.techdesignforums.com/practice/technique/high-speed-designs-erc/#comments</comments>
		<pubDate>Wed, 08 May 2013 07:41:04 +0000</pubDate>
		<dc:creator>Paul Dempsey</dc:creator>
				<category><![CDATA[Design Integrity]]></category>
		<category><![CDATA[PCB Topics]]></category>
		<category><![CDATA[electrical rule checking]]></category>
		<category><![CDATA[EMI/EMC]]></category>
		<category><![CDATA[High Speed Effects]]></category>
		<category><![CDATA[IP]]></category>
		<category><![CDATA[Power Integrity]]></category>
		<category><![CDATA[signal integrity]]></category>

		<guid isPermaLink="false">http://www.techdesignforums.com/practice/?p=5136</guid>
		<description><![CDATA[Electrical rule checks (ERC) are now available to deal with increasing PCB design complexity, speed project delivery and protect the intellectual property within them.]]></description>
				<content:encoded><![CDATA[<p>Designers must confront faster clock speeds and driver edge rates, increasing net densities and a growing number of constrained nets.</p>
<p>There are numerous challenges to ensuring a ‘clean design’. They include signal integrity, reliable timing, well behaved waveforms, acceptable crosstalk, adequate power distribution, and electromagnetic interference (EMI) levels that meet global regulatory requirements. These can no longer be overcome by using outdated design methods and procedures that lack adequate simulation coverage, and performance verification techniques that jeopardize first time design success or profitable performance yields.</p>
<h2>Everything matters</h2>
<p>For today’s high-speed designs, <em>everything matters</em>. That includes the electrical performance characteristics of the connectors, the properties of the PCB interconnect and dielectric, the test and measurement setup, and of course the design tools employed.</p>
<p>Many designs use leading-edge FPGAs to target 40G/100G Ethernet applications. Others use multi-GHz microprocessors for laptops and tablets. And often, they incorporate DDR2/DDR3 wide bus memories. All require very dense interconnect implementations with lightning fast signals. The ‘Everything Matters’ world of design demands an approach that quickly and accurately checks electrical performance robustness as schematic and layout design proceeds.</p>
<p>We can think of such a capability as synonymous with a yearly health checkup that includes X-rays/MRIs and various other tests. Those tests help lead to a diagnosis and hopefully a remedy or care plan to rectify any problem. The equivalent for PCB design is an EDA tool that runs appropriate tests on a layout to identify potential electrical rule violations so that more in-depth simulation and analysis can be proscribed to prevent catastrophic design flaws slipping through the final sign-off gate (Figure 1).</p>
<div class="article_figure"><a class="figure" title="Figure 1 : <i>PCB design performance checkup results</i> (Source: Mentor Graphics)" href="http://www.techdesignforums.com/practice/files/2013/05/tdf-ment-pcb1may-fig1-lg.jpg"><img src="http://www.techdesignforums.com/practice/files/2013/05/tdf-ment-pcb1may-fig1-med.jpg" alt="<i>PCB design performance checkup results</i> (Source: Mentor Graphics)" /></a></div><div class="article_figure"><p class="figure_wrapper"><span class="figure_title">Figure 1 </span><i>PCB design performance checkup results</i> (Source: Mentor Graphics)</p></div>
<h2>Design intellectual property</h2>
<p>In their efforts to manage complexity, teams rely heavily on master designers with extensive experience and know-how. They help to develop sets of guidelines, blueprints for success that can be carefully followed throughout the design process.</p>
<p>Many companies consider this special design knowledge and know-how proprietary. Often, they consider it to be among their most sensitive intellectual property (IP). Converting this IP into usable guidelines poses several challenges and risks.</p>
<p>It end-users typically have a broad range of skills and understanding themselves that must be considered during guideline development. The guidelines must avoid any ambiguous and be complete. Nevertheless, the intent of the master designer is often misunderstood or the rule is applied incorrectly. This issue is further complicated once individual design/CAD teams need to interpret and derive actionable physical and electrical design rules to be followed during circuit design and layout.</p>
<p>Physical design rules tend to be straightforward. They often deal with common routing issues like trace spacing, width, via clearances, and so on. Constructing electrical rules can be more challenging.</p>
<p>Each electrical rule has an impact on the final geometrical layout but is based upon a much deeper understanding of the electrical behavior of certain physical representations. Electrical rules are intended to help the layout team avoid inadvertently creating EMI, electromagnetic compatibility (EMC) and electromagnetic coupling. They are also meant to ensure signal integrity (SI) and power integrity (PI). As one would expect, arranging, say, the physical layout of components, signal interconnects, and power planes to comply with electrical rules is not intuitive. It usually depends on visual inspection and measurement to confirm rules are being followed.</p>
<p>As in-house guidelines are more and more frequently augmented by others from targeted IC vendors, many resulting designs are ultimately coming to include some significant deviation from the combined rules in one way or another. As design complexity and performance increase, the risk of such deviation also logically increases. Senior managers struggle with this dilemma every day.</p>
<h2>Electrical rules-driven performance verification</h2>
<p>Faithfully following electrical rules derived from design guidelines provides an adequate, but not always mandatory, condition for a successful design. Often, design tradeoffs exist where overdesign to meet one rule saves enough design margin for another rule to pass without violations.</p>
<p>This common occurrence may lead you to believe that not all guidelines are necessary and that some may even mask the real performance condition of the design. The goal for any electrical rule checking (ERC)-driven performance verification flow should be to verify if overall electrical performance meets expectations, not the enforcement of each individual physical routing rule.</p>
<p>One new approach to consider is a design flow that allows design teams to &#8216;DARE&#8217; &#8211; that is, to easily Detect, Analyze, Repair, and Evaluate potential electrical design flaws early and often as the layout hardens (Figure 2).</p>
<p>Once the design requirements have been determined and pre-layout circuit design starts wrapping up, layout begins. Here, performance-critical portions of the design can be among the first layout structures to be checked and verified against performance objectives. Using HyperLynx ERC early and often to guide layout designers towards performance-friendly physical implementations will reduce the rework required as more of the layout topology hardens.</p>
<p>Running ERC to detect potential performance design flaws enables the layout team to iterate immediately as they seek to resolve issues through layout alternatives. However, some SI/PI design flaws do require a deeper understanding. These are candidates for simulation under HyperLynx SI, HyperLynx PI, or HyperLynx 3D EM to analyze and quantify the severity of the violation.</p>
<p>Critical design flaws are then flagged and passed back to the layout team for further repair. This simulation triage process continues as the layout matures and until all ERC violations have been addressed either through layout repair or an engineering-led violation mitigation design practice. At that point, a final performance evaluation ERC run should be run as part of larger electrical rule signoff procedure to double check that no catastrophic flaws have been missed.</p>
<div class="article_figure"><a class="figure" title="Figure 2 : <i>ERC-driven performance verification flow</i> (Source: Mentor Graphics)" href="http://www.techdesignforums.com/practice/files/2013/05/tdf-ment-pcb1may-fig2-lg.jpg"><img src="http://www.techdesignforums.com/practice/files/2013/05/tdf-ment-pcb1may-fig2-med.jpg" alt="<i>ERC-driven performance verification flow</i> (Source: Mentor Graphics)" /></a></div><div class="article_figure"><p class="figure_wrapper"><span class="figure_title">Figure 2 </span><i>ERC-driven performance verification flow</i> (Source: Mentor Graphics)</p></div>
<p>Adopting this type of ERC-driven performance verification methodology truly reduces the chances of hard-to-detect electrical design flaws passing manufacturing signoff. Furthermore, overall design quality is improved as are design performance margins. And every design still benefits from a master designer’s quality review before release.</p>
<h2>Keep it clean with ERC</h2>
<p>ERC not only helps improve simulation coverage, but also provides a secure method for capturing and applying sensitive master designer know-how.</p>
<p>The Mentor Graphics HyperLynx ERC tool helps keep designs clean of hard-to-detect electrical design flaws. It supports custom rule definition using standard scripting languages.</p>
<p>Rule developers leverage work converting design guidelines to actionable layout rule modes by capturing intent in the form of programmable electrical rule descriptions. The IP contained within each description can be encrypted, locking the contents from outside tampering. The encrypted rule set can then safely be shared with internal design teams or with business partners and customers (see Figure 3).</p>
<div class="article_figure"><a class="figure" title="Figure 3 : <i>Secure deployment of electrical rule sets</i> (Source: Mentor Graphics)" href="http://www.techdesignforums.com/practice/files/2013/05/tdf-ment-pcb1may-fig3-lg.jpg"><img src="http://www.techdesignforums.com/practice/files/2013/05/tdf-ment-pcb1may-fig3-med.jpg" alt="<i>Secure deployment of electrical rule sets</i> (Source: Mentor Graphics)" /></a></div><div class="article_figure"><p class="figure_wrapper"><span class="figure_title">Figure 3 </span><i>Secure deployment of electrical rule sets</i> (Source: Mentor Graphics)</p></div>
<p>Encryption opens the door for ERC developers to consider enriching rule sets with sensitive design know-how in the form of algorithmically-rich computational light rules. This fills a gap between those design teams that do not use any simulation tools and those that rely on SI/PI experts for help with performance verification. In the former case, design teams lack the expertise to adequately operate sophisticated SI/PI tools but do desire to improve design performance predictability. In the latter case, there typically are not enough SI/PI experts available to the team or enough time to exhaustively simulate the entire design.</p>
<p>These performance indicator rules are really lightweight analysis engines. They are not intended to replace the SI/PI analysis tool, but rather to be used to better screen potential electrical design faults by reporting a quantitative measure of ‘goodness’ against SI/PI metrics. Faults detected by these types of rule sets help the design team focus on precise locations that warrant deeper SI/PI simulation and analysis. Or better yet, they can direct the layout team to iterate, attempting to resolve violations through alternative layouts without requiring help from an SI/PI expert.</p>
<p>Only rule violations that cannot be resolved through layout iterations reach the SI/PI expert, where deeper simulation and analysis is applied to help quantify the seriousness of the violation and determine if other rule violation mitigation techniques should be employed. Using this methodology helps guarantee that the complete design is being checked and only mission-critical performance issues are being passed on to the SI/PI experts for resolution.</p>
<h2>Taking ERC to the next level</h2>
<p>Most designers will see the value in an ERC-driven performance verification methodology. But some may ask, “How does this new way of driving performance analysis fit into the real world way of doing design that entails multiple PCBs, multiple geographically dispersed design teams, and chipsets originating from multiple vendors?”  The answer is two-fold:</p>
<ol>
<li>The pervasive availability of separate private (internal-use only) and public (customer/partner use) rules sets helps ensures designers are checking the right things.</li>
<li>The adoption of an ERC solution that supports multi-rule sets, multi-user, and multi-engine models is one that fits the preferred design strategies prevalent among many enterprise-wide ventures.</li>
</ol>
<p>As IC vendors, ODM vendors, and others realize that an encrypted ERC rule set is an elegant way of delivering lightweight analysis engines for performance checking, the number and variety of commercially available rule sets will increase. Vendors, design centers and customer marketing/support teams can use them to ensure that chipset-centric design guidelines are observed during the layout process. This will help make sure that a PCB moves to manufacturing on time and the forecast delivery for IC vendor chipsets is not disrupted.</p>
<p>In cases where certain PCBs within a multi-board system are outsourced to an ODM, ERC rule sets can augment/replace visual inspection methods used during typical incoming inspection and design reviews. The ODMs certainly possess extensive design knowledge for most contracted designs. Each ODM can develop an encrypted ERC rule set to be used as part of standard in-house design practices and it can be passed on to customers to help speed up their incoming inspection and acceptance of the design.</p>
<p>Encrypted HyperLynx ERC rule sets enable universities, consultants, material suppliers and component providers in the PCB design and manufacturing food chain to deliver their own electrical rule IP securely to PCB engineers. This will become increasingly important as PCB speeds increase and thereby require more exhaustive performance verification.</p>
<p>Component manufacturers especially can help their customers ensure proper use of their products by providing rule sets that determine the power delivery/isolation rules or recommended interconnect topologies for critical signals that should followed. The types and varieties of rule sets that improve overall design quality and performance margins are left to the clever minds of the rule set developer.</p>
<div class="article_figure"><a class="figure" title="Figure 4 : <i>Pervasive HyperLynx ERC rule sets</i> (Source: Mentor Graphics)" href="http://www.techdesignforums.com/practice/files/2013/05/tdf-ment-pcb1may-fig4-lg.jpg"><img src="http://www.techdesignforums.com/practice/files/2013/05/tdf-ment-pcb1may-fig4-med.jpg" alt="<i>Pervasive HyperLynx ERC rule sets</i> (Source: Mentor Graphics)" /></a></div><div class="article_figure"><p class="figure_wrapper"><span class="figure_title">Figure 4 </span><i>Pervasive HyperLynx ERC rule sets</i> (Source: Mentor Graphics)</p></div>
<p>In cases where multiple PCBs are designed across geographically dispersed design teams, you need a work flow where ERC occurs concurrently with layout. Under this model, layout engineers can choose to run multiple rule sets on a network of computers. A concurrent analysis tool manager (Figure 5) enables many different ERC runs based upon multiple rule sets to be assigned to different network servers as layout progresses. As the number of PCBs under design, the number of designers actively performing layout, and design sizes and rule set complexity increase, there will be a growing need to launch multiple rule checks and to manage violation resolution concurrently with the layout.</p>
<div class="article_figure"><a class="figure" title="Figure 5 : Multi-user, multi-rule, multi-engine concurrent HyperLynx ERC (Source: Mentor Graphics)" href="http://www.techdesignforums.com/practice/files/2013/05/tdf-ment-pcb1may-fig5-lg.jpg"><img src="http://www.techdesignforums.com/practice/files/2013/05/tdf-ment-pcb1may-fig5-med.jpg" alt="Multi-user, multi-rule, multi-engine concurrent HyperLynx ERC (Source: Mentor Graphics)" /></a></div><div class="article_figure"><p class="figure_wrapper"><span class="figure_title">Figure 5 </span>Multi-user, multi-rule, multi-engine concurrent HyperLynx ERC (Source: Mentor Graphics)</p></div>
<h2>Electrical rule checks help ensure a clean design</h2>
<p>The electrical performance of all PCBs comprising an entire system can now be assessed in a timely manner for compliance against original design performance goals. As rule violations are detected, the simulation triage described earlier is applied to ensure only critical errors receive more detailed SI/PI simulations to evaluate whether or not each violation requires repair. Higher quality designs with better design margins that reach production volume sooner are the benefits to those companies that &#8216;DARE&#8217; to change the way they verify electrical performance.</p>
<h2>About the author</h2>
<p><em>Rod Dudzinski is responsible for IE3D market and business development at Mentor Graphics. He has a successful history of leading the development and introduction of new IC place and route, timing/circuit simulation and modeling, and EM Synthesis products into the EDA marketplace. Rod’s experience spans 25 years. Prior to Mentor, Rod held positions at Zeland Software, Lorentz Solution, Cooper &amp; Chyan Technologies and Cadence Design Systems.</em></p>
<h2>Contact</h2>
<p><em>Mentor Graphics</em><br />
<em>Corporate Office<br />
</em><em>8005 SW Boeckman Rd<br />
</em><em>Wilsonville<br />
</em><em>OR 97070<br />
</em><em>USA</em></p>
<p><em>T: +1 800 547 3000<br />
</em><em>W: <a href="http://www.mentor.com" target="_blank">www.mentor.com</a></em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.techdesignforums.com/practice/technique/high-speed-designs-erc/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Better analysis helps improve design quality</title>
		<link>http://www.techdesignforums.com/practice/technique/analysis-reduce-design-complexity/</link>
		<comments>http://www.techdesignforums.com/practice/technique/analysis-reduce-design-complexity/#comments</comments>
		<pubDate>Tue, 07 May 2013 21:36:06 +0000</pubDate>
		<dc:creator>Luke Collins</dc:creator>
				<category><![CDATA[Verification]]></category>
		<category><![CDATA[assertions]]></category>
		<category><![CDATA[clock domain crossing]]></category>
		<category><![CDATA[clock gating]]></category>
		<category><![CDATA[methodology]]></category>

		<guid isPermaLink="false">http://www.techdesignforums.com/practice/?p=5112</guid>
		<description><![CDATA[Better upfront analysis can help avoid propagating errors from RTL into the netlist, and reveal a number of ways to improve the quality of your final design.]]></description>
				<content:encoded><![CDATA[<p>Today’s systems on chip (SoC) are deeply complex in new ways. A dozen or so years ago, a state-of-the-art processor such as the Intel Pentium 4 used 42 million transistors, was built on a 180nm process and relied upon discrete chips to handle its system interfaces. Scroll forward, and the Westmere processor that Intel introduced in 2012 uses 2.6 billion transistors and is built on a 32nm process. The chip includes ten 64bit x86 cores, L3 cache, graphics processing, DDR3 interfaces, virtualisation support and more. This trend to massive integration is even stronger in the mobile space, where SoCs bring together complex computing, communications and entertainment functions on one die.</p>
<p>It’s no longer possible to design all the subsystems of an SoC from scratch and expect to get the chip out in a reasonable timeframe, so today’s SoCs are complex integrations of new logic, IP blocks brought forward from previous designs, and functional and interface IP licensed in from third parties. Some companies are even using third-party IP to build their system interconnect, on the basis of that its communications management support and interfaces to other IP blocks will help get a design out more quickly. In effect, an SoC is a sea of interfaces.</p>
<p>The use of IP enables complex products such as the iPad, which is good for the consumer, but brings integration challenges for SoC designers and the verification teams that ensure their designs will work as planned. Individual IP blocks may be as complex as entire SoCs of five years ago, and may have internal clocking and power-management strategies which SoC designers need to be aware of. The integration of these complex blocks means that clock signals may have to negotiate up to 100 asynchronous clock domains as they cross block interfaces. Similarly, systemic power management strategies may involve coordinating power management within a block and among many blocks.</p>
<p>Managing the verification of such complex systems is challenging. The designs are large, so designers need tools with very high capacities. They need to be able to control the rising tide of uncertainty caused by clock signals that cross domains, and power-management strategies that create unknown (X) logic states when they blocks are turned on and off. Most of all, designers need these tools to tackle such problems at the highest level of abstraction possible, to speed up the verification process and stop the issues multiplying and becoming more obscure as the RTL design is decomposed to gates. Clock domain crossing (CDC) tools, engineered to recognize and analyze crossings for problems, are essential to help control the verification complexity involved in tackling a full SoC.</p>
<p>With the right tools, overcoming these issues can mean more than just a problem solved: by applying the right analytic tools, the design can actually be functionally improved. For example, linting tools that give early indications of potential design-for-test problems help avoid synthesizing untestable logic. Tools that understand the X states that are created at domain boundaries in complex designs, and how they propagate, can help avoid too much optimism about their impact at the RTL level (where X states can hide real issues), and too much pessimism about their impact at the gate level. The result: better analysis leads to better designs.</p>
<p>The same is true with ‘resetability analysis’, which considers whether it is necessary to take a reset signal to every node that the design says needs one, or whether it is possible to restrict the distribution of reset signals to a subset of critical nodes. If an early analysis shows that some nodes can do without a reset signal without affecting the logical function, which means less routing and lower power consumption on the final chip. Again: better analysis leads to a better design.</p>
<p>As SoCs have become more complex, so has the task of verifying that what is implemented on chip is what the designer intended. No single verification approach can deliver the certainty that design teams need to tape out, but a suite of tools that each address a particular issue can help build that confidence. Applying the tools may also do more than avoid errors: better analysis early in the design process can avoid issues propagating, and, by highlighting which issues matter and which can be safely ignored, give designers the freedom to improve their designs.</p>
<h2>Contact</h2>
<address>Real Intent</address>
<address>990 Almanor Avenue, Suite 220<br />
Sunnyvale, CA 94085<br />
phone: 408-830-0700<br />
fax: 408-737-1962<br />
web: www.realintent.com<br />
email: info@realintent.com</address>
<address> </address>
]]></content:encoded>
			<wfw:commentRss>http://www.techdesignforums.com/practice/technique/analysis-reduce-design-complexity/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Physical verification of finFET and FD-SOI devices</title>
		<link>http://www.techdesignforums.com/practice/technique/physical-verification-design-finfet-fd-soi/</link>
		<comments>http://www.techdesignforums.com/practice/technique/physical-verification-design-finfet-fd-soi/#comments</comments>
		<pubDate>Thu, 02 May 2013 12:43:38 +0000</pubDate>
		<dc:creator>Luke Collins</dc:creator>
				<category><![CDATA[Verification]]></category>
		<category><![CDATA[FD-SOI]]></category>
		<category><![CDATA[fin quantization]]></category>
		<category><![CDATA[finFET]]></category>
		<category><![CDATA[physical verification]]></category>

		<guid isPermaLink="false">http://www.techdesignforums.com/practice/?p=4956</guid>
		<description><![CDATA[A look at some of the design and physical verification challenges of working with finFET and FD-SOI devices, including their impact on layout, DRC and LVS.  ]]></description>
				<content:encoded><![CDATA[<p>As the minimum dimension of planar transistors  has fallen below 90nm, their effectiveness as a true On/Off switch has been undermined by the increasing proximity of the source and drain at either end of the device channel. In such small devices, the length of the transistor’s conduction channel underneath the gate has become comparable to the width of the depletion layer around the source and drain regions, as shown.</p>
<div class="article_figure"><a class="figure" title="Figure 1 : A short-channel device's channel length is  comparable to the depletion widths associated with the drain and source, so edge effects cannot be neglected" href="http://www.techdesignforums.com/practice/files/2013/05/tdf-snpspvff-fdsoi-1lrg.jpg"><img src="http://www.techdesignforums.com/practice/files/2013/05/tdf-snpspvff-fdsoi-1med.jpg" alt="A short-channel device's channel length is  comparable to the depletion widths associated with the drain and source, so edge effects cannot be neglected" /></a></div><div class="article_figure"><p class="figure_wrapper"><span class="figure_title">Figure 1 </span>A short-channel device's channel length is  comparable to the depletion widths associated with the drain and source, so edge effects cannot be neglected</p></div>
<p>This creates a number of ‘short-channel’ effects, including:</p>
<p>- <em>Threshold voltage roll-off:</em> The threshold voltage of the device is reduced because the depletion regions of the source and drain intrude into the channel, change its effective geometry and therefore make it easier for current to flow from source to drain. The lowered threshold voltage makes it more difficult to turn the transistor fully Off, increasing its leakage current and therefore power consumption.</p>
<p>- <em>Drain-induced barrier lowering (DIBL):</em> As the source and drain get closer together, they become electrostatically coupled, such that the drain bias can affect the potential barrier to current flow at the source junction, increasing the threshold voltage roll-off, and hence the leakage.</p>
<p>- <em>Charge mobility degradation:</em> Atomistic effects such as surface scattering, velocity saturation, impact ionization, and hot-electron effects, reduce the mobility of both electron and hole charge carriers. Device architects have tried various ways to strain the crystal lattice of the channel to make it easier for charge carriers to flow through it, hence restoring its conductivity.</p>
<p>- <em>Threshold voltage variation:</em> This is due to statistical irregularities in dopant concentration, which become significant when the devices are so small.</p>
<p>Short-channel effects have negated many of the benefits of scaling transistors from 28nm to 20nm and have led to concerns about the economic viability of planar semiconductor processes at 20nm and below.</p>
<h2><b>Alternative transistor designs </b></h2>
<p>The industry has responded by suggesting two alternate device architectures, known as finFET and FD-SOI, that mitigate the effects. The finFET moves the channel out of the bulk silicon into a vertical fin and wraps it on three sides with a gate electrode, improving the electrostatic control of the channel. The silicon-on-insulator (SOI) transistor is a planar device whose channel is built in such a thin (shallow) silicon layer that the gate electrode can exercise full electrostatic control of the charge carriers in it.</p>
<h3><b>FinFETs</b></h3>
<p>FinFETs <a href="http://www.techdesignforums.com/practice/guides/finfets/"><i><span style="text-decoration: underline">(</span></i><em><span style="text-decoration: underline">Guide</span><span style="text-decoration: underline">)</span></em></a> are built as shown in this diagram.<b></b></p>
<div class="article_figure"><a class="figure" title="Figure 2 : A finFET’s wrap-around gate is more effective in switching the transistor off, thereby reducing leakage current (Source: Synopsys)" href="http://www.techdesignforums.com/practice/files/2013/05/tdf-snpspvff-fdsoi-2lrg.jpg"><img src="http://www.techdesignforums.com/practice/files/2013/05/tdf-snpspvff-fdsoi-2med.jpg" alt="A finFET’s wrap-around gate is more effective in switching the transistor off, thereby reducing leakage current (Source: Synopsys)" /></a></div><div class="article_figure"><p class="figure_wrapper"><span class="figure_title">Figure 2 </span>A finFET’s wrap-around gate is more effective in switching the transistor off, thereby reducing leakage current (Source: Synopsys)</p></div>
<p>The strong electrostatic control afforded by ensuring that the gate is close to all areas of the channel makes it easier to control the device’s threshold voltage, reducing leakage between source and drain, as well as making it possible to switch it more quickly.</p>
<p>FinFETs can be made on regular silicon wafers, or on SOI wafers, as shown.</p>
<div class="article_figure"><a class="figure" title="Figure 3 : FinFETs can be made on bulk silicon or on SOI wafers (Source: Synopsys)" href="http://www.techdesignforums.com/practice/files/2013/05/tdf-snpspvff-fdsoi-3lrg.jpg"><img src="http://www.techdesignforums.com/practice/files/2013/05/tdf-snpspvff-fdsoi-3med.jpg" alt="FinFETs can be made on bulk silicon or on SOI wafers (Source: Synopsys)" /></a></div><div class="article_figure"><p class="figure_wrapper"><span class="figure_title">Figure 3 </span>FinFETs can be made on bulk silicon or on SOI wafers (Source: Synopsys)</p></div>
<p>Building finFETs on SOI is simpler than building them in bulk, because the etch process that forms the fin comes to a natural stop when it reaches the buried oxide layer of SOI, rather than having to be controlled by a timed etch process, subject to process variation, in bulk silicon. Building finFETs on SOI may also lead to more rectangular fins with more regular heights, since on bulk silicon the shallow trench isolation oxide that is necessary to insulate one fin from the next can’t be planarized, leading to variability in the height of the fin above the oxide.</p>
<p>The main manufacturing challenges for finFETs are:</p>
<p>- controlling the etch along the edges of these tall structures to generate uniform fin widths and achieve good edge verticality</p>
<p>- uniformly doping the resulting complex 3D surfaces</p>
<p>- depositing all the films used in the gate stack so that they conform to the surface of the fin</p>
<p>Meeting these challenges requires new materials and more processing steps than used in bulk or SOI devices, making them more expensive to produce.</p>
<p>From an electrical perspective, the gate length (L) of a finFET is equal to the thickness of the gate (t<sub>fin</sub>), while the gate width (W<sub>eff</sub>) is the periphery of the gate along both sides on the fin plus the thickness of the fin: W = 2 * H + t, as shown.</p>
<div class="article_figure"><a class="figure" title="Figure 4 : The electrical dimensions of a finFET, showing the effective gate width W = 2 * H + t (Source: Synopsys)" href="http://www.techdesignforums.com/practice/files/2013/05/tdf-snpspvff-fdsoi-4lrg.jpg"><img src="http://www.techdesignforums.com/practice/files/2013/05/tdf-snpspvff-fdsoi-4med.jpg" alt="The electrical dimensions of a finFET, showing the effective gate width W = 2 * H + t (Source: Synopsys)" /></a></div><div class="article_figure"><p class="figure_wrapper"><span class="figure_title">Figure 4 </span>The electrical dimensions of a finFET, showing the effective gate width W = 2 * H + t (Source: Synopsys)</p></div>
<p>The fin’s thinness enables strong electrostatic control over the channel’s charge carriers, but limits the number of carriers and hence the channel’s drive strength. To overcome this problem, multiple fins can be connected in parallel, all driven by the same gate. The consequence of the finFET’s topology is that the choice of device width is restricted to multiples of W<sub>eff</sub>.</p>
<h3><b>Benefits and risks of finFETs</b></h3>
<p>&nbsp;</p>
<table style="margin-left: 120px" border="1" cellspacing="0" cellpadding="40">
<tbody>
<tr>
<td width="221">
<p style="text-align: center" align="center"><strong>Strengths</strong></p>
</td>
<td style="text-align: center" width="216">
<p align="center"><strong>Weaknesses</strong></p>
</td>
</tr>
<tr>
<td width="221">Significant reduction in power consumption (~50% over 32nm)</td>
<td width="216">Very restrictive design options, especially for analog &#8211; Transistor drive strength is quantized to multiples of a single fin width</td>
</tr>
<tr>
<td width="221">Faster switching speed</td>
<td width="216">Fin width variability and edge quality leads to variability in threshold voltage VT</td>
</tr>
<tr>
<td width="221">Effective speed/power trade-off possible with multi-Vt</td>
<td width="216">Extra manufacturing complexity and expense (~+3% according to Intel)</td>
</tr>
<tr>
<td width="221">Availability of strain engineering</td>
<td width="216"></td>
</tr>
<tr>
<td width="221"></td>
<td width="216"></td>
</tr>
<tr>
<td width="221">
<p align="center"><strong>Opportunities</strong></p>
</td>
<td width="216">
<p align="center"><strong>Threats</strong></p>
</td>
</tr>
<tr>
<td width="221">Low power makes 20nm technology deployable for mobile applications</td>
<td width="216">The potentially superior electrical performance and simpler manufacturing of fully depleted SOI</td>
</tr>
<tr>
<td width="221">Increase CPU speeds beyond 4GHz</td>
<td width="216"></td>
</tr>
</tbody>
</table>
<p style="text-align: center"><b> </b></p>
<h3><b>Fully depleted silicon-on-insulator</b></h3>
<p>Another way to control short-channel effects in deep submicron transistors is to make planar devices on SOI wafers, forming the channel in a thin silicon layer above the buried oxide. As discussed, the silicon layer is so thin that the gate can influence the whole channel and so completely shut off conduction.</p>
<p>There are two types of SOI devices: partially depleted (PD-SOI) and fully depleted SOI (FD-SOI).</p>
<p>In PD-SOI, the silicon film is 50nm to 90nm thick, so the gate’s influence cannot reach through the channel’s full depth, making the PD-SOI device behave like a slightly better bulk MOSFET.</p>
<p>In FD-SOI<a href="http://www.techdesignforums.com/practice/guides/fd-soi/"><i><span style="text-decoration: underline"> (</span></i><em><span style="text-decoration: underline">Guide</span><span style="text-decoration: underline">)</span></em></a> the silicon film is 5nm to 20nm thick, so that the gate’s influence penetrates the full depth of the channel. The conduction channel doesn’t need to be doped, which eliminates a source of process variability, and the fact that it sits on an insulating layer means there is nowhere for unwanted current paths to form. This in turn relaxes the performance necessary from the gate, and so means that a thicker gate oxide can be used, reducing gate leakage. One advantage of FDSOI over finFETs is that it is possible to back-bias the channels through substrate contacts and so gain even greater control over the threshold voltage.</p>
<p>The primary disadvantage of FD-SOI is the cost of SOI wafers, which is due to the difficulty of accurately controlling the silicon film thickness across the entire wafer. The film thickness is important because the threshold voltage is dependent upon it, so if the film thickness varies across the wafer, so will the device performance. Some manufacturers claim to be able to control the film thickness across a 300mm wafer to within +/-0.5nm.</p>
<p>One drawback of FD-SOI is that it is harder to implement multiple V<sub>t</sub> circuitry, which is useful for creating areas of higher-performance or lower-power devices, because it is not possible to alter the channel doping to affect the devices’ threshold voltage. Instead, the threshold voltage is controlled by tuning the gate-stack materials. However, a combination of doping and active biasing can be used to alter the threshold of a target group of transistors that have a common sub-oxide backplane. Another disadvantage of FD-SOI is that it is not possible to selectively strain the channel to improve carrier mobility.</p>
<h3><b>Benefits and risks of FD-SOI</b></h3>
<p>&nbsp;</p>
<table style="margin-left: 120px" border="1" cellspacing="0" cellpadding="40">
<tbody>
<tr>
<td width="221">
<p align="center"><strong>Strengths</strong></p>
</td>
<td valign="top" width="216">
<p align="center"><strong>Weaknesses</strong></p>
</td>
</tr>
<tr>
<td valign="top" width="221">Significant reduction in power consumption</td>
<td valign="top" width="216">High cost of initial wafers (~+10% over regular wafers, according to Intel)</td>
</tr>
<tr>
<td width="221">Faster switching speed</td>
<td width="216">Limited number of wafer suppliers</td>
</tr>
<tr>
<td width="221">Easier, standard manufacturing process</td>
<td width="216">Variability in VT due to variations in the thickness of silicon thin-film</td>
</tr>
<tr>
<td width="221">Availability of back-biasing to control VT</td>
<td width="216">Multi- VT more complex to implement</td>
</tr>
<tr>
<td width="221">No doping variability</td>
<td width="216">Lack of strain engineering</td>
</tr>
<tr>
<td width="221">Layout library compatible with existing bulktechnologies</td>
<td width="216">Thin channel limits drive strength</td>
</tr>
<tr>
<td width="221"></td>
<td width="216"></td>
</tr>
<tr>
<td width="221">
<p align="center"><strong>Opportunities</strong></p>
</td>
<td width="216">
<p align="center"><strong>Threats</strong></p>
</td>
</tr>
<tr>
<td width="221">Simpler and more flexible alternative to finFETs if wafer cost issue can be overcome<b></b></td>
<td width="216">High wafer cost threatens economic viability for wider market adoption<b></b></td>
</tr>
<tr>
<td width="221">Better controllability for analog applications</td>
<td width="216"></td>
</tr>
</tbody>
</table>
<h2></h2>
<h2></h2>
<p>&nbsp;</p>
<h2><b>Designing with finFETs </b></h2>
<h3><b><i>Layout</i></b></h3>
<p>Early results suggest that the impact on digital design of using finFETs need not be that great if conservative approaches are taken. It should be possible to migrate circuits created for advanced planar processes to finFETs, using a lot of modeling and simulation to recheck the circuit performance.</p>
<p>One key difference between finFET-based design and that using conventional planar devices is that the freedom to choose the device’s drive strength is reduced as the drive strength can only be increased during layout by adding fins. The effective width of the device becomes quantized, and the quantization effect is worse for smaller transistors for which the next step up from the minimum-size device is one that is twice as wide.</p>
<p>In terms of optimization for power, the finFET provides circuit designers with the opportunity to trade leakage for switching speed and create fast/medium/slow versions of the base transistor.</p>
<h3><b><i>Design rule checking</i></b></h3>
<p>DRC for finFETs implies fin-related checks &#8211; such as of fin width and fin-to-fin spacing &#8211; that are similar to regular planar checks. There are also a number of checks that are specific to finFET processes, such as checking that fins and poly are confined to regular, lithographically-friendly pitches in a fixed orientation.</p>
<p>A typical set of finFET physical verification rules contains the elements listed in Table 1 and illustrated in Figure 9:</p>
<div class="article_figure"><a class="figure" title="Table 1 : Example of classes of DRC checks for finFET spacing (Source: Synopsys)" href="http://www.techdesignforums.com/practice/files/2013/05/tdf-snpspvff-fdsoi-table1-lrg.jpg"><img src="http://www.techdesignforums.com/practice/files/2013/05/tdf-snpspvff-fdsoi-table1-med.jpg" alt="Example of classes of DRC checks for finFET spacing (Source: Synopsys)" /></a></div><div class="article_figure"><p class="figure_wrapper"><span class="figure_title">Table 1 </span>Example of classes of DRC checks for finFET spacing (Source: Synopsys)</p></div>
<p>&nbsp;</p>
<p>RX and RXFIN are drawn layers. RX is drawn as in prior technologies except for some gridding constraints imposed by RXFIN.</p>
<p>RXFIN over RX is ‘active’. RXFIN not over RX is ‘dummy’. An RXFIN will often have portions that are active and portions that are dummy.</p>
<p>Active portions must be surrounded by three dummies. This is enforced through the RXFIN_enclosure rules.</p>
<p>RXFIN_enclosure is a generated shape that encloses a region of fins with a common pitch.</p>
<div class="article_figure"><a class="figure" title="Figure 5 : <b> </b>Typical physical parameters used for DRC verification of finFETs (Source: Synopsys)" href="http://www.techdesignforums.com/practice/files/2013/05/tdf-snpspvff-fdsoi-5lrg.jpg"><img src="http://www.techdesignforums.com/practice/files/2013/05/tdf-snpspvff-fdsoi-5med.jpg" alt="<b> </b>Typical physical parameters used for DRC verification of finFETs (Source: Synopsys)" /></a></div><div class="article_figure"><p class="figure_wrapper"><span class="figure_title">Figure 5 </span><b> </b>Typical physical parameters used for DRC verification of finFETs (Source: Synopsys)</p></div>
<h3><b><i>Layout vs schematic</i></b></h3>
<p>Device parameter extraction for finFETs is similar to planar devices, including the need to consider layout-dependent effects. Unlike planar devices, the key feature of gate width cannot be measured directly from the two-dimensional layout. The effective gate width depends on the fin height, the distance that the fin protrudes above the oxide surface. However, all the fins are the same height, so Hfin can be entered as a process  parameter and used to calculate effective device width, as before.</p>
<p>Another LVS concern for finFETs is the need to accurately extract the source/drain resistance, which forms a significant component of the transistor’s parasitic series resistance.</p>
<h2><b>Designing with FD-SOI</b></h2>
<h3><b><i>Layout</i></b></h3>
<p>In principle, it is possible to directly port a cell library from a bulk process on to an FD-SOI process. However, the results will not be as good as dedicated cell libraries designed to take advantage of the different balance of capacitances in FD-SOI devices and the reduced variability of undoped channels.</p>
<p>The <a href="http://www.soiconsortium.com"><span style="text-decoration: underline">SOI Consortium</span></a> has proposed a ‘safe’ way to adopt FD-SOI, using hybrid FD-SOI/bulk co-integration. In this approach, IP cores that are deemed too risky to redesign are placed on to areas of the wafer that expose the bulk silicon, while synthesized logic optimized to use FD-SOI structures is placed on areas of the wafer that retain the buried oxide layer.</p>
<p>The use of FD-SOI could have a significant impact on design if designers take up the opportunity to dynamically back-bias the channel, which gives better control over VT variability and so improves device switching speed.</p>
<h3><b><i>DRC/LVS</i></b></h3>
<p>Since FD-SOI is a planar technology, DRC and LVS checking is the same as for existing bulk technologies.</p>
<h2><b>Conclusion </b></h2>
<p>Short-channel effects severely limit the performance of bulk planar transistors at process nodes below 90nm.Two alternative devices are being proposed for process nodes below 20nm: finFETs and FD-SOI. Both are practical solutions that solve the major short-channel difficulties, both have been proven on real designs, and both are backed by major semiconductor companies. It is not yet clear which technology will predominate or if they will continue to co-exist.</p>
<p>In terms of physical verification, both processes are well supported by Synopsys’ physical verification tools. All new physical checks are well within the capabilities of Synopsys’ DRC solution, and the new device parameters can be extracted with Synopsys’ LVS tools.</p>
<p><em>(To read what attendees at the December 2012 International Electron Device Meeting felt about the relative merits of finFETs and FD-SOI in practice, <a href="http://www.techdesignforums.com/blog/2012/12/11/fd-soi-vs-finfet/">click here</a>.)</em></p>
<h2><b>Authors</b></h2>
<p><em>Marc Swinnen received a Master’s Degree in Electrical Engineering from the Catholic University of Leuven in Belgium and an MBA from San Jose State University. He has over 20 years of experience in Marketing and Technical Support at various EDA companies. Marc has been at Synopsys for 7 years, where he is Senior Product Marketing Manager working with IC Validator.</em></p>
<p><em>Ron Duncan is a Sr. Manager at Synopsys in Mountain View, CA.  His team supports IC Validator and PrimeYield. Ron has worked at Hewlett Packard, ISS and Avant!, prior to Synopsys. His semiconductor and EDA experience spans physical design and verification, circuit simulation, device design, mask synthesis, TCAD and semiconductor manufacturing.  He holds electrical engineering degrees from MIT and Cornell. </em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.techdesignforums.com/practice/technique/physical-verification-design-finfet-fd-soi/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Knock down the wall to SoC integration</title>
		<link>http://www.techdesignforums.com/practice/technique/soc-integration-embedded-software/</link>
		<comments>http://www.techdesignforums.com/practice/technique/soc-integration-embedded-software/#comments</comments>
		<pubDate>Tue, 30 Apr 2013 12:04:18 +0000</pubDate>
		<dc:creator>Luke Collins</dc:creator>
				<category><![CDATA[Embedded Topics]]></category>
		<category><![CDATA[Integration & Debug]]></category>
		<category><![CDATA[debug]]></category>
		<category><![CDATA[embedded software]]></category>
		<category><![CDATA[SoC]]></category>
		<category><![CDATA[virtual prototyping]]></category>

		<guid isPermaLink="false">http://www.techdesignforums.com/practice/?p=4988</guid>
		<description><![CDATA[SoC integration can be accelerated by using virtualization to make the benefits of emulation more accessible to both hardware and software engineers.    ]]></description>
				<content:encoded><![CDATA[<p>Software engineers are too often treated as an afterthought in the SoC integration and verification process, in which hardware-software integration remains a rigidly serial process, despite software&#8217;s <a href="http://www.deepchip.com/items/0522-01.html" target="_blank">increasing domination</a> of the overall system function. Traditional verification technologies that are supposed to enable such integration actually prevent it.</p>
<p>System-on-chip (SoC) verification environments have not been very software-aware, limiting the productivity and quality gains available from better hardware-software integration. Software engineers face the same time-to-market pressures as their hardware counterparts, and sometimes greater ones if design-in customers want early access to the chip. We must knock down the wall between the two disciplines.</p>
<p>There is now a productized technology that can help do just that by making hardware models available early to software engineers. Hardware and software can be designed, verified, and improved in unison, boosting productivity.</p>
<h2><b>The Great Wall</b></h2>
<h3><i>FPGA prototyping</i></h3>
<p>FPGA prototypes are widely used but impose a serial flow. The software team cannot get a development board until the RTL is finished. If the design is a moving target and is a multi-processor SoC with more than a few million gates, this inserts unnecessary delay into HW/SW integration.</p>
<p>Then, when designs become too large to fit onto one FPGA board, mapping the RTL becomes a real problem. The typical performance of FPGA software means it can take many hours or days to compile and then place-and-route a design on the full prototype. ECOs expand the turnaround time further before verification can resume.</p>
<p>Add to that the difficulties in gaining enough visibility into a design for effective debugging with standard FPGA verification techniques, and the whole process becomes yet more difficult and unproductive.</p>
<h3><i>Accelerated simulation with VIP</i></h3>
<p>Accelerated simulation using verification IP speeds up hardware design. But it does not bring the software and integration teams into the loop.</p>
<p>Although testbench-based simulation environments (e.g., SystemVerilog/UVM for hardware verification) are ideal for testing and debugging RTL IP, software developers cannot easily run their tests and use their tools concurrently in this environment. We need an alternative approach to system-level verification.</p>
<h3><i>Emulation</i></h3>
<p>Emulation can make a difference, but in-circuit emulation (ICE) effectively strands the emulator in the lab. Access becomes a special privilege, largely confined to hardware engineers. Software teams get very limited access or companies must buy and maintain multiple emulator setups.</p>
<p>Multiple emulators—each perhaps needing multiple hardware speed adaptors to model protocol host and peripheral connections—fill the lab with fault-vulnerable cables, present pin limitations, and incur high costs.</p>
<h2><b>Fenced in</b></h2>
<p>Thus, there remains an unnecessarily high wall between hardware and software engineers: different verification silos; different languages; related but disconnected problems.</p>
<p>The verification process must reflect the importance of software, since it constitutes a large and increasing share of any SoC, is updated more frequently than hardware, and may be being worked upon by ten times as many engineers as the hardware.</p>
<p>In consumer electronics, hardware will often undergo a respin every few months. This leaves almost no time for software teams to do their work if they need to wait for hardware prototypes.</p>
<h2><b>Tear down the wall</b></h2>
<p>A new virtual lab use-model for emulation helps tear down the wall, shortens development time, and enhances verification robustness.</p>
<p>It replaces physical hardware models with virtual peripherals that run target hardware protocols in software, moving the emulator out of the lab and into the datacenter. The emulator can be accessed like a server by everyone in the company. Virtual labs provide stable 24/7 emulation environments in which hardware and software meet and evolve together.</p>
<div class="article_figure"><a class="figure" title="Figure 1 : <i>With VirtuaLAB, the software environment host and peripheral protocols </i><i>run in software</i> (Source: Mentor Graphics)" href="http://www.techdesignforums.com/practice/files/2013/04/tdf-ment-socint-fig1-lrg.jpg"><img src="http://www.techdesignforums.com/practice/files/2013/04/tdf-ment-socint-fig1-med.jpg" alt="<i>With VirtuaLAB, the software environment host and peripheral protocols </i><i>run in software</i> (Source: Mentor Graphics)" /></a></div><div class="article_figure"><p class="figure_wrapper"><span class="figure_title">Figure 1 </span><i>With VirtuaLAB, the software environment host and peripheral protocols </i><i>run in software</i> (Source: Mentor Graphics)</p></div>
<p>Virtual models can be connected or disconnected in software by choosing a particular configuration of a compiled design under test (DUT) rather than by connecting different pieces of hardware with cables. The emulator can be accessed simultaneously by software, hardware, and integration engineers without disruption. This approach can be applied both to a single project and to multiple projects running at the same time.</p>
<p>Where ICE typically provides access for a handful of engineers per emulator, a virtual lab can provide access for hundreds. That’s a dramatic expansion of a valuable resource.</p>
<p>With 24/7 remote access and lower downtime due to fewer physical connection issues (no more kicked cables or running out of pins), the cost-benefit advantages over ICE become even greater.</p>
<p>Plus, because users can run host connections through a virtual machine, they don’t have to worry about crashing the verification process if they address a memory that does not yet exist.</p>
<p>With a virtual lab, the global software team gets access to hardware while it is still in RTL, running against multiple peripherals, on a single emulator. Developers can target it from remote PCs, using the real OS, drivers, stack, and applications rather than hardware targets running on speed adaptors.</p>
<p>A virtual lab using independently validated protocol models, and running on a Mentor Graphics <a href="http://www.mentor.com/products/fv/emulation-systems/" target="_blank">Veloce</a> emulator, brings the SoC verification environment closer to the real, externally-connected one with which the target device will ultimately interact. The protocol models are the same IP and same software, delivering the same functionality as ICE-based speed adapter solutions. But they are now delivered as software products, or Veloce <a href="http://www.mentor.com/products/fv/emulation-systems/virtual-devices" target="_blank">VirtuaLAB</a> components.</p>
<div class="article_figure"><a class="figure" title="Figure 2 : <i>VirtuaLAB components provide models for Flash memories, DDRs, USB, PCIe, Ethernet, SATA, HDMI, and other AV standards</i> (Source: Mentor Graphics)" href="http://www.techdesignforums.com/practice/files/2013/04/tdf-ment-socint-fig2-lrg.jpg"><img src="http://www.techdesignforums.com/practice/files/2013/04/tdf-ment-socint-fig2-med.jpg" alt="<i>VirtuaLAB components provide models for Flash memories, DDRs, USB, PCIe, Ethernet, SATA, HDMI, and other AV standards</i> (Source: Mentor Graphics)" /></a></div><div class="article_figure"><p class="figure_wrapper"><span class="figure_title">Figure 2 </span><i>VirtuaLAB components provide models for Flash memories, DDRs, USB, PCIe, Ethernet, SATA, HDMI, and other AV standards</i> (Source: Mentor Graphics)</p></div>
<p>As more embedded software is verified with hardware in virtual labs, the productivity of SoC verification immediately increases. Going forward, the technique will raise our understanding of how software debug can be improved, and thereby also improve the efficiency of hardware-software integration.</p>
<h2><b>Further enhancing the virtual lab</b></h2>
<p>Multicore SoCs are encountering limitations in traditional hardware-based debug connections, such as JTAG. This is also driving interest in software-based alternatives.</p>
<p>JTAG is intrusive because it suspends the target processor during interactive debug, performing thousands of additional clock cycles while the state of the rest of the design advances. In a multiprocessor design, this means other processors continue clocking normally, becoming out of sync with the target processor. What is debugged would not occur in the real software-hardware environment, rendering the process ineffective.</p>
<p>Mentor Graphics’ Codelink software provides a non-intrusive debug connection that passively streams data from the design (i.e., monitoring rather than interrupting the process as JTAG does). Codelink offers a more transparent way of performing multiprocessor design debug, and the virtual lab is a key enabler both for using it in emulation and delivering it to teams worldwide.</p>
<h2><b>Conclusion</b></h2>
<p>The virtual lab method puts emulation into the datacenter, providing server-like access. It is all about bringing software engineering into the heart of verification. Software and integration engineers have been knocking on the wall long enough; it’s time we knocked it down.</p>
<h2><b>Author</b><b> </b></h2>
<p><em>Richard Pugh is the product marketing manager for Mentor’s Emulation Division.</em></p>
<h2>Contact</h2>
<p><em>Mentor Graphics</em><br />
<em>Corporate Office<br />
</em><em>8005 SW Boeckman Rd<br />
</em><em>Wilsonville<br />
</em><em>OR 97070<br />
</em><em>USA</em></p>
<p><em>T: +1 800 547 3000<br />
</em><em>W: <a href="http://www.mentor.com/" target="_blank">www.mentor.com</a></em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.techdesignforums.com/practice/technique/soc-integration-embedded-software/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>IP-to-SoC prototyping demands consistency</title>
		<link>http://www.techdesignforums.com/practice/technique/ip-to-soc-fpga-prototyping-consistency/</link>
		<comments>http://www.techdesignforums.com/practice/technique/ip-to-soc-fpga-prototyping-consistency/#comments</comments>
		<pubDate>Wed, 24 Apr 2013 12:19:13 +0000</pubDate>
		<dc:creator>Paul Dempsey</dc:creator>
				<category><![CDATA[Assembly & Integration]]></category>
		<category><![CDATA[EDA Topics]]></category>
		<category><![CDATA[IP Topics]]></category>
		<category><![CDATA[Verification]]></category>
		<category><![CDATA[FPGA-based prototypes]]></category>
		<category><![CDATA[IP]]></category>
		<category><![CDATA[SoC]]></category>

		<guid isPermaLink="false">http://www.techdesignforums.com/practice/?p=4939</guid>
		<description><![CDATA[Many problems arise during the IP-to-SoC phase of FPGA-based prototyping due to the mix-and-match nature of the prototypes not the actual designs.]]></description>
				<content:encoded><![CDATA[<p>Porting your IP to the FPGA-based prototype of an SoC design is hazardous. Particularly for new blocks, the process often involves some form of translation that can introduce errors. Sometimes, the system design team finds itself repeating verification already undertaken by the IP’s developers. These are just two of the more common concerns.</p>
<p>Overall, engineers need an infrastructure that reduces risk, reflects shrinking project deadlines and brings greater consistency to the IP-to-SoC flow. Let’s look at why you should and how you can achieve that.</p>
<h2>The risks of mix-and-match</h2>
<p>It is often the case that the IP-to-SoC FPGA-based prototyping process will involve different tools and different boards across the two stages. Bridging this divide is where many of the problems arise. They can be illustrated by looking at the main obstacles presented in terms of how the IP is delivered to the SoC team. Typically, this is done in one of three ways:</p>
<ol>
<li> As RTL</li>
<li> Encapsulated as a ‘physical’ hardware deliverable</li>
<li> As both an IP block and an FPGA binary</li>
</ol>
<h3>1. RTL deliverables</h3>
<p>The RTL option has the benefits that it represents a portable ‘soft’ delivery and provides such flexibility that the code can be tailored to fit any SoC platform.</p>
<p>But SoC teams will often find themselves duplicating much of the earlier IP prototyping and verification effort.</p>
<p>If the platform and tools on which the IP was developed differ from those being used by the SoC team, there is a greater likelihood of model mismatches. This mismatch issue is obviously a concern where working with third-party IP suppliers.</p>
<p>However, a further issue arises for third-party partners: the lack of sufficient protection for the IP. Externally and internally, some encryption is desirable.</p>
<h3>2. Physical hardware deliverables</h3>
<p>A physical board offers the advantages of validated delivery in a format that should have been tuned to the configuration of the prototype. So, it should only involve a single prototyping effort, shouldn’t it?</p>
<p>Well, it might, but it is also expensive: there are now two prototyping boards where only one was probably necessary. However, this may be the design team’s preferred way of working. Still, there are other factors to consider.</p>
<p>In our inconsistent mix-and-match world, connecting the IP board to the SoC board typically entails extra work creating adapters and custom interfaces, both of which then need their own debug. There is also no guarantee of clock synchronization.</p>
<h3>3. FPGA binaries</h3>
<p>An FPGA binary looks like an all-round winner. It’s validated, it’s tuned and it involves a single prototyping effort.</p>
<p>But you can only feed forward these binaries where you are using the same FPGA platform at both the IP and SoC prototyping stages. Any mix-and-match of in-house and third-party boards makes it effectively impossible to do.</p>
<h2>Consistency with HAPS-70</h2>
<p>The obvious answer is to align your IP and SoC prototyping environments as closely as possible.</p>
<p>We’ve discussed juggling the use of different types of board and design software at the stages in your IP-to-SoC prototyping flow. Beyond that though, it’s fair to ask how many of the tools needed to realize both the IP and the SoC the prototyping board vendor provides. Often, the beyond-silicon support is limited and the gaps are significant.</p>
<p>Synopsys has developed an integrated flow based around existing tools and its Xilinx Virtex-7 2000T-based family of <a href="http://www.synopsys.com/Systems/FPGABasedPrototyping/haps70/Pages/default.aspx" target="_blank">HAPS-70 boards</a>, embracing both the hardware and the software needed to provide consistency. You can thereby minimize risk and reduce prototyping time.</p>
<p>Let’s look again at the three IP delivery models but in the context of HAPS-70.</p>
<h3>1. HAPS-70 and RTL deliverables</h3>
<p>IP can be developed and prepared for FPGA-based prototyping utilizing Synopsys’ <a href="http://www.synopsys.com/Systems/FPGABasedPrototyping/Pages/Certify.aspx" target="_blank">Certify</a> and <a href="http://www.synopsys.com/tools/implementation/fpgaimplementation/fpgasynthesis/pages/synplifypremier.aspx" target="_blank">Synplify Premier</a> software tools. These tools are applicable to usage at both the individual IP level and higher SoC level. This enables direct script reuse, saves time and increases confidence in the block being fed through.</p>
<p>The target hardware is also consistent at both stages. HAPS-70 has a modular and symmetrical system architecture to smooth the transfer from, say, a single to a multi-FPGA-based system. Moving daughter boards and related connection pin-outs has a minimal effect on timing and performance as the HAPS-70 I/O connector banks directly map to the super logic regions (SLRs) found on the 2000T. The 1:1 SLR-to-bank-to-HapsTrak connector ensures uniform timing characteristics.</p>
<div class="article_figure"><a class="figure" title="Figure 1 :  Failing to map directly to SLRs can lead to crossing delays" href="http://www.techdesignforums.com/practice/files/2013/04/tdf-haps70-boards-fig1-lg.jpg"><img src="http://www.techdesignforums.com/practice/files/2013/04/tdf-haps70-boards-fig1-med.jpg" alt=" Failing to map directly to SLRs can lead to crossing delays" /></a></div><div class="article_figure"><p class="figure_wrapper"><span class="figure_title">Figure 1 </span> Failing to map directly to SLRs can lead to crossing delays</p></div>
<h3>2. HAPS-70 and physical deliverables</h3>
<p>Simply put, you’re now working withing the same IP-to-SoC flow, the same environment. There’s a single IP FPGA bit image file for both the IP and SoC stages. Connection between the two systems is over the common HapsTrak interface. Translation and debug risks are thus mitigated. Clocks and resets are synchronized by the inbuilt HAPS-70 capabilities.</p>
<p>The system allows for high-speed time-domain multiplexing between systems to raise FPGA-based prototype performance. Configuration over UMRBus is quick and easy. And the IP can now be locked and encrypted.</p>
<div class="article_figure"><a class="figure" title="Figure 2 :  Common interfaces reduce risks in developing custom alternatives and synchronize timing" href="http://www.techdesignforums.com/practice/files/2013/04/tdf-haps70-boards-fig2-lg.jpg"><img src="http://www.techdesignforums.com/practice/files/2013/04/tdf-haps70-boards-fig2-med.jpg" alt=" Common interfaces reduce risks in developing custom alternatives and synchronize timing" /></a></div><div class="article_figure"><p class="figure_wrapper"><span class="figure_title">Figure 2 </span> Common interfaces reduce risks in developing custom alternatives and synchronize timing</p></div>
<h3>3. HAPS-70 and FPGA binaries</h3>
<p>Here again, the use of the same consistent platform overcomes the otherwise near insuperable obstacle facing this IP-to-SoC prototyping model.</p>
<p>In addition to your ability to feed through the same FPGA bit file of your IP, there are the further benefits that both daughter board placement and the connectors are identical.</p>
<h2>What we all really need</h2>
<p>By the time that a typical SoC project has reached FPGA-based prototyping, the design team wants to focus on issues that could affect the user experience in the target design. Do the various IP blocks in the design function as intended and fit together in the whole? What happens when you expose them to real-world inputs and multi-processor software?</p>
<p>If the prototyping infrastructure itself introduces delays and irrelevant bugs, the technique’s advantages are greatly diminished. The rational answer then is to adopt an IP-to-SoC infrastructure that offers most consistency from initial IP development through to final silicon.</p>
<p><a href="http://www.synopsys.com/Systems/FPGABasedPrototyping/haps70/Pages/default.aspx" target="_blank">HAPS-70</a> has been configured to do just that. With IP and SoC teams working in the same environment, communication between them will improve. And your engineers can focus on the needs of the final silicon as quickly and as efficiently as possible.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.techdesignforums.com/practice/technique/ip-to-soc-fpga-prototyping-consistency/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The rush to open source tools</title>
		<link>http://www.techdesignforums.com/practice/technique/the-rush-to-open-source-tools/</link>
		<comments>http://www.techdesignforums.com/practice/technique/the-rush-to-open-source-tools/#comments</comments>
		<pubDate>Wed, 24 Apr 2013 10:02:02 +0000</pubDate>
		<dc:creator>Paul Dempsey</dc:creator>
				<category><![CDATA[Architecture & Design]]></category>
		<category><![CDATA[Embedded Topics]]></category>
		<category><![CDATA[Integration & Debug]]></category>
		<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[User Experience]]></category>
		<category><![CDATA[Open Source Tools]]></category>

		<guid isPermaLink="false">http://www.techdesignforums.com/practice/?p=4929</guid>
		<description><![CDATA[Mind how you go. The only truly free thing about open source tools is the download itself. There is, however, a 'third way', matching professional support to these often useful options.]]></description>
				<content:encoded><![CDATA[<p>Everyone is familiar with open source. Even the most conservative developers &#8211; and embedded software engineers almost invented the phrase, &#8220;If it ain&#8217;t broke, don&#8217;t fix it&#8221; &#8211; have considered using open source solutions to help them meet cost and time-to-market goals. Over the years, embedded versions of Linux have proved a good fit for many applications, particularly where real-time response and memory footprint are not serious constraints. Now, there is an increasing move toward using open source development tools.</p>
<p>Until recently, there were two main ways to set up embedded software development tools:</p>
<ol>
<li>Purchase proprietary tools from one of the specialist vendors. Such tools are not cheap, as they offer specialist functionality to a limited market (compared with desktop software development tools, for example). But good quality support is generally available, which is reassuring to a developer on a tight deadline.</li>
<li>Obtain open source tools by downloading them at no cost.</li>
</ol>
<p>The second option sounds attractive. For many, ‘open source’ means ‘free of charge’. However, this is only true really for the software download. The real costs are down the line. There are many issues to resolve. All take time and time = money.</p>
<ul>
<li>Which specific tools do you need?</li>
<li>Have you got the right desktop tools to build the embedded tools?</li>
<li>Which versions of the toolchain components do you need?</li>
<li>Which patches should be applied?</li>
<li>Do you know what works with what? (<a href="http://kegel.com/crosstool/crosstool-0.43/buildlogs/" target="_blank">This table</a> gives some clues as to the challenge)</li>
<li>How do you configure the toolchain components? (There are thousands of options!)</li>
<li>What CPUs do you need to support?</li>
<li>Do you need big- or little-endian support or both?</li>
<li>Have you got the test suites to validate the tools?</li>
</ul>
<p>An understanding of the cost of resolving these issues has led many developers to seek a ‘Third Way’. A number of specialist vendors supply embedded toolchains. These are based on open source technology and pre-configured ready for use. Value is added by including other tools and offering professional grade support instead of necessitating hours posting on forums.</p>
<p>This is a cost effective approach that offers the economy of open source software with the security and backup of a commercial product. More information can be found in <a href="http://www.mentor.com/embedded-software/resources/overview/gnu-toolchain-for-embedded-development-build-or-buy-5ad09fc9-82d2-49e8-88cc-15ad16ea5012" target="_blank">this whitepaper</a>.</p>
<p><em>Colin&#8217;s regular blog is located at: <a title="Colin Walls' blog at mentor.com" href="http://blogs.mentor.com/colinwalls" target="_blank">http://blogs.mentor.com/colinwalls</a>. He can be reached by email at colin underscore walls at mentor dot com.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.techdesignforums.com/practice/technique/the-rush-to-open-source-tools/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The five key challenges of 20nm custom and analog design</title>
		<link>http://www.techdesignforums.com/practice/technique/five-key-challenges-20nm-custom-design/</link>
		<comments>http://www.techdesignforums.com/practice/technique/five-key-challenges-20nm-custom-design/#comments</comments>
		<pubDate>Mon, 22 Apr 2013 09:53:05 +0000</pubDate>
		<dc:creator>Chris Edwards</dc:creator>
				<category><![CDATA[DFM]]></category>
		<category><![CDATA[IC Implementation]]></category>
		<category><![CDATA[14nm]]></category>
		<category><![CDATA[16nm]]></category>
		<category><![CDATA[20nm]]></category>
		<category><![CDATA[design rules]]></category>
		<category><![CDATA[double patterning]]></category>
		<category><![CDATA[finFET]]></category>
		<category><![CDATA[layout dependent effects]]></category>

		<guid isPermaLink="false">http://www.techdesignforums.com/practice/?p=4880</guid>
		<description><![CDATA[The arrival of the 20nm and finFET-based sub-20nm processes bring with them challenges for custom IC design. These are the five key areas and a methodology that can address them.]]></description>
				<content:encoded><![CDATA[<p>The new generation of 20nm and sub-20nm process nodes offer greatly improved circuit density and power/performance characteristics. At the same time, they raise new concerns for SoC integration, particularly with respect to mixed-signal and custom design flows. At 20nm and below, an increased amount of silicon IP must be obtained from multiple sources, and there will be increasing concerns about mixed-signal design, integration, and verification. Mixed-signal interactions will increase as more and more digital control circuitry is used, and as analog and digital components come into close proximity.</p>
<p>Of most concern to custom/analog designers are the challenges that arise from manufacturing complexity. On top of increasing timing, power, and area challenges, the 20nm process generation brings with it the deep and complex interdependency of manufacturing and variability. The five main challenges of 20nm custom and analog design comprise the use of double patterning; layout-dependent effects (LDE); the use of new types of local interconnect layers; a massively increased collection of design rules; and increasing device complexity and variation. Concerns about manufacturability issues vary according to design style. For example, analog and I/O designers are most concerned about LDE, circuit specifications, and area versus performance tradeoffs. Memory and standard cell designers are very concerned about density, and as such, double patterning and local interconnect are key concerns.</p>
<p>The solution to 20nm custom/analog challenges lies not only in new point tools, but in a new custom design methodology. In this methodology, circuit (schematic) and layout designers will work in close collaboration, and will have the ability to rapidly exchange information. Circuit designers will be able to obtain early parasitic estimates before the layout is completed. The flow will use in-design, signoff-quality engines as opposed to attempting to fix everything during the final signoff stage. And all tools will be “double patterning-aware” and ready for 20nm.</p>
<h2>1. Double Patterning</h2>
<p>The most-discussed manufacturability issue at 20nm is double patterning. This technology splits a layer into two separate masks so that 193nm lithography can print structures that are too close together to resolve with a single mask. This technology is needed to get current 193nm lithography equipment to print correctly when metal pitches are below 80nm, which will be the case for at least some of the metal layers for almost any 20nm design.</p>
<p>When double patterning is used, each mask is exposed separately, and the exposures overlap to create features that are half the pitch that would otherwise be printable with 193nm lithography.</p>
<p>The concept may be simple, but managing double patterning is difficult. It requires a two-color layout decomposition process in which alternate colors (such as red and green) are used to indicate which features will be placed on which mask. This results in added design rules that restrict the placement and proximity of layout features. For example, traces that are the same color can’t be placed too closely together.</p>
<div class="article_figure"><a class="figure" title="Figure 1 : Attempts to solve double patterning issues can easily lead to loops" href="http://www.techdesignforums.com/practice/files/2013/04/cdns_20nm_dp1.jpg"><img src="http://www.techdesignforums.com/practice/files/2013/04/cdns_20nm_dp1_med.jpg" alt="Attempts to solve double patterning issues can easily lead to loops" /></a></div><div class="article_figure"><p class="figure_wrapper"><span class="figure_title">Figure 1 </span>Attempts to solve double patterning issues can easily lead to loops</p></div>
<p>It is very easy to create a design-rule checking (DRC) “loop,” which is a coloring conflict that cannot converge on a solution that works. And in many cases, it will be necessary to trace back a number of steps to unravel how the loop was created.</p>
<p>Handling double patterning properly is a major concern for custom designers of standard cells, memories, and I/Os. These designers must be cognizant of coloring as they create layouts to optimize area. It is difficult to achieve a high density while making the design decomposable. According to some reports, standard cells that previously took four hours to lay out sometimes take a week at 20nm, because designers have to keep re-running verification as they try to pack decomposable cells as tightly as possible.</p>
<p>Analog designers are concerned about the mismatches that an additional mask can cause. Double patterning impacts electrical performance because different masks on a given layer will shift during the manufacturing process. This mask shift causes variations that have a direct impact on RC and the interconnect. As a result, parasitic matching can become very challenging between elements that are rendered using different masks.</p>
<div class="article_figure"><a class="figure" title="Figure 2 : Mask shifts can occur with double patterning" href="http://www.techdesignforums.com/practice/files/2013/04/cdns_20nm_dp2.jpg"><img src="http://www.techdesignforums.com/practice/files/2013/04/cdns_20nm_dp2_med.jpg" alt="Mask shifts can occur with double patterning" /></a></div><div class="article_figure"><p class="figure_wrapper"><span class="figure_title">Figure 2 </span>Mask shifts can occur with double patterning</p></div>
<p>EDA tools can help automate the colorized decomposition process and can help ensure correctness. In a 20nm-aware toolset, all physical design tools should be double patterning‒aware, including placement, routing, extraction, and physical verification. For example, extraction must be able to predict the capacitance variation resulting from mask shift.</p>
<p>Another capability that’s needed is automated color-aware layout. Here, color conflicts are avoided as the layout designer draws shapes and places cells. Once a shape is dropped into place, the tool automatically makes any needed coloring changes. Locking color choices down in the design phase sets a constraint for the manufacturer and helps to ensure that matching is correct by placing pairs of nets or devices on the same mask.</p>
<p>Instead of running signoff verification once every four hours, a quick double patterning check should be run after every editing command. In this way errors can be fixed quickly, and designers don’t end up with DRC loops that may take many steps to unwind.</p>
<h2>2. Layout-Dependent Effects</h2>
<p>At 20nm, it’s not enough to model the performance of a transistor or cell in isolation. The environment around a device will change its behavior, a phenomenon captured by the term LDE. Although LDE became apparent at 28nm, the problem has grown significantly worse at 20nm, where cells are much closer together. At 20nm, up to 30 per cent of device performance can be attributed to the layout ‘context’. For example, the voltage threshold of a transistor will change based on how close the device is to the edge of a well – this is called the well proximity effect.</p>
<p>Other modifiers of device performance include the distance between gates – including dummy poly – which has a direct effect on the drain current of the transistor. This is known as the poly spacing effect. Another major cause of LDE is mechanical stress, which is often intentionally induced to improve CMOS transistor performance. For example, a dual stress liner is a silicon nitride (SiN) capping layer that is intentionally deposited to be compressive on PMOS and tensile on NMOS. Such stresses improve the performance of either form of transistor. Nonetheless, it results in variability that may make it difficult to close timing. Stress is also unintentionally induced through technologies such as shallow trench isolation, which isolates transistors and determines active-to-active spacing.</p>
<p>LDE cannot be modeled in Pcells (parametized cells) or device models. As a result, it is no longer enough to create a schematic, pick a topology, run a simulation, and throw it over the wall to a layout designer. At 20nm, circuit designers have to consider layout context as well as device topology, and they need to simulate with layout effects prior to layout completion. This may sound paradoxical, but it is possible to identify devices that are sensitive to LDE and thus allow the designer to recognize which components need greater care in layout. Less sensitive devices can be placed closer to well boundaries, for example. Circuit designers can also make use of LDE simulation using partial layouts and LDE-aware layout module generators. Layout engineers can use LDE hotspot detection and fixing.</p>
<div class="article_figure"><a class="figure" title="Figure 3 : The increase in layout-dependent effects with reductions in geometry" href="http://www.techdesignforums.com/practice/files/2013/04/cdns_20nm_lde.jpg"><img src="http://www.techdesignforums.com/practice/files/2013/04/cdns_20nm_lde_med.jpg" alt="The increase in layout-dependent effects with reductions in geometry" /></a></div><div class="article_figure"><p class="figure_wrapper"><span class="figure_title">Figure 3 </span>The increase in layout-dependent effects with reductions in geometry</p></div>
<p>The underlying technology that is needed is context-driven placement and optimization that can determine how different cells are going to interact and how the layout context affects timing and power. The design tool should take care of LDE during both schematic and layout phases. But it’s not just about tools. Circuit and layout designers have to learn to work together in new ways, with a much higher level of cooperation.</p>
<h2>3. Interconnect layers</h2>
<p>For many custom designers, 20nm is all about density. Local interconnect (LI) layers – also called middle-of-line (MOL) layers – offer a way to achieve very dense local routing below the first metal layer. There may be several of these layers available in a given process, and most don’t use contacts. Instead, they connect by shape overlap without any need for a cut layer. Getting rid of contacts makes routing denser because contacts are bigger than nets, and can’t be placed too close to nets.</p>
<div class="article_figure"><a class="figure" title="Figure 4 : New types of local interconnect enable increased density at 20nm" href="http://www.techdesignforums.com/practice/files/2013/04/cdns_20nm_ic.jpg"><img src="http://www.techdesignforums.com/practice/files/2013/04/cdns_20nm_ic_med.jpg" alt="New types of local interconnect enable increased density at 20nm" /></a></div><div class="article_figure"><p class="figure_wrapper"><span class="figure_title">Figure 4 </span>New types of local interconnect enable increased density at 20nm</p></div>
<p>However, designers need to be aware that LI layers have their own restrictive design rules. For example, LI shapes can only be rectangles, and they often have fixed directions with length constraints.</p>
<h2>4. Design rules</h2>
<p>At 20nm, there may be more than 5,000 design rule checks. Some 1,000 new rules have been added since the 90nm process node, and double patterning alone requires 30 to 40 checks. The 20nm node adds another 400 new advanced rules such as wrong width and wrong spacing, discrete width and spacing, and special rules for pin access on cells. Custom and mixed-signal designers will face directional orientation rules, specific rules regarding length/width and transistor proximity, and new rules governing legal inter-digitation patterns.</p>
<p>An important point to remember is that 20nm doesn’t just bring in more rules; it brings in more complex rules. Simple rules of thumb or memorization won’t work any more. What’s needed is a design system that can check design rules as the design progresses, rather than waiting for a final signoff check.</p>
<h2>5. Device complexity and variation</h2>
<p>Transistors at 20nm are very small and very fast, and variation is a constant challenge. Transistors are sensitive to channel length and channel doping, and transistor behavior is subject to short-channel effects. Custom designers must minimize leakage, ensure reliability, and achieve reasonable yields.</p>
<p>The reality is that 20nm transistors were designed for achieving high densities in digital design, not for optimizing leakage or gain in custom/analog design. A limited set of device sizes are available for design. Width and length parameters are limited to a small set of values, and with fewer choices, manually tuning transistors to meet specs (such as gain) is difficult.</p>
<p>Below 20nm many, if not most, designs will use a new type of transistor called a FinFET. In a FinFET, the transistor gate wraps around three sides of the transistor’s elevated channel, or ‘fin’. This forms conducting channels on three sides of the vertical fin structure, providing much more control over current than planar transistors. Multiple fins can be used to provide additional current. FinFETs promise greatly reduced power at a given level of performance.</p>
<p>However, FinFETs raise some challenges for custom/analog designers. One constraint is that all the fins on the transistors on a given chip must be the same width and height. Designers can add fins to increase the width, but this can only be done in discrete increments—you can add 2 fins or 3 fins, but not 2.75 fins. In contrast, planar transistors can be adjusted to any channel width in order to meet specifications.</p>
<p>Other challenges include additional design rules, manufacturing variations in the width and height of fins, and metal and via resistance. SPICE models with additional parameters will be needed for the FinFETs, and simulators must be able to interpret them. Extraction tools must be aware of the capacitance and resistance that arises from 3D transistor structures. Layout tools will have to be optimized to handle FinFETs. And like any new technology, FinFETS will require ecosystem support including EDA tools, process design kits (PDKs), physical IP, and silicon-proven manufacturing processes.</p>
<h2>A new methodology</h2>
<p>To resolve the challenges described in the above sections, every tool in the custom/analog flow needs to be aware of the changes that 20nm brings, including double patterning, LDE, local interconnect, complex design rules, device variation, and FinFETs. But it is not enough to just improve tools. What is needed is a new methodology that provides a higher level of automation than existing flows. In this methodology, circuit and layout designers will exchange and share information, ”partial layout” will provide early estimates of parasitics and LDE, and an in-design signoff approach will greatly shorten final signoff runs.</p>
<p>The traditional custom/analog flow is a manual, “throw it over the wall” approach. Circuit designers do schematic entry and run an ideal simulation without layout parasitics. The design is tossed to layout designers who handle device creation, manual placement, and manual routing. Next comes physical verification, extraction, and a final simulation. It’s a time-consuming, serial methodology in which issues are exposed very late in the design process, and many design iterations may occur. Without changing this approach, blocks that took four hours to lay out at 28nm could easily take a week at 20nm.</p>
<div class="article_figure"><a class="figure" title="Figure 5 : A sub-25nm custom design methodology allows rapid information exchange between schematic and layout" href="http://www.techdesignforums.com/practice/files/2013/04/cdns_20nm_flow.jpg"><img src="http://www.techdesignforums.com/practice/files/2013/04/cdns_20nm_flow_med.jpg" alt="A sub-25nm custom design methodology allows rapid information exchange between schematic and layout" /></a></div><div class="article_figure"><p class="figure_wrapper"><span class="figure_title">Figure 5 </span>A sub-25nm custom design methodology allows rapid information exchange between schematic and layout</p></div>
<p>What is required is a more automated and collaborative methodology. Here, circuit designers draw schematics, just like they always have. But they also pass constraints to the layout designers, and run a pre-layout parasitic and LDE estimation. On the right side of the diagram, both circuit designers and layout designers can use Modgens – a Cadence term for automatic module generators – to quickly generate layouts for structures such as differential pairs, current mirrors, and resistor arrays. Although not final “DRC-clean” layouts, these automatically generated layouts allow accurate physical effects to be extracted, analyzed, and simulated. Modules can then be fed into an analog placer and assembled into a floorplan.</p>
<p>Layout engineers then perform device placement, routing, in-design signoff, and extraction. In short, the basic roles of circuit designers and layout designers remain the same, but there is an ongoing and rapid exchange of information and a high degree of collaboration. Constraints flow easily between the schematic and layout environments.</p>
<p>Automatic module generators help enable “partial layout,” which is the ability to quickly generate an extracted layout view so circuit designers can run simulations with real parasitic and LDE information. In this way, the electrical and mismatch problems that might be caused by layout effects can be spotted and remedied early in the design cycle. The module generators, however, must be ready for 20nm, with place-and-route engine support for 20nm design rules, complex abutment support, array-based FinFET configurations, coloring for double patterning, and LDE awareness.</p>
<p>Another approach for bringing physical effects into initial simulations is incremental design. Here, designers lay out the pieces they care about with assisted or full automation, and gather as much physical and electrical information as they can. This results in a partial layout extraction. The emphasis is on placement rather than routing. The point is that designers are not taking the time to do a full layout, just what’s necessary to generate the desired parasitic information.</p>
<p>Finally, in-design signoff is a must at 20nm. If a designer makes a mistake during layout, the feedback should come immediately, not four hours or a month later. Otherwise many design iterations may be needed for each block, with each iteration taking longer than it took to do the original design. There are two types of interactive editing checks that can help avoid those iterations. One is “in-edit checking,” which warns of errors while geometry is being created. Another is “post-edit signoff-quality checking,” which exercises a more robust check after each edit is completed.<br />
The key to in-design signoff is having “signoff-quality” engines that run during the design flow. This does not remove the need for a final signoff check, but it greatly reduces the amount of time and potentially the number of licenses that a final check might consume.</p>
<h2>Conclusion</h2>
<p>The performance, power, and density advantages of the sub-25nm nodes will provide competitive advantages to those who adopt them. Although there are a number of design and manufacturing challenges, the good news is that the challenges are manageable – but only if we look beyond individual tools and rethink the ways that custom and analog circuits are designed.</p>
<p>What’s needed is a new custom/analog methodology in which schematic and layout designers work in close cooperation, schematic designers can run prototype layouts to gather parasitic and LDE information, and signoff- quality engines permit “in-design signoff” that catches the vast majority of errors before the final signoff phase. Cadence Design Systems fully supports such a methodology today.</p>
<h2>About the author</h2>
<p><em>Steve Carlson is marketing group director for silicon realization at Cadence Design Systems.</em></p>
<h2>Contact</h2>
<p>Cadence Design Systems<br />
2655 Seely Avenue<br />
San Jose, CA 95134</p>
<p>T: +1 (408) 943 1234<br />
W: <a href="http://www.cadence.com" target="_blank">www.cadence.com</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.techdesignforums.com/practice/technique/five-key-challenges-20nm-custom-design/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
<!-- This Quick Cache file was built for (  www.techdesignforums.com/practice/feed/ ) in 0.64123 seconds, on May 24th, 2013 at 6:15 am PDT. -->
<!-- This Quick Cache file will automatically expire ( and be re-built automatically ) on May 24th, 2013 at 7:15 am PDT -->