EE115C Spring 2013 Digital Electronic Circuits. Lecture 19: Timing Analysis

Similar documents
Lecture 8: Skew Tolerant Design (including Dynamic Circuit Issues)

Lecture 20: Sequential Circuits. Sequencing

Lecture 8: Skew Tolerant Domino Clocking

PUF RO (RING OSCILLATOR)

A Heuristic Method for Statistical Digital Circuit Sizing

Statistical Static Timing Analysis: How simple can we get?

Physical Unclonable Functions (PUFs) and Secure Processors. Srini Devadas Department of EECS and CSAIL Massachusetts Institute of Technology

Sequential Gates. Gate Level Design. Young Won Lim 3/15/16


Chapter 7. Registers & Register Transfers. J.J. Shann. J. J. Shann

Statistical Skew Modeling for General Clock Distribution Networks in Presence of Process Variations

Moving PUFs out of the lab

The good, the bad and the statistical

FPGA PUF Based on Programmable LUT Delays

Modeling Logic Gates with Delay- Part#1

A Physical Unclonable Function based on Capacitor Mismatch in a Charge-Redistribution SAR-ADC

Practice 10: Ratioed Logic

TOSHIBA Transistor Silicon NPN Epitaxial Planar Type 2SC5086. Characteristics Symbol Test Condition Min Typ. Max Unit

TOSHIBA Transistor Silicon NPN Epitaxial Planar Type 2SC5066. Characteristics Symbol Test Condition Min Typ. Max Unit

Bitline PUF:! Building Native Challenge-Response PUF Capability into Any SRAM. Daniel E. Holcomb Kevin Fu University of Michigan

TB General description. 2. Features and benefits. 3. Applications. 4. Pinning information. 5. Ordering information

Soft Response Generation and Thresholding Strategies for Linear and Feed-Forward MUX PUFs

TOSHIBA Transistor Silicon NPN Epitaxial Type TMBT3904

Ultimate Control. Maxeler RiskAnalytics

Delay Budgeting in Sequential Circuit with Application on FPGA Placement

TOSHIBA Field Effect Transistor Silicon N Channel Junction Type 2SK mw

BLF7G20L-160P; BLF7G20LS-160P

Pathloss and Link Budget From Physical Propagation to Multi-Path Fading Statistical Characterization of Channels. P r = P t Gr G t L P

Passivated, sensitive gate thyristors in a SOT54 plastic package. General purpose switching and phase control.

EXERCISES ON PERFORMANCE EVALUATION

Finite state machines (cont d)

Secure and Energy Efficient Physical Unclonable Functions

ISI, RIN and MPN Modeling: Some Clarifications

± 0.2 ppm/ C ± 3 ppm/ C. ± 2.0 ppm/ C

PMBFJ111; PMBFJ112; PMBFJ113

Microprocessor Based Physical Unclonable Function

description/ordering information

50 V, 3 A PNP low VCEsat (BISS) transistor

Introduction. Introduction. Six Steps of PERT/CPM. Six Steps of PERT/CPM LEARNING OBJECTIVES

BAT54 series 1. Product profile 2. Pinning information Schottky barrier diodes 1.1 General description 1.2 Features and benefits

Anne Bracy CS 3410 Computer Science Cornell University

IMPROVED PRODUCT VCS331, VCS332

TOSHIBA Transistor Silicon NPN Epitaxial Type (PCT process) 2SC2712

DS9638 RS-422 Dual High Speed Differential Line Driver

TOSHIBA Field Effect Transistor Silicon N Channel MOS Type RFM07U7X

SELECTABLE GTL VOLTAGE REFERENCE

RN1114, RN1115, RN1116, RN1117, RN1118

A PUF Design for Secure FPGA-Based Embedded Systems

RN1401, RN1402, RN1403 RN1404, RN1405, RN1406

Employing Process Variation for Building Chip Identifiers

TOSHIBA Transistor Silicon NPN Epitaxial Type (PCT Process) (Bias Resistor built-in Transistor) RN1910, RN1911

AN021: RF MODULES RANGE CALCULATIONS AND TEST

RN1421, RN1422, RN1423, RN1424 RN1425, RN1426, RN1427

SN54HCT08, SN74HCT08 QUADRUPLE 2-INPUT POSITIVE-AND GATES

COS 318: Operating Systems. CPU Scheduling. Jaswinder Pal Singh Computer Science Department Princeton University

Winston Way and Trevor Chan NeoPhotonics, USA

COMP417 Introduction to Robotics and Intelligent Systems. Reinforcement Learning - 2

Analytics in 10 Micro-Seconds Using FPGAs. David B. Thomas Imperial College London

ITI LIMITED Phone: Palakkad Plant, Kanjikode West

DISCRETE SEMICONDUCTORS DATA SHEET

FSM Optimization. Outline. FSM State Minimization. Some Definitions. Methods 10/14/2015

Ultrafast epitaxial rectifier diode in a SOT226 (I2PAK) plastic package. Discontinuous Current Mode (DCM) Power Factor Correction (PFC)

A PUF Design for Secure FPGA-Based Embedded Systems

3.6V / 2600mAh Primary Lithium x 0.85 (60mm x 21mm) 1.0 oz (28 gr) -30 C to +77 C. Bluetooth Low Energy dBm. +5dBm. 1Mbit/s / 2Mbit/s*

Case Material: Molded Plastic. UL Flammability Classification Low Reverse Leakage Current

PMT High Voltage Power Supply Status

Secure Payment Transactions based on the Public Bankcard Ledger! Author: Sead Muftic BIX System Corporation

4. Absolute Maximum Ratings (Note) (Unless otherwise specified, T a = 25 ) Symbol. Note. V RM V R I O I FSM T j T stg.

ECE 5325/6325: Wireless Communication Systems Lecture Notes, Fall Link Budgeting. Lecture 7. Today: (1) Link Budgeting

Accelerating Financial Computation

TOSHIBA Transistor Silicon PNP Triple Diffused Type 2SA1941

TOSHIBA Field Effect Transistor Silicon N Channel Junction Type 2SK330

... OUTPUT VOLTAGE RANGE, ALL OUTPUTS

Active and Passive Side-Channel Attacks on Delay Based PUF Designs

AC Line Rated Ceramic Disc Capacitors Class X1, 400 V AC / Class Y4, 125 V AC

EELE 5414 Wireless Communications. Chapter 4: Mobile Radio Propagation: Large-Scale Path Loss

Option Valuation (Lattice)

Textbook: pp Chapter 11: Project Management

Project Management Chapter 13

AFRL-RI-RS-TR

Path Loss Models and Link Budget

Is Now Part of To learn more about ON Semiconductor, please visit our website at

Equivalence between Masterpact M and the new range NT/NW

Multilevel Tree Fusion for Robust Clock Networks

Chapter 1: Data Storage

How Computers Work Lecture 12

Data sheet acquired from Harris Semiconductor SCHS049C Revised October 2003

LTE RF Planning Training LTE RF Planning, Design, Optimization Training

AC Line Rated Ceramic Disc Capacitors Class X1, 760 V AC / Class Y1, 500 V AC

Efficient Power Conversion Corporation

3.6V / 2600mAh Primary Lithium x 0.85 (6 cm x 2.1 cm) 1.0 oz (28 gr) -25 C to 65 C. Bluetooth Low Energy dbm.

High Reliability PUF using Hot-Carrier Injection Based Response Reinforcement

CS4311 Design and Analysis of Algorithms. Lecture 14: Amortized Analysis I

CGTA: Current Gain-based Timing Analysis for Logic Cells

Performance Comparison Based on SMG 2 Evaluation Reports: WCDMA vs. WB-TDMA/CDMA

Physically Unclonable Functions: a Study on the State of the Art and Future Research Directions.

ITM1010 Computer and Communication Technologies

Decision Trees: Booths

Simple Descriptive Statistics

Data sheet acquired from Harris Semiconductor SCHS072B Revised July 2003

Transcription:

EE115C Spring 2013 Digital Electronic Circuits Lecture 19: Timing Analysis

Outline Timing parameters Clock nonidealities (skew and jitter) Impact of Clk skew on timing Impact of Clk jitter on timing Flip-flop- vs. Latch-based timing Clock distribution EE115C Spring 2013 2

Synchronous Timing CLK In R Combinational 1 R Logic 2 C in C out Out EE115C Spring 2013 3

Datapath and Timing Parameters In D R1 Q Combinational Logic R2 D Q CLK t CLK1 t CLK2 t c - q t c - q, cd t su, t hold t logic t logic, cd R1 and R2 can be latches or flip-flops EE115C Spring 2013 4

Latch Parameters D Q Clk Clk D PW m T H T SU Q T Clk-Q T D-Q Delays can be different for rising and falling data transitions EE115C Spring 2013 5

Flip-Flop Parameters D Q Clk Clk D PW m T H T SU Q T Clk-Q Delays can be different for rising and falling data transitions EE115C Spring 2013 6

Timing Constraints (Cycle Time & Race Margin) In D R1 Q Combinational Logic R2 D Q CLK t CLK1 t CLK2 t c - q t c - q, cd t su, t hold t logic t logic, cd Cycle time: T Clk > t c-q + t logic + t su Race margin: t hold < t c-q,cd + t logic,cd EE115C Spring 2013 7

Outline Timing parameters Clock nonidealities (skew and jitter) Impact of Clk skew on timing Impact of Clk jitter on timing Flip-flop- vs. Latch-based timing Clock distribution EE115C Spring 2013 8

Clock Nonidealities Clock skew Spatial variation in temporally equivalent clock edges Deterministic + random, t SK Clock jitter Temporal variations in consecutive edges of the clock signal Modulation + random noise Cycle-to-cycle (short-term) t JS Long term t JL Variation of the pulse width for level-sensitive clocking EE115C Spring 2013 9

Clock Skew and Jitter Clk t SK Clk t JS Both skew and jitter affect the effective cycle time Only skew affects the race margin EE115C Spring 2013 10

Clock Skew # of registers Earliest occurrence of Clk edge Nominal T sk /2 Latest occurrence of Clk edge Nominal + T sk /2 Insertion delay Clk delay T sk Max Clk skew EE115C Spring 2013 11

Sources of Skew and Jitter 4 Power Supply Devices 2 3 Interconnect 6 Capacitive Load 1 Clock Generation 5 Temperature 7 Coupling to Adjacent Lines EE115C Spring 2013 12

Positive Skew T CLK + d CLK1 1 d T CLK 3 CLK2 2 4 d + t h Launching edge arrives before the receiving edge EE115C Spring 2013 13

Negative Skew T CLK + d CLK1 1 T CLK 3 CLK2 2 d 4 Receiving edge arrives before the launching edge EE115C Spring 2013 14

Positive and Negative Skew In R1 D Q Combinational Logic R2 D Q Combinational Logic R3 D Q CLK t CLK1 t CLK2 t CLK3 delay delay (a) Positive skew In R1 D Q Combinational Logic R2 D Q Combinational Logic R3 D Q t CLK1 t CLK2 t CLK3 delay delay CLK (b) Negative skew EE115C Spring 2013 15

Outline Timing parameters Clock nonidealities (skew and jitter) Impact of Clk skew on timing Impact of Clk jitter on timing Flip-flop- vs. Latch-based timing Clock distribution EE115C Spring 2013 16

Timing Constraints (Quick Refresh) In D R1 Q Combinational Logic R2 D Q CLK t CLK1 t CLK2 t c - q t c - q, cd t su, t hold t logic t logic, cd Cycle time: T Clk > t c-q + t logic + t su Race margin: t hold < t c-q,cd + t logic,cd EE115C Spring 2013 17

Impact of Clock Skew on Timing: Cycle Time (Long Path) Clk t c-q t logic T Clk t su δ Arrival of next cycle t c-q + t logic + t su < T Clk + d T Clk > t c-q + t logic + t su - d Neg skew demands longer T Clk EE115C Spring 2013 18

Impact of Clock Skew on Timing: Race Margin (Short Path) Clk t c-q,cd t logic,cd Clk t hold δ Data must not arrive before this time t c-q,cd + t logic,cd > t hold + d t hold + d < t c-q,cd + t logic,cd Pos skew eats up t hold EE115C Spring 2013 19

Impact of Clock Skew on Timing Positive skew improves performance T Clk > t c-q + t logic + t su - d Negative skew improves race margin t hold + d < t c-q,cd + t logic,cd Worst-case skew (t skew = d ) really matters T Clk > t c-q + t logic + t su + t skew t hold + t skew < t c-q,cd + t logic,cd Implications: Longer T Clk Smaller race margin EE115C Spring 2013 20

How to Counter Clock Skew? Negative Skew REG REG. REG log Out In REG Positive Skew Clock Distribution EE115C Spring 2013 21

Outline Timing parameters Clock nonidealities (skew and jitter) Impact of Clk skew on timing Impact of Clk jitter on timing Flip-flop- vs. Latch-based timing Clock distribution EE115C Spring 2013 22

Impact of Clock Jitter T C LK CLK -t jitter t jitter In REGS CLK t c-q, t c-q, cd t su, t hold t jitter Combinational Logic t log ic t log ic, cd EE115C Spring 2013 23

Impact of Clock Jitter on Timing: Cycle Time (Late-Early Problem) t jitter Clk t c-q t logic T Clk t su Latest point of launching Earliest arrival of next cycle t c-q + t logic + t su < T Clk t jitter t jitter T Clk > t c-q + t logic + t su + 2 t jitter EE115C Spring 2013 24

Impact of Clock Skew and Jitter: Cycle Time (Late-Early Problem) t jitter + d Clk t c-q t logic T Clk t su Latest point of launching Earliest arrival of next cycle t c-q + t logic + t su < T Clk t jitter t jitter + d T Clk > t c-q + t logic + t su + t skew + 2 t jitter EE115C Spring 2013 25

Impact of Clock Skew and Jitter: Race Margin (Early-Late Problem) Earliest point of launching Clk t c-q,cd t logic,cd Clk Latest arrival of next cycle t jitter + d t hold Nominal clock edge Data must not arrive before this time t c-q,cd + t logic,cd t jitter > t hold + t jitter + d t hold + 2 t jitter + t skew < t c-q,cd + t logic,cd EE115C Spring 2013 26

Impact of Skew and Jitter on Timing Cycle time Positive skew improves performance Negative skew reduces performance Jitter reduces performance T Clk > t c-q + t logic + t su + t skew + 2 t jitter Race Margin Skew reduces race margin Jitter reduces acceptable skew t cu : clock uncertainty t hold + 2 t jitter + t skew < t c-q,cd + t logic,cd t cu : clock uncertainty EE115C Spring 2013 27

Outline Timing parameters Clock nonidealities (skew and jitter) Impact of Clk skew on timing Impact of Clk jitter on timing Flip-flop- vs. Latch-based timing Clock distribution EE115C Spring 2013 28

Flip-Flop Based Timing Logic delay Skew Flip-flop delay Flip -flop Logic T SU = 0 T Clk-Q = 1 [Horowitz96] EE115C Spring 2013 29

Latch-Based Timing Static logic Skew L1 Latch Logic L2 Latch = 1 L2 latch L1 latch Logic Can tolerate skew! Long path = 0 Short path EE115C Spring 2013 30

Perspective: Clock Distribution

Clock Distribution H-tree CLK Clock is distributed in a tree-like fashion EE115C Spring 2013 32

Clock Distribution CLOCK H-Tree Network Observe: Only Relative Skew is Important EE115C Spring 2013 33

More Realistic H-Tree [Restle98] EE115C Spring 2013 34

Clock Network with Distributed Buffering Local Area Module Module secondary clock drivers Module Module Module Module main clock driver CLOCK Reduces absolute delay, and makes Power-Down easier Sensitive to variations in Buffer Delay EE115C Spring 2013 35

Driver The Grid System Driver GCLK GCLK Driver GCLK No RC-matching Large power Driver GCLK EE115C Spring 2013 36

Example: Dec Alpha 21164 Clock Frequency: 300 MHz, 9.3 Million Transistors Total Clock Load: 3.75 nf Power in Clock Distribution Network: 20 W (out of 50 W) Uses Two Level Clock Distribution: Single 6-stage driver at center of chip Secondary buffers drive left and right side Clock grid in Metal-3 and Metal-4 Total driver size: 58 cm! EE115C Spring 2013 37

21164 Clocking (EV5), 1995 t rise = 0.35ns t cycle = 3.3ns Clock waveform final drivers pre-driver Location of clock driver on die t skew = 150ps Single-phase clocking 2 distributed driver channels Reduced RC delay/skew Improved thermal distribution 3.75 nf clock load 58 cm final driver width Local inverters for latching Conditional clocks in caches to reduce power More complex race checking Device variation EE115C Spring 2013 38

21164 Clocking (EV5), 1995 Clock Drivers EE115C Spring 2013 39

Clock Skew in Alpha Processor EE115C Spring 2013 40

EV6 (Alpha 21264) Clocking 600 MHz, 0.35mm CMOS, 1998 t cycle = 1.67ns t rise = 0.15ns Global clock waveform t skew = 50ps PLL Multiple conditional buffered clocks 2.8 nf clock load 40 cm final driver width Reduced load/skew Reduced thermal issues Multiple clocks complicate race checking EE115C Spring 2013 41

21264 Clocking EE115C Spring 2013 42

EV6 Clock Results ps 5 10 15 20 25 30 35 40 45 50 ps 300 305 310 315 320 325 330 335 340 345 GCLK Skew (at Vdd/2 Crossings) GCLK Rise Times (20% to 80% Extrapolated to 0% to 100%) EE115C Spring 2013 43

EV7 Clock Hierarchy, 2002 152 million transistors, 15/137 logic/memory Active Skew Management and Multiple Clock Domains DLL NCLK (Mem Ctrl) DLL DLL + widely dispersed drivers + DLLs compensate static and low-frequency variation + divides design and verification effort L2L_CLK (L2 Cache) GCLK (CPU Core) PLL L2R_CLK (L2 Cache) - DLL design and verification is added work SYSCLK + tailored clocks EE115C Spring 2013 44

Alpha Processors Case Study EV4 (21064) 0.75mm, 200 MHz ~ 1992 Single global clock driver, 5 levels of buffering 35 cm driver, 3.25 nf, 40% power EV5 (21164) 0.5mm, 300 MHz ~ 1995 One central, two side clock drivers 58 cm driver, 3.75 nf, 40% power EV6 (21264) 0.35mm, 600 MHz ~ 1998 Clock grid, 4 window panes, hierarchical, gated clock domains 40 cm driver, 2.8 nf EV7 0.18mm, 1.2 GHz ~ 2002 Multiple clock domains, DLLs EE115C Spring 2013 45