

**Technical Document 1099** June 1987

# NAVAL OCENNSYSTEMS CENTER San Diego, California 921525000 **Integrated Circuit Design**

V. S. Wong and J. Grinberg **Hughes Research Laboratories** 



Approved for public release distribution is unlimited

The views and conclusions contained in this report are those of the authors and should not be interpreted as representing the official policies, either expressed or implied of the Naval Ocean Systems Center or the U.S. government

#### NAVAL OCEAN SYSTEMS CENTER

San Diego, California 92152-5000

E. G. SCHWEIZER, CAPT, USN Commander R. M. HILLYER Technical Director

#### ADMINISTRATIVE INFORMATION

This work was performed for the Department of Defense, Ft. Meade, MD 20755. Contract N66001-84-C-0104 was carried out by Hughes Research Laboratories, 3011 Malibu Canyon Road, Malibu. CA 90265, under the direction of W.H. McKnight, Code 743, NAVOCEANSYSCEN.

Released by J.M. Alsup, Head Image Processing and Display Branch Under authority of R.L. Petty, Head Electromagnetic Systems and Technology Division

MA

### UNCLASSIFIED

•

.

١

•

.

|                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |                                 | RI                      | EPORT DOCUM                        | ENTATION PA                                          | GE                          |                               |              |
|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|---------------------------------|-------------------------|------------------------------------|------------------------------------------------------|-----------------------------|-------------------------------|--------------|
| IN REPORT SECUR                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           | TY CLASSIFICATION               |                         |                                    | 16 RESTRICTIVE MANKING                               |                             |                               |              |
| UNCLASSIFIED                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              |                                 |                         |                                    |                                                      |                             |                               |              |
| Za SECUNITY CLASSIFICATION AUTHONITY                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      |                                 |                         | J DISTRIBUTION/AVAILAB             |                                                      |                             |                               |              |
| 26 DECLASSIFICATION DOWNGRADING SCHEDULE                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  |                                 |                         | Approved for pub                   | lic release; dist                                    | tribution is unli           | mited.                        |              |
| 4 PERFORMING ON                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           | GANIZATION REPORT NUMBE         | R(S)                    |                                    | 5 MONITORING ORGANIZA                                | TION REPORT NUMBER          | (\$)                          | ····         |
|                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |                                 |                         | NOSC TD 1099                       |                                                      |                             |                               |              |
| Se. NAME OF PERFORMING ORGANIZATION BL. OFFICE SYMBOL (# Applicable)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      |                                 |                         | OL OFFICE SYMBOL<br>(# applicable) | 74. NAME OF MONTORING                                | ORGANIZATION                |                               |              |
| Hughes Rese                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               | earch Laboratories              |                         |                                    | Naval Ocean Sy                                       | items Center                |                               |              |
| Sc ADDRESS (City, S                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       | State and DP Codes              |                         |                                    | 7b. ADDRESS /Cay, State an                           | d ZIP Codej                 |                               |              |
| 3011 Malibu<br>Malibu, CA                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 | Canyon Road<br>90265            |                         |                                    | Image Processing<br>San Diego, CA                    | r and Display<br>92152-5000 |                               |              |
| BE MANE OF FUNDI                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          | NG/SPONSORING ORGANIZA          | TION                    | B. OFFICE SYMBOL                   | 9 PROCUREMENT INETRUS                                | NENT IDENTIFICATION N       | UMBER                         |              |
| Denetari                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  | A Defen                         |                         |                                    |                                                      |                             |                               |              |
| Department                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                | or Delense                      |                         | DoD-K913                           | N66001-84-C-010                                      | •                           |                               |              |
| BC ADORESS /Cay                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           | State and ZIP Cadal             |                         |                                    | 10. SOURCE OF FUNDING N                              |                             | TARK NO                       | AGENCY       |
| 9800 Savage                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               | Road                            |                         |                                    |                                                      |                             |                               | ACCESSION NO |
| Ft. Meade,                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                | MD 20755                        |                         |                                    | RDDA                                                 | NSA                         | 740-EE93                      | DN488 839    |
| 12 PERSONAL AUT                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           | and J. Grinberg                 | 13. THE COVER           | ED                                 | 14 DATE OF REPORT (Year.                             | Manth. Days                 | 15. PAGE COU                  | MT           |
| Final                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     |                                 | FROM Dec                | 83 to Apr 84                       | June 1987                                            |                             | 62                            |              |
| 16 SUPPLEMENTAR                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |                                 |                         |                                    |                                                      |                             |                               |              |
| 17 COSATI CODES                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |                                 |                         | 18. SUBJECT TEMMS (Continue        | on reverse if necessary and ide                      | ntily by block number(      |                               |              |
| PIELD                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     | GAOUP                           | SUE-GROUP               | Field Oxide, dope                  | d N-Substrate, epitaxial silicon, sapphire subtrate, |                             |                               |              |
|                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |                                 |                         | silicon-on-sapphire                | (SOS) device, com                                    | plementary me               | tal-oxide semico              | aductor      |
| 19 ABSTRACT (Cons                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                         | imue an reverse il nepilicary a | nd edgewady by blank in |                                    |                                                      |                             |                               |              |
| This report describes the results of a study on the chip design of a low-power filter, using state-of-the-art CMOS technology. The filter is for speech applications and is specified to have 1024 taps with programmable weights and linear phase. The chip implementation is to have a word length of 8 to 12 bits and consume a maximum of 2.0 mA at 3.6V. Included are current capabilities of CMOS/SOS and CMOS/bulk, technologics, and in particular, the Hughes VHSIC CMOS process. The architecture of the filter is discussed and estimates are made for the power consumption, speed, device count, and projected chip size of the filter implementation. A comparison of a multiple taps and a single taps implementation of the filter is also presented in terms of power consumption and operational speed. |                                 |                         |                                    |                                                      |                             |                               |              |
| 20 DISTRIBUTION/A                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                         | VALABLITY OF ABSTRACT           |                         |                                    | 21 ADDITACT SECURITY                                 | CLASSIFICATION              |                               |              |
|                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           | PED/UNUMITED                    | SAME AS RET             |                                    | UNCLASSIFIE                                          | D                           |                               |              |
| 22a name of resp<br>W.H. McKni                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            | ONSIBLE INDIVIDUAL              |                         |                                    | 22b TELEPHONE (make)<br>(619)225-7439                | Ares Codal                  | 22c OFFICE SYMBOL<br>Code 743 |              |
| DD FORM 1473, 84 JAN BI AM EDITION MAY BE USED UNTIL EXHAUSTED UNCLASSIFIED UNCLASSIFIED                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  |                                 |                         |                                    |                                                      |                             |                               |              |

ALL OTHER EDITIONS ARE OBSOLETE

UNCLASSIFIED

DD FORM 1473, 84 JAN

UNCLASSIFIED SECURITY CLASSIFICATION OF THIS PAGE (Then Data Entered)

UNCLASSIFIED

SECURITY CLASSIFICATION OF THIS PAGE (Then Date Entered)

#### TABLE OF CONTENTS

| SECTION   |      |                              | PAGE |
|-----------|------|------------------------------|------|
| 1         | REPO | ORT SYNOPSIS                 | 1    |
|           | A.   | Introduction                 | 1    |
|           | в.   | Summary of Results           | 2    |
|           | c.   | Conclusion                   | 6    |
| 2         | TECH | INICAL ISSUES                | 9    |
|           | A.   | Introduction                 | 9    |
|           | в.   | Technology Issues            | 9    |
|           | c.   | Architectural Issues         | 26   |
|           | D.   | Other Design Considerations  | 49   |
|           | E.   | Cost of Fabrication          | 49   |
|           | REFE | RENCES                       | 53   |
| APPENDICE | S    |                              |      |
|           | Α.   | Toshiba 256K CMOS Static RAM | 55   |
|           | в.   | Hitachi 64K CMOS Static RAM  | 57   |

•

.

#### LIST OF ILLUSTRATIONS

| FIGURE |                                                                                         | PAGE |
|--------|-----------------------------------------------------------------------------------------|------|
| 1      | Silicon-on-sapphire device structure                                                    | 11   |
| 2      | CMOS/bulk device structure                                                              | 12   |
| 3      | Total rise and fall time for interconnect vs minimum feature                            | 14   |
| 4      | Operating voltage range vs gate length for scaled CMOS/SOS inverter                     | 16   |
| 5      | Scaling relationships of various parasitic capacitance ratios to gate oxide capacitance | 19   |
| 6      | Interconnect resistance scaling                                                         | 21   |
| 7      | CCD memory partitioned into four segments                                               | 25   |
| · 8    | Single-tap implementation of low power filter                                           | 27   |
| 9      | VHSIC full adder cell                                                                   | 30   |
| 10     | Accumulator for adding 1024 b-bit words                                                 | 32   |
| 11     | Pipelined book multiplier                                                               | 35   |
| 12     | Reducing the power consumption of data storage by positioning into four segments        | 40   |
| 13     | Output driver configuration                                                             | 41   |
| 14     | Four-tap approach for low power filter                                                  | 47   |
| 15     | Tentative schedule for processing low power filter chip                                 | 51   |

#### SECTION 1

#### REPORT SYNOPSIS

#### A. INTRODUCTION

We report here the results of a study on implementing a low power filter using state-of-the-art CMOS technology. The basic goal is to design a 1024 tap filter with programmable weights, that has linear phase, operates at a sample rate of 8KHz, and consumes a maximum of 2.0 mA at 3.6 V (7.2 mW power). Input data word length has been specified as between 8 and 12 bits. The work performed here is based in part on an earlier study<sup>1</sup> on a low power filter done at HRL for NOSC.

In Section 1.B we present a summary of the results of the study, with conclusions in Section 1.C. Section 2 contains the technical supporting details of the report. The study is divided into two main parts - Technology Issues (Section 2.B) and Architectural Issues (Section 2.C). Under Technology Issues we will review the latest developments in CMOS technology and compare the performance of CMOS/SOS and CMOS/bulk technologies. We will look at the voltage requirements of these technologies to see if 3.6 V is an acceptable power source level. In particular, the Hughes VHSIC CMOS/SOS process will be examined. In the Architectural Issues section we will analyze each component of the low power filter in terms of power consumption, speed, and gate count when implemented with the Hughes CMOS/SOS process. We will also review state-of-the-art memory chips to see if one could be suitably used for the low power filter. Lastly, in Section 2.E, we provide a preliminary estimate of the cost of fabricating the low power filter chip using Hughes CMOS/SOS technology.

#### B. SUMMARY OF RESULTS

#### 1. <u>Technology</u>

Since an important consideration of this filter chip is extremely low power, CMOS technology is a natural candidate for implementation. CMOS circuitry dissipates mainly dynamic switching power ( $CV^2f$  power) and negligible guiescent power. Of the two CMOS technologies to choose from - CMOS/SOS and CMOS/bulk - CMOS/SOS clearly provides better speed/power performance above 1.5 µm channel length, but below 1.0 µm there is evidence that CMOS/bulk performance becomes comparable with CMOS/SOS. As channel lengths approach 1  $\mu$ m and below, the lower line-to-substrate capacitance advantage held by CMOS/SOS is lost as line-to-line interconnect capacitance becomes significant. Progress is still being made to improve the speed/power performance of both SOS and bulk technologies, and it is unclear if CMOS/SOS would still be significantly better than CMOS/bulk at submicron feature sizes.

As to supply voltage requirements for state-of-the-art CMOS devices, we have reviewed the relevant literature and conclude that the current Hughes VHSIC CMOS/SOS technology will be able to operate from a 3.6 V power source.

We have also investigated using CCD technology for meeting the data storage requirements of the filter. We used a scheme that partitions the data for storage into 4 CCD registers and runs each register at 8 MHz, but only for one-fourth of the time, so that the effective operational rate is 2 MHz. When the 1500b bits (b is the word size) of data and coefficient storage is implemented using current CCD technology, we calculated that power dissipation would be 3.6b mW. This assumes that the lowest acceptable clocking voltage - 6V, is used. From this analysis, we see that both power consumption and required operating voltage would exceed filter specifications. Hence, we do not recommend the use of CCD technology for data storage.

#### 2. Architectural Issues and Power Consumption

Assuming a single tap implementation for the low power filter operating at 8 MHz, the major components of the filter, along with power dissipation and device count for each component are listed in Table 1. The power calculations are based in part on a low-power Toshiba 256K CMOS static RAM chip announced at the ISSCC conference in February, 1984. The power consumption for this chip was scaled down to meet the data and coefficient storage requirements of the filter. The Hughes VHSIC CMOS/SOS process parameters were used to calculate power dissipation in the processor section.  $CV^2f$  dynamic power is assumed to be the primary source of power dissipation in this section. C is the total capacitance of each component in the processor section, V is taken as 3.6 V, and f is 8 MHz. In Table 1 the parameter b is the word size, specified as between 8 and 12 bits. The total power dissipation for the filter can be obtained by adding the power dissipated within each component, resulting in

 $P_{total} = 1.74b^2 + 246b + 18.3 \mu W.$ 

Similarly, the total device count is obtained by adding the devices for each component, resulting in

$$D_{total} = 32b^2 + 9265b + 340.$$

The device count and power dissipation broken down by major components for a 10-bit filter is shown in Table 2. Data and coefficient storage requirements contribute to 89% of the total power consumption and 96% of device count. The total power consumption, device count and projected chip size for an 8, 10 and 12-bit filter are shown in Table 3. The projected chip size for the filter is obtained by estimating the area occupied by RAM and by random logic, and is given by

 $S = 51.6b^2 + 949b + 548 \text{ mil}^2$ .

|               | Device Count          | Power (µW)              |
|---------------|-----------------------|-------------------------|
| Storage       | 9200b                 | 241b                    |
| Adder/Acc.    | 42b + 340             | 2.265 + 18.3            |
| Multr.        | b(32b + 19)           | $1.74b^2 + 2.16b$       |
| Output Driver | 4b                    | 0.536                   |
| Total         | $32b^2 + 9265b + 340$ | $1.74b^2 + 246b + 18.3$ |

Table 1. Device Count and Power for Single Tap Filter

Table 2. Device Count and Power Breakdown for 10-bit Filter

|               | Device Count<br>(% of total) | Power (µ₩)<br>(% of total) |
|---------------|------------------------------|----------------------------|
| Storage       | 92 K (96%)                   | 2.4 (89%)                  |
| Adder/Acc.    | 760 (0.8%)                   | 0.041 (1.5%)               |
| Multr.        | 3.4 K (3.5%)                 | 0.20 (7.3%)                |
| Output Driver | 40 (0.04%)                   | 0.005 (0.19%)              |
| Total         | 96.2 K (100%)                | 2.7 (100%)                 |

Table 3. Total Power Consumption and Device Count for an 8, 10 and 12 bit Filter

| b bits/word | Power Consumption<br>(µW) | Device<br>Count | Projected Chip Size<br>(mil <sup>2</sup> )/(% memory) |
|-------------|---------------------------|-----------------|-------------------------------------------------------|
| 8           | 2.1                       | 76.5 K          | 11.4 K (59%)                                          |
| 0 (         | 2 . 7                     | 96.2 K          | 15.2 K (56%)                                          |
| 12          | 3.2                       | 116 K           | 19.4 K (52%)                                          |

~

There is a trade-off in terms of power consumption between a single tap approach and a multi-tap approach. A multi-tap approach operates at lower speed but requires more gates for implementation, whereas a single tap approach would have to operate at higher speed, but requires fewer gates. By analyzing the computational requirements of a multi-tap approach we find that the power consumption as a function of the number of taps, N, is

 $P(b,N) = 243b/N + 1.73b^2 + 2.43b + 14 uW.$ 

Here we see that only the power associated with data and coefficient storage (the first term of the equation) decreases with N. The power consumed in the processor section (the remainder of the terms) remains constant because even though the processor section is operating at lower speeds as N increases, the number of processors operating simultaneously increases with N.

The total power consumption, device count and projected chip size for a 10-bit filter is shown for 1 through 4 taps in Table 4. The device count for an N-tap filter is given by

 $D(b,N) = 9229b + 80 + N(32b^2 + 45b + 260)$ ,

and the projected chip size is

 $S(b,N) = 876b + 129 + N(51.6b^2 + 72.6b + 419) mil^2$ .

The first term in D(b,N) and S(b,N) arises primarily from the data and coefficient storage section, and the bracketed term in both equations represents contributions from the processor section.

When using a multi-tap approach, chip yield must be considered. From Table 4, one sees that going from a single to a two-tap implementation reduces the power consumption by 46%, but also increases chip size by 41%. This increase in chip size

| Number of tap,<br>N | Power<br>(µW) | Consumption<br>(% dec) | Devic<br>‡    | ce Count<br>(% inc.) | Chip<br>mil <sup>2</sup> ( | Size<br>% inc.) |
|---------------------|---------------|------------------------|---------------|----------------------|----------------------------|-----------------|
| 1                   | 2.6           |                        | 96.2 K        |                      | 15.2 K                     |                 |
| 2                   | 1.4           | 46                     | 100 <b>K</b>  | 4.3                  | 21.5 K                     | 41              |
| 3                   | 1.0           | 62                     | 104 K         | 8.4                  | 27.8 K                     | 83              |
| 4                   | 0.82          | 68                     | 10 <b>8 K</b> | 12.8                 | 34.1 K                     | 124             |

Table 4. Power Consumption, Device Count and Chip Size for a 10-bit Filter Implemented with N Taps

may translate into a sizable reduction in fabrication yields. To improve chip yields, the filter implementation may be partitioned into two or more chips. This approach, however, would require communications between chips at processor speeds. The power consumed by chip drivers for handling inter-chip communications would in all likelihood not be compensated for by operating the filter at lower speeds.

#### C. CONCLUSION

The choice of a CMOS process for implementing the filter is important. The two components of the filter having special processing requirements are the RAM for data and coefficient storage and the A/D converter (if it is placed on the same chip). Choice of a technology for developing a low-power integrated RAM design is important because the RAM portion of the filter consumes most of the power on the chip. Also, the large size of the RAM makes it desirable to use a high device density technology. Our power and device density calculations for data storage were based on a 256K static RAM from Toshiba. The technology used is a two-level polysilicon, two-level metal, p-well CMOS process, with channel lengths comparable to the Hughes VHSIC SUS process. In order to meet the low power specification for the filter, a similar CMOS process may have to be used to develop a low power RAM for data storage in the filter.

In the area of A/D converter designs, significant leakage currents in SOS devices can make it difficult to achieve millivolt precision in CMOS/SOS A/D converters. However, recent work at RCA has reduced the leakage currents of SOS devices significantly. Hence it may now be possible to develop high precision A/D converters using this improved CMOS/SOS technology.

We have also examined the trade-offs between a single tap and a multi-tap approach for the filter. A single tap approach requires a higher operational speed but fewer gates to implement, whereas a multi-tap approach requires a lower operational speed but more gates to implement. A multi-tap approach would result in power savings if all the components could be integrated on one chip, but would require much higher power if two or more chips were needed for the implementation. The yield of fabricated chips becomes an important consideration when chip size is increased for multi-tap implementations. We estimate that going from a single tap to two taps for a 10-bit filter would increase chip size by 41%, but also decrease power consumption by 46%.

Given the state-of-the-art of current CMOS technology, the development of the low power filter chip must be viewed as a research effort. The low power requirement for the filter makes it desirable to implement the filter on a single chip. However, the large number of devices necessary for implementing the filter makes chip yield a primary concern. Also, to successfully meet the low power requirements of the filter, it will be necessary to integrate a high density, low power RAM technology with logic circuitry. This RAM technology, requiring advanced processing techniques, is not widely available at this time.

#### SECTION 2

#### TECHNICAL ISSUES

#### A. INTRODUCTION

In the preliminary report we assumed that CMOS/SOS would be the technology for implementing the low power filter. In this report we will consider the trade-offs between CMOS/SOS and CMOS/bulk more critically. We will also examine the operating voltage requirements of CMOS technologies to see if 3.6 V is an acceptable operating voltage. The Hughes VHSIC CMOS/SOS process is a good example of what is possible in state-of-the-art CMOS technologies, so we will present the performance parameters of this process as being representative of what can be achieved in CMOS/SOS technologies today. We will also analyze power consumption requirements if CCD technology is used for data storage on the filter chip.

Using the performance parameters for the Hughes VHSIC CMOS/SOS process, we will obtain speed, power consumption and device count estimates for the adder/accumulator, multiplier and output drivers on the filter chip. Hughes does not have a current effort to develop a 1.25  $\mu$ m static RAM chip, so to obtain power and device count estimates for data and coefficient storage on the filter chip, we will examine CMOS static RAM chips currently available in the market. We will also look at the trade-offs between using a single tap and a multi-tap implementation for the filter chip.

#### B. TECHNOLOGY ISSUES

In the following sections we will review state-of-the-art CMOS technologies and compare CMOS/SOS and CMOS/bulk in terms of power consumption, speed and device technology. We will also examine operating voltage requirements for these technologies,

and interconnect parasitic capacitances as feature sizes shrink to submicron dimensions to see how significant they are in comparison to gate capacitances. Specifically, we will present the performance characteristics for the Hughes VHSIC CMOS/SOS process. Finally, we will investigate the use of CCD technology for data storage.

#### 1. CMOS Technology

In the preliminary study it was assumed that CMOS/SOS would be used for fabricating the low-power filter. It is generally acknowledged that CMOS/SOS is lower power and faster than CMOS/bulk technology at channel lengths greater than about 1.0 um. Below 1.0 um, however, there is evidence that CMOS/bulk becomes competitive with CMOS/SOS. Traditionally CMOS/SOS is used for radiation-hard applications, whereas CMOS/bulk is usually used where chip yield and cost effectiveness is a factor. In choosing a technology for implementing the low power filter, some consideration should be given to developing a lowpower, high-density staticRAM and an A/D converter (if it is to be on the same chip) using that technology. These are two crucial components of the filter, and the successful implementation of these components coulddepend on the choice of atechnology.

As is well known, CMOS/SOS technology derives its superiority from the fact that there is virtually no capacitance from interconnects to the non-conducting sapphire substrate (Figure 1). This low capacitance translates into higher switching speeds and lower dynamic power dissipation - two important considerations in choosing a technology. CMOS/bulk, on the other hand, does exhibit considerable interconnect capacitance to the substrate (Figure 2). First, line-to-substrate capacitance arises from metal and poly-silicon wiring over field oxide. This capacitance is usually about an order of magnitude less than gate oxide capacitance. Second, there is diffused



8114-2

Figure 1. Silicon-on-sapphire device structure.

12563-10R1



Figure 2. CMOS/bulk device structure.

line capacitance, primarily at the sidewall adjoining the field oxide, with a typical value of about  $4x10^{-4}$  pF/µm. The presence of greater capacitance in CMOS/bulk circuits results in lower performance of bulk circuits at device dimensions greater than 1.5 µm. This is evident in Figure 3 which compares the switching speed of a CMOS/SOS and a CMOS/bulk inverter as a function of minimum feature size.<sup>2</sup>

As feature sizes are decreased to less than 1.5  $\mu$ m, the performance of CMOS/bulk circuits become comparable to that of CMOS/SOS circuits. First, the mobility of carriers in bulk devices are greater than those of SOS devices. Because silicon grown on sapphire contains many more defects than bulk silicon, the scattering of carriers reduces their mobilities in SOS devices to less than that of bulk devices. The effect of this is a higher drive current for bulk devices than SOS devices at equal channel lengths. Second, interconnect line-to-line capacitance becomes significant as feature sizes decrease below 14 (see Section 2.B.). Since line-to-line capacitance is present in both SOS and bulk technologies, the advantage of lower line-to-substrate capacitance enjoyed by SOS circuits is lost at submicron dimensions. As shown in Figure 3, the switching speed of a CMOS/bulk inverter is comparable to that of an equivalent SOS inverter at less than 1.5 µm. Moreover, the study at Hewlett-Packard<sup>2</sup> comparing the performance of ring oscillators on SOS and bulk technologies at a channel length of 1.3 µm concluded that both speed and power dissipation were about the same.

Improvements are still being made in both CMOS/SOS and CMOS/bulk technologies. New crystal growth techniques have reduced the defects in silicon grown on sapphire, thereby increasing the mobilities of SOS devices.<sup>7</sup> The latchup problem, particularly severe in CMOS/bulk circuits as feature sizes approach 1 µm, is also being solved. Recent studies at Toshiba indicate that CMOS/SOS circuits may still enjoy a speed/power advantage over CMOS/bulk circuits at submicron dimensions.<sup>8</sup> How much of a performance advantage CMOS/SOS



F = Leff. MICRON



Figure 3. Total rise and fall time for interconnect vs minimum feature.

circuits will still have at submicron dimensions is unclear at this time. The Hughes VHSIC program has interest in both CMOS/SOS and CMOS/bulk technologies, although current CMOS/SOS development at 1.2  $\mu$ m channel lengths is at a more advanced stage.

#### 2. Power Supply Voltage for CMOS Filter Chip

One of the specifications for the low-power filter was that it operate from a 3.6V power supply. Current VHSIC CMOS technology is targeted at 5.0V operation. This may be partly because of a desire to maintain voltage compatibility with other digital logic families (particularly TTL), most of which operate at 5.0V. In this section we will consider whether it is possible to operate current and future VHSIC CMOS technologies at 3.6V.

Some work has been done within the Hughes VHSIC program in studying the operating voltage range of CMOS circuits as device dimensions are scaled down. The result of this work is shown in Figure 4. As device dimensions are scaled down, there are several phenomena that limit the operating voltage range of CMOS circuits. Among these are device punchthrough, oxide breakdown, junction breakdown, device turn-on threshold, and excess thermal generation. From the VHSIC study, the two factors that limited MOSFET operation when device dimensions were optimally scaled were identified.

At the high end, junction breakdown occured when drain-tosubstrate potentials exceeded the breakdown voltage. This phenomenon is well understood as avalanche breakdown in reverse biased pn junctions. When electric fields within the junction reach a critical value (around 3 x  $10^5$  V/cm), carrier impact ionization will cause a rapid increase in current flow through the junction. Because substrate doping tends to increase as device dimensions are scaled down (for threshold compensation and to decrease depletion widths), critical breakdown fields are



Figure 4. Operating voltage range vs gate length for scaled CMOS/SOS inverter.

reached at lower drain potentials. Hence, operating voltages will have to be decreased as device dimensions are scaled down to prevent junction breakdown.

At the low end, device operation is limited by the turn-on voltage of devices. This parameter can be controlled to some extent by changing the substrate doping by ion implantation and gate oxide thickness. To maintain a satisfactory noise margin, however, it is desirable to use an operating voltage above the turn-on voltage by several times the thermal voltage, kT/q.

In Figure 4 the limits of operating voltage for CMOS/SOS technology is plotted against the gate length of devices. The shaded area represents the acceptable operating voltage range for circuits. As can be seen from the figure, current VHSIC technology at 1.25  $\mu$ m can be operated safely at 3.6V. Moreover, we believe that future VHSIC submicron technologies down to 0.5  $\mu$ m feature sizes can be operated at 3.6V without any problem.

## 3. Interconnect Parasitic Capacitances as a Function of Peature Size

In the preliminary study on the low power filter, it was assumed that the parasitic capacitances arising from interconnects were negligible compared to gate capacitances. We will examine this issue in greater detail in this section.

In a study performed for NOSC in 1980 entitled "Develop Submicron Devices,"<sup>3</sup> the parasitic capacitance arising from wiring interconnects as device feature sizes were scaled down was considered. In this study it was assumed that all dimensions scale linearly with x, the field oxide thickness was 12.5 times the gate oxide thickness (t<sub>g</sub>), and the width and spacing of wiring was 1.5 times the gate length,  $L_{ch}$ . Under these conditions the gate capacitance, given by,

decreases directly with L ch.

Figure 5 shows the interconnect parasitic capacitances relative to gate capacitances as device dimensions are scaled down. There are two primary sources of wiring capacitances line-to-substrate ( $C_{a}$ ) and line-to-line capacitances ( $C_{m}$ ). Let us first consider the case of short wiring interconnects within cells, assuming they have an average length of  $8L_{ch}$ . As seen from the figure, the ratio of  $C_{s}(8L_{ch})/C_{q}$  would be constant, since the lengths of these wires would scale directly with gate lengths. This ratio is found to be almost unity. Therefore, line-to-substrate capacitance is significant in CMOS/bulk technology. This capacitance, however, is negligible in CMOS/SOS technology because there is no conducting substrate. From the figure, the ratio  $C_m(8L_{ch})/C_q$  for short wires is seen to be an order of magnitude less than I for gate lengths above 1  $\mu$ m, and increase to exceed 1 below 1  $\mu$ m. From this we can conclude that for CMOS/SOS technology gate capacitance would be dominant at a VHSIC gate length of 1.2  $\mu$ m, but below 1  $\mu$ m lineto-line capacitance becomes significant.

Now let us consider the case of wiring interconnects with dimensions comparable to chip size, L. These long interconnects are evident in regular chip designs such as memory chips and programmable logic arrays. As can be seen in Figure 5, long wiring capacitances tend to dominate over other capacitances. In the line-to-substrate case, the ratio  $C_{g}(L_{c})/C_{q}$  is about 100 at 5  $\mu m$  gate lengths and increases rapidly to exceed 104 at submicron dimensions. In the line-toline case,  $C_m(L_c)/C_q$  is about 1 at 5  $\mu m$  and increases to be comparable to  $C_{g}(L_{c})/C_{g}$  at submicron dimensions. At 1.2  $\mu m$  $C_{g}(L_{c})$  is about 10<sup>3</sup> greater than  $C_{q}$  and  $C_{m}(L_{c})$  is about 10<sup>2</sup> greater than  $C_{d}$ . The contribution to total chip capacitance from long wires will be significant in regularized structures where there are many long wires spaced closely together, such as in memory chips. In random logic chips, however, the collective contribution from gate capacitances usually dominates total chip

9472-17R1



Figure 5. Scaling relationships of various parasitic capacitance ratios to gate oxide capacitance.

capacitance. Figure 5 shows the total line-to-substrate  $(C_s(chip))$  and line-to-line  $(C_m(chip))$  wiring capacitance for a chip dominated by short wiring interconnects.

In the above analysis it was assumed that all feature sizes scaled linearly with x. In practice, however, this would not be desirable; technology limitations would prevent the scaling down of certain feature sizes before others. For example, as the gate oxide is scaled below 150 A, breakdown mechanisms begin to occur, causing leakage currents across the oxide, and reducing the reliability of devices. Below 50 Å, direct quantum mechanical tunneling of electrons across the oxide occurs. Hence, for MOS devices to be useful at submicron dimensions, the gate and field oxide thickness would have to be scaled down less than linearly. Effectively, gate and wiring capacitances would decrease more than linearly with x. As wire widths are scaled down, however, the resistance of these wires increases (Piqure 6). At submicron dimensions, the resistance of these wires would be great enough to introduce considerable RC time delay in the propagation of signals. Also, as wire widths are scaled down, electromigration failure becomes more prominent. These two phenomena dictate that wire widths (and spacing) would have to be scaled down less than linearly. This would result in greater line-to-substrate capacitance (in CMOS/bulk circuits) and less line-to-line capacitance than if scaling were done linearly.

Another source of interconnect capacitance in CMOS/bulk is diffusion line capacitance. The contribution from diffusion line capacitance tends to increase as feature sizes decrease. This is because as feature sizes decrease, substrate doping is increased for threshold compensation, resulting in smaller depletion regions at diffusion junctions, and hence, greater capacitance. The exact relationship between diffusion capacitance and the scaling down of feature sizes is left for further study.



Figure 6. Interconnect resistance scaling.

This section has summarized the results of a study on wiring capacitances as feature sizes are scaled down linearly. Although feature sizes do not scale linearly in practice, we will use the results of this study in our report and assume that gate capacitance is dominant in CMOS/SOS circuits at the VHSIC gate length of 1.2  $\mu$ m.

#### 4. VHSIC Technology

The current emphasis of the VHSIC program at Hughes is on 1.2  $\mu$ m CMOS/SOS technology. This technology has been demonstrated with fabrication of a 72,000 device correlator chip for the VHSIC program. Research is also underway to develop a submicron SOS process at either 0.75  $\mu$ m or 0.5  $\mu$ m feature size. The projections are that a submicron technology will be available in 1987. There is also interest at Hughes Newport Beach in CMOS/bulk technology where a 3.0  $\mu$ m process is available at this time. Also, a 1.2  $\mu$ m CMOS/bulk process is currently being developed. In this section we will summarize the performance characteristics of current VHSIC 1.2  $\mu$ m zechnology as applied to the low-power filter design.

Table 5 summarizes the feature sizes and electrical parameters of the Hughes 1.2  $\mu$ m CMOS/SOS process.<sup>4</sup> The minimum drawn gate length is 1.4  $\mu$ m, resulting in a channel length of 1.2  $\mu$ m after lateral diffusion at source and drain are taken into account. The threshold voltages of the p and n-channel devices are nearly identical at 1.2V. With gate oxide thickness at 400 Å, the gate oxide capacitance is 8.6 x 10<sup>-4</sup> pF/u<sup>2</sup>. From Section 2.8.3, we can assume that gate capacitance will be the primary source of power dissipation in CMOS/SOS circuits. For a minimum geometry device of 2  $\mu$ m x 1.2  $\mu$ m, the capacitance per gate is

 $C_q = 8.6 \times 10^{-4} (pF/u^2) 2 \times 1.2 (u^2) = 2.1 \times 10^{-3} pF/gate.$ 

Table 5. Hughes VHSIC CMOS/SOS Process Parameters

Minimum Dimensions  $(\mu)$ Transistor Length 1.4 (Leff = 1.2)Transistor Width 2.0 Metal Width 2.4 Metal Spacing 2.6 Polysilicon Width 1.4 Polysilicon Spacing 2.2 Nominal Thickness (Å) Silicon 5000 Poly Silicide 5500 Field Oxide 5000 Metal 7500 Gate Oxide 400 Electrical Parameters Contact Resistances (ohms) for  $2\mu$  x  $2\mu$  contact area  $\leq$ Max Typical Metal/N<sup>+</sup> 100 50 Metal/P<sup>+</sup> 30 10 Metal/Poly 5 2 N<sup>+</sup>/Metal/P<sup>+</sup> 250 100  $(2\mu \times 4\mu \text{ contact area})$ Sheet Resistances N<sup>+</sup> Si P<sup>+</sup> Si 100 40-60 200 100-115 Poly Silicide 5 2.5-4.5 0.05 0.04 Metal Electrical Parameters 1.2 V. -1.2 V. V<sub>TN</sub> V<sub>TP</sub> 8.6 x  $10^{-4}$  pF/ $\mu^2$ Cox  $300 \text{ cm}^2/\text{V-sec}$  $170 \text{ cm}^2/\text{V-sec}$ μ<sub>N</sub> μ<sub>p</sub> 62 µmhos 35 µmhos g<sub>N</sub> g<sub>p</sub> R<sub>ch,n</sub> R<sub>ch,P</sub> 16 Kohms/sg. 28.6 Kohms/sq.

The corresponding dynamic power dissipation for a minimum geometry device operating with a supply voltage of 3.6V at 8MHz is

 $P_g = 2.1 \times 10^{-3} (pF/gate) (3.6V)^2 8 \times 10^6 (MHz) 50% / 2$ = 0.054 uW/gate.

The 50% factor arises from assuming that the device changes state every other cycle. We will use these values for  $C_g$  and  $P_g$  in Section 2.C for calculating the power consumption of components for the low power filter.

From Table 5, the channel resistance in the linear region is 16 KQ/sq for an n-channel device and 28.6 KQ/sq for a p-channel device. These values are much greater than the interconnect and contact resistances shown in the same table. Therefore, in estimating the speed of circuit components for the filter in Section 2.C, we will assume that channel resistance and gate capacitance contribute the most to propagation delay in the circuits.

#### 5. CCDs for Data Storage

In the preliminary study of the low-power filter chip, it was discovered that the memory for storage of data and filter coefficients (about 1.5b Kbits total) consumed a significant amount of power. In this section we will consider the use of CCD shift registers to see if power for data storage can be minimized.

Since most of the power dissipated in a CCD shift register is  $CV^2f$  dynamic power, we will try to minimize this power by assuming a design based on four 256-word shift registers connected as shown in Figure 7. Every 1/(8KHz) second the B switches connect the four shift registers into one long 1024word shift register, and a new datum is inserted at INPUT into the shift register. The B switches are then flipped the other way so that data within each of the four shift registers can





circulate within themselves. However, only one of the 256-word shift register is circulating at one time, and as it does so, data is fed through switch A at 8MHz to the multiplier for correlation. The shift rate for each shift register is 8MHz, although each shift register stops for

$$\frac{256}{8 \text{ MHz}} \times 3 \sim 96 \text{ } \mu \text{s}$$

while data is being fed from the other three shift registers. Effectively, each shift register is working at 2 MHz. The shift register for coefficient storage would be treated in a similar way.

Table 6 lists the parameters for CCD technology developed at Hughes. Using a minimum line width of 2.5  $\mu$ m and a cell size of 10  $\mu$ m x 10  $\mu$ m results in an array area of 1.5 x 10<sup>5</sup>b  $\mu$ m<sup>2</sup> for 1500b bits of data storage. This translates to 50b pP of electrode capacitance. The minimum operating voltage of these CCD circuits is 6V. Hence, the power required to operate this memory array is

P = 50b pF.  $(6.0V)^2$ . 2MHz = 3.6b mW.

For a minimum word size of 8 bits, the power consumed would be approximately 29 mW, exceeding the low power filter requirement. Also, the need to run the CCD array at a minimum of 6V tends to rule out the use of CCD technology for the low power filter.

#### C. ARCHITECTURAL ISSUES

In the preliminary report we advocated a single tap approach for the low power filter (Figure 8). In the following sections we will estimate the operational speed, power dissipation, device count, and chip size of a single tap filter

| Minimum Line Width              | 2.5 µ                           |
|---------------------------------|---------------------------------|
| Cell Size                       | 10 μ x 10 μ                     |
| Oxide Thickness                 | 1000 A                          |
| Total Array Area (1500b bits)   | 1.5 x 10 <sup>5</sup> b $\mu^2$ |
| Total Capacitance               | 50 b p <b>r</b>                 |
| Power Consumption (@ 6V, 2 MHz) | 3.6 b mW                        |

#### Table 6. Hughes CCD Technology

14003-8



Figure 8. Single-tap implementation of low power filter.

when implemented using current Hughes VHSIC CMOS/SOS technology. We will use the technology parameters presented in Section 2.8.4 of this report. In Section 2.0.6 we will examine the trade-offs of a multi-tap implementation for the filter.

In estimating the power dissipation of the filter components, we will make two simplifying assumptions. First, we will assume that parasitic capacitances for CMOS/SOS technology, including line-to-line interconnect capacitances and interconnect crossover capacitance, is small compared with gate capaci-From the data in Figure 5, this is a reasonable tances. assumption. In this figure we see that line-to-line interconnect capacitance at 1.2 µm is an order of magnitude less than gate capacitance. Second, we will assume that in the case of RAMs, power consumption is proportional to the size (number of bits and chip area) of the RAM. This is not strictly true, since in CMOS RAMs a considerable amount of power is dissipated in driving the capacitances of long data and address lines. These capacitances do not scale linearly with RAM size. However, we will scale down power consumption for state-of-the-art CMOS static RAM chips to obtain first order estimates of power dissipation for data and coefficient storage in the filter. Making these two assumptions for power dissipation will enable us to estimate total power consumption for the filter without laying out the components first.

In obtaining power consumption estimates for the filter, we will assume that cell designs utilize minimum geometry devices. Cells in the Hughes VHSIC library are usually designed to drive large capacitive loads, and hence dissipate more power and operate at higher speeds. In estimating the speed and power for a filter design using minimum geometry devices, the VHSIC values for power dissipation and operational speeds will be scaled down accordingly. Also, in calculating the power dissipation of cells in the filter, we will assume that only half the electrical modes in the cell change state during any clock cycle. This provides a conservative estimate for power dissipation, since probably fewer than 50% of the modes change state

every cycle. For the purpose of estimating power dissipation for the filter, however, we will use the conservtive value calculated using 1/2 CV<sup>2</sup>f (50%).

#### 1. Adder/Accumulator Configuration

In this section we will present the basic adder and accumulator configurations to be used in the low-power filter. The adder is a crucial element used repeatedly in the accumulator and multiplier sections of the filter. We will obtain estimates for the power, speed, and silicon area of the adder and accumulator in terms of current VHSIC 1.2  $\mu$ m CMOS/SOS technology.

The full adder cell being used in the Hughes VHSIC program is shown in Figure 9.<sup>5</sup> There are a total of 26 devices in the cell, with channel widths ranging from 5  $\mu$ m to 17  $\mu$ m. For the low power filter, however, we will assume a minimum geometry device design with channel width of 2.0  $\mu$ m. Using the parameters in Table 5 for the VHSIC 1.2  $\mu$ m CMOS/SOS process, the total gate capacitance was calculated to be

> $C = 8.6 \times 10^{-4} (pP/u^2) \times 2u \times 1.2u \times 26$ = 0.054pF.

Total power dissipation for the cell is given by

Power =  $\frac{1}{2}$  CV<sup>2</sup>f.(50% duty cycle), C=0.054pF, V=3.6V, f=8MHz

= 1.4 uW.

The 50% duty cycle arises from assuming that only half the nodes in the cell change state every cycle.

The performance of this cell has been simulated as part of the VHSIC effort. From the simulations it was found that the propagation delay from input to the SUM output was approximately 8.5ns. Propagation delay from  $C_{in}$  to  $C_{out}$  for a cell was 3.6ns. These propagation delay times would be longer if minimum geometry devices were used in the cell.

14003



VHSIC full adder cell. Figure 9.

#### Accumulator

The output of the multiplier has to be summed for 1024 cycles to obtain one convolution point. To perform this summation, an accumulator with b+10 adders is provided, as shown in Figure 10. The outputs of these adders are captured in shift registers and fed back to the adders every clock cycle to be added to the next output from the multiplier. The maximum carry propagation through the accumulator using VHSIC device geometries for b=12 bits is

 $T_{2} = 22 \times 3.6 ns = 79.2 ns,$ 

much less than the cycle time for the filter. This propagation delay would increase if minimum geometry devices are used to implement the adders. If this delay becomes longer than the filter cycle time, carry-lookahead techniques may be used to decrease the delay. Table 7 lists the number of devices in the accumulator and the power dissipation when minimum geometry devices are used. The total power dissipated by the adder/accumulator section is

 $P_{acc} = 2.26b + 18.3 uW,$ 

and the total device count is

 $D_{acc} = 425 + 340$  .

#### 2. Multiplier Configuration

In this section we will consider the multiplier for the low-power filter in terms of the Hughes VHSIC 1.2 µm technology. There are several possible multiplier configurations (tree, array, ROM-based), but the configuration based on a modified Booth's algorithm seems to be best from the standpoint of power





Figure 10. Accumulator for adding 1024 b-bit words.

| Table 7. Device Count as | nd Power | Dissipation | tor | Accumulator |
|--------------------------|----------|-------------|-----|-------------|
|--------------------------|----------|-------------|-----|-------------|

| Cell Type | <pre>   Cells </pre> | Devices<br>per cell | Total 🛊<br>Devices | Power (µW)   |
|-----------|----------------------|---------------------|--------------------|--------------|
| Adder     | ъ +10                | 26                  | 26 (b + 10)        | 1.4 (5 + 10) |
| SR        | 2 (b + 5)            | 8                   | 16 (b + 5)         | 0 86 (b + 5) |

dissipation, device count, speed, and layout geometry. This algorithm uses a radix 4 method to examine the multiplier word 3 overlapping bits at a time. Partial products are accumulated in half the number of steps as necessary in other schemes. Moreover, negative numbers are handled as well, in 2's complement form. Table 8 shows the method for accumulating partial products based on examining 3 bits of the multiplier.

The Booth multiplier, implemented in pipeline fashion, is shown in Figure 11. The incoming multiplier words are stored in shift registers at the right, and decoded 3 overlapping bits at a time by the Booth Decoders (BD's). Depending on the value of the 3 bits, the control lines to the Select circuits are activated to add either 0, X, 2X, -X or -2X to the partial product. The actual addition is performed in ripple carry form using the full adder described in the previous section. Note that only (b/2-1) rows of adders are needed to accumulate the partial products.

The worst case propagation delay through one stage of the pipeline is the sum of set-up time for the Booth decoders, select circuits, and carry propagation through the adders:

T<sub>stage</sub> = T<sub>bd</sub> + T<sub>sel</sub> + 12\*T<sub>carry</sub>.

This propagation delay through one stage would be 63ns if VHSIC geometry devices were used in the design, but would be longer if minimum geometry devices were used. We do not anticipate that one pipeline stage delay will exceed the filter cycle time even if minimum geometry devices are used in the design.

Table 9 lists the number of cells and devices used in the pipelined Booth multiplier. The power dissipation shown for each cell type is based on  $CV^2f(50t)/2$  dynamic power, where C is the total gate capacitance in the cell, V is 3.6V, and f is 8MHz. The 50t factor is included assuming only half the nodes in the cell change state each cycle. The total power consumed by the multiplier is

| $y_{i+1} y_i y_{i-1}$ | Add to Partial Product |
|-----------------------|------------------------|
| 0 0 0                 | 0                      |
| 001                   | x                      |
| 0 1 0                 | x                      |
| 0 1 1                 | 2X                     |
| 100                   | -2X                    |
| 1 0 1                 | -X                     |
| 1 1 0                 | -X                     |
| 1 1 1                 | 0                      |

#### Table 8. Modified Booth's Algorithm for Accumulating Partial Products

X - Multiplicand

Table 9. Device Count and Power Consumption for Components of Pipelined Booth Multiplier

|                | Devices<br>Per Cell | <pre># Cells</pre>             | Total #<br>of Devices | Power<br>(µW)            |
|----------------|---------------------|--------------------------------|-----------------------|--------------------------|
| Shift Register | 8                   | $\frac{b}{2}(\frac{5}{2} b-1)$ | $4b(\frac{5}{2}b-1)$  | $0.22b(\frac{5}{2}b-1)$  |
| Select         |                     | $b^2/2$                        | $9b^2$                | $0.48b^2$                |
| Booth's        | 98                  | b/2                            | 49b                   | 3.8b                     |
| Full Adder     | 26                  | $b(\frac{b}{2} - 1)$           | 26b( <u>b</u> - 1)    | 1.4b $(\frac{b}{2} - 1)$ |



Figure 11. Pipelined booth multiplier.

#### $P_{mult} = b(1.74b + 2.16) uW$ ,

and the total device count for the multiplier is

 $D_{mult} = b(32b + 19).$ 

#### 3. Data and Coefficient Storage

In the preliminary study of the low-power filter it was estimated that the data storage section would consume the most power on chip. This estimate was based on a Hughes 16K static RAM fabricated using 2.5  $\mu$ m CMOS/SOS technology about 3 years ago. For a more accurate estimate of speed and power dissipation of current CMOS memory chips, we surveyed the papers presented at the most recent ISSCC conference (held February 1984). These papers are representative of what can be achieved in memory design today.

Table 10 shows in summarized form the characteristics of the low power RAMs presented at the ISSCC conference.<sup>6</sup> Most of these RAMs use CMOS technology, and the effective gate lengths are comparable to current Hughes VHSIC gate lengths (1.2  $\mu$ m). The access times of the static RAMs are in general better than those for dynamic RAMs because charge sensing and refreshing of dynamic memory cells require a longer cycle time. The access times of the static RAMs would meet the requirements of the low power filter easily, but the longer cycle times of the dynamic RAMs would be a problem.

The lowest power RAM in Table 10 is a 256K CMOS static RAM developed by Toshiba (Appendix A). The gate lengths are 1.2  $\mu$ m for n-channel devices and 1.5  $\mu$ m for p-channel devices (current Hughes VHSIC technology uses 1.2  $\mu$ m for both n and p-channel devices). Access time for this chip is 46ns, much less than the 125ns required in the low power filter. Active power dissipation measured at 1MHz is 10mW, and standby power is 0.03mW. If we scale the active power according to the requirements for the

Table 10. Performance Characteristics of Current RAM's

| Company      | Size  | Tech | Leff<br>(n-ch/p-ch)<br>(µ) | Cell Size<br>{µ) | Access<br>Time<br>(ns) | Cycle<br>Time<br>(ns) | Active Power<br>(mi) | Standby<br>Power<br>(mH) | Power/Kb<br>(aw) |
|--------------|-------|------|----------------------------|------------------|------------------------|-----------------------|----------------------|--------------------------|------------------|
| Static RAMS  |       |      |                            |                  |                        |                       |                      |                          |                  |
| Toshiba      | 256Kb | CHOS | 1.2/1.5                    | 11 × 13.5        | 46                     |                       | 10 8 1 MHz           | 0.03                     | 0.04 8 1 MHz     |
| Innos        | 64Kb  | CMOS | 1.5/1.7                    | 12.2 x 23.9      | 30                     |                       | 250                  | 60                       | 3.9              |
| NEC          | 64Kb  | CMOS | 1.5/1.8                    | 14.9 x 19.3      | 25                     |                       | 350                  | 15                       | 5.5              |
| Hitachi      | 64Kb  | CMOS | 1.3/1.3                    | 8.0 × 16.0       | 22                     | _                     | 70 @ 1 MHz           | ţ                        | 1.1 ê 1 MHz      |
| Toshiba      | 64Kb  | CMOS | 1.2/1.5                    | 18.0 x 20.0      | 28                     |                       | 225                  | 110                      | 3.5              |
|              |       |      |                            |                  |                        |                       |                      |                          |                  |
| Dynamic RAMS |       |      |                            |                  |                        |                       |                      |                          |                  |
| NTT Atsugi   | IMb   | CNOS | 0.5/0.9                    | 3.7 × 5.4        | 140                    | 350                   | 250                  | 2                        | 0.24             |
| NEC          | IMb   | SOMN | 1.0                        | 5.5 × 8.0        | 120                    | 300                   | 290 8 3 MHz          | 15                       | 0.28             |
| Hitachi      | 288Kb | CMOS | 2.0                        | 6.8 × 13.6       | 100                    | 100                   | 400 8 10 MHz         | 10                       | 1.4 @ 10 MHz     |
| IBM          | 256Kb | SOMN | 1,8                        | 5.7 × 18         | 80                     | 180                   | 300 8 3 MHz          | 25                       | 1.2 8 3 MHz      |
| Hitachi      | dMI   | NMOS | 1.6                        | 3.0 × 7.0        | 06                     | 260                   | 300 8 3.8 MH2        | 10                       | 0.29 @ 3.8 MHz   |

low power filter (data storage = 1.5b Kbits, voltage supply =
3.6V, speed = 8Mhz), we obtain

$$P = \frac{1.5b}{256} \cdot \left(\frac{3.6}{5.0}\right)^2 \cdot \frac{3}{1} \cdot 10 \text{ mW} = 0.243b \text{ mW}.$$

From this calculation, we can see that a static RAM with 1500b bits of storage could be designed using state-of-the-art CMOS technology that would meet the specifications of the low power filter. Such a RAM would require approximately 9000b devices to implement, assuming a design utilizing 6 devices/cell.

We should provide a note of caution here. The technology used for fabricating the low power Toshiba RAM chip is a twolevel polysilicon, two-level metal, p-well CMOS process. Each memory cell utilizes 4 transistors and 2 polysilicon resistor loads. These polysilicon resistors require tight processing tolerances so that the resistances would be relatively temperature invariant, and uniform resistances are maintained across the wafer. This requires fairly advanced processing techniques that may not be widely available yet. Without a CMOS process similar to this, it may be difficult to develop a RAM that would meet the low power specifications of the filter.

In comparison, the next lowest power consuming RAM presented at the ISSCC conference was a 64K CMOS static RAM from Hitachi (Appendix B). The gate length for both n- and p-channel devices in this chip was 1.3  $\mu$ m. The power consumption measured for this chip at 8MHz was about 150mW (Figure 5, Appendix B). If we scale this down for the low power filter's requirements, we obtain

$$P = \frac{1.5b}{64} \cdot \left(\frac{3.6}{5.0}\right)^2 \cdot 150 \text{ mW}$$

= 1.85 mW.

Obviously, this would exceed the filter specification for power consumption. We stress here again that developing a RAM that would meet the low power specification for the filter is critically dependent on the availability of a suitable CMOS technology.

Another scheme for lowering the power consumption of the data storage section further is to partition the memory into segments and activate one segment at a time, when data is needed from it. This scheme is illustrated in Figure 12. Since dynamic power is proportional to capacitance, C, and assuming that capacitance is proportional to area, A, then decreasing the area of the memory activated at one time by 4 would result in lowering the power consumption by a factor of 4 as well. Each segment of the partitioned memory still operates at 8MHz when that segment is activated, but as far as power consumption is concerned, the effective operational rate is 2MHz.

#### 4. Output Drivers

Output drivers can consume a significant amount of dynamic power because of the relatively large off-chip capacitance they have to drive. In this section we will consider the power dissipated by output drivers in terms of current Hughes VHSIC technology. This will determine whether it is possible to divide the implementation of the filter into two or more chips and yet maintain low power consumption.

Figure 13 shows a typical CMOS output driver handling an off-chip capacitance of 20pF. The drive ratio of the output devices have to be large enough to ensure that rise and fall times of the off-chip signal is satisfactory. Here, they are shown with 100:1 ratios, adequate to drive 20pF loads in less than 30ns. Another beefed-up inverter is used to drive the considerable capacitance at gates P1 and N1 (calculated to be about 0.25pF). The drive ratios of P2 and N2 are chosen to be 10:1, presenting a gate capacitance of about 0.025pF to the previous stage.

1400.1 4



Figure 12. Reducing the power consumption of data stor age by partitioning into four segments.



Figure 13. Output driver configuration.

Dynamic power dissipation is calculated using CV+f/2. For the case where the filter is partitioned into two or more chips (for example, if the RAM were on a chip separate from the multiplier accumulator), the output drivers handling signals between these chips would have to operate at processor speed, (i.e., BMHz for a single tap implemention). The power consumed by a single such driver would then be

P = 20.275 pF,  $(3.6V)^2 = 8MHz = (50% duty cycle) / 2 = 0.53mW$ 

Twelve such drivers operating in parallel (actually more than 12 would be needed to handle communications between data storage and processor for a 12-bit implementation) would consume more than 6mW. Comparatively, if the filter were totally integrated on one chip, the output drivers would operate at BKHz, and the power consumed by each would be 0.53uW. Also, considerably fewer output drivers would be needed in an integrated chip approach, since no inter-chip communication would be necessary. From this analysis, we strongly favor the single chip approach.

5. Total Power Requirements, Device Count and Chip Size

In this section we will summarize the results of preceding sections and obtain estimates for total power consumption, device count and chip size for the low power filter. The cell types used in the filter are listed in Table 11, together with the power dissipation and device count for each cell. Table 12 shows the breakdown for the major components of the filter in terms of cell types. The power consumption and device count of each major component is summarized in Table 1. Total power consumption for the filter is obtained by adding up the power consumption for the filter is obtained by adding up the power

Protat = 1,7462 + 2466 + 18 4 GW

4.2

| Cells          | Power @ 8 MHz<br>(µW) | Device Count |
|----------------|-----------------------|--------------|
| Shift Register | 0.43                  | 8            |
| Full Adder     | 1.4                   | 26           |
| Select Cell    | 0.96                  | 18           |
| Booth Decoder  | 7.5                   | 98           |
| RAM            | 0.16                  | 6            |

# Table 11. Power Dissipation and Device Count forCells Used in Low Power Filter

Table 12. Power Dissipation and Device Count for Functional Blocks of Filter, SR-Shift Register, FA-Full Adder, SEL-Select, BD-Booth Decoder, RAM-Random Access Memory, OD-Output Driver

|                        |                                              |                    | 1                 |        | ł     |    |                        |                         |
|------------------------|----------------------------------------------|--------------------|-------------------|--------|-------|----|------------------------|-------------------------|
|                        |                                              | t of               | Cellg             |        |       |    |                        | c                       |
| Functional Block       | SR                                           | FA                 | SEL               | ය<br>ස | RAM   | ao | Count                  | (Juw)                   |
| Data Storage           |                                              |                    |                   |        | 1024b |    | <b>6</b> 100b          | 160b                    |
| Coefficient Storage    |                                              |                    |                   |        | 512b  |    | 3100b                  | 81b                     |
| Adder/Accumulator      | 2(b + 5)                                     |                    |                   |        |       |    | 16(b + 5)              | 0.86(b + 5)             |
|                        |                                              | b + 10             |                   |        |       |    | 26(b + 10)             | 1.4(b + 10)             |
|                        | $\left[\frac{b}{2}(\frac{5}{2}b - 1)\right]$ |                    |                   |        |       |    | $4b(\frac{5}{2}b - 1)$ | $0.22b(\frac{5}{2}b-1)$ |
|                        |                                              | $p(\frac{b}{2}-1)$ |                   |        |       |    | $26b(\frac{b}{2} - 1)$ | $1.4b(\frac{b}{2}-1)$   |
| Multipiter             |                                              |                    | b <sup>2</sup> /2 |        |       |    | 9b <sup>2</sup>        | 0.48b <sup>2</sup>      |
|                        |                                              |                    |                   | b/2    |       |    | <b>4</b> 9b            | 3.8b                    |
| Outpur Drivers (8 kHz) |                                              |                    |                   |        |       | a  | <b>4</b> b             | 0.53b                   |

Similarly, the total device count is obtained by adding up the total number of devices in each component:

 $D_{total} = 32b^2 + 9265b + 340$  devices.

This device count does not include circuitry for timing, control, and memory address generation. To allow for these extra components, an extra 5 to 10% must be added to the total device count calculated above. The estimate for total power consumption must be similarly increased to account for the extra circuitry.

The power consumption and device count for a 10-bit filter broken down by major components is shown in Table 2. One should note that data and coefficient storage make up about 90% of the device count and power consumed. Table 3 lists the total power consumption and device count for an 8, 10 and 12 bit filter using the equations for  $P_{total}$  and  $D_{total}$  just derived.

To estimate the size of the filter chip we have to estimate the area taken up by RAM and the area taken up by random logic. To estimate the device density for random logic, we note that the Hughes VHSIC correlator chip contains 72,000 devices and measures 315 x 368 mil<sup>2</sup>. The resulting device density for this chip is 0.62 device/mil<sup>2</sup>. We will assume that this is a representative density for random logic fabricated using 1.2  $\mu$ m technology. To estimate the device density for memory arrays, we note that the Toshiba 256K RAM mentioned in Section 2.2.4 contains approximately

256K cells . 4 devices/cell > 10% devices,

on a chip measuring 6.68 x 8.86 mm<sup>2</sup> (263 x 349 mil<sup>2</sup>). The device density for this chip is  $\{0.9 \text{ devices/mil^2}\}$ . The chip size for the filter can be estimated by adding the area occupied by RAM and the area occupied by the processor:

 $s = 9200b/10.9 + (32b^2 + 65b + 340)/0.62 mil<sup>2</sup>$ = 51.6b<sup>2</sup> + 949b + 548 mil<sup>2</sup>.

The projected chip size for an 8, 10 and 12-bit filter is listed in Table 3. Approximately half the chip area is taken up by data and coefficient storage.

#### 6. Single Tap Versus Multiple Taps

In our preliminary study we chose a single tap implementation (Figure 8) for the low power filter. A single tap implementation involves buffering the incoming data samples in memory and then multiplexing the data and filter coefficients at high speed into the multiplier/accumulator portion of the filter. Whereas the data samples are acquired at BKHz, this scheme requires the processor section to be run at 8MHz. A multi-tap approach could be run at lower speeds, but would require more gates to implement. Since dynamic power dissipation in CMOS technology is directly proportional to total capacitance and speed of operation, there is a trade-off between using a multi-tap and a single tap approach.

We will now try to estimate the trade-off between using a single tap and a multi-tap approach in terms of power consemption. Figure 14 shows a four-tap implementation for the low power filter. From this figure we see that even though data and coefficient storage become partitioned as the number of taps, N, increases, total data and coefficient storage remains the same. Power consumed by the RAM can be estimated in the same way as in Section 2.C.3:

$$P = \frac{1.5b}{256} \cdot \left(\frac{3.6}{5.0}\right)^2 \cdot \frac{f}{N} \cdot 10 \text{ mW} = 0.243 \text{ b/N mW}.$$

Here f is the operational speed for the single tap approach, 3MHz, and f/N is the operational speed for an N-tap filter. The number of multipliers and adders, would increase with the number of taps, so the total capacitance for these components would be



Figure 14. Four-tap approach for low power filter.

.....

 $N(C_m + C_a)$ , where  $C_m$  and  $C_a$  are the total capacitances associated with a multiplier and an adder, respectively. The power dissipated by these components would be

$$P = N(C_m + C_a)V^2 (f/N)(50)/2 = 25.6 \times 10^6 (C_m + C_a).$$

Combining this with power dissipation for the RAM and results from sections 2.C.1 and 2.C.2, total power for an N-tap filter is given by

 $P(b,N) = 243b/N + 1.73b^2 + 2.43b + 14 uW.$ 

Hence, total power for an N-tap filter shows a decrease only with the data and coefficient storage component. This component, however, consumes 90% of filter power, and any power savings here would be significant.

Table 4 lists the power consumption, device count and chip size for a 10-bit filter implemented with 1 through 4 taps. Device count is obtained with the help of Table 12,

 $D(b,N) = 9220b + 80 + N(32b^2 + 45b + 260)$ ,

and chip size obtained in a similar manner to section 2.C.5,

 $S(b,N) = 9200b/10.9 + [20b + 80 + N(32b^2 + 45b + 260)]/0.62$ = 876b + 129 + N(51.6b<sup>2</sup> + 72.6b + 419) mil<sup>2</sup>.

The percent increase or decrease in power consumption, device count and chip size over the single tap approach is also noted in Table 4. Going from a single tap to a two-tap filter would decrease power consumption by 46%, but would also increase chip size by 41%.

Chip yield must be considered when using a multi-tap approach. The low power filter is a substantial size chip even

when a single tap approach is used (Section 2.0.5). If extra multipliers and adders are to be added for a multi-tap approach, then fabrication yields would decrease as chip size increases. A multi-chip implementation may be used, but this approach requires inter-chip communication, and as shown in Section 2.0.4, significant power is consumed by output drivers handling off-chip capacitances. We believe that this power will not be compensated for by operating at a lower speed.

#### D. OTHER DESIGN CONSIDERATIONS

So far this report has assumed that the inputs are provided in digital form, and the input signal does not contain aliasing. This assumes that the original analog signal has been "conditioned"; that is, it has been sent through an automatic gain control (AGC) circuit and pre-filtered with a cut-off at about 4KHz to prevent aliasing. Furthermore, the conditioned signal has to be digitized. Whether the analog-to-digital (A/D) converter is to be included as part of the low power filter chip has to be considered. We have not examined the speed and power requirements of the A/D converter in this report, but it is an important part of the filter and needs further study. We believe, however, that the device count and power dissipation in an A/D converter would be small compared with the total device count and power consumption for the filter.

#### E. COST OF FABRICATION

A preliminary estimate of the cost of fabricating the low power filter chip using Hughes VHSIC CMOS/SOS technology has been made by our Industrial Electronics Group (IEG) at Carlsbad. This cost estimate is presented in Table 13, and includes processing 2 lots of wafers — almost a necessity for a chip of the filter's complexity. Errors in the first design will be eliminated in the second lot. A tentative schedule for fabrication of the filter chip over a 24 month period is shown in Figure 15.

| Design                                                                       |            | \$           |
|------------------------------------------------------------------------------|------------|--------------|
| Design and Layout                                                            | 300K       |              |
| Simulation                                                                   | 35K        |              |
| Generate CALMA tape, PG tape, mask set<br>Test (test bardware, test program, | <u>37k</u> | 37 <b>2K</b> |
| generate test vectors)                                                       |            | 65K          |
| Reiteration (redesign and new masks)                                         |            | 40K          |
| Processing (2 lots, parametric measure and probe)                            |            | 70 <b>K</b>  |
| Assembly and Test                                                            |            | 25K          |
| Project Engineer                                                             |            | 50K          |
| Program Management and Administration                                        |            | 40K          |
|                                                                              | Total      | 662K         |
|                                                                              | G&A        | 73K          |
|                                                                              | COM        | 10K          |

# Table 13. Cost of Fabrication for Filter Using the Hughes VHSIC CMOS/SOS Power Process

|                                        |   |    |         |      |    |   |     |   |    |    |     |           |    |    |    |     |    |                 |    |    | -  | 100 | -  |
|----------------------------------------|---|----|---------|------|----|---|-----|---|----|----|-----|-----------|----|----|----|-----|----|-----------------|----|----|----|-----|----|
| 0 0 11 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 |   |    |         |      |    |   |     |   |    | ž  | INC | HS        |    |    |    |     |    |                 |    |    |    |     |    |
| MILESIONES                             | - | 2  | 3       |      | 8  | - | 8   | • | 10 | Ξ  | 12  | 13        | 14 | 15 | 16 | 17  | 18 | <u>.</u><br>161 | 8  | 21 | 22 | 23  | 24 |
| DESIGN                                 |   |    |         |      |    |   |     |   |    |    |     |           |    |    |    |     |    | <u> </u>        |    |    |    |     |    |
| A-D CELLS/CIRCUIT                      |   | HH | Hh      | -    | H  | T |     |   |    |    |     |           |    |    |    | -11 | П  |                 |    |    |    |     |    |
| SIMULATION                             |   |    | <b></b> | -1   | ┥┝ | H | ┥┝  |   |    |    |     |           |    |    |    |     |    |                 |    |    |    |     |    |
| RAM CELLS/CIRCUIT                      |   |    | H       | HH   | H  |   | -11 |   |    |    |     |           |    |    |    |     |    |                 |    |    |    |     |    |
| SIMULATION                             |   |    |         |      | HH |   |     | T |    |    |     |           |    |    |    |     |    |                 |    |    |    |     |    |
| LAYOUT                                 |   |    |         | Ш    | H  |   |     |   |    |    |     |           |    |    |    |     | Т  |                 |    |    |    |     |    |
| CHECK                                  |   |    |         |      |    |   |     | Ш | ┛┟ | ┥┝ |     |           |    |    |    |     |    | _               |    |    |    |     |    |
| PG TAPE/CHECK                          |   |    |         |      |    |   |     |   |    | Щ  |     |           |    |    |    |     |    |                 |    |    |    |     |    |
| MASK FAB/CHECK                         |   |    |         |      |    |   |     |   |    |    | _   | $\square$ |    |    |    |     | щ  |                 |    |    |    |     |    |
| TEST PROGRAM                           |   |    |         |      |    |   |     | Ш |    |    |     |           |    |    |    |     |    |                 |    |    |    |     |    |
| TEST HARDWARE                          |   |    |         |      |    |   |     |   |    |    |     |           |    |    |    |     |    |                 |    |    |    |     |    |
| PROCESSING/MEASUREMENT                 |   |    |         |      |    |   |     |   |    |    |     |           |    |    |    |     |    |                 |    |    |    |     |    |
| LOT NO. 1                              |   |    |         |      |    |   |     |   |    |    |     |           |    | ٦ŀ |    | Т   |    |                 |    |    |    |     |    |
| LOT NO. 2                              |   |    |         |      |    |   |     |   |    |    |     |           |    |    |    |     |    | -11             | HH | ۲ŀ | П  |     |    |
| PACKAGING/TEST                         |   |    |         |      |    |   |     |   |    |    |     |           |    |    |    |     |    | П               |    |    | ш  | HH  |    |
|                                        |   |    |         |      |    |   |     |   |    |    |     |           |    |    |    |     |    |                 |    |    |    |     |    |
|                                        |   |    |         | <br> |    |   |     |   |    |    |     |           |    |    |    |     |    |                 |    |    |    |     |    |
|                                        |   | 1  | 1       |      | 1  |   |     |   |    |    |     |           |    | 1  | 1  | 1   | 1  |                 | 1  | 1  | 1  | 1   | ]  |

Tentative schedule for processing low power filter chip. Figure 15.

#### REFERENCES

- 1. J.G. Nash and G.R. Nudd, "Ultra-Low Power Digital Filter", NOSC Contract No. N66001-82-C-0504, Oct. 1982.
- E. Sun, "Performance Comparison of MUS Transistors and Circuits Fabricated Using Bulk and SOS Technology", Hewlett-Packard Technical Report, May 1980.
- R.C. Henderson, et al. "Develop Submicron Devices", HRL Technical Report, NOSC Contract No. N00123-79-C-0271, Oct. 1980.
- D. Leong, "VHSIC Revision 7 Layout Rules", Hughes IDC, Feb. 1984.
- 5. Hughes VHSIC Cell Library, 1984.
- Digest of Technical Papers, ISSCC Conference, Feb. 1984.
- 7. D.C. Mayer et al., "A short-channel CMOS/SOS Technology in Recrystallized 0.3 µm-Thick Silicon-on-Sapphire Films," HRL Technical Report, to be published in Electron Device Letters.
- 8. S. Taguchi et al., "Feasibility study of SOS VLSI: Capacitance Analysis in Downward Scaling and Improvement of Thin Films by a Solid-Phase Epitaxy," 1981 Symposium on VLSI Technology, Sept. 1981, pp. 92-93.

#### APPENDIX A

#### TOSHIBA 256 K CMOS STATIC RAM

#### ISSCC 84 / THURSDAY, FEBRUARY 23, 1984 / CONTINENTAL BALLROOMS 5-9 / THPM 15.1

[See page 340 for Future 1.]

#### SESSION XV: STATIC RAMS

#### THPM 15.1 A 46ns 256K CMOS RAM

Chairman R. Inard Pastilev Inter Corp Santa Clara, CA

Mitsuo isobe, Junichi Matsunega, Tekayasu Sekurai, Tekayuki Ohtani Kazuhiro Sawada, Hiroshi Nozawa, Tetsuya lizuka, Susumu Kohyama Toshiba Samiconductor Device Engineering Laboratory Kawasaki, Japan

DEMANUS FOR HEH DENSITY high speed and low power dissipation are increasing with recent static RAMs. Actually as storage capacity on a chip increases, reduction in delay time and power consumption within the core area, i.e., word line. Difficient and sensing curruit delays becomes vitally important to meet the requirements. This paper will describe an asynchroicus 250 AD (MOS static RAM, which utilizes a double word use technique)<sup>21</sup> automatic power down function, and interruals or order of the

The stip microphotograph and the typical characteristics are shown in Figure 1 and Table 1. The row decoder is placed on the efficience of the memory arrays. The cell size is 11 x 1.1 June and the hip measures 0.68 x 8.86mm, which fits into a standard 28pm DIP. The RAM offers typically 40ns at the limb operating power and 30µW standby power

For an eving high packing density and improved performation with this sub-micron channel length VLSI memory of the Power board double polysition CMOS technology has term texeloged, which includes a narrow field isolation, and 11, type transitions for 5V operation. A 1-2µm ground rule waynow with the forget density regions. Table 2 lists the ten at the evided double parameters, in comparison with the preied by generation device. Double level A1 structure is characcontent is a codue, ow temperature interlayer formation and 2 at a table to thought, combined with a hillock suppressed metal aver deposition.

The point of agram of the PAM is shown in Figure 2. A press transition detector surputs are employed to generate the solution is solution of particular pulses and an automatic mean point. The only activating pulses are used to equal the third point. The only activating pulses are used to equal the third point. The only activating pulses are used to equal the third point. The only activating pulses are used to equal the third point. The only activating pulses are used to equal the third point. The only activating pulses are used to equal the third point. The only activating pulses are used to equal activation and the read out operation. The pulse descents a charter the read out operation. The pulse descents a charter the read out operation. Therefore, the active power is the order of a charter of a complete the pulse of a shown in Figure 1.

Sakura, T. et al. Methods for Analyzing a Word Fine Telas act. Their Applications. The Community of Solid state weights of IbCE of Japan. SSD82:72, p. 15-21. Oct. 1482.

Trainimana M., et al. A 54KD Full (MOS RAM with Divided Auro Structure - USSIC DIVEST OF TECHNICAL PAPERS p. 58.59 Feb. 1983

<sup>5</sup> Marsunaga 2 et al. 1.22m Process Design for CMOS 91.955 - Electrochem, Soc. Meeting San Francisco, Extended Abstracts 93(1): p. 566, Max. 1983.

\* Konimi S. et al. 1 A 64K5 (MOS RAM.) ISSEC DIGEST F TECHNICAL PAPERS \$ 258,259 Feb. 1982 To reduce both the word one delay and the active power dissipation, a double word one structure is introduced. Row lines consist of aluminum main word lines which select one of 512 rows, and polysilicon section word line is activated by the main word line and a column select one at local by the memory relis connected to one section word line are accessed in a cycle column current flows only in a selected section. In addition, an RC time delay of each section word line is reduced to 1.256 compared with conventional arrangements. Therefore the total word line delay is reduced to 8.5% from 30ns as is the case of conventional 4 block word line configurations. The circuit design is realized by utilizing double aluminum structure.

A schematic diagram of a memory cell and peripheral circuits is illustrated in Figure 4. A two-stage current mirror type CMOS sense amplifier is used to achieve high speed read operation. The first stage amplifies a small signal from one of the four bitline pairs. The second stage amplifies the first stage output signal to a large swing level.

The bit lines and the first sense amplifier output are equalized by the chip activating pulse before the read operation. To improve fabrication vield, a redundancy circuit is employed without any speed degradation.

The oscillugraph of the address input and data output signal waveforms at Vdd (5V with 100pF load rapacitance is shown in Figure 5, which indicates a 40ns address access time

#### Acknowledgments

The authors wish to thank Y. Nishi and Y. Uchida for their continuous support and encouragement, and H. Xaneko and K. Sato for their great efforts in processing and evaluationa



FIGURE 5-Oscillograph of address input and data output waveforms.

[Right] FIGURE 2- Block diagram of the 256Kb CMOS RAM

© ---: 1987. Betricted, with genra lists, these this information with 1997 of the second state of the definition of the period of the second state of the second state

#### ISSCC 84 / THURSDAY, FEBRUARY 23, 1984 / CONTINENTAL BALLROOMS 5-8 / 1:30 P.M.

| PARAMETERS           | 64K-CMOS RAM                           | 256K-CMOS RAM      |
|----------------------|----------------------------------------|--------------------|
| 290CESS              | DOUBLE-LEVEL POLY-S<br>SINGLE LEVEL AT | OUBLE-LEVEL POLY-S |
| GATE LENGTH INMOST   | 2.) µm                                 | 1 2 µm             |
| PMOS                 | . 2 µm                                 | 5 µm               |
| GATE OXIDE THICKNES  | s 450å                                 | 2501               |
| JUNCT ON DEPTH INTI  | 0.25µm                                 | 0.20µm             |
| (P*)                 | 05 µm                                  | 0 35 µm            |
| POLY-SI-WIDTH/SPACIN | G) 2µm/2µm                             | 1 2 mm/ 1 2 mm     |
| st A. WIDTH/SPACIN   | m بير 2 (m                             | ⊨ 2µm/+ 6µm        |
| ST CONTACT HOLE      | n#2µm∡2                                | 2µm≭÷2µm           |
| 2nd AL EWIDTH SPACEN | G)                                     | 2 Oµm/2 Oµm        |
| 2 NE CONTACT HOLE    |                                        | <br>               |









| OPERATION      | FULLY ASYNCHRONOUS<br>ADDRESS ACTIVATED<br>CLOCKED OPERATION)<br>AUTO POWER DOWN FUNCTION |
|----------------|-------------------------------------------------------------------------------------------|
| ORGANIZATION   | 32K WORDS + 8BIT                                                                          |
| REDUNDANCY     | 4 SPARE ROWS                                                                              |
| CHIP SIZE      | 6 68 ± 8 86 mm                                                                            |
| CELL SIZE      | 11 × 13 5µm                                                                               |
| 1/0 INTERFACE  | TTE COMPATIBLE                                                                            |
| ADDRESS ACCESS | TIME 46 ns                                                                                |
| ACTIVE POWER   | 0 mW (1MHz)                                                                               |
| STANDBY POWER  | 30 µ W                                                                                    |
| PACKAGE        | STANDARD 28P'N DIP                                                                        |

TABLE 1-Typical characteristics of the 256Kb CMOS RAM.

/Left:Top/ TABLE 2-Design rules and device parameters of the 256Kb CMOS RAM.

[Left] FIGURE 3-Supply current versus operating frequencies.

[Below] FIGURE 4-Schematic of memory cell and peripheral circuita.



DIGEST OF TECHNICAL PAPERS . 215

#### HITACHI 64 K CMOS STATIC RAM

#### ISSCC 84 / THURSDAY, FEBRUARY 23, 1984 / CONTINENTAL BALLROOMS 5-9 / 3:45 P.M.

[See page 343 for Figure 3.]

#### SESSION XV: STATIC RAMS

#### THPM 15.5: A 20ns 64K CMOS SRAM

Osamu Minato, Toshiak, Masuhara, Toshio Sasaki, Yoshio Sakar, Tetsuva Havashida Hitachi, Ltd Tokvo, Japan

IN RECENT YEARS, several circuit techniques have been combined with scaling to realize MOS static RAMs, having a speed comparable to bipolar RAMs<sup>1/2</sup>. It has also been possible to realize bipolar RAMs with densities comparable to MOS devices<sup>2</sup>. This paper will report on a 64K x-1b CMOS static RAM with a 20ns typical address access time and 70mW active power dissipation.

The RAM performance has been achieved by the development of a bolked word-line (PWL) technique and double P well bipotar (MOS (BLMOS) circuitry. Also, gate length of MOS transitions has been sealed down to 1 3µm.

The puised word-line technique is illustrated in Figure 1. The schematic of the part of the RAM controlled by clocks is shown in Figure 2. The RAM circuitry is activated by the internal riock formed by detecting all address,  $\overline{WF}$  and  $\overline{CS}$ transitions. The basic flock for the RAM circuitry operation is XD which controls word lines through the X-decoder and sense amplitiers. For precharge and equilibration of data lines. precharge more PC generated by XD and equivalent to XD, , is used. When the basic clock XD becomes high, a selected word. line becomes high because the X-decoder is activated. At the same time, the sense amplitiers are activated by XD. At this moment, since the PC clock is high, the precharge transistor works as the usual data line load. Consequently, small differentral data of the selected cell appear on the data lines. These data of the cell are transferred to the sense amplifier which consists of two stages of two single-ended active load differential amplifiers, and then to the output buffer. This operation is completely static and all operations begin after the data lines are equilibrated. Then precharge clock PC goes low, cutting all data incloads. From this transition no de current is consumed by the cell. After the signal is transferred to the output buffer. the data is latened to the output buffer by the DL clock Forther, this clock pulls down all word lines to the low state 32 this moment, data lines are unmediately precharged by the PC wen to prepare for subsemient data read. This circuit tech

19 Conner, K.J., et al., "A 5ns 4K x 1 NMOS Static RAM" ISSOC, DIGEST, OF, TECHNICAL PAPERS, p. 104-105, Feb., 1983.

<sup>1</sup> Minato, O. - 'HI-CMOSIE 4K, Static RAM'', ISSCC, DIGEST "PETECHNICAL PAPERS, p. 14-15, Eeb., 1981

3 Aledmann 3 K et al. "A 25ns RK x 85 Static MTL 121 RAM 1 ISSCC DIGEST IJF TECHNICAL PAPERS 0 110-111 Feb. 1983

<sup>4</sup> Tanumote K et al. "A 64K ± 15 NMOS Static RAM" ISSEE DIGEST OF TECHNICAL PAPERS D 6665" Feb 1983

<sup>6</sup>K man, S. H. B. A 64KH (M/)S RAM (155(C.DH/)FST (FFCHNC(A) PAPERS D.2\*R 254 Feb. 1982) nique differs from existing data line equilibration techniques<sup>4</sup> and latched column techniques<sup>5</sup> in that the word line is kept high when a specific address is read. The former suffers from large current through memory cells, although last access is achieved. The latter achieves low power by latching the signal at the column by pulling one of the bit lines to a completely low state resulting in slow access time and large bit line recovery time. Pulsed-word-line (PWL) techniques make it possible to reduce current through the transmission gate of the cell by the PC clock and to obtain fast access time by static operation during which both XD and PC are high. Also the PWL technique reduces the data line recovery time.

The ND, PC clock generators and output buffer use a bipolar. CMOS (BCMOS) configuration to assure fast risetime. The bipolar transistor is formed in a thin P well to realize high  $t_T$ . Thus, this technology utilizes double P wells: one for NMOS transistors and the other for high performance bipolar transistors. Risetime capability of the bipolar device is 0.013ns/V - pF. This is three times greater than that of bipolar devices formed in the usual P well.

Third generation CMOS (Hi-CMOSIII) technology has been developed. Used are N- and P-channel MOS transistors having 1-3 $\mu$ m typical gate length and 1-3 $\mu$ m design rule. Basically, this technology is a 70 percent reduction in size, both horizon-tally and vertically of the original Hi-CMOSIII technology which utilizes a 2 $\mu$ m design rule.

The memory cell is a cross-coupled four NMOS flipflop with high resistance loads. The cell is  $128\mu m^2/3\mu m \times 16\mu m$ 

A photomulograph of the chip is shown in Figure 3. The die measures 3 10mm x 6.0mm. To achieve (ast access time, the RAM is organized so that the array is split into four planes of 64 columns x 256 rows. Corresponding to these arrays, tour some amplifiers are laid out and power switched according to address A14 and A15.

A typical 20ns address access time was achieved with a 70mW nominal power dissipation at 1MHz cycle time. Thip select access time is 22ns. The RAM output waveforms for a typical 30pE load capacitance ar: shown in Figure 4. Supply current versus operating frequencies are shown in Figure 5. Active power dissipation at low operating frequencies is reduced by the aid of the pulsed word line (PWE) techniques. Typical features of the RAM are summarized in Table 1. This RAM has realized a speed comparable to bipolar 16K and 10.04K. RAMs even though it consume much less power.

#### Acknowledgments

- The authors wish to thank M. Kubo, S. Asar, S. Yonesama,
- Y Kosa and T. Yasui for their guidance, and N. Hashimoto.
- Y. Nagai, S. Yamamoto for the device fabrication

⑥ Head (1997). Reserves of wate december (e.g. dream 1998) Physicitatic traditional of the material traditional of the material traditional statements. The reserve of 4, 19994, 19994. And the reserve of 7 data statements.

#### ISSCC 84 / THURSDAY, FEBRUARY 23, 1984 / CONTINENTAL BALLROOMS 5-9 / THPM 15.5



DIGEST OF TECHNICAL PAPERS . 223

# END DATE FILMED

