US10606484B2 - NAND flash storage device with NAND buffer - Google Patents

NAND flash storage device with NAND buffer Download PDF

Info

Publication number
US10606484B2
US10606484B2 US15/816,447 US201715816447A US10606484B2 US 10606484 B2 US10606484 B2 US 10606484B2 US 201715816447 A US201715816447 A US 201715816447A US 10606484 B2 US10606484 B2 US 10606484B2
Authority
US
United States
Prior art keywords
chip
nand flash
write
future
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
US15/816,447
Other versions
US20180373440A1 (en
Inventor
Monish Shah
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Google LLC
Original Assignee
Google LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Google LLC filed Critical Google LLC
Assigned to GOOGLE LLC reassignment GOOGLE LLC ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: SHAH, MONISH
Priority to US15/816,447 priority Critical patent/US10606484B2/en
Priority to PCT/US2018/022683 priority patent/WO2018236440A1/en
Priority to KR1020197035081A priority patent/KR102276350B1/en
Priority to JP2019566261A priority patent/JP7234144B2/en
Priority to GB1804693.8A priority patent/GB2563713B/en
Priority to TW107111866A priority patent/TWI727160B/en
Priority to DE102018109929.0A priority patent/DE102018109929A1/en
Priority to EP18169162.7A priority patent/EP3418897B1/en
Priority to DE202018102304.7U priority patent/DE202018102304U1/en
Priority to CN201810391996.1A priority patent/CN109117085B/en
Publication of US20180373440A1 publication Critical patent/US20180373440A1/en
Publication of US10606484B2 publication Critical patent/US10606484B2/en
Application granted granted Critical
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0655Vertical data movement, i.e. input-output transfer; data movement between one or more hosts and one or more storage devices
    • G06F3/0658Controller construction arrangements
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F12/00Accessing, addressing or allocating within memory systems or architectures
    • G06F12/02Addressing or allocation; Relocation
    • G06F12/0223User address space allocation, e.g. contiguous or non contiguous base addressing
    • G06F12/023Free address space management
    • G06F12/0238Memory management in non-volatile memory, e.g. resistive RAM or ferroelectric memory
    • G06F12/0246Memory management in non-volatile memory, e.g. resistive RAM or ferroelectric memory in block erasable memory, e.g. flash memory
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F12/00Accessing, addressing or allocating within memory systems or architectures
    • G06F12/02Addressing or allocation; Relocation
    • G06F12/08Addressing or allocation; Relocation in hierarchically structured memory systems, e.g. virtual memory systems
    • G06F12/0802Addressing of a memory level in which the access to the desired data or data block requires associative addressing means, e.g. caches
    • G06F12/0804Addressing of a memory level in which the access to the desired data or data block requires associative addressing means, e.g. caches with main memory updating
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F12/00Accessing, addressing or allocating within memory systems or architectures
    • G06F12/02Addressing or allocation; Relocation
    • G06F12/08Addressing or allocation; Relocation in hierarchically structured memory systems, e.g. virtual memory systems
    • G06F12/0802Addressing of a memory level in which the access to the desired data or data block requires associative addressing means, e.g. caches
    • G06F12/0866Addressing of a memory level in which the access to the desired data or data block requires associative addressing means, e.g. caches for peripheral storage systems, e.g. disk cache
    • G06F12/0868Data transfer between cache memory and other subsystems, e.g. storage devices or host systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0602Interfaces specially adapted for storage systems specifically adapted to achieve a particular effect
    • G06F3/061Improving I/O performance
    • G06F3/0611Improving I/O performance in relation to response time
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0629Configuration or reconfiguration of storage systems
    • G06F3/0635Configuration or reconfiguration of storage systems by changing the path, e.g. traffic rerouting, path reconfiguration
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0646Horizontal data movement in storage systems, i.e. moving data in between storage devices or systems
    • G06F3/0652Erasing, e.g. deleting, data cleaning, moving of data to a wastebasket
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0655Vertical data movement, i.e. input-output transfer; data movement between one or more hosts and one or more storage devices
    • G06F3/0656Data buffering arrangements
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0655Vertical data movement, i.e. input-output transfer; data movement between one or more hosts and one or more storage devices
    • G06F3/0659Command handling arrangements, e.g. command buffers, queues, command scheduling
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0668Interfaces specially adapted for storage systems adopting a particular infrastructure
    • G06F3/0671In-line storage system
    • G06F3/0673Single storage device
    • G06F3/0679Non-volatile semiconductor memory device, e.g. flash memory, one time programmable memory [OTP]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0668Interfaces specially adapted for storage systems adopting a particular infrastructure
    • G06F3/0671In-line storage system
    • G06F3/0683Plurality of storage devices
    • G06F3/0688Non-volatile semiconductor memory arrays
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2212/00Indexing scheme relating to accessing, addressing or allocation within memory systems or architectures
    • G06F2212/21Employing a record carrier using a specific recording technology
    • G06F2212/214Solid state disk
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2212/00Indexing scheme relating to accessing, addressing or allocation within memory systems or architectures
    • G06F2212/22Employing cache memory using specific memory technology
    • G06F2212/222Non-volatile memory
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2212/00Indexing scheme relating to accessing, addressing or allocation within memory systems or architectures
    • G06F2212/72Details relating to flash memory management
    • G06F2212/7203Temporary buffering, e.g. using volatile buffer or dedicated buffer blocks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2212/00Indexing scheme relating to accessing, addressing or allocation within memory systems or architectures
    • G06F2212/72Details relating to flash memory management
    • G06F2212/7205Cleaning, compaction, garbage collection, erase control

Definitions

  • a swap device can serve as extended memory space, sometimes referred to as swap space, in a computing system.
  • the swap device can represent a tradeoff between the faster speed of RAM and the lower cost of alternative memory/storage mediums such as a hard disk drive (HDD) or a solid state drive (SSD).
  • HDD hard disk drive
  • SSD solid state drive
  • the NAND flash storage device includes a plurality of NAND flash chips and a controller.
  • the controller is configured to receive data over an input/output (I/O) bus.
  • the controller is configured to write the received data to a first NAND flash chip of the plurality of NAND flash chips and a second NAND flash chip of the plurality of NAND flash chips successively such that write operations to each NAND flash chip do not overlap in time.
  • the controller is configured to receive a read command over the I/O bus.
  • the controller is configured to, responsive to receiving the read command, read data from whichever of the first NAND flash chip or the second NAND flash chip is not currently executing a write operation such that read operations are not queued behind write operations.
  • the controller can be further configured to designate the first NAND flash chip as a secondary write chip, the second NAND flash chip as a future secondary write chip, a third NAND flash chip as a primary write chip, a fourth NAND flash chip as a future primary write chip, and any remaining NAND flash chips as read-only.
  • the controller can be configured to write the received data to the primary write chip and the secondary write chip successively such that write operations to each chip do not overlap in time.
  • the controller can be configured to copy valid data from the future primary write chip to the future secondary write chip.
  • the controller can be configured to, following the writing of the received data and the copying of the valid data, designate the first NAND flash chip as read-only, the second NAND flash chip as the secondary write chip, the third NAND flash chip as the future secondary write chip, and the fourth NAND flash chip as the primary write chip.
  • the controller can be further configured to, following the writing of the received data and the copying of the valid data, designate a fifth NAND flash chip as the future primary write chip.
  • the controller can be configured to write second data received over the I/O bus to the primary write chip and the secondary write chip successively such that write operations to each chip do not overlap in time.
  • the controller can be configured to copy second valid data from the future primary write chip to the future secondary write chip.
  • the controller can be configured to, following the writing of the second received data and the copying of the second valid data, designate the second NAND flash chip as read-only, the third NAND flash chip as the secondary write chip, the fourth NAND flash chip as the future secondary write chip, and the fifth NAND flash chip as the primary write chip.
  • the copy operation can include copying the valid data from the future primary write chip to the future secondary write chip such that the valid data and the free space, respectively, occupy substantially contiguous respective memory spaces of the future secondary write chip.
  • the copying operation can include copying the valid data from the future secondary write chip back to the future primary write chip such that the valid data and the free space, respectively, occupy substantially contiguous respective memory spaces of the future primary write chip.
  • the copying operation can include erasing the memory spaces of the future primary write chip and the future secondary write chip occupied by the free space.
  • the write operation and the copy operation are performed during overlapping time periods
  • the NAND flash storage device includes a plurality of buffer NAND flash chips, a plurality of storage NAND flash chips, and a controller.
  • the controller is configured to receive incoming data over an input/output (I/O) bus.
  • the controller is configured to write the received incoming data to a first buffer NAND flash chip of the plurality of storage NAND flash chips and a first storage NAND flash chip of the plurality of buffer NAND flash chips successively such that write operations to each NAND flash chip do not overlap in time.
  • the controller is configured to receive a read command over the I/O bus.
  • the controller is configured to, responsive to receiving the read command, read data from whichever of the first buffer NAND flash chip or the first storage NAND flash chip is not currently executing a write operation such that read operations are not queued behind write operations.
  • the controller can be configured to designate the first storage NAND flash chip as a primary write chip, a second storage NAND flash chip as a future primary write chip, the first buffer NAND flash chip as a secondary write chip, a second buffer NAND flash chip as a future secondary write chip, and any remaining NAND flash chips as read-only.
  • the controller can be configured to write the received data to the primary write chip and the secondary write chip successively such that write operations to each chip do not overlap in time.
  • the controller can be configured to copy valid data from the future primary write chip to the future secondary write chip.
  • the controller can be configured to following the writing of the received data and the copying of the valid data, designate the first storage NAND flash chip as read-only, the second storage NAND flash chip as the primary write chip, the first buffer NAND flash chip as the future secondary write chip, and the second buffer NAND flash chip as the secondary write chip.
  • the controller can be configured to, following the writing of the received data and the copying of the valid data, designate a third storage NAND flash chip as the future primary write chip.
  • the controller can be configured to write second data received over the I/O bus to the primary write chip and the secondary write chip successively such that write operations to each chip do not overlap in time.
  • the controller can be configured to copy second valid data from the future primary write chip to the future secondary write chip.
  • the controller can be configured to, following the writing of the second received data and the copying of the second valid data, designate the second storage NAND flash chip as read-only, the third storage NAND flash chip as the primary write chip, the first buffer NAND flash chip as the secondary write chip, and the second buffer NAND flash chip as the future secondary write chip.
  • the copy operation can include copying the valid data from the future primary write chip to the future secondary write chip such that the valid data and the free space, respectively, occupy substantially contiguous respective memory spaces of the future secondary write chip.
  • the copy operation can include copying the valid data from the future secondary write chip back to the future primary write chip such that the valid data and the free space, respectively, occupy substantially contiguous respective memory spaces of the future primary write chip; and erasing the memory spaces of the future primary write chip and the future secondary write chip occupied by the free space.
  • the buffer NAND flash chips can be single-level cell (SLC) NAND flash chips, and the storage NAND flash chips can be triple-level cell (TLC) NAND flash chips.
  • SLC single-level cell
  • TLC triple-level cell
  • At least one aspect is directed to method of buffering a NAND flash storage device.
  • the method includes receiving data over an input/output (I/O) bus.
  • the method includes writing the received data to a first NAND flash chip of the plurality of NAND flash chips and a second NAND flash chip of the plurality of NAND flash chips successively such that write operations to each NAND flash chip do not overlap in time.
  • the method includes receiving a read command over the I/O bus.
  • the method includes, responsive to receiving the read command, reading data from whichever of the first NAND flash chip or the second NAND flash chip is not currently executing a write operation such that read operations are not queued behind write operations
  • the method can include designating the first NAND flash chip as a secondary write chip, the second NAND flash chip as a future secondary write chip, a third NAND flash chip as a primary write chip, a fourth NAND flash chip as a future primary write chip, and any remaining NAND flash chips as read-only.
  • the method can include writing the received data to the primary write chip and the secondary write chip successively such that write operations to each chip do not overlap in time.
  • the method can include copying valid data from the future primary write chip to the future secondary write chip.
  • the method can include, following the writing of the received data and the copying of the valid data, designating the first NAND flash chip as read-only, the second NAND flash chip as the secondary write chip, the third NAND flash chip as the future secondary write chip, and the fourth NAND flash chip as the primary write chip.
  • the method can include, following the writing of the received data and the copying of the valid data, designating a fifth NAND flash chip as the future primary write chip.
  • the method can include writing second data received over the I/O bus to the primary write chip and the secondary write chip successively such that write operations to each chip do not overlap in time.
  • the method can include copying second valid data from the future primary write chip to the future secondary write chip.
  • the method can include, following the writing of the second received data and the copying of the second valid data, designating the second NAND flash chip as read-only, the third NAND flash chip as the secondary write chip, the fourth NAND flash chip as the future secondary write chip, and the fifth NAND flash chip as the primary write chip.
  • the copy operation can include copying the valid data from the future primary write chip to the future secondary write chip such that the valid data and the free space, respectively, occupy substantially contiguous respective memory spaces of the future secondary write chip.
  • the copy operation can include copying the valid data from the future secondary write chip back to the future primary write chip such that the valid data and the free space, respectively, occupy substantially contiguous respective memory spaces of the future primary write chip.
  • the copy operation can include erasing the memory spaces of the future primary write chip and the future secondary write chip occupied by the free space.
  • the writing operation and the copying operation are performed during overlapping time periods.
  • At least one aspect is directed to method of buffering a NAND flash storage device.
  • the method includes receiving data over an input/output (I/O) bus.
  • the method includes writing the received incoming data to a first buffer NAND flash chip of the plurality of storage NAND flash chips and a first storage NAND flash chip of the plurality of buffer NAND flash chips successively such that write operations to each NAND flash chip do not overlap in time.
  • the method includes receiving a read command over the I/O bus.
  • the method includes, responsive to receiving the read command, reading data from whichever of the first buffer NAND flash chip or the first storage NAND flash chip is not currently executing a write operation such that read operations are not queued behind write operations.
  • the method includes designating the first storage NAND flash chip as a primary write chip, a second storage NAND flash chip as a future primary write chip, the first buffer NAND flash chip as a secondary write chip, a second buffer NAND flash chip as a future secondary write chip, and any remaining NAND flash chips as read-only.
  • the method includes writing the received data to the primary write chip and the secondary write chip successively such that write operations to each chip do not overlap in time.
  • the method includes copying valid data from the future primary write chip to the future secondary write chip.
  • the method includes following the writing of the received data and the copying of the valid data, designating the first storage NAND flash chip as read-only, the second storage NAND flash chip as the primary write chip, the first buffer NAND flash chip as the future secondary write chip, and the second buffer NAND flash chip as the secondary write chip.
  • the method includes, following the writing of the received data and the copying of the valid data, designating a third storage NAND flash chip as the future primary write chip.
  • the method includes writing second data received over the I/O bus to the primary write chip and the secondary write chip successively such that write operations to each chip do not overlap in time.
  • the method includes copying second valid data from the future primary write chip to the future secondary write chip.
  • the method includes, following the writing of the second received data and the copying of the second valid data, designating the second storage NAND flash chip as read-only, the third storage NAND flash chip as the primary write chip, the first buffer NAND flash chip as the secondary write chip, and the second buffer NAND flash chip as the future secondary write chip.
  • the method includes copying the valid data from the future primary write chip to the future secondary write chip such that the valid data and the free space, respectively, occupy substantially contiguous respective memory spaces of the future secondary write chip.
  • the method includes copying the valid data from the future secondary write chip back to the future primary write chip such that the valid data and the free space, respectively, occupy substantially contiguous respective memory spaces of the future primary write chip.
  • the method includes erasing the memory spaces of the future primary write chip and the future secondary write chip occupied by the free space.
  • the buffer NAND flash chips can be single-level cell (SLC) NAND flash chips, and the storage NAND flash chips can be triple-level cell (TLC) NAND flash chips.
  • SLC single-level cell
  • TLC triple-level cell
  • FIG. 1 is block diagram of an example computer system including a NAND flash storage device with a NAND buffer, according to an illustrative implementation
  • FIG. 2 is a block diagram of an example NAND flash storage device with a NAND buffer, according to an illustrative implementation
  • FIG. 3 is a flowchart showing an example method of using a NAND flash storage device with a NAND buffer, according to an illustrative implementation
  • FIG. 4 is a block diagram of an example NAND flash storage device with a NAND buffer and additional NAND storage, according to an illustrative implementation
  • FIG. 5 shows example operations of a NAND flash storage device with a NAND buffer and additional NAND storage, according to an illustrative implementation
  • FIG. 6 is a flowchart showing an example method of using a NAND flash storage device with a NAND buffer and additional NAND storage, according to an illustrative implementation
  • FIG. 7 is block diagram of an example NAND flash storage device with a rotating NAND buffer for wear leveling, according to an illustrative implementation
  • FIG. 8 shows example operations of a NAND flash storage device with a rotating NAND buffer for wear leveling, according to an illustrative implementation
  • FIG. 9 is a flowchart showing an example method of using a NAND flash storage device with a rotating NAND buffer for wear leveling, according to an illustrative implementation.
  • the storage device can be a solid-state drive (SSD) or a swap device.
  • a swap device is a secondary storage device that can function as an extension of Random Access Memory (RAM). When the amount of data to be retained in memory exceeds the capacity of RAM, some of the data can be moved to the swap device in an operation sometimes referred to as “paging.”
  • the secondary device could be a hard disk drive (HDD). Reading and writing data from an HDD is very slow relative to RAM, however, making an HDD impractical as a swap device in settings where retrieval speed is important.
  • NAND flash-based SSDs although still slower than RAM, exhibit faster retrieval times relative to HDDs. NAND SSDs also tend to be less expensive than RAM, thus reducing RAM capacity in favor of SSD capacity can lead to cost savings. More recently developed low-latency NAND (LLNAND), such as Samsung's Z-NAND, are designed with special circuitry that improves its typical read latency. LLNAND-based SSDs are even faster, and thus practical for paging and other temporary data storage in some server applications.
  • LLNAND low-latency NAND
  • NAND flash SSDs have other drawbacks, however. While NAND read time is very fast, on the order of 10s of microseconds, erase and write operations are much slower, on the order of milliseconds each. And, because a NAND flash chip cannot execute a read operation and write operation at the same time, a read operation queued behind an erase/write operation can exhibit a latency increase of two or three orders of magnitude from normal. This is called “tail latency.” While it is possible to buffer data in RAM for writing to a NAND flash chip, the overhead can be cost prohibitive. For example, a device having ten NAND flash chips will require RAM equivalent to one tenth of the total NAND capacity as a buffer. Because DRAM can be four times as expensive as NAND storage, the cost overhead may be 40%.
  • This disclosure therefore proposes employing a NAND buffer to improve tail latency at a lower cost overhead than that of a RAM buffer.
  • the system works by writing each page of data into two chips in sequence: first into a first chip, and then into a second chip.
  • the second chip may be considered a NAND buffer.
  • the write operations do not overlap in time. Read operations can be executed by the chip that is presently not executing a write operation. Thus, the read operations need not be queued behind write operations, allowing the NAND flash storage device to have more predictable latency.
  • the first chip (and, by extension, the second chip) are filled, and two new chips are chosen as write buffers.
  • the first chip can be switched to read only and, because the first chip can perform read operations without interruptions from write operations, the copy of the data stored on the second chip is no longer necessary.
  • the second chip is now free for future primary write operations.
  • Two new chips, a third chip and a fourth chip, can be selected for writing.
  • the fourth chip may be considered a NAND buffer.
  • the two new chips selected for writing can contain duplicate data. Accordingly, prior to receiving new data, existing data on the third chip can first be copied to the fourth chip.
  • the copying operation involves “garbage collection,” an operation memory devices perform to free up space occupied by data that has been invalidated (is no longer in use). Garbage collection can be time consuming and, like erase/write operations, can prevent read operations from executing until it is complete. Therefore, the garbage collection and copying operations are executed such that initially, only the fourth chip is being written and third chip is available for reading. Later, the garbage collected version is copied back from the fourth chip to the third chip such that the fourth chip is available for reading.
  • garbage collection and copy operations between third and fourth chip are overlapped in time with writing into the first and second chip. As long as the garbage collection and copy operation takes less time than the writing operations, the third and fourth chips will be ready to take over execution of write operations when the first and second chips are filled. A new pair of chips will then be selected for garbage collection and copy operation.
  • two buffer chips contain a duplicate of the data in two other chips. Accordingly, a device having ten NAND flash chips for storage can have two additional NAND flash buffer chips. The cost overhead will thus be 20%, which can be half the cost overhead of using a DRAM buffer.
  • Additional buffer NAND flash chips can be employed to improve the write bandwidth of the NAND flash storage device.
  • the write bandwidth of a NAND flash storage device refers to the amount of data the NAND flash storage device can receive and store in a given amount of time.
  • the write bandwidth of a NAND flash storage device can be limited by the time it takes to write data to a particular NAND flash chip. Therefore, write bandwidth can be increased by adding more buffer NAND flash chips, and distributing the write operations among them.
  • the NAND flash storage device can include additional buffer chips (i.e., 4, 6, 8, or more) to improve the write bandwidth.
  • the NAND flash storage device can employ wear leveling.
  • a NAND flash chip can only perform a finite number of write operations before exhibiting excessive bit errors. Eventually, after performing many write operations, the NAND flash chip will wear out and begin to fail.
  • the NAND flash storage device can rotate which NAND flash chips are designated as buffer chips for write operations. The designation can be rotated to equalize or substantially equalize wear among some or all of the NAND flash chips in the device, as opposed to concentrating buffering write operations on a small number of chips. In this manner, the wear can be spread, or “leveled,” across many NAND flash chips of the device.
  • FIG. 1 is block diagram of an example computer system 100 including a NAND flash storage device with a NAND buffer, according to an illustrative implementation.
  • the system 100 can be a computing device; for example and without limitation, a personal computing device such as a laptop or desktop computer, a shared computing device such as a server or mainframe, or component of the aforementioned or other computing device.
  • the system 100 includes a processor such as a central processing unit (CPU) 110 , RAM 120 such as DRAM, an input/output (I/O) bus 130 such as a Peripheral Component Interconnect Express (PCIe), a data storage device 140 such as a hard disk drive (HDD) or a solid-state drive (SSD) for long-term data storage, and a NAND flash storage device 150 .
  • the RAM 120 may be a main memory and can be directly accessible to the CPU 110 over, for example, an address bus that carries a memory address of the desired location, and a data bus used to read or write data from/to the specified memory address.
  • the NAND flash storage device 150 can be a swap device providing a swap space to the system 100 .
  • the kernel or operating system can employ a memory/storage abstraction such as virtual memory, which provides applications executing on the CPU 110 with transparent and consistent access to one or more memory devices that can vary in type and location.
  • virtual memory can allow applications executing on the CPU 110 to access an extended memory space.
  • the extended memory space can exceed the capacity of the RAM 120 .
  • the virtual memory function can perform paging, by which the system 100 can store data in a secondary memory device such as an HDD or a SSD.
  • This extended memory space can be referred to as swap space
  • the storage device providing the swap space can be referred to as a swap device.
  • NAND flash storage devices are much faster than HDDs, and thus become useful for a wider range of applications.
  • NAND flash storage devices have the additional advantage of being less expensive than some common types of RAM such as DRAM.
  • Newer types of NAND such as low-latency NAND (LLNAND) can be even faster, exhibiting latency for read operations of approximately 10 ⁇ s versus approximately 70 ⁇ s regular NAND.
  • LLNANDs also exhibit less latency for program (write) operations; i.e., 100-200 ⁇ s versus a few milliseconds for regular NAND.
  • a write operation generally involves both a program operation and an erase operation.
  • LLNANDs may still suffer from slow erase times, which can take on the order of 5 ms.
  • a NAND flash chip or die can generally only perform one operation at a time. For example, if a NAND flash chip is executing an erase/program sequence of a write operation, any incoming read commands will be queued until completion. Thus, some read operations may require up to several milliseconds to complete despite a nominal latency of tens of microseconds.
  • tail latency Both the duration and the unpredictability of this tail latency can present difficulties when using NAND flash.
  • Some systems can use a RAM to cache the contents of a NAND flash chip being written to such that the RAM can perform the read operation while the NAND flash chip executes write operations—see, for example, U.S. patent application Ser. No. 14/936,486. Because RAM can be more expensive that NAND flash, however, the RAM adds a cost overhead that can be unacceptable for some applications. Therefore, this disclosure proposes providing one or more additional NAND flash chips to provide a cache for data residing in NAND flash chips executing write operations.
  • a NAND flash-based buffer is described in more detail below with regard to FIG. 2 .
  • FIG. 2 is a block diagram of an example NAND flash storage device 250 with a NAND buffer, according to an illustrative implementation.
  • the NAND flash storage device 250 can serve as the NAND flash storage device 150 .
  • the NAND flash storage device 250 can employ double buffering using two NAND flash chips 270 a and 270 b (collectively, “NAND flash chips 270 ”).
  • a controller 260 can handle storage access commands received over the I/O bus 230 .
  • the controller 260 can include programmable logic such as an application-specific integrated circuit (ASIC), a field-programmable gate array (FPGA), a microcontroller, or a microprocessor.
  • ASIC application-specific integrated circuit
  • FPGA field-programmable gate array
  • microcontroller or a microprocessor.
  • the controller 260 can be integral with, or external to the NAND flash storage device 250 .
  • the controller 260 can orchestrate read and write operations of the NAND flash storage device 250 such that the CPU 110 need not be aware of, or itself execute, the buffering operations of the NAND flash storage device 250 .
  • the controller 260 is configured to receive data and/or commands over the I/O bus 230 , and write the received data first to one of the NAND flash chips 270 , and then to a second of the NAND flash chips 270 .
  • the controller 260 executes the write operations successively—either autonomously or under the direction of CPU 110 —such that they do not overlap in time; thus, at any given time, the NAND flash chip 270 that is not presently performing a write operation can promptly perform read operations.
  • the controller 260 executes each write operation, two of the NAND flash chips 270 can contain the same data.
  • the controller 260 can execute read commands by reading from whichever of the first NAND flash chip or the second NAND flash chip is not currently executing a write operation such that read operations are not queued behind write operations. In this manner, tail latency can be improved.
  • the NAND flash storage device 250 can include additional NAND flash chips 270 c and 270 d to further improve write bandwidth.
  • a NAND flash storage device 250 having four NAND flash chips 270 a through 270 d can split write operations between two NAND flash chips 270 ; e.g., NAND flash chips 270 a and 270 b . After the NAND flash chips 270 a and 270 b complete their respective write operations, those write operations can be repeated on the NAND flash chips 270 c and 270 d . In this manner, the write bandwidth of the NAND flash storage device 250 can be approximately doubled.
  • the NAND flash storage device 250 can employ additional NAND flash chips 270 for further increases in write bandwidth.
  • the NAND device 250 can include some RAM 265 to serve as a cache or buffer to store data while it is written to each NAND flash chip 270 .
  • This RAM 265 need not have the capacity to buffer the entire contents of a NAND flash chip 270 , but rather can be only large enough to buffer a single block or page of data. Accordingly, it can be much smaller than a dedicated RAM buffer, and thus not contribute appreciably to the cost overhead of the NAND device 250 .
  • FIG. 3 is a flowchart showing an example method 300 of using a NAND flash storage device with a NAND buffer, according to illustrative implementations.
  • the NAND flash storage device can include any of the NAND flash storage devices 150 , 250 , 450 , or 750 described herein.
  • the method 300 includes receiving data over an input/output (I/O) bus (stage 310 ).
  • the method 300 includes writing the received data to a first NAND flash chip and a second NAND flash chip successively such that the write operations to each NAND flash chip do not overlap in time (stage 320 ).
  • the method includes receiving a read command over the I/O bus (stage 330 ).
  • the method 300 includes reading data from whichever of the first NAND flash chip or the second NAND flash chip is not currently executing a write operation such that read operations are not queued behind write operations (stage 340 ).
  • the method 300 includes receiving data over an I/O bus (stage 310 ).
  • the NAND flash storage device can receive a write command over the I/O bus, such as one of the I/O busses 130 or 230 previously described with respect to the NAND flash storage devices 150 or 250 , respectively.
  • the write command can include data to be stored by the NAND flash storage device.
  • the method 300 includes writing the received data to a first NAND flash chip and a second NAND flash chip successively such that the write operations to each NAND flash chip do not overlap in time (stage 320 ).
  • the write operations can be executed by a controller of the NAND flash storage device, such as the CPU 110 or the controller 260 previously described.
  • the controller can execute the write operation on one of two NAND flash chips, such as the NAND flash chip 270 a or 270 b . While the controller executes a write operation on one of the NAND flash chips 270 , that particular NAND flash chip will not be able to process other operations. Accordingly, the controller will execute the write operations on the two NAND flash chips successively such that they do not overlap in time.
  • the receiving stage 310 and the writing stage 320 can be repeated until the first and second NAND flash chips are full or substantially full.
  • the NAND flash chips are low-latency NAND (LLNAND) chips.
  • the method includes receiving a read command over the I/O bus (stage 330 ).
  • the method 300 includes reading data from whichever of the first NAND flash chip or the second NAND flash chip is not currently executing a write operation such that read operations are not queued behind write operations (stage 340 ).
  • the NAND flash storage device may receive a read command over the I/O bus.
  • the controller can direct the read command to whichever of the NAND flash chips is not currently executing a write operation.
  • stages 330 and 340 can proceed concurrently with stages 310 and 320 , and read operations can be processed promptly without being queued behind slower write operations.
  • the stages of the method 300 can repeat as additional data and read commands are received over the I/O bus.
  • the role played by each particular NAND flash chip can cycle, with different NAND flash chips performing different buffer and storage functions at different times. Examples of different manners of NAND flash chip buffering and storage are described below with reference to FIGS. 4 through 9 .
  • FIG. 4 is a block diagram of an example NAND flash storage device 450 with a NAND buffer and additional NAND storage, according to illustrative implementations.
  • the NAND flash storage device 450 includes a controller 460 for executing read and write commands received over the I/O bus 430 .
  • the controller 460 can include RAM 465 , or be in communication with the RAM 465 via a memory bus.
  • the controller 460 can include programmable logic such as an application-specific integrated circuit (ASIC), a field-programmable gate array (FPGA), a microcontroller, or a microprocessor.
  • ASIC application-specific integrated circuit
  • FPGA field-programmable gate array
  • the controller 460 can be integral with, or external to the NAND flash storage device 450 .
  • the NAND flash storage device 450 can include two buffer NAND flash chips 470 a and 470 b (collectively, “buffer NAND flash chips 470 ”). In some implementations, the NAND flash storage device 450 can include additional buffer NAND flash chips 470 , as described further below. In some implementations, the NAND flash storage device 450 can use the RAM 465 for buffering data received over the I/O bus 430 while it is being written to one or more of the buffer NAND flash chips 470 .
  • the NAND flash storage device 450 also includes storage NAND flash chips 475 a through 475 e (collectively, “storage NAND flash chips 475 ”) that provide additional memory capacity over that of the buffer NAND flash chips 470 . Data received over the I/O bus 430 can be buffered in the buffer NAND flash chips 470 before being copied to one of the storage NAND flash chips 475 .
  • the buffer NAND flash chips 470 can be single level cell (SLC) NAND flash chips or low-latency NAND (LLNAND) flash chips while the storage NAND flash chips 475 may be multi-level cell (MLC) or triple-level cell (TLC) NAND flash chips.
  • the buffer NAND flash chips 470 can be of smaller capacity than the storage NAND flash chips 475 .
  • a group of buffer NAND flash chips 470 can have a capacity equal to the capacity of a single storage NAND flash chip 475 .
  • the group of buffer NAND flash chips 470 can be used collectively as a single buffer chip.
  • the write bandwidth of NAND flash storage devices can be limited due to the relatively slow pace of performing write operations. However, in some implementations, write bandwidth can be improved by designating more buffer NAND flash chips 470 as write buffers.
  • the NAND flash storage device 450 can include additional buffer NAND flash chips 470 ; for example, buffer NAND flash chips 470 c and 470 d . In some implementations, the NAND flash storage device 450 can include even more buffer NAND flash chips 470 . The controller 460 can thus spread write operations among the buffer NAND flash chips 470 to multiply the write bandwidth.
  • the NAND flash storage device 450 can potentially face two drawbacks. First, the buffer NAND flash chips 470 will eventually reach capacity. Once all data in a buffer NAND flash chip 470 has been copied to one of the storage NAND flash chips 475 , all data in the buffer NAND flash chip 470 can be deleted. While the buffer NAND flash chip 470 performs erase operations, it cannot perform a write operation. Therefore, the write bandwidth will be limited by the unavailability of that buffer NAND flash chip 470 .
  • the second potential drawback relates to the finite number of write operations that a NAND flash chip can handle before exhibiting excessive bit errors.
  • NAND flash chips experience wear from processing write operations. Eventually, after performing many write operations, the NAND flash chip will wear out and begin to exhibit failures.
  • the challenge of limited write bandwidth can be addressed by increasing the number of buffer NAND flash chips 470 as described above, and the NAND wear problem can be addressed by using higher endurance chips as buffer NAND flash chips 470 .
  • the NAND wear problem can be addressed by rotating the particular NAND flash chips designated as buffers for incoming data. Such a wear-leveling scheme is described further below with reference to FIGS. 7 through 9 .
  • FIG. 5 below illustrates example operations of the NAND flash storage device 450 .
  • FIG. 5 shows example operations 500 of a NAND flash storage device with a NAND buffer and additional NAND storage, according to an illustrative implementation. These operations can be performed by a NAND flash storage device such as the NAND flash storage device 450 previously described.
  • the operations 500 include stage 510 , in which incoming data is written to the buffer NAND flash chip 470 a and the storage NAND flash chip 475 a , and data is copied with garbage collection from the storage NAND flash chip 475 b to the buffer NAND flash chip 470 b and subsequently copied back to the storage NAND flash chip 475 b .
  • the operations 500 include stage 520 , in which incoming data is written to the buffer NAND flash chip 470 b and the storage NAND flash chip 475 b , and data is copied with garbage collection from the storage NAND flash chip 475 c to the buffer NAND flash chip 470 a and subsequently copied back to the storage NAND flash chip 475 c .
  • the operations 500 include stage 530 , in which incoming data is written to the buffer NAND flash chip 470 a and the storage NAND flash chip 475 c , and the buffer NAND flash chip 470 b receives data copied from the storage NAND flash chip 475 d .
  • the buffer NAND flash chips 470 are indicated by a buffer 580 box.
  • each NAND flash chip 470 or 475 is currently performing is indicated by an acronym.
  • PW stands for “primary write” and indicates that the storage NAND flash chip 475 is currently designated for receiving data and performing write operations.
  • SW stands for “secondary write” and indicates that the buffer NAND flash chip 470 is also currently designated for receiving data and performing write operations.
  • FPW stands for “future primary write” and indicates that the NAND flash chip 475 is designated for being prepared for future service receiving data and performing write operations.
  • FSW stands for “future secondary write” and indicates that the NAND flash chip 470 is designated for being prepared for future service receiving data and performing write operations.
  • RO stands for “read-only” and indicates that the controller 460 will only call upon the RO storage NAND flash chips 475 to execute read operations. These labels are merely arbitrary, and are assigned only for the purpose of describing the operation of the various chips at particular points in time. Between each state of the operations 500 , the role of each NAND flash chip 470 or 475 changes as different NAND flash chips 470 or 475 receive data and perform write operations.
  • the buffer NAND flash chip 470 a and the storage NAND flash chip 475 a receive and write incoming data, as indicated by the arrows.
  • the buffer NAND flash chip 470 a and the storage NAND flash chip 475 a receive and write the same data, as the SW and the PW chips, respectively; however, they can perform the write operations during non-overlapping time periods.
  • both PW and the SW chips contain substantially the same data, and a NAND flash chip that is not performing a write operation can respond to a read command and perform the read operation promptly without having to wait for completion of the relatively long write operation.
  • the PW and SW chips can continue receiving data and performing write operations until they are full or substantially full.
  • the FPW storage NAND flash chip 475 b copies its data to the FSW buffer NAND flash chip 470 b in preparation for receiving external data in the next stage 520 .
  • the NAND flash storage device 450 can perform garbage collection on the FPW and the FSW chips. When fragmented data is copied from the FPW chip to the FSW chip, it is consolidated so that it is written to the FSW chip in a continuous or substantially continuous memory space, while free space is similarly consolidated in a second continuous or substantially continuous memory space. This makes the data representation more compact and leaves space at the end for new data to be written at the next stage 520 .
  • the operations 500 can continue with the next stage 520 .
  • the NAND flash storage device 450 may from time to time receive a read request.
  • the controller 460 may handle the read request in one of three ways depending on whether the requested data is on a RO chip, one of the PW or SW chips, or one of the FPW or FSW chips. If the requested data is on a RO chip, the controller 460 can simply execute the read operation on the RO chip and returns the data—there is no write operation occurring that would interfere with or delay execution of the read operation. If the requested data is on the PW or SW chip, the controller 460 can execute the read operation on whichever of the PW or SW chips is not currently executing a write operation such that read operations are not queued behind write operations.
  • the controller 460 can execute the read operation on whichever of the FPW or FSW is not currently executing a write operation such that read operations are not queued behind write operations. That is, if data is currently being copied with garbage collection from the FPW chip to the FSW chip, the controller 460 will be executing write operations on the FSW chip, and so can execute the read operation on the FPW chip. When the data is being copied back from the FSW chip to the FPW chip, the controller 460 can execute the read operation on the FSW chip.
  • the NAND flash device 450 can read and return the requested data without undue delay caused by queuing the read operation behind a write operation.
  • the storage NAND flash chip 475 b is designated as the PW chip
  • the buffer NAND flash chip 470 b is designated as the SW chip
  • the storage NAND flash chip 475 c is designated as the FPW chip
  • the buffer NAND flash chip 470 a is designated as the FSW chip
  • the storage NAND flash chip 475 a is designated RO.
  • the functions of the PW, SW, FPW, and FSW chips are similar to functions performed by the chips with the same designations in the previous stage 510 , but with different NAND flash chips performing each designated role.
  • the SW and the PW chips receive and write the same data as indicated by the arrows; however, they can perform the write operations during non-overlapping time periods so that they can respond to a read command and perform the read operation promptly without having to wait for completion of the relatively long write operation.
  • the FPW chip copies its data to the FSW chip in preparation for receiving external data in the next stage 530 .
  • the FPW and the FSW chips can perform garbage collection operations.
  • the operations 500 can continue with the next stage 530 .
  • the storage NAND flash chip 475 c is designated as the PW chip
  • the buffer NAND flash chip 470 a is designated as the SW chip
  • the storage NAND flash chip 475 d is designated as the FPW chip
  • the buffer NAND flash chip 470 b is designated as the FSW chip
  • the storage NAND flash chip 475 b is designated RO.
  • the functions of the PW, SW, FPW, and FSW chips are similar to functions performed by the chips with the same designations in the previous stages 510 and 520 , but with different NAND flash chips performing each designated role.
  • the SW and the PW chips receive and write the same data as indicated by the arrows; however, they can perform the write operations during non-overlapping time periods so that they can respond to a read command and perform the read operation promptly without having to wait for completion of the relatively long write operation.
  • the FPW chip copies its data to the FSW chip in preparation for receiving external data in the next stage.
  • the FPW and the FSW chips can perform garbage collection operations.
  • the operations 500 can continue with the next stage. The operations 500 can continue in this manner until all of the storage NAND flash chips 475 have been written to, at which time the operations 500 can repeat starting at the stage 510 .
  • FIG. 6 is a flowchart showing an example method 600 of using a NAND flash storage device with a NAND buffer and additional NAND storage, according to illustrative implementations.
  • the method 600 can provide for NAND flash storage space beyond the storage space provided by the buffer NAND chips.
  • the method 600 can be performed by the NAND flash storage device 450 previously described with reference to FIG. 4 .
  • the NAND flash storage device 450 can include buffer NAND flash chips 470 and storage NAND flash chips 475 .
  • the method 600 includes designating a first storage NAND flash chip as a primary write (PW) chip, a second storage NAND flash chip as a future primary write (FPW) chip, a first buffer NAND flash chip as a secondary write (SW) chip, a second buffer NAND flash chip as a future secondary write (FSW) chip, and any remaining NAND flash chips as read-only (RO).
  • the method 600 includes receiving data over an input/output (I/O) bus (stage 620 ).
  • the method 600 includes writing the received data to the PW chip and the SW chip successively such that the write operations to each NAND flash chip do not overlap in time (stage 630 ).
  • the method 600 includes receiving a read command over the I/O bus (stage 640 ).
  • the method 600 includes, responsive to receiving the read command, reading data from a NAND flash chip that is not currently executing a write operation such that read operations are not queued behind write operations (stage 650 ).
  • the method 600 includes copying valid data from the FPW chip to the FSW chip (stage 660 ).
  • the method 600 includes, following the writing of the received data and the copying of the valid data, designating the first storage NAND flash chip as RO, the second storage NAND flash chip as the PW chip, the first buffer NAND flash chip as the FSW chip, and the second buffer NAND flash chip as the SW chip (stage 670 ).
  • the method 600 includes designating a first storage NAND flash chip as a primary write chip, a second storage NAND flash chip as a future primary write chip, a first buffer NAND flash chip as a secondary write chip, a second buffer NAND flash chip as a future secondary write chip, and any remaining NAND flash chips as read-only (RO).
  • a first storage NAND flash chip as a primary write chip
  • a second storage NAND flash chip as a future primary write chip
  • a first buffer NAND flash chip as a secondary write chip
  • a second buffer NAND flash chip as a future secondary write chip
  • any remaining NAND flash chips as read-only (RO).
  • the method 600 includes receiving data over an I/O bus (stage 620 ).
  • Stage 620 can be similar to stage 310 described previously.
  • the method 600 includes writing the received data to the PW chip and the SW chip successively such that the write operations to each NAND flash chip do not overlap in time (stage 630 ).
  • Stage 630 can be similar to stage 320 described previously.
  • the receiving stage 620 and the writing stage 630 can be repeated until the PW chip and the SW chip are full or substantially full.
  • the method 600 includes receiving a read command over the I/O bus (stage 640 ).
  • Stage 630 can be similar to stage 430 described previously.
  • the method 600 includes, responsive to receiving the read command, reading data from a NAND flash chip that is not currently executing a write operation such that read operations are not queued behind write operations (stage 650 ).
  • Stage 650 can be similar to stage 340 described previously.
  • the requested data may reside on a RO chip, one of the PW or SW chips, or one of the FPW or FSW chips.
  • the controller may handle the read request in one of three ways depending on where the requested data resides. If the requested data is on the PW or SW chip, the controller can execute the read operation on whichever of the PW or SW chips is not currently executing a write operation such that read operations are not queued behind write operations, and the method 600 can proceed to the next stage.
  • the controller can simply execute the read operation on the RO chip and returns the data—there is no write operation occurring that would interfere with or delay execution of the read operation. If the requested data is on the FPW or FSW chip, the controller can execute the read operation on whichever of the FPW or FSW is not currently executing a write operation such that read operations are not queued behind write operations. That is, if data is currently being copied with garbage collection from the FPW chip to the FSW chip, the controller 460 will be executing write operations on the FSW chip, and so can execute the read operation on the FPW chip. When the data is being copied back from the FSW chip to the FPW chip, the controller can execute the read operation on the FSW chip.
  • stages 640 and 650 can proceed concurrently with stages 620 and 630 , and read operations can be processed promptly without being queued behind slower write operations.
  • the stages 620 through 650 can repeat as additional data and read commands are received over the I/O bus.
  • the method 600 includes copying valid data from the FPW chip to the FSW chip (stage 660 ).
  • the FPW and FSW chips can perform garbage collection to consolidate the valid data and the free space, respectively.
  • the NAND flash storage device can first copy the valid data from the FPW chip to the FSW chip such that the valid data and the free space, respectively, occupy substantially contiguous respective memory spaces of the FSW chip.
  • the NAND flash storage device can copy the valid data from the FSW chip back to the FPW chip such that the valid data and the free space, respectively, occupy substantially contiguous respective memory spaces of the FPW chip.
  • the NAND flash storage device can erase the memory spaces of the FPW and FSW chips occupied by the free space.
  • stage 660 can be performed during an overlapping time period with the buffering operations of stages 620 and 630 .
  • read commands can be received and executed by whichever of the FPW and the FSW is not currently executing write operations as part of the copy operations.
  • the copying operations of stage 660 can also be performed during an overlapping time period with the read operations of stages 640 and 650 .
  • the method 600 includes, following the writing of the received data and the copying of the valid data, designating the first storage NAND flash chip as RO, the second storage NAND flash chip as the PW chip, the first buffer NAND flash chip as the FSW chip, and the second buffer NAND flash chip as the SW chip (stage 670 ).
  • the stages of the method 600 can repeat as additional data and read commands are received over the I/O bus.
  • the role played by each particular NAND flash chip can cycle, with different NAND flash chips performing different buffer and storage functions at different times.
  • FIG. 7 is block diagram of an example NAND flash storage device 750 with a rotating NAND buffer for wear leveling, according to illustrative implementations.
  • the NAND flash storage device 750 includes a controller 760 for processing read and write commands received over the I/O bus 730 .
  • the controller 760 can include RAM 765 , or be in communication with the RAM 765 via a memory bus.
  • the controller 760 can include programmable logic such as an application-specific integrated circuit (ASIC), a field-programmable gate array (FPGA), a microcontroller, or a microprocessor.
  • ASIC application-specific integrated circuit
  • FPGA field-programmable gate array
  • the controller 760 can be integral with, or external to the NAND flash storage device 750 .
  • the NAND flash storage device 750 includes NAND flash chips 770 a to 770 f (collectively, “NAND flash chips 770 ”). In some implementations, the NAND flash storage device 750 can include more or fewer NAND flash chips 770 . In some implementations, the NAND flash storage device 750 can use the RAM 765 for buffering data received over the I/O bus 730 while it is being written to one or more of the NAND flash chips 770 . The NAND flash storage device 750 can employ wear leveling by occasionally changing the NAND flash chips 770 used for buffering. For example, during a first stage of operation, NAND flash chips 770 a and 770 b can serve as the NAND buffers, with the controller 760 writing data to each one in turn. Example operations of the NAND flash storage device 750 are described in more detail below with reference to FIG. 8 .
  • FIG. 8 shows example operations 800 of a NAND flash storage device with a rotating NAND buffer employing wear leveling, according to illustrative implementations. These operations can be performed by a NAND flash storage device such as the NAND flash storage device 750 previously described.
  • the operations 800 include stage 810 , in which incoming data is written to the NAND flash chips 770 a and 770 c , while data is copied with garbage collection from the NAND flash chip 770 d to the NAND flash chip 770 b and subsequently copied back to the NAND flash chip 770 d .
  • the operations 800 include stage 820 , in which incoming data is written to the NAND flash chips 770 b and 770 d , while data is copied with garbage collection from the NAND flash chip 770 e to the NAND flash chip 770 c and subsequently copied back to the NAND flash chip 770 e .
  • the operations 800 include stage 830 , in which incoming data is written to the NAND flash chips 770 c and 770 e , while data is copied with garbage collection from the NAND flash chip 770 f to the NAND flash chip 770 d and subsequently copied back to the NAND flash chip 770 f .
  • the secondary write and the future secondary write NAND flash chips 770 are considered the buffer NAND flash chips, and are indicated by a buffer 880 box.
  • the operation each NAND flash chip 770 is currently performing is indicated by an acronym.
  • PW stands for “primary write” and indicates that the NAND flash chip 770 is currently receiving data and performing write operations.
  • SW stands for “secondary write” and indicates that the NAND flash chip 770 is currently receiving data and performing write operations as one of the buffer NAND flash chips.
  • FPW stands for “future primary write” and indicates that the NAND flash chip 770 is being prepared for future service as the primary write chip by copying its data to the future secondary write chip.
  • FSW Future secondary write
  • RO read-only
  • the NAND flash chip 770 a serves as the SW chip
  • the NAND flash chip 770 b serves as the FSW chip
  • the NAND flash chip 770 c serves as the PW chip
  • the NAND flash chip 770 d serves as the FPW chip
  • the remaining NAND flash chips serve as RO chips.
  • Incoming data is written to the NAND flash chips 770 a and 770 c —the SW and PW chips, respectively. Meanwhile, data is copied with garbage collection from the NAND flash chip 770 d to the NAND flash chip 770 b —the FPW and FSW chips, respectively—and subsequently copied back to the NAND flash chip 770 b .
  • the buffer 880 includes the NAND flash chips 770 a and 770 b .
  • the NAND flash chip 770 b receives a copy of the data from the NAND flash chip 770 d , so it may be at least partially unavailable for executing read operations, and may be affirmatively prevented by the controller 760 from executing read or write operations not associated with the copy operation. Garbage collection may be performed during the copy operation. This means that only valid data is copied and any gaps left by invalid data are removed in that process. This makes the data representation more compact and leaves space at the end for new data to be written at the next stage. In some implementations, this compacted form of data in NAND flash chip 770 b may be copied back to NAND flash chip 770 d .
  • the NAND flash chips 770 e and 770 f can serve as RO chips.
  • the controller 760 can update the NAND flash chip 770 designations, and proceed to stage 820 of the operations 800 . This can be when the buffer NAND flash chips 770 a and 770 c become full or substantially full, after a predetermined number of write operations have been performed, or following a predetermined interval of time.
  • the NAND flash storage device 750 may from time to time receive a read request.
  • the controller 760 may handle the read request in one of three ways depending on whether the requested data is on a RO chip, one of the PW or SW chips, or one of the FPW or FSW chips. If the requested data is on a RO chip, the controller 760 can simply execute the read operation on the RO chip and returns the data—there is no write operation occurring that would interfere with or delay execution of the read operation. If the requested data is on the PW or SW chip, the controller 760 can execute the read operation on whichever of the PW or SW chips is not currently executing a write operation such that read operations are not queued behind write operations.
  • the controller 760 can execute the read operation on whichever of the FPW or FSW is not currently executing a write operation such that read operations are not queued behind write operations. That is, if data is currently being copied with garbage collection from the FPW chip to the FSW chip, the controller 760 will be executing write operations on the FSW chip, and so can execute the read operation on the FPW chip. When the data is being copied back from the FSW chip to the FPW chip, the controller 760 can execute the read operation on the FSW chip.
  • the NAND flash device 750 can read and return the requested data without undue delay caused by queuing the read operation behind a write operation.
  • the NAND flash chip 770 b serves as the SW chip
  • the NAND flash chip 770 c serves as the FSW chip
  • the NAND flash chip 770 d serves as the PW chip
  • the NAND flash chip 770 e serves as the FPW chip
  • the remaining NAND flash chips serve as RO chips.
  • Incoming data is written to the NAND flash chips 770 b and 770 d , and data is copied with garbage collection from the NAND flash chip 770 e to the NAND flash chip 770 c , and subsequently copied back to the NAND flash chip 770 e .
  • the NAND flash chip 770 a is designated as an RO chip, and is available to execute read operations without interruption from write operations.
  • the buffer 880 now includes the NAND flash chips 770 b and 770 c . Because the NAND flash chip 770 c previously served as the PW chip, the data it contains will not be unique because the data had previously been written to the previous SW as well. Accordingly, the NAND flash chip 770 c can receive duplicate data from the NAND flash chip 770 e in preparation for being designated SW in the next stage of operation. During the copying operation, the NAND flash chips 770 c and 770 e can perform garbage collection operations.
  • the NAND flash chip 770 c serves as the SW chip
  • the NAND flash chip 770 d serves as the FSW chip
  • the NAND flash chip 770 e serves as the PW chip
  • the NAND flash chip 770 f serves as the FPW chip
  • the remaining NAND flash chips serve as RO flash chips.
  • Incoming data is written to the NAND flash chips 770 c and 770 e , while data is copied with garbage collection from the NAND flash chip 770 f to the NAND flash chip 770 d , and subsequently copied back to the NAND flash chip 770 f .
  • the NAND flash chip 770 b is designated as an RO chip, and is available to execute read operations without interruptions from write operations.
  • the buffer 880 now includes the NAND flash chips 770 c and 770 d . Because the NAND flash chip 770 d previously served as the PW chip, the data it contains will not be unique because the data had previously been written to the previous SW as well. Accordingly, the NAND flash chip 770 d can receive duplicate data from the NAND flash chip 770 f in preparation for being designated SW in a subsequent stage of operation.
  • Additional stages can progress in this manner indefinitely to spread wear among some or all of the available NAND flash chips 770 .
  • the NAND flash storage device 750 will write each received page of data to the NAND flash chips 770 twice, doubling the amount of wear inflicted; however, when the wear is spread evenly across many NAND flash chips 770 , this amount of additional wear may be acceptable.
  • FIG. 9 is a flowchart showing an example method 900 of using a NAND flash storage device with a rotating NAND buffer for wear leveling, according to illustrative implementations.
  • the method 900 can be performed by the NAND flash storage device 750 previously described with reference to FIG. 7 .
  • the method 900 can rotate through available NAND flash chips, designating each in turn as a buffer NAND chip, in order to distribute the wear caused by executing write operations and avoid failures potentially caused by focusing write operations on a small number of NAND flash chips.
  • the method 900 includes designating a first NAND flash chip as a secondary write (SW) chip, a second NAND flash chip as a future secondary write (FSW) chip, a third NAND flash chip as a primary write (PW) chip, and a fourth NAND flash chip as a future primary write (FPW) chip (stage 910 ).
  • the method 900 includes receiving data over an input/output (I/O) bus (stage 920 ).
  • the method 900 includes writing the received data to the PW chip and the SW chip successively such that write operations to each chip do not overlap in time (stage 930 ).
  • the method 900 includes receiving a read command over the I/O bus (stage 940 ).
  • the method 900 includes, responsive to receiving the read command, reading data from a NAND flash chip that is not currently executing a write operation such that read operations are not queued behind write operations (stage 950 ).
  • the method 900 includes copying valid data from the FPW chip to the FSW chip (stage 960 ).
  • the method 900 includes, following the writing of the received data and the copying of the valid data: designating the first NAND flash chip as read-only, the second NAND flash chip as the SW chip, the third NAND flash chip as the FSW chip, and the fourth NAND flash chip as the PW chip (stage 970 ).
  • the method 900 includes designating the first NAND flash chip as the SW chip, the second NAND flash chip as the FSW chip, the third NAND flash chip as the PW chip, and the fourth NAND flash chip as the FPW chip (stage 910 ). The designations were described previously with reference to FIG. 8 .
  • the method 900 includes receiving data over the I/O bus (stage 920 ).
  • Stage 920 is similar to stages 310 and 610 described previously.
  • the method 900 includes writing the received data to the PW chip and the SW chip successively such that write operations to each chip do not overlap in time (stage 930 ).
  • Stage 930 is similar to stages 320 and 620 described previously.
  • the method 900 may continue performing receive operations of stage 920 and the write operations of stage 930 until the PW chip and the SW chip are full or substantially full before advancing to stage 950 .
  • the operations of stages 940 and 950 described below, can be performed concurrently with the operations of stages 920 and 930 .
  • the method 900 includes receiving a read command over the I/O bus (stage 940 ).
  • Stage 940 is similar to stages 330 and 640 described previously.
  • the method 900 includes, responsive to receiving the read command, reading data from a NAND flash chip that is not currently executing a write operation such that read operations are not queued behind write operations (stage 950 ).
  • Stage 950 is similar to stages 340 and 650 previously described.
  • the requested data may reside on a RO chip, one of the PW or SW chips, or one of the FPW or FSW chips.
  • the controller may handle the read request in one of three ways depending on where the requested data resides. If the requested data is on the PW or SW chip, the controller can execute the read operation on whichever of the PW or SW chips is not currently executing a write operation such that read operations are not queued behind write operations, and the method 600 can proceed to the next stage.
  • the controller can simply execute the read operation on the RO chip and returns the data—there is no write operation occurring that would interfere with or delay execution of the read operation. If the requested data is on the FPW or FSW chip, the controller can execute the read operation on whichever of the FPW or FSW is not currently executing a write operation such that read operations are not queued behind write operations. That is, if data is currently being copied with garbage collection from the FPW chip to the FSW chip, the controller 460 will be executing write operations on the FSW chip, and so can execute the read operation on the FPW chip. When the data is being copied back from the FSW chip to the FPW chip, the controller can execute the read operation on the FSW chip.
  • stages 940 and 950 can proceed concurrently with stages 920 and 930 , and read operations can be processed promptly without being queued behind slower write operations.
  • the stages 920 through 950 can repeat as additional data and read commands are received over the I/O bus.
  • the method 900 includes copying valid data from the FPW chip to the FSW chip (stage 960 ). To prepare two new NAND flash chips to serves as buffer NAND chips in a later stage, the method 900 can copy data from one NAND flash chip to another NAND flash chip.
  • the copy source can be designated the FPW chip and the copy destination can be designated the FSW chip.
  • the copy operation can be bidirectional; that is, data can be harmonized between the FPW chip and the FSW chip such that they each contain a duplicate of the other's data.
  • the FSW chip and the FPW chip can perform garbage collection operations to consolidate data and free space of data that is invalid or no longer needed.
  • stage 960 can be performed during an overlapping time period with the buffering operations of stages 920 and 930 .
  • read commands can be received and executed by whichever of the FPW and the FSW is not currently executing write operations as part of the copy operations.
  • the copying operations of stage 960 can also be performed during an overlapping time period with the read operations of stages 940 and 950 .
  • the method 900 includes, following the writing of the received data and the copying of the valid data: designating the first NAND flash chip as a read-only chip, the second NAND flash chip as the SW chip, the third NAND flash chip as the FSW chip, and the fourth NAND flash chip as the PW chip (stage 970 ). Once the FPW chip and the FSW chip contain duplicate data, they will be ready to take over as the PW chip and the SW chip, respectively. As long as the copying operations can be performed while the buffer NAND flash chips are still handling write operations, the NAND flash storage device 770 can switch designations without delay and continue executing write operations without a significant loss in write bandwidth.
  • the rotation can continue.
  • the controller 760 can designate a fifth NAND flash chip 770 e as the FPW chip.
  • the controller 760 can write second data received over the I/O bus 730 to the PW chip and the SW chip such that write operations to each chip do not overlap in time.
  • the controller 760 can copy second valid date from the FPW chip to the FSW chip.
  • the controller 760 can designate the second NAND flash chip as read-only, the third NAND flash chip as the SW chip, the fourth NAND flash chip as the FSW chip, the fifth NAND flash chip as the PW chip, and a sixth NAND flash chip as the FPW chip.
  • references to “or” may be construed as inclusive so that any terms described using “or” may indicate any of a single, more than one, and all of the described terms.
  • the labels “first,” “second,” “third,” and so forth are not necessarily meant to indicate an ordering and are generally used merely to distinguish between like or similar items or elements.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Memory System (AREA)
  • Read Only Memory (AREA)
  • Memory System Of A Hierarchy Structure (AREA)

Abstract

At least one aspect is directed to a NAND flash storage device including a plurality of NAND flash chips and a controller. The controller is configured to receive data over an input/output (I/O) bus and write the received data to a first NAND flash chip of the plurality of NAND flash chips and a second NAND flash chip of the plurality of NAND flash chips. The write operations to each NAND flash chip do not overlap in time. The controller is configured to read data from whichever of the first NAND flash chip or the second NAND flash chip is not currently executing a write operation such that read operations are not queued behind write operations.

Description

RELATED APPLICATIONS
The present application claims the benefit of and priority to U.S. Provisional Patent Application No. 62/524,016, titled “NAND FLASH STORAGE DEVICE WITH NAND BUFFER,” and filed on Jun. 23, 2017, the entire contents of which are hereby incorporated by reference.
BACKGROUND
Storage devices based on NAND flash have good average latency; however, the worst-case latency, referred to as “tail latency” can be much longer. Providing more predictable latency in a NAND flash storage device may be beneficial in some applications of NAND flash technology; for example, in NAND flash-based storage devices and swap devices. A swap device can serve as extended memory space, sometimes referred to as swap space, in a computing system. The swap device can represent a tradeoff between the faster speed of RAM and the lower cost of alternative memory/storage mediums such as a hard disk drive (HDD) or a solid state drive (SSD). An SSD based on NAND flash, while faster than an HDD, may suffer from unpredictable tail latency due to the way NAND flash chips operate.
SUMMARY
At least one aspect is directed to a NAND flash storage device. The NAND flash storage device includes a plurality of NAND flash chips and a controller. The controller is configured to receive data over an input/output (I/O) bus. The controller is configured to write the received data to a first NAND flash chip of the plurality of NAND flash chips and a second NAND flash chip of the plurality of NAND flash chips successively such that write operations to each NAND flash chip do not overlap in time. The controller is configured to receive a read command over the I/O bus. The controller is configured to, responsive to receiving the read command, read data from whichever of the first NAND flash chip or the second NAND flash chip is not currently executing a write operation such that read operations are not queued behind write operations.
In some implementations, the controller can be further configured to designate the first NAND flash chip as a secondary write chip, the second NAND flash chip as a future secondary write chip, a third NAND flash chip as a primary write chip, a fourth NAND flash chip as a future primary write chip, and any remaining NAND flash chips as read-only. The controller can be configured to write the received data to the primary write chip and the secondary write chip successively such that write operations to each chip do not overlap in time. The controller can be configured to copy valid data from the future primary write chip to the future secondary write chip. The controller can be configured to, following the writing of the received data and the copying of the valid data, designate the first NAND flash chip as read-only, the second NAND flash chip as the secondary write chip, the third NAND flash chip as the future secondary write chip, and the fourth NAND flash chip as the primary write chip.
In some implementations, the controller can be further configured to, following the writing of the received data and the copying of the valid data, designate a fifth NAND flash chip as the future primary write chip. The controller can be configured to write second data received over the I/O bus to the primary write chip and the secondary write chip successively such that write operations to each chip do not overlap in time. The controller can be configured to copy second valid data from the future primary write chip to the future secondary write chip. The controller can be configured to, following the writing of the second received data and the copying of the second valid data, designate the second NAND flash chip as read-only, the third NAND flash chip as the secondary write chip, the fourth NAND flash chip as the future secondary write chip, and the fifth NAND flash chip as the primary write chip.
In some implementations, the copy operation can include copying the valid data from the future primary write chip to the future secondary write chip such that the valid data and the free space, respectively, occupy substantially contiguous respective memory spaces of the future secondary write chip. The copying operation can include copying the valid data from the future secondary write chip back to the future primary write chip such that the valid data and the free space, respectively, occupy substantially contiguous respective memory spaces of the future primary write chip. The copying operation can include erasing the memory spaces of the future primary write chip and the future secondary write chip occupied by the free space.
In some implementations, the write operation and the copy operation are performed during overlapping time periods
At least one aspect is directed to a NAND flash storage device. The NAND flash storage device includes a plurality of buffer NAND flash chips, a plurality of storage NAND flash chips, and a controller. The controller is configured to receive incoming data over an input/output (I/O) bus. The controller is configured to write the received incoming data to a first buffer NAND flash chip of the plurality of storage NAND flash chips and a first storage NAND flash chip of the plurality of buffer NAND flash chips successively such that write operations to each NAND flash chip do not overlap in time. The controller is configured to receive a read command over the I/O bus. The controller is configured to, responsive to receiving the read command, read data from whichever of the first buffer NAND flash chip or the first storage NAND flash chip is not currently executing a write operation such that read operations are not queued behind write operations.
In some implementations, the controller can be configured to designate the first storage NAND flash chip as a primary write chip, a second storage NAND flash chip as a future primary write chip, the first buffer NAND flash chip as a secondary write chip, a second buffer NAND flash chip as a future secondary write chip, and any remaining NAND flash chips as read-only. The controller can be configured to write the received data to the primary write chip and the secondary write chip successively such that write operations to each chip do not overlap in time. The controller can be configured to copy valid data from the future primary write chip to the future secondary write chip. The controller can be configured to following the writing of the received data and the copying of the valid data, designate the first storage NAND flash chip as read-only, the second storage NAND flash chip as the primary write chip, the first buffer NAND flash chip as the future secondary write chip, and the second buffer NAND flash chip as the secondary write chip.
In some implementations, the controller can be configured to, following the writing of the received data and the copying of the valid data, designate a third storage NAND flash chip as the future primary write chip. The controller can be configured to write second data received over the I/O bus to the primary write chip and the secondary write chip successively such that write operations to each chip do not overlap in time. The controller can be configured to copy second valid data from the future primary write chip to the future secondary write chip. The controller can be configured to, following the writing of the second received data and the copying of the second valid data, designate the second storage NAND flash chip as read-only, the third storage NAND flash chip as the primary write chip, the first buffer NAND flash chip as the secondary write chip, and the second buffer NAND flash chip as the future secondary write chip.
In some implementations, the copy operation can include copying the valid data from the future primary write chip to the future secondary write chip such that the valid data and the free space, respectively, occupy substantially contiguous respective memory spaces of the future secondary write chip. The copy operation can include copying the valid data from the future secondary write chip back to the future primary write chip such that the valid data and the free space, respectively, occupy substantially contiguous respective memory spaces of the future primary write chip; and erasing the memory spaces of the future primary write chip and the future secondary write chip occupied by the free space.
In some implementations, the buffer NAND flash chips can be single-level cell (SLC) NAND flash chips, and the storage NAND flash chips can be triple-level cell (TLC) NAND flash chips.
At least one aspect is directed to method of buffering a NAND flash storage device. The method includes receiving data over an input/output (I/O) bus. The method includes writing the received data to a first NAND flash chip of the plurality of NAND flash chips and a second NAND flash chip of the plurality of NAND flash chips successively such that write operations to each NAND flash chip do not overlap in time. The method includes receiving a read command over the I/O bus. The method includes, responsive to receiving the read command, reading data from whichever of the first NAND flash chip or the second NAND flash chip is not currently executing a write operation such that read operations are not queued behind write operations
In some implementations, the method can include designating the first NAND flash chip as a secondary write chip, the second NAND flash chip as a future secondary write chip, a third NAND flash chip as a primary write chip, a fourth NAND flash chip as a future primary write chip, and any remaining NAND flash chips as read-only. The method can include writing the received data to the primary write chip and the secondary write chip successively such that write operations to each chip do not overlap in time. The method can include copying valid data from the future primary write chip to the future secondary write chip. The method can include, following the writing of the received data and the copying of the valid data, designating the first NAND flash chip as read-only, the second NAND flash chip as the secondary write chip, the third NAND flash chip as the future secondary write chip, and the fourth NAND flash chip as the primary write chip.
In some implementations, the method can include, following the writing of the received data and the copying of the valid data, designating a fifth NAND flash chip as the future primary write chip. The method can include writing second data received over the I/O bus to the primary write chip and the secondary write chip successively such that write operations to each chip do not overlap in time. The method can include copying second valid data from the future primary write chip to the future secondary write chip. The method can include, following the writing of the second received data and the copying of the second valid data, designating the second NAND flash chip as read-only, the third NAND flash chip as the secondary write chip, the fourth NAND flash chip as the future secondary write chip, and the fifth NAND flash chip as the primary write chip.
In some implementations, the copy operation can include copying the valid data from the future primary write chip to the future secondary write chip such that the valid data and the free space, respectively, occupy substantially contiguous respective memory spaces of the future secondary write chip. The copy operation can include copying the valid data from the future secondary write chip back to the future primary write chip such that the valid data and the free space, respectively, occupy substantially contiguous respective memory spaces of the future primary write chip. The copy operation can include erasing the memory spaces of the future primary write chip and the future secondary write chip occupied by the free space.
In some implementations, the writing operation and the copying operation are performed during overlapping time periods.
At least one aspect is directed to method of buffering a NAND flash storage device. The method includes receiving data over an input/output (I/O) bus. The method includes writing the received incoming data to a first buffer NAND flash chip of the plurality of storage NAND flash chips and a first storage NAND flash chip of the plurality of buffer NAND flash chips successively such that write operations to each NAND flash chip do not overlap in time. The method includes receiving a read command over the I/O bus. The method includes, responsive to receiving the read command, reading data from whichever of the first buffer NAND flash chip or the first storage NAND flash chip is not currently executing a write operation such that read operations are not queued behind write operations.
In some implementations, the method includes designating the first storage NAND flash chip as a primary write chip, a second storage NAND flash chip as a future primary write chip, the first buffer NAND flash chip as a secondary write chip, a second buffer NAND flash chip as a future secondary write chip, and any remaining NAND flash chips as read-only. The method includes writing the received data to the primary write chip and the secondary write chip successively such that write operations to each chip do not overlap in time. The method includes copying valid data from the future primary write chip to the future secondary write chip. The method includes following the writing of the received data and the copying of the valid data, designating the first storage NAND flash chip as read-only, the second storage NAND flash chip as the primary write chip, the first buffer NAND flash chip as the future secondary write chip, and the second buffer NAND flash chip as the secondary write chip.
In some implementations, the method includes, following the writing of the received data and the copying of the valid data, designating a third storage NAND flash chip as the future primary write chip. The method includes writing second data received over the I/O bus to the primary write chip and the secondary write chip successively such that write operations to each chip do not overlap in time. The method includes copying second valid data from the future primary write chip to the future secondary write chip. The method includes, following the writing of the second received data and the copying of the second valid data, designating the second storage NAND flash chip as read-only, the third storage NAND flash chip as the primary write chip, the first buffer NAND flash chip as the secondary write chip, and the second buffer NAND flash chip as the future secondary write chip.
In some implementations, the method includes copying the valid data from the future primary write chip to the future secondary write chip such that the valid data and the free space, respectively, occupy substantially contiguous respective memory spaces of the future secondary write chip. The method includes copying the valid data from the future secondary write chip back to the future primary write chip such that the valid data and the free space, respectively, occupy substantially contiguous respective memory spaces of the future primary write chip. The method includes erasing the memory spaces of the future primary write chip and the future secondary write chip occupied by the free space.
In some implementations, the buffer NAND flash chips can be single-level cell (SLC) NAND flash chips, and the storage NAND flash chips can be triple-level cell (TLC) NAND flash chips.
These and other aspects and implementations are discussed in detail below. The foregoing information and the following detailed description include illustrative examples of various aspects and implementations, and provide an overview or framework for understanding the nature and character of the claimed aspects and implementations. The drawings provide illustration and a further understanding of the various aspects and implementations, and are incorporated in and constitute a part of this specification.
BRIEF DESCRIPTION OF THE DRAWINGS
The accompanying drawings are not intended to be drawn to scale. Like reference numbers and designations in the various drawings indicate like elements. For purposes of clarity, not every component may be labeled in every drawing. In the drawings:
FIG. 1 is block diagram of an example computer system including a NAND flash storage device with a NAND buffer, according to an illustrative implementation;
FIG. 2 is a block diagram of an example NAND flash storage device with a NAND buffer, according to an illustrative implementation;
FIG. 3 is a flowchart showing an example method of using a NAND flash storage device with a NAND buffer, according to an illustrative implementation;
FIG. 4 is a block diagram of an example NAND flash storage device with a NAND buffer and additional NAND storage, according to an illustrative implementation;
FIG. 5 shows example operations of a NAND flash storage device with a NAND buffer and additional NAND storage, according to an illustrative implementation;
FIG. 6 is a flowchart showing an example method of using a NAND flash storage device with a NAND buffer and additional NAND storage, according to an illustrative implementation;
FIG. 7 is block diagram of an example NAND flash storage device with a rotating NAND buffer for wear leveling, according to an illustrative implementation;
FIG. 8 shows example operations of a NAND flash storage device with a rotating NAND buffer for wear leveling, according to an illustrative implementation; and
FIG. 9 is a flowchart showing an example method of using a NAND flash storage device with a rotating NAND buffer for wear leveling, according to an illustrative implementation.
DETAILED DESCRIPTION
Following below are descriptions of various concepts related to, and implementations of, a predictable-latency NAND flash storage device with NAND buffer and methods of using the same. The various concepts introduced above and discussed in greater detail below may be implemented in any of numerous ways, as the described concepts are not limited to any particular manner of implementation. Examples of specific implementations and applications are provided primarily for illustrative purposes.
This disclosure generally relates to a predictable-latency NAND flash storage device with a NAND buffer. The storage device can be a solid-state drive (SSD) or a swap device. A swap device is a secondary storage device that can function as an extension of Random Access Memory (RAM). When the amount of data to be retained in memory exceeds the capacity of RAM, some of the data can be moved to the swap device in an operation sometimes referred to as “paging.” Traditionally, the secondary device could be a hard disk drive (HDD). Reading and writing data from an HDD is very slow relative to RAM, however, making an HDD impractical as a swap device in settings where retrieval speed is important. More recent SSDs, particularly NAND flash-based SSDs, although still slower than RAM, exhibit faster retrieval times relative to HDDs. NAND SSDs also tend to be less expensive than RAM, thus reducing RAM capacity in favor of SSD capacity can lead to cost savings. More recently developed low-latency NAND (LLNAND), such as Samsung's Z-NAND, are designed with special circuitry that improves its typical read latency. LLNAND-based SSDs are even faster, and thus practical for paging and other temporary data storage in some server applications.
NAND flash SSDs have other drawbacks, however. While NAND read time is very fast, on the order of 10s of microseconds, erase and write operations are much slower, on the order of milliseconds each. And, because a NAND flash chip cannot execute a read operation and write operation at the same time, a read operation queued behind an erase/write operation can exhibit a latency increase of two or three orders of magnitude from normal. This is called “tail latency.” While it is possible to buffer data in RAM for writing to a NAND flash chip, the overhead can be cost prohibitive. For example, a device having ten NAND flash chips will require RAM equivalent to one tenth of the total NAND capacity as a buffer. Because DRAM can be four times as expensive as NAND storage, the cost overhead may be 40%.
This disclosure therefore proposes employing a NAND buffer to improve tail latency at a lower cost overhead than that of a RAM buffer. The system works by writing each page of data into two chips in sequence: first into a first chip, and then into a second chip. The second chip may be considered a NAND buffer. The write operations do not overlap in time. Read operations can be executed by the chip that is presently not executing a write operation. Thus, the read operations need not be queued behind write operations, allowing the NAND flash storage device to have more predictable latency.
In some implementations, the first chip (and, by extension, the second chip) are filled, and two new chips are chosen as write buffers. The first chip can be switched to read only and, because the first chip can perform read operations without interruptions from write operations, the copy of the data stored on the second chip is no longer necessary. The second chip is now free for future primary write operations. Two new chips, a third chip and a fourth chip, can be selected for writing. In this case, the fourth chip may be considered a NAND buffer.
For such a system to work, it is helpful for the two new chips selected for writing to contain duplicate data. Accordingly, prior to receiving new data, existing data on the third chip can first be copied to the fourth chip. The copying operation involves “garbage collection,” an operation memory devices perform to free up space occupied by data that has been invalidated (is no longer in use). Garbage collection can be time consuming and, like erase/write operations, can prevent read operations from executing until it is complete. Therefore, the garbage collection and copying operations are executed such that initially, only the fourth chip is being written and third chip is available for reading. Later, the garbage collected version is copied back from the fourth chip to the third chip such that the fourth chip is available for reading. Further, the garbage collection and copy operations between third and fourth chip are overlapped in time with writing into the first and second chip. As long as the garbage collection and copy operation takes less time than the writing operations, the third and fourth chips will be ready to take over execution of write operations when the first and second chips are filled. A new pair of chips will then be selected for garbage collection and copy operation.
In such a system, at any given time, two buffer chips contain a duplicate of the data in two other chips. Accordingly, a device having ten NAND flash chips for storage can have two additional NAND flash buffer chips. The cost overhead will thus be 20%, which can be half the cost overhead of using a DRAM buffer.
Additional buffer NAND flash chips can be employed to improve the write bandwidth of the NAND flash storage device. The write bandwidth of a NAND flash storage device refers to the amount of data the NAND flash storage device can receive and store in a given amount of time. In some cases, the write bandwidth of a NAND flash storage device can be limited by the time it takes to write data to a particular NAND flash chip. Therefore, write bandwidth can be increased by adding more buffer NAND flash chips, and distributing the write operations among them. Accordingly, in some implementations, the NAND flash storage device can include additional buffer chips (i.e., 4, 6, 8, or more) to improve the write bandwidth.
In some implementation, the NAND flash storage device can employ wear leveling. A NAND flash chip can only perform a finite number of write operations before exhibiting excessive bit errors. Eventually, after performing many write operations, the NAND flash chip will wear out and begin to fail. To prolong the life of the NAND flash storage device, the NAND flash storage device can rotate which NAND flash chips are designated as buffer chips for write operations. The designation can be rotated to equalize or substantially equalize wear among some or all of the NAND flash chips in the device, as opposed to concentrating buffering write operations on a small number of chips. In this manner, the wear can be spread, or “leveled,” across many NAND flash chips of the device.
FIG. 1 is block diagram of an example computer system 100 including a NAND flash storage device with a NAND buffer, according to an illustrative implementation. The system 100 can be a computing device; for example and without limitation, a personal computing device such as a laptop or desktop computer, a shared computing device such as a server or mainframe, or component of the aforementioned or other computing device. The system 100 includes a processor such as a central processing unit (CPU) 110, RAM 120 such as DRAM, an input/output (I/O) bus 130 such as a Peripheral Component Interconnect Express (PCIe), a data storage device 140 such as a hard disk drive (HDD) or a solid-state drive (SSD) for long-term data storage, and a NAND flash storage device 150. The RAM 120 may be a main memory and can be directly accessible to the CPU 110 over, for example, an address bus that carries a memory address of the desired location, and a data bus used to read or write data from/to the specified memory address.
In some implementations, the NAND flash storage device 150 can be a swap device providing a swap space to the system 100. In some computing environments, the kernel or operating system can employ a memory/storage abstraction such as virtual memory, which provides applications executing on the CPU 110 with transparent and consistent access to one or more memory devices that can vary in type and location. For example, virtual memory can allow applications executing on the CPU 110 to access an extended memory space. The extended memory space can exceed the capacity of the RAM 120. When the data sought to be stored in memory exceed the capacity of the RAM 120, the virtual memory function can perform paging, by which the system 100 can store data in a secondary memory device such as an HDD or a SSD. This extended memory space can be referred to as swap space, and the storage device providing the swap space can be referred to as a swap device.
Different types of memory and storage devices can vary greatly in access speed, with RAM generally being much faster than HDDs, and NAND flash falling in the middle. The slow access times of HDDs make them impractical for applications requiring fast access to data. NAND flash storage devices are much faster than HDDs, and thus become useful for a wider range of applications. NAND flash storage devices have the additional advantage of being less expensive than some common types of RAM such as DRAM. Newer types of NAND such as low-latency NAND (LLNAND) can be even faster, exhibiting latency for read operations of approximately 10 μs versus approximately 70 μs regular NAND. LLNANDs also exhibit less latency for program (write) operations; i.e., 100-200 μs versus a few milliseconds for regular NAND. A write operation, however, generally involves both a program operation and an erase operation. LLNANDs may still suffer from slow erase times, which can take on the order of 5 ms. As an additional challenge, a NAND flash chip or die can generally only perform one operation at a time. For example, if a NAND flash chip is executing an erase/program sequence of a write operation, any incoming read commands will be queued until completion. Thus, some read operations may require up to several milliseconds to complete despite a nominal latency of tens of microseconds. These outlier latencies are referred to as “tail latency.” Both the duration and the unpredictability of this tail latency can present difficulties when using NAND flash. Some systems can use a RAM to cache the contents of a NAND flash chip being written to such that the RAM can perform the read operation while the NAND flash chip executes write operations—see, for example, U.S. patent application Ser. No. 14/936,486. Because RAM can be more expensive that NAND flash, however, the RAM adds a cost overhead that can be unacceptable for some applications. Therefore, this disclosure proposes providing one or more additional NAND flash chips to provide a cache for data residing in NAND flash chips executing write operations. A NAND flash-based buffer is described in more detail below with regard to FIG. 2.
FIG. 2 is a block diagram of an example NAND flash storage device 250 with a NAND buffer, according to an illustrative implementation. The NAND flash storage device 250 can serve as the NAND flash storage device 150. To avoid tail latency associated with read operations being queued behind write operations, the NAND flash storage device 250 can employ double buffering using two NAND flash chips 270 a and 270 b (collectively, “NAND flash chips 270”). A controller 260 can handle storage access commands received over the I/O bus 230. The controller 260 can include programmable logic such as an application-specific integrated circuit (ASIC), a field-programmable gate array (FPGA), a microcontroller, or a microprocessor. The controller 260 can be integral with, or external to the NAND flash storage device 250. The controller 260 can orchestrate read and write operations of the NAND flash storage device 250 such that the CPU 110 need not be aware of, or itself execute, the buffering operations of the NAND flash storage device 250. Alternatively, it is possible to design a simpler controller that relies on the CPU 110 to orchestrate the operations, while the controller performs the lower level work of completing each operation. Accordingly, the controller 260 is configured to receive data and/or commands over the I/O bus 230, and write the received data first to one of the NAND flash chips 270, and then to a second of the NAND flash chips 270. The controller 260 executes the write operations successively—either autonomously or under the direction of CPU 110—such that they do not overlap in time; thus, at any given time, the NAND flash chip 270 that is not presently performing a write operation can promptly perform read operations. Once the controller 260 executes each write operation, two of the NAND flash chips 270 can contain the same data. The controller 260 can execute read commands by reading from whichever of the first NAND flash chip or the second NAND flash chip is not currently executing a write operation such that read operations are not queued behind write operations. In this manner, tail latency can be improved.
In some implementations, the NAND flash storage device 250 can include additional NAND flash chips 270 c and 270 d to further improve write bandwidth. For example, a NAND flash storage device 250 having four NAND flash chips 270 a through 270 d can split write operations between two NAND flash chips 270; e.g., NAND flash chips 270 a and 270 b. After the NAND flash chips 270 a and 270 b complete their respective write operations, those write operations can be repeated on the NAND flash chips 270 c and 270 d. In this manner, the write bandwidth of the NAND flash storage device 250 can be approximately doubled. The NAND flash storage device 250 can employ additional NAND flash chips 270 for further increases in write bandwidth.
In some implementations, the NAND device 250 can include some RAM 265 to serve as a cache or buffer to store data while it is written to each NAND flash chip 270. This RAM 265 need not have the capacity to buffer the entire contents of a NAND flash chip 270, but rather can be only large enough to buffer a single block or page of data. Accordingly, it can be much smaller than a dedicated RAM buffer, and thus not contribute appreciably to the cost overhead of the NAND device 250.
FIG. 3 is a flowchart showing an example method 300 of using a NAND flash storage device with a NAND buffer, according to illustrative implementations. The NAND flash storage device can include any of the NAND flash storage devices 150, 250, 450, or 750 described herein. The method 300 includes receiving data over an input/output (I/O) bus (stage 310). The method 300 includes writing the received data to a first NAND flash chip and a second NAND flash chip successively such that the write operations to each NAND flash chip do not overlap in time (stage 320). The method includes receiving a read command over the I/O bus (stage 330). The method 300 includes reading data from whichever of the first NAND flash chip or the second NAND flash chip is not currently executing a write operation such that read operations are not queued behind write operations (stage 340).
The method 300 includes receiving data over an I/O bus (stage 310). The NAND flash storage device can receive a write command over the I/O bus, such as one of the I/O busses 130 or 230 previously described with respect to the NAND flash storage devices 150 or 250, respectively. The write command can include data to be stored by the NAND flash storage device.
The method 300 includes writing the received data to a first NAND flash chip and a second NAND flash chip successively such that the write operations to each NAND flash chip do not overlap in time (stage 320). The write operations can be executed by a controller of the NAND flash storage device, such as the CPU 110 or the controller 260 previously described. The controller can execute the write operation on one of two NAND flash chips, such as the NAND flash chip 270 a or 270 b. While the controller executes a write operation on one of the NAND flash chips 270, that particular NAND flash chip will not be able to process other operations. Accordingly, the controller will execute the write operations on the two NAND flash chips successively such that they do not overlap in time. In some implementations, the receiving stage 310 and the writing stage 320 can be repeated until the first and second NAND flash chips are full or substantially full. In some implementations, the NAND flash chips are low-latency NAND (LLNAND) chips.
The method includes receiving a read command over the I/O bus (stage 330). The method 300 includes reading data from whichever of the first NAND flash chip or the second NAND flash chip is not currently executing a write operation such that read operations are not queued behind write operations (stage 340). During execution of the write operations, the NAND flash storage device may receive a read command over the I/O bus. The controller can direct the read command to whichever of the NAND flash chips is not currently executing a write operation. In this manner, stages 330 and 340 can proceed concurrently with stages 310 and 320, and read operations can be processed promptly without being queued behind slower write operations. The stages of the method 300 can repeat as additional data and read commands are received over the I/O bus. In some implementations, the role played by each particular NAND flash chip can cycle, with different NAND flash chips performing different buffer and storage functions at different times. Examples of different manners of NAND flash chip buffering and storage are described below with reference to FIGS. 4 through 9.
FIG. 4 is a block diagram of an example NAND flash storage device 450 with a NAND buffer and additional NAND storage, according to illustrative implementations. The NAND flash storage device 450 includes a controller 460 for executing read and write commands received over the I/O bus 430. The controller 460 can include RAM 465, or be in communication with the RAM 465 via a memory bus. The controller 460 can include programmable logic such as an application-specific integrated circuit (ASIC), a field-programmable gate array (FPGA), a microcontroller, or a microprocessor. The controller 460 can be integral with, or external to the NAND flash storage device 450. The NAND flash storage device 450 can include two buffer NAND flash chips 470 a and 470 b (collectively, “buffer NAND flash chips 470”). In some implementations, the NAND flash storage device 450 can include additional buffer NAND flash chips 470, as described further below. In some implementations, the NAND flash storage device 450 can use the RAM 465 for buffering data received over the I/O bus 430 while it is being written to one or more of the buffer NAND flash chips 470. The NAND flash storage device 450 also includes storage NAND flash chips 475 a through 475 e (collectively, “storage NAND flash chips 475”) that provide additional memory capacity over that of the buffer NAND flash chips 470. Data received over the I/O bus 430 can be buffered in the buffer NAND flash chips 470 before being copied to one of the storage NAND flash chips 475.
In some implementations, the buffer NAND flash chips 470 can be single level cell (SLC) NAND flash chips or low-latency NAND (LLNAND) flash chips while the storage NAND flash chips 475 may be multi-level cell (MLC) or triple-level cell (TLC) NAND flash chips. In some implementations, the buffer NAND flash chips 470 can be of smaller capacity than the storage NAND flash chips 475. In some implementations, a group of buffer NAND flash chips 470 can have a capacity equal to the capacity of a single storage NAND flash chip 475. The group of buffer NAND flash chips 470 can be used collectively as a single buffer chip.
The write bandwidth of NAND flash storage devices can be limited due to the relatively slow pace of performing write operations. However, in some implementations, write bandwidth can be improved by designating more buffer NAND flash chips 470 as write buffers. In some implementations, and similar to the NAND flash storage device 250, the NAND flash storage device 450 can include additional buffer NAND flash chips 470; for example, buffer NAND flash chips 470 c and 470 d. In some implementations, the NAND flash storage device 450 can include even more buffer NAND flash chips 470. The controller 460 can thus spread write operations among the buffer NAND flash chips 470 to multiply the write bandwidth.
The NAND flash storage device 450 can potentially face two drawbacks. First, the buffer NAND flash chips 470 will eventually reach capacity. Once all data in a buffer NAND flash chip 470 has been copied to one of the storage NAND flash chips 475, all data in the buffer NAND flash chip 470 can be deleted. While the buffer NAND flash chip 470 performs erase operations, it cannot perform a write operation. Therefore, the write bandwidth will be limited by the unavailability of that buffer NAND flash chip 470.
The second potential drawback relates to the finite number of write operations that a NAND flash chip can handle before exhibiting excessive bit errors. NAND flash chips experience wear from processing write operations. Eventually, after performing many write operations, the NAND flash chip will wear out and begin to exhibit failures.
In some implementations, the challenge of limited write bandwidth can be addressed by increasing the number of buffer NAND flash chips 470 as described above, and the NAND wear problem can be addressed by using higher endurance chips as buffer NAND flash chips 470. In some implementations, the NAND wear problem can be addressed by rotating the particular NAND flash chips designated as buffers for incoming data. Such a wear-leveling scheme is described further below with reference to FIGS. 7 through 9. FIG. 5 below illustrates example operations of the NAND flash storage device 450.
FIG. 5 shows example operations 500 of a NAND flash storage device with a NAND buffer and additional NAND storage, according to an illustrative implementation. These operations can be performed by a NAND flash storage device such as the NAND flash storage device 450 previously described. The operations 500 include stage 510, in which incoming data is written to the buffer NAND flash chip 470 a and the storage NAND flash chip 475 a, and data is copied with garbage collection from the storage NAND flash chip 475 b to the buffer NAND flash chip 470 b and subsequently copied back to the storage NAND flash chip 475 b. The operations 500 include stage 520, in which incoming data is written to the buffer NAND flash chip 470 b and the storage NAND flash chip 475 b, and data is copied with garbage collection from the storage NAND flash chip 475 c to the buffer NAND flash chip 470 a and subsequently copied back to the storage NAND flash chip 475 c. The operations 500 include stage 530, in which incoming data is written to the buffer NAND flash chip 470 a and the storage NAND flash chip 475 c, and the buffer NAND flash chip 470 b receives data copied from the storage NAND flash chip 475 d. The buffer NAND flash chips 470 are indicated by a buffer 580 box. The operation each NAND flash chip 470 or 475 is currently performing is indicated by an acronym. “PW” stands for “primary write” and indicates that the storage NAND flash chip 475 is currently designated for receiving data and performing write operations. “SW” stands for “secondary write” and indicates that the buffer NAND flash chip 470 is also currently designated for receiving data and performing write operations. “FPW” stands for “future primary write” and indicates that the NAND flash chip 475 is designated for being prepared for future service receiving data and performing write operations. “FSW” stands for “future secondary write” and indicates that the NAND flash chip 470 is designated for being prepared for future service receiving data and performing write operations. “RO” stands for “read-only” and indicates that the controller 460 will only call upon the RO storage NAND flash chips 475 to execute read operations. These labels are merely arbitrary, and are assigned only for the purpose of describing the operation of the various chips at particular points in time. Between each state of the operations 500, the role of each NAND flash chip 470 or 475 changes as different NAND flash chips 470 or 475 receive data and perform write operations.
During stage 510, the buffer NAND flash chip 470 a and the storage NAND flash chip 475 a receive and write incoming data, as indicated by the arrows. The buffer NAND flash chip 470 a and the storage NAND flash chip 475 a receive and write the same data, as the SW and the PW chips, respectively; however, they can perform the write operations during non-overlapping time periods. In this manner, both PW and the SW chips contain substantially the same data, and a NAND flash chip that is not performing a write operation can respond to a read command and perform the read operation promptly without having to wait for completion of the relatively long write operation. In some implementations, the PW and SW chips can continue receiving data and performing write operations until they are full or substantially full. Meanwhile, the FPW storage NAND flash chip 475 b copies its data to the FSW buffer NAND flash chip 470 b in preparation for receiving external data in the next stage 520. During the copy operation, the NAND flash storage device 450 can perform garbage collection on the FPW and the FSW chips. When fragmented data is copied from the FPW chip to the FSW chip, it is consolidated so that it is written to the FSW chip in a continuous or substantially continuous memory space, while free space is similarly consolidated in a second continuous or substantially continuous memory space. This makes the data representation more compact and leaves space at the end for new data to be written at the next stage 520. After all valid data is copied from the FPW chip to the FSW chip, the free space on the FPW is erased, and the valid data is copied back from the FSW chip to the FPW chip. When the PW and SW chips have completed their write operations and the FPW and the FSW chips have completed their copying and garbage collection, the operations 500 can continue with the next stage 520.
During stage 510 (and stages 520 and 530 as well) the NAND flash storage device 450 may from time to time receive a read request. The controller 460 may handle the read request in one of three ways depending on whether the requested data is on a RO chip, one of the PW or SW chips, or one of the FPW or FSW chips. If the requested data is on a RO chip, the controller 460 can simply execute the read operation on the RO chip and returns the data—there is no write operation occurring that would interfere with or delay execution of the read operation. If the requested data is on the PW or SW chip, the controller 460 can execute the read operation on whichever of the PW or SW chips is not currently executing a write operation such that read operations are not queued behind write operations. If the requested data is on the FPW or FSW chip, the controller 460 can execute the read operation on whichever of the FPW or FSW is not currently executing a write operation such that read operations are not queued behind write operations. That is, if data is currently being copied with garbage collection from the FPW chip to the FSW chip, the controller 460 will be executing write operations on the FSW chip, and so can execute the read operation on the FPW chip. When the data is being copied back from the FSW chip to the FPW chip, the controller 460 can execute the read operation on the FSW chip. In this manner, regardless of which NAND flash chip contains the requested data and which NAND flash chip[s] is/are executing write operations, the NAND flash device 450 can read and return the requested data without undue delay caused by queuing the read operation behind a write operation.
During stage 520, the storage NAND flash chip 475 b is designated as the PW chip, the buffer NAND flash chip 470 b is designated as the SW chip, the storage NAND flash chip 475 c is designated as the FPW chip, the buffer NAND flash chip 470 a is designated as the FSW chip, and the storage NAND flash chip 475 a is designated RO. The functions of the PW, SW, FPW, and FSW chips are similar to functions performed by the chips with the same designations in the previous stage 510, but with different NAND flash chips performing each designated role. The SW and the PW chips receive and write the same data as indicated by the arrows; however, they can perform the write operations during non-overlapping time periods so that they can respond to a read command and perform the read operation promptly without having to wait for completion of the relatively long write operation. Meanwhile, the FPW chip copies its data to the FSW chip in preparation for receiving external data in the next stage 530. During the copy operation, the FPW and the FSW chips can perform garbage collection operations. When the PW and SW chips have completed their write operations and the FPW and the FSW chips have completed their copying and garbage collection, the operations 500 can continue with the next stage 530.
During stage 530, the storage NAND flash chip 475 c is designated as the PW chip, the buffer NAND flash chip 470 a is designated as the SW chip, the storage NAND flash chip 475 d is designated as the FPW chip, the buffer NAND flash chip 470 b is designated as the FSW chip, and the storage NAND flash chip 475 b is designated RO. The functions of the PW, SW, FPW, and FSW chips are similar to functions performed by the chips with the same designations in the previous stages 510 and 520, but with different NAND flash chips performing each designated role. The SW and the PW chips receive and write the same data as indicated by the arrows; however, they can perform the write operations during non-overlapping time periods so that they can respond to a read command and perform the read operation promptly without having to wait for completion of the relatively long write operation. Meanwhile, the FPW chip copies its data to the FSW chip in preparation for receiving external data in the next stage. During the copy operation, the FPW and the FSW chips can perform garbage collection operations. When the PW and SW chips have completed their write operations and the FPW and the FSW chips have completed their copying and garbage collection, the operations 500 can continue with the next stage. The operations 500 can continue in this manner until all of the storage NAND flash chips 475 have been written to, at which time the operations 500 can repeat starting at the stage 510.
FIG. 6 is a flowchart showing an example method 600 of using a NAND flash storage device with a NAND buffer and additional NAND storage, according to illustrative implementations. The method 600 can provide for NAND flash storage space beyond the storage space provided by the buffer NAND chips. For example, the method 600 can be performed by the NAND flash storage device 450 previously described with reference to FIG. 4. The NAND flash storage device 450 can include buffer NAND flash chips 470 and storage NAND flash chips 475. The method 600 includes designating a first storage NAND flash chip as a primary write (PW) chip, a second storage NAND flash chip as a future primary write (FPW) chip, a first buffer NAND flash chip as a secondary write (SW) chip, a second buffer NAND flash chip as a future secondary write (FSW) chip, and any remaining NAND flash chips as read-only (RO). The method 600 includes receiving data over an input/output (I/O) bus (stage 620). The method 600 includes writing the received data to the PW chip and the SW chip successively such that the write operations to each NAND flash chip do not overlap in time (stage 630). The method 600 includes receiving a read command over the I/O bus (stage 640). The method 600 includes, responsive to receiving the read command, reading data from a NAND flash chip that is not currently executing a write operation such that read operations are not queued behind write operations (stage 650). The method 600 includes copying valid data from the FPW chip to the FSW chip (stage 660). The method 600 includes, following the writing of the received data and the copying of the valid data, designating the first storage NAND flash chip as RO, the second storage NAND flash chip as the PW chip, the first buffer NAND flash chip as the FSW chip, and the second buffer NAND flash chip as the SW chip (stage 670).
The method 600 includes designating a first storage NAND flash chip as a primary write chip, a second storage NAND flash chip as a future primary write chip, a first buffer NAND flash chip as a secondary write chip, a second buffer NAND flash chip as a future secondary write chip, and any remaining NAND flash chips as read-only (RO).
The method 600 includes receiving data over an I/O bus (stage 620). Stage 620 can be similar to stage 310 described previously.
The method 600 includes writing the received data to the PW chip and the SW chip successively such that the write operations to each NAND flash chip do not overlap in time (stage 630). Stage 630 can be similar to stage 320 described previously. In some implementations, the receiving stage 620 and the writing stage 630 can be repeated until the PW chip and the SW chip are full or substantially full.
The method 600 includes receiving a read command over the I/O bus (stage 640). Stage 630 can be similar to stage 430 described previously.
The method 600 includes, responsive to receiving the read command, reading data from a NAND flash chip that is not currently executing a write operation such that read operations are not queued behind write operations (stage 650). Stage 650 can be similar to stage 340 described previously. In practice, the requested data may reside on a RO chip, one of the PW or SW chips, or one of the FPW or FSW chips. The controller may handle the read request in one of three ways depending on where the requested data resides. If the requested data is on the PW or SW chip, the controller can execute the read operation on whichever of the PW or SW chips is not currently executing a write operation such that read operations are not queued behind write operations, and the method 600 can proceed to the next stage. If the requested data is on a RO chip, the controller can simply execute the read operation on the RO chip and returns the data—there is no write operation occurring that would interfere with or delay execution of the read operation. If the requested data is on the FPW or FSW chip, the controller can execute the read operation on whichever of the FPW or FSW is not currently executing a write operation such that read operations are not queued behind write operations. That is, if data is currently being copied with garbage collection from the FPW chip to the FSW chip, the controller 460 will be executing write operations on the FSW chip, and so can execute the read operation on the FPW chip. When the data is being copied back from the FSW chip to the FPW chip, the controller can execute the read operation on the FSW chip. In this manner, regardless of which NAND flash chip contains the requested data and which NAND flash chip[s] is/are executing write operations, the NAND flash device can read and return the requested data without undue delay caused by queuing the read operation behind a write operation. In this manner, stages 640 and 650 can proceed concurrently with stages 620 and 630, and read operations can be processed promptly without being queued behind slower write operations. The stages 620 through 650 can repeat as additional data and read commands are received over the I/O bus.
The method 600 includes copying valid data from the FPW chip to the FSW chip (stage 660). In some implementations, during the copy operation, the FPW and FSW chips can perform garbage collection to consolidate the valid data and the free space, respectively. For example, the NAND flash storage device can first copy the valid data from the FPW chip to the FSW chip such that the valid data and the free space, respectively, occupy substantially contiguous respective memory spaces of the FSW chip. The NAND flash storage device can copy the valid data from the FSW chip back to the FPW chip such that the valid data and the free space, respectively, occupy substantially contiguous respective memory spaces of the FPW chip. The NAND flash storage device can erase the memory spaces of the FPW and FSW chips occupied by the free space. The copying operations of stage 660 can be performed during an overlapping time period with the buffering operations of stages 620 and 630. In addition, read commands can be received and executed by whichever of the FPW and the FSW is not currently executing write operations as part of the copy operations. In this manner, the copying operations of stage 660 can also be performed during an overlapping time period with the read operations of stages 640 and 650.
The method 600 includes, following the writing of the received data and the copying of the valid data, designating the first storage NAND flash chip as RO, the second storage NAND flash chip as the PW chip, the first buffer NAND flash chip as the FSW chip, and the second buffer NAND flash chip as the SW chip (stage 670). The stages of the method 600 can repeat as additional data and read commands are received over the I/O bus. In some implementations, the role played by each particular NAND flash chip can cycle, with different NAND flash chips performing different buffer and storage functions at different times.
FIG. 7 is block diagram of an example NAND flash storage device 750 with a rotating NAND buffer for wear leveling, according to illustrative implementations. The NAND flash storage device 750 includes a controller 760 for processing read and write commands received over the I/O bus 730. The controller 760 can include RAM 765, or be in communication with the RAM 765 via a memory bus. The controller 760 can include programmable logic such as an application-specific integrated circuit (ASIC), a field-programmable gate array (FPGA), a microcontroller, or a microprocessor. The controller 760 can be integral with, or external to the NAND flash storage device 750. The NAND flash storage device 750 includes NAND flash chips 770 a to 770 f (collectively, “NAND flash chips 770”). In some implementations, the NAND flash storage device 750 can include more or fewer NAND flash chips 770. In some implementations, the NAND flash storage device 750 can use the RAM 765 for buffering data received over the I/O bus 730 while it is being written to one or more of the NAND flash chips 770. The NAND flash storage device 750 can employ wear leveling by occasionally changing the NAND flash chips 770 used for buffering. For example, during a first stage of operation, NAND flash chips 770 a and 770 b can serve as the NAND buffers, with the controller 760 writing data to each one in turn. Example operations of the NAND flash storage device 750 are described in more detail below with reference to FIG. 8.
FIG. 8 shows example operations 800 of a NAND flash storage device with a rotating NAND buffer employing wear leveling, according to illustrative implementations. These operations can be performed by a NAND flash storage device such as the NAND flash storage device 750 previously described. The operations 800 include stage 810, in which incoming data is written to the NAND flash chips 770 a and 770 c, while data is copied with garbage collection from the NAND flash chip 770 d to the NAND flash chip 770 b and subsequently copied back to the NAND flash chip 770 d. The operations 800 include stage 820, in which incoming data is written to the NAND flash chips 770 b and 770 d, while data is copied with garbage collection from the NAND flash chip 770 e to the NAND flash chip 770 c and subsequently copied back to the NAND flash chip 770 e. The operations 800 include stage 830, in which incoming data is written to the NAND flash chips 770 c and 770 e, while data is copied with garbage collection from the NAND flash chip 770 f to the NAND flash chip 770 d and subsequently copied back to the NAND flash chip 770 f. During each operation 800, the secondary write and the future secondary write NAND flash chips 770 are considered the buffer NAND flash chips, and are indicated by a buffer 880 box. The operation each NAND flash chip 770 is currently performing is indicated by an acronym. “PW” stands for “primary write” and indicates that the NAND flash chip 770 is currently receiving data and performing write operations. “SW” stands for “secondary write” and indicates that the NAND flash chip 770 is currently receiving data and performing write operations as one of the buffer NAND flash chips. “FPW” stands for “future primary write” and indicates that the NAND flash chip 770 is being prepared for future service as the primary write chip by copying its data to the future secondary write chip. “FSW” stands for “future secondary write” and indicates that the NAND flash chip 770 is receiving data from the FPW chip in preparation for future service as a SW buffer NAND flash chip. “RO” stands for “read-only” and indicates that the controller 760 will only call upon the RO NAND flash chip 770 to execute read operations. These labels are merely arbitrary, and are assigned only for the purpose of describing the function of the various chips during a given stage of operation 800. Between each stage of the operations 800, the role of each NAND flash chip 770 can change, as indicated by the labels below each chip and the buffer 880 box rotating through the available NAND flash chips 770.
During stage 810, the NAND flash chip 770 a serves as the SW chip, the NAND flash chip 770 b serves as the FSW chip, the NAND flash chip 770 c serves as the PW chip, the NAND flash chip 770 d serves as the FPW chip, and the remaining NAND flash chips serve as RO chips. Incoming data is written to the NAND flash chips 770 a and 770 c—the SW and PW chips, respectively. Meanwhile, data is copied with garbage collection from the NAND flash chip 770 d to the NAND flash chip 770 b—the FPW and FSW chips, respectively—and subsequently copied back to the NAND flash chip 770 b. During stage 810, the buffer 880 includes the NAND flash chips 770 a and 770 b. The NAND flash chip 770 b receives a copy of the data from the NAND flash chip 770 d, so it may be at least partially unavailable for executing read operations, and may be affirmatively prevented by the controller 760 from executing read or write operations not associated with the copy operation. Garbage collection may be performed during the copy operation. This means that only valid data is copied and any gaps left by invalid data are removed in that process. This makes the data representation more compact and leaves space at the end for new data to be written at the next stage. In some implementations, this compacted form of data in NAND flash chip 770 b may be copied back to NAND flash chip 770 d. The NAND flash chips 770 e and 770 f, as well as any other NAND flash chips 770 not pictured, can serve as RO chips. At an appropriate time, the controller 760 can update the NAND flash chip 770 designations, and proceed to stage 820 of the operations 800. This can be when the buffer NAND flash chips 770 a and 770 c become full or substantially full, after a predetermined number of write operations have been performed, or following a predetermined interval of time.
During stage 810 (and stages 820 and 830 as well) the NAND flash storage device 750 may from time to time receive a read request. The controller 760 may handle the read request in one of three ways depending on whether the requested data is on a RO chip, one of the PW or SW chips, or one of the FPW or FSW chips. If the requested data is on a RO chip, the controller 760 can simply execute the read operation on the RO chip and returns the data—there is no write operation occurring that would interfere with or delay execution of the read operation. If the requested data is on the PW or SW chip, the controller 760 can execute the read operation on whichever of the PW or SW chips is not currently executing a write operation such that read operations are not queued behind write operations. If the requested data is on the FPW or FSW chip, the controller 760 can execute the read operation on whichever of the FPW or FSW is not currently executing a write operation such that read operations are not queued behind write operations. That is, if data is currently being copied with garbage collection from the FPW chip to the FSW chip, the controller 760 will be executing write operations on the FSW chip, and so can execute the read operation on the FPW chip. When the data is being copied back from the FSW chip to the FPW chip, the controller 760 can execute the read operation on the FSW chip. In this manner, regardless of which NAND flash chip contains the requested data and which NAND flash chip[s] is/are executing write operations, the NAND flash device 750 can read and return the requested data without undue delay caused by queuing the read operation behind a write operation.
During stage 820, the NAND flash chip 770 b serves as the SW chip, the NAND flash chip 770 c serves as the FSW chip, the NAND flash chip 770 d serves as the PW chip, the NAND flash chip 770 e serves as the FPW chip, and the remaining NAND flash chips serve as RO chips. Incoming data is written to the NAND flash chips 770 b and 770 d, and data is copied with garbage collection from the NAND flash chip 770 e to the NAND flash chip 770 c, and subsequently copied back to the NAND flash chip 770 e. The NAND flash chip 770 a is designated as an RO chip, and is available to execute read operations without interruption from write operations. The buffer 880 now includes the NAND flash chips 770 b and 770 c. Because the NAND flash chip 770 c previously served as the PW chip, the data it contains will not be unique because the data had previously been written to the previous SW as well. Accordingly, the NAND flash chip 770 c can receive duplicate data from the NAND flash chip 770 e in preparation for being designated SW in the next stage of operation. During the copying operation, the NAND flash chips 770 c and 770 e can perform garbage collection operations.
During stage 830, the NAND flash chip 770 c serves as the SW chip, the NAND flash chip 770 d serves as the FSW chip, the NAND flash chip 770 e serves as the PW chip, the NAND flash chip 770 f serves as the FPW chip, and the remaining NAND flash chips serve as RO flash chips. Incoming data is written to the NAND flash chips 770 c and 770 e, while data is copied with garbage collection from the NAND flash chip 770 f to the NAND flash chip 770 d, and subsequently copied back to the NAND flash chip 770 f. The NAND flash chip 770 b is designated as an RO chip, and is available to execute read operations without interruptions from write operations. The buffer 880 now includes the NAND flash chips 770 c and 770 d. Because the NAND flash chip 770 d previously served as the PW chip, the data it contains will not be unique because the data had previously been written to the previous SW as well. Accordingly, the NAND flash chip 770 d can receive duplicate data from the NAND flash chip 770 f in preparation for being designated SW in a subsequent stage of operation.
Additional stages can progress in this manner indefinitely to spread wear among some or all of the available NAND flash chips 770. In performing these operations, the NAND flash storage device 750 will write each received page of data to the NAND flash chips 770 twice, doubling the amount of wear inflicted; however, when the wear is spread evenly across many NAND flash chips 770, this amount of additional wear may be acceptable.
FIG. 9 is a flowchart showing an example method 900 of using a NAND flash storage device with a rotating NAND buffer for wear leveling, according to illustrative implementations. For example, the method 900 can be performed by the NAND flash storage device 750 previously described with reference to FIG. 7. The method 900 can rotate through available NAND flash chips, designating each in turn as a buffer NAND chip, in order to distribute the wear caused by executing write operations and avoid failures potentially caused by focusing write operations on a small number of NAND flash chips. The method 900 includes designating a first NAND flash chip as a secondary write (SW) chip, a second NAND flash chip as a future secondary write (FSW) chip, a third NAND flash chip as a primary write (PW) chip, and a fourth NAND flash chip as a future primary write (FPW) chip (stage 910). The method 900 includes receiving data over an input/output (I/O) bus (stage 920). The method 900 includes writing the received data to the PW chip and the SW chip successively such that write operations to each chip do not overlap in time (stage 930). The method 900 includes receiving a read command over the I/O bus (stage 940). The method 900 includes, responsive to receiving the read command, reading data from a NAND flash chip that is not currently executing a write operation such that read operations are not queued behind write operations (stage 950). The method 900 includes copying valid data from the FPW chip to the FSW chip (stage 960). The method 900 includes, following the writing of the received data and the copying of the valid data: designating the first NAND flash chip as read-only, the second NAND flash chip as the SW chip, the third NAND flash chip as the FSW chip, and the fourth NAND flash chip as the PW chip (stage 970).
The method 900 includes designating the first NAND flash chip as the SW chip, the second NAND flash chip as the FSW chip, the third NAND flash chip as the PW chip, and the fourth NAND flash chip as the FPW chip (stage 910). The designations were described previously with reference to FIG. 8.
The method 900 includes receiving data over the I/O bus (stage 920). Stage 920 is similar to stages 310 and 610 described previously.
The method 900 includes writing the received data to the PW chip and the SW chip successively such that write operations to each chip do not overlap in time (stage 930). Stage 930 is similar to stages 320 and 620 described previously. In some implementations, the method 900 may continue performing receive operations of stage 920 and the write operations of stage 930 until the PW chip and the SW chip are full or substantially full before advancing to stage 950. The operations of stages 940 and 950, described below, can be performed concurrently with the operations of stages 920 and 930.
The method 900 includes receiving a read command over the I/O bus (stage 940). Stage 940 is similar to stages 330 and 640 described previously.
The method 900 includes, responsive to receiving the read command, reading data from a NAND flash chip that is not currently executing a write operation such that read operations are not queued behind write operations (stage 950). Stage 950 is similar to stages 340 and 650 previously described. In practice, the requested data may reside on a RO chip, one of the PW or SW chips, or one of the FPW or FSW chips. The controller may handle the read request in one of three ways depending on where the requested data resides. If the requested data is on the PW or SW chip, the controller can execute the read operation on whichever of the PW or SW chips is not currently executing a write operation such that read operations are not queued behind write operations, and the method 600 can proceed to the next stage. If the requested data is on a RO chip, the controller can simply execute the read operation on the RO chip and returns the data—there is no write operation occurring that would interfere with or delay execution of the read operation. If the requested data is on the FPW or FSW chip, the controller can execute the read operation on whichever of the FPW or FSW is not currently executing a write operation such that read operations are not queued behind write operations. That is, if data is currently being copied with garbage collection from the FPW chip to the FSW chip, the controller 460 will be executing write operations on the FSW chip, and so can execute the read operation on the FPW chip. When the data is being copied back from the FSW chip to the FPW chip, the controller can execute the read operation on the FSW chip. In this manner, regardless of which NAND flash chip contains the requested data and which NAND flash chip[s] is/are executing write operations, the NAND flash device can read and return the requested data without undue delay caused by queuing the read operation behind a write operation. In this manner, stages 940 and 950 can proceed concurrently with stages 920 and 930, and read operations can be processed promptly without being queued behind slower write operations. The stages 920 through 950 can repeat as additional data and read commands are received over the I/O bus.
The method 900 includes copying valid data from the FPW chip to the FSW chip (stage 960). To prepare two new NAND flash chips to serves as buffer NAND chips in a later stage, the method 900 can copy data from one NAND flash chip to another NAND flash chip. The copy source can be designated the FPW chip and the copy destination can be designated the FSW chip. In some implementations, the copy operation can be bidirectional; that is, data can be harmonized between the FPW chip and the FSW chip such that they each contain a duplicate of the other's data. During the copying operation, the FSW chip and the FPW chip can perform garbage collection operations to consolidate data and free space of data that is invalid or no longer needed. The copying operations of stage 960 can be performed during an overlapping time period with the buffering operations of stages 920 and 930. In addition, read commands can be received and executed by whichever of the FPW and the FSW is not currently executing write operations as part of the copy operations. In this manner, the copying operations of stage 960 can also be performed during an overlapping time period with the read operations of stages 940 and 950.
The method 900 includes, following the writing of the received data and the copying of the valid data: designating the first NAND flash chip as a read-only chip, the second NAND flash chip as the SW chip, the third NAND flash chip as the FSW chip, and the fourth NAND flash chip as the PW chip (stage 970). Once the FPW chip and the FSW chip contain duplicate data, they will be ready to take over as the PW chip and the SW chip, respectively. As long as the copying operations can be performed while the buffer NAND flash chips are still handling write operations, the NAND flash storage device 770 can switch designations without delay and continue executing write operations without a significant loss in write bandwidth.
In some implementations, the rotation can continue. For example, the controller 760 can designate a fifth NAND flash chip 770 e as the FPW chip. The controller 760 can write second data received over the I/O bus 730 to the PW chip and the SW chip such that write operations to each chip do not overlap in time. The controller 760 can copy second valid date from the FPW chip to the FSW chip. And, following the writing of the second received data and the copying of the second valid data, the controller 760 can designate the second NAND flash chip as read-only, the third NAND flash chip as the SW chip, the fourth NAND flash chip as the FSW chip, the fifth NAND flash chip as the PW chip, and a sixth NAND flash chip as the FPW chip.
While this specification contains many specific implementation details, these should not be construed as limitations on the scope of any inventions or of what may be claimed, but rather as descriptions of features specific to particular implementations of particular inventions. Certain features that are described in this specification in the context of separate implementations can also be implemented in combination in a single implementation. Conversely, various features that are described in the context of a single implementation can also be implemented in multiple implementations separately or in any suitable sub-combination. Moreover, although features may be described above as acting in certain combinations and even initially claimed as such, one or more features from a claimed combination can in some cases be excised from the combination, and the claimed combination may be directed to a sub-combination or variation of a sub-combination.
References to “or” may be construed as inclusive so that any terms described using “or” may indicate any of a single, more than one, and all of the described terms. The labels “first,” “second,” “third,” and so forth are not necessarily meant to indicate an ordering and are generally used merely to distinguish between like or similar items or elements.
Various modifications to the implementations described in this disclosure may be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other implementations without departing from the spirit or scope of this disclosure. Thus, the claims are not intended to be limited to the implementations shown herein, but are to be accorded the widest scope consistent with this disclosure, the principles and the novel features disclosed herein.

Claims (16)

What is claimed is:
1. A NAND flash storage device comprising:
a plurality of NAND flash chips, including a first NAND flash chip, a second NAND flash chip, a third NAND flash chip, and a fourth NAND flash chip; and
a controller configured to:
designate the first NAND flash chip as a secondary write chip, the second NAND flash chip as a future secondary write chip, the third NAND flash chip as a primary write chip, the fourth NAND flash chip as a future primary write chip, and any remaining NAND flash chips as read-only NAND flash chips;
receive data over an input/output (I/O) bus;
write the received data to the primary write chip and the secondary write chip successively such that write operations to each chip do not overlap in time;
receive a read command over the I/O bus;
responsive to receiving the read command, read data from whichever of the first NAND flash chip or the second NAND flash chip is not currently executing a write operation such that read operations are not queued behind write operations;
copy valid data from the future primary write chip to the future secondary write chip; and
following the writing of the received data and the copying of the valid data, designate the first NAND flash chip as read-only, the second NAND flash chip as the secondary write chip, the third NAND flash chip as the future secondary write chip, and the fourth NAND flash chip as the primary write chip.
2. The NAND flash storage device of claim 1, wherein the controller is configured to:
following the writing of the received data and the copying of the valid data:
designate a fifth NAND flash chip as the future primary write chip;
write second data received over the I/O bus to the primary write chip and the secondary write chip successively such that write operations to each chip do not overlap in time;
copy second valid data from the future primary write chip to the future secondary write chip; and
following the writing of the received second data and the copying of the second valid data:
designate the second NAND flash chip as read-only, the third NAND flash chip as the secondary write chip, the fourth NAND flash chip as the future secondary write chip, and the fifth NAND flash chip as the primary write chip.
3. The NAND flash storage device of claim 1, wherein the copy operation includes:
copying the valid data from the future primary write chip to the future secondary write chip such that the valid data and the free space, respectively, occupy substantially contiguous respective memory spaces of the future secondary write chip;
copying the valid data from the future secondary write chip back to the future primary write chip such that the valid data and the free space, respectively, occupy substantially contiguous respective memory spaces of the future primary write chip; and
erasing the memory spaces of the future primary write chip and the future secondary write chip occupied by the free space.
4. The NAND flash storage device of claim 1, wherein:
the write operation and the copy operation are performed during overlapping time periods.
5. A NAND flash storage device comprising:
a plurality of buffer NAND flash chips, including a first buffer NAND flash chip and a second buffer NAND flash chip;
a plurality of storage NAND flash chips, including a first storage NAND flash chip and a second storage NAND flash chip; and
a controller configured to:
designate the first storage NAND flash chip as a primary write chip, the second storage NAND flash chip as a future primary write chip, the first buffer NAND flash chip as a secondary write chip, the second buffer NAND flash chip as a future secondary write chip, and any remaining NAND flash chips as read-only;
receive incoming data over an input/output (I/O) bus;
write the received data to the primary write chip and the secondary write chip successively such that write operations to each chip do not overlap in time;
receive a read command over the I/O bus;
responsive to receiving the read command, read data from whichever of the first buffer NAND flash chip or the first storage NAND flash chip is not currently executing a write operation such that read operations are not queued behind write operations;
copy valid data from the future primary write chip to the future secondary write chip; and
following the writing of the received data and the copying of the valid data, designate the first storage NAND flash chip as read-only, the second storage NAND flash chip as the primary write chip, the first buffer NAND flash chip as the future secondary write chip, and the second buffer NAND flash chip as the secondary write chip.
6. The NAND flash storage device of claim 5, wherein the controller is configured to:
following the writing of the received data and the copying of the valid data:
designate a third storage NAND flash chip as the future primary write chip;
write second data received over the I/O bus to the primary write chip and the secondary write chip successively such that write operations to each chip do not overlap in time;
copy second valid data from the future primary write chip to the future secondary write chip; and
following the writing of the received second data and the copying of the second valid data, designate the second storage NAND flash chip as read-only, the third storage NAND flash chip as the primary write chip, the first buffer NAND flash chip as the secondary write chip, and the second buffer NAND flash chip as the future secondary write chip.
7. The NAND flash storage device of claim 5, wherein the copy operation includes:
copying the valid data from the future primary write chip to the future secondary write chip such that the valid data and the free space, respectively, occupy substantially contiguous respective memory spaces of the future secondary write chip;
copying the valid data from the future secondary write chip back to the future primary write chip such that the valid data and the free space, respectively, occupy substantially contiguous respective memory spaces of the future primary write chip; and
erasing the memory spaces of the future primary write chip and the future secondary write chip occupied by the free space.
8. The NAND flash storage device of claim 5, wherein:
the plurality of buffer NAND flash chips are single-level cell (SLC) NAND flash chips; and
the plurality of storage NAND flash chips are triple-level cell (TLC) NAND flash chips.
9. A method of buffering a NAND flash storage device comprising:
designating a first NAND flash chip as a secondary write chip, a second NAND flash chip as a future secondary write chip, a third NAND flash chip as a primary write chip, a fourth NAND flash chip as a future primary write chip, and any remaining NAND flash chips as read-only;
receiving data over an input/output (I/O) bus;
writing the received data to the primary write chip and the secondary write chip successively such that write operations to each chip do not overlap in time;
receiving a read command over the I/O bus;
responsive to receiving the read command, reading data from whichever of the first NAND flash chip or the second NAND flash chip is not currently executing a write operation such that read operations are not queued behind write operations;
copying valid data from the future primary write chip to the future secondary write chip; and
following the writing of the received data and the copying of the valid data: designating the first NAND flash chip as read-only, the second NAND flash chip as the secondary write chip, the third NAND flash chip as the future secondary write chip, and the fourth NAND flash chip as the primary write chip.
10. The method of claim 9, comprising:
following the writing of the received data and the copying of the valid data:
designating a fifth NAND flash chip as the future primary write chip;
writing second data received over the I/O bus to the primary write chip and the secondary write chip successively such that write operations to each chip do not overlap in time;
copying second valid data from the future primary write chip to the future secondary write chip; and
following the writing of the received second data and the copying of the second valid data, designating the second NAND flash chip as read-only, the third NAND flash chip as the secondary write chip, the third NAND flash chip as the further secondary write chip, the fourth NAND flash chip as the primary write chip, and the fifth NAND flash chip as the future primary write chip.
11. The method of claim 9, wherein the copy operation includes:
copying the valid data from the future primary write chip to the future secondary write chip such that the valid data and the free space, respectively, occupy substantially contiguous respective memory spaces of the future secondary write chip;
copying the valid data from the future secondary write chip back to the future primary write chip such that the valid data and the free space, respectively, occupy substantially contiguous respective memory spaces of the future primary write chip; and
erasing the memory spaces of the future primary write chip and the future secondary write chip occupied by the free space.
12. The method of claim 9, wherein:
the writing operation and the copying operation are performed during overlapping time periods.
13. A method of buffering a NAND flash storage device comprising:
designating a first storage NAND flash chip as a primary write chip, a second storage NAND flash chip as a future primary write chip, a first buffer NAND flash chip as a secondary write chip, a second buffer NAND flash chip as a future secondary write chip, and any remaining NAND flash chips as read-only;
receiving data over an input/output (I/O) bus;
writing the received data to the primary write chip and the secondary write chip successively such that write operations to each chip do not overlap in time;
receiving a read command over the I/O bus;
responsive to receiving the read command, reading data from whichever of the first buffer NAND flash chip or the first storage NAND flash chip is not currently executing a write operation such that read operations are not queued behind write operations;
copying valid data from the future primary write chip to the future secondary write chip; and
following the writing of the received data and the copying of the valid data: designating the first storage NAND flash chip as read-only, the second storage NAND flash chip as the primary write chip, the first buffer NAND flash chip as the future secondary write chip, and the second buffer NAND flash chip as the secondary write chip.
14. The method of claim 13, comprising:
following the writing of the received data and the copying of the valid data:
designating a third storage NAND flash chip as the future primary write chip;
writing second data received over the I/O bus to the primary write chip and the secondary write chip successively such that write operations to each chip do not overlap in time;
copying second valid data from the future primary write chip to the future secondary write chip; and
following the writing of the received second received data and the copying of the second valid data, designating the second storage NAND flash chip as read-only, the third storage NAND flash chip as the primary write chip, the first buffer NAND flash chip as the secondary write chip, and the second buffer NAND flash chip as the future secondary write chip.
15. The method claim 14, wherein the copy operation includes:
copying the valid data from the future primary write chip to the future secondary write chip such that the valid data and the free space, respectively, occupy substantially contiguous respective memory spaces of the future secondary write chip;
copying the valid data from the future secondary write chip back to the future primary write chip such that the valid data and the free space, respectively, occupy substantially contiguous respective memory spaces of the future primary write chip; and
erasing the memory spaces of the future primary write chip and the future secondary write chip occupied by the free space.
16. The method of claim 13, wherein:
the plurality of buffer NAND flash chips are single-level cell (SLC) NAND flash chips; and
the plurality of storage NAND flash chips are triple-level cell (TLC) NAND flash chips.
US15/816,447 2017-06-23 2017-11-17 NAND flash storage device with NAND buffer Active US10606484B2 (en)

Priority Applications (10)

Application Number Priority Date Filing Date Title
US15/816,447 US10606484B2 (en) 2017-06-23 2017-11-17 NAND flash storage device with NAND buffer
PCT/US2018/022683 WO2018236440A1 (en) 2017-06-23 2018-03-15 Nand flash storage device with nand buffer
KR1020197035081A KR102276350B1 (en) 2017-06-23 2018-03-15 NAND flash storage device with NAND buffer
JP2019566261A JP7234144B2 (en) 2017-06-23 2018-03-15 NAND flash storage device with NAND buffer
GB1804693.8A GB2563713B (en) 2017-06-23 2018-03-23 NAND flash storage device with NAND buffer
TW107111866A TWI727160B (en) 2017-06-23 2018-04-03 Nand flash storage device with nand buffer
DE102018109929.0A DE102018109929A1 (en) 2017-06-23 2018-04-25 NAND flash memory device with NAND buffer
EP18169162.7A EP3418897B1 (en) 2017-06-23 2018-04-25 Nand flash storage device with nand buffer
DE202018102304.7U DE202018102304U1 (en) 2017-06-23 2018-04-25 NAND flash memory device with NAND buffer
CN201810391996.1A CN109117085B (en) 2017-06-23 2018-04-27 NAND flash memory device with NAND buffer

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201762524016P 2017-06-23 2017-06-23
US15/816,447 US10606484B2 (en) 2017-06-23 2017-11-17 NAND flash storage device with NAND buffer

Publications (2)

Publication Number Publication Date
US20180373440A1 US20180373440A1 (en) 2018-12-27
US10606484B2 true US10606484B2 (en) 2020-03-31

Family

ID=61899381

Family Applications (1)

Application Number Title Priority Date Filing Date
US15/816,447 Active US10606484B2 (en) 2017-06-23 2017-11-17 NAND flash storage device with NAND buffer

Country Status (9)

Country Link
US (1) US10606484B2 (en)
EP (1) EP3418897B1 (en)
JP (1) JP7234144B2 (en)
KR (1) KR102276350B1 (en)
CN (1) CN109117085B (en)
DE (2) DE102018109929A1 (en)
GB (1) GB2563713B (en)
TW (1) TWI727160B (en)
WO (1) WO2018236440A1 (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10481834B2 (en) * 2018-01-24 2019-11-19 Samsung Electronics Co., Ltd. Erasure code data protection across multiple NVME over fabrics storage devices
US11048430B2 (en) 2019-04-12 2021-06-29 Netapp, Inc. Object store mirroring where during resync of two storage bucket, objects are transmitted to each of the two storage bucket
KR20200142219A (en) 2019-06-12 2020-12-22 삼성전자주식회사 Electronic device and method of utilizing storage space thereof
JP2021043908A (en) * 2019-09-13 2021-03-18 キオクシア株式会社 Memory system and control method
US11269779B2 (en) 2020-05-27 2022-03-08 Microsoft Technology Licensing, Llc Memory system with a predictable read latency from media with a long write latency

Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5930167A (en) 1997-07-30 1999-07-27 Sandisk Corporation Multi-state non-volatile flash memory capable of being its own two state write cache
WO2010071655A1 (en) 2008-12-19 2010-06-24 Hewlett-Packard Development Company, L.P. Redundant data storage for uniform read latency
US20110145475A1 (en) 2009-12-15 2011-06-16 International Business Machines Corporation Reducing access contention in flash-based memory systems
US8301832B1 (en) 2012-03-23 2012-10-30 DSSD, Inc. Storage system with guaranteed read latency
WO2013048451A1 (en) 2011-09-30 2013-04-04 Intel Corporation Presentation of direct accessed storage under a logical drive model
US20140304452A1 (en) 2013-04-03 2014-10-09 Violin Memory Inc. Method for increasing storage media performance
US20150149692A1 (en) * 2013-11-22 2015-05-28 Apple Inc. Efficient reuse of segments in nonoverwrite storage systems
US20150324125A1 (en) * 2014-05-07 2015-11-12 Seagate Technology Llc Storage compute device with tiered memory processing
US20160179422A1 (en) 2014-12-19 2016-06-23 Samsung Electronics Co., Ltd. Method of performing garbage collection and raid storage system adopting the same
US20170068456A1 (en) * 2015-09-04 2017-03-09 Kabushiki Kaisha Toshiba Control unit and control method for controlling semiconductor storage devices
US20170131908A1 (en) 2015-11-09 2017-05-11 Google Inc. Memory Devices and Methods
US20180260136A1 (en) * 2017-03-07 2018-09-13 Samsung Electronics Co., Ltd. SSD with heterogeneous NVM types

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8380909B2 (en) * 2009-04-08 2013-02-19 Google Inc. Multiple command queues having separate interrupts
CN106021147B (en) * 2011-09-30 2020-04-28 英特尔公司 Storage device exhibiting direct access under logical drive model
JP2013109404A (en) * 2011-11-17 2013-06-06 Toshiba Corp Information processing device
KR102039537B1 (en) * 2013-03-15 2019-11-01 삼성전자주식회사 Nonvolatile storage device and os image program method thereof

Patent Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5930167A (en) 1997-07-30 1999-07-27 Sandisk Corporation Multi-state non-volatile flash memory capable of being its own two state write cache
WO2010071655A1 (en) 2008-12-19 2010-06-24 Hewlett-Packard Development Company, L.P. Redundant data storage for uniform read latency
US20110258362A1 (en) * 2008-12-19 2011-10-20 Mclaren Moray Redundant data storage for uniform read latency
US20110145475A1 (en) 2009-12-15 2011-06-16 International Business Machines Corporation Reducing access contention in flash-based memory systems
WO2013048451A1 (en) 2011-09-30 2013-04-04 Intel Corporation Presentation of direct accessed storage under a logical drive model
US8301832B1 (en) 2012-03-23 2012-10-30 DSSD, Inc. Storage system with guaranteed read latency
US20140304452A1 (en) 2013-04-03 2014-10-09 Violin Memory Inc. Method for increasing storage media performance
US20150149692A1 (en) * 2013-11-22 2015-05-28 Apple Inc. Efficient reuse of segments in nonoverwrite storage systems
US20150324125A1 (en) * 2014-05-07 2015-11-12 Seagate Technology Llc Storage compute device with tiered memory processing
US20160179422A1 (en) 2014-12-19 2016-06-23 Samsung Electronics Co., Ltd. Method of performing garbage collection and raid storage system adopting the same
US20170068456A1 (en) * 2015-09-04 2017-03-09 Kabushiki Kaisha Toshiba Control unit and control method for controlling semiconductor storage devices
US20170131908A1 (en) 2015-11-09 2017-05-11 Google Inc. Memory Devices and Methods
US20180260136A1 (en) * 2017-03-07 2018-09-13 Samsung Electronics Co., Ltd. SSD with heterogeneous NVM types

Non-Patent Citations (6)

* Cited by examiner, † Cited by third party
Title
Combined Search and Examination Report dated Oct. 8, 2018 in United Kingdom Patent Application No. 1804693.8 (6 pages).
Extended European Search Report dated Jun. 22, 2018 in European Patent Application No. 18169162.7.
International Search Report and Written Opinion dated Jun. 22, 2018in International (PCT) Application No. PCT/US2018/022683.
Kgil, Taeho, et al. "Improving NAND Flash Based Disk Caches" IEEE 35th International Symposium on Computer Architecture, ISCA '08, pp. 327-338, Jun. 21, 2008.
Kim, Jonghwa, et al. "Deduplication in SSDs: Model and Quantitative Analysis" IEEE 28th Symposium on Mass Storage Systems and Technologies (MSST), pp. 1-12, Apr. 16, 2012).
Office Action dated Mar. 29, 2019 in Taiwanese Patent Application No. 107111866, and English translation thereof.

Also Published As

Publication number Publication date
WO2018236440A1 (en) 2018-12-27
JP7234144B2 (en) 2023-03-07
GB2563713B (en) 2020-01-15
JP2020524839A (en) 2020-08-20
CN109117085B (en) 2021-08-20
EP3418897B1 (en) 2022-09-14
CN109117085A (en) 2019-01-01
GB2563713A (en) 2018-12-26
EP3418897A1 (en) 2018-12-26
US20180373440A1 (en) 2018-12-27
DE202018102304U1 (en) 2018-10-30
KR102276350B1 (en) 2021-07-12
TW201905921A (en) 2019-02-01
DE102018109929A1 (en) 2018-12-27
GB201804693D0 (en) 2018-05-09
KR20200003055A (en) 2020-01-08
TWI727160B (en) 2021-05-11

Similar Documents

Publication Publication Date Title
US10606484B2 (en) NAND flash storage device with NAND buffer
US20240103769A1 (en) Data Writing Method and Storage Device
US11416161B2 (en) Zone formation for zoned namespaces
CN107092561B (en) Multilevel memory with direct access
WO2016107442A1 (en) Method for writing data to solid state drive and solid state drive
US20200356484A1 (en) Timed Data Transfer between a Host System and a Memory Sub-System
TW201324147A (en) Statistical wear leveling for non-volatile system memory
WO2020263322A1 (en) Hold of write commands in zoned namespaces
US10754785B2 (en) Checkpointing for DRAM-less SSD
US9268681B2 (en) Heterogeneous data paths for systems having tiered memories
US20210294739A1 (en) Controller and memory system
US11334272B2 (en) Memory system and operating method thereof
US11573891B2 (en) Memory controller for scheduling commands based on response for receiving write command, storage device including the memory controller, and operating method of the memory controller and the storage device
US11507289B2 (en) Storage device, controller and method for operating storage device
JP2024513234A (en) Method and apparatus for mitigating NAND die collisions in a solid state drive
US11481136B1 (en) Very low sized zone support for storage devices
CN111722792A (en) Memory system
CN108877862B (en) Data organization of page stripes and method and device for writing data into page stripes
US11281405B2 (en) Controlled die asymmetry during MLC operations for optimal system pipeline
EP3772682A1 (en) Method and apparatus to improve write bandwidth of a block-based multi-level cell non-volatile memory
US20150254011A1 (en) Memory system, memory controller and control method of non-volatile memory
JP2012164114A (en) Non-volatile semiconductor storage device and data processing system
US11797183B1 (en) Host assisted application grouping for efficient utilization of device resources

Legal Events

Date Code Title Description
AS Assignment

Owner name: GOOGLE LLC, CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:SHAH, MONISH;REEL/FRAME:044163/0040

Effective date: 20171116

FEPP Fee payment procedure

Free format text: ENTITY STATUS SET TO UNDISCOUNTED (ORIGINAL EVENT CODE: BIG.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

STPP Information on status: patent application and granting procedure in general

Free format text: NON FINAL ACTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER

STPP Information on status: patent application and granting procedure in general

Free format text: FINAL REJECTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: RESPONSE AFTER FINAL ACTION FORWARDED TO EXAMINER

STPP Information on status: patent application and granting procedure in general

Free format text: NOTICE OF ALLOWANCE MAILED -- APPLICATION RECEIVED IN OFFICE OF PUBLICATIONS

STPP Information on status: patent application and granting procedure in general

Free format text: PUBLICATIONS -- ISSUE FEE PAYMENT RECEIVED

STPP Information on status: patent application and granting procedure in general

Free format text: PUBLICATIONS -- ISSUE FEE PAYMENT VERIFIED

STCF Information on status: patent grant

Free format text: PATENTED CASE

MAFP Maintenance fee payment

Free format text: PAYMENT OF MAINTENANCE FEE, 4TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1551); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

Year of fee payment: 4