maojudong

JFFS3 design issues

JFFS3 design issues
Artem B. Bityutskiy
[email protected]
Version 0.32 (draft)
November 27, 2005
Abstract
JFFS2, the Journalling Flash File System version 2, is widely used in the embedded
systems world. It was designed for relatively small flash chips and has serious problems
when it is used on large flash devices. Unfortunately, these scalability problems are deep
inside the design of the file system, and cannot be solved without full redesign.
This document describes JFFS3 . a new flash file system which is designed to be
scalable.
Contents
1 JFFS2 overview 1
2 JFFS3 Requirements 2
3 Introduction to JFFS3 3
3.1 Indexing problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
3.2 Wandering trees . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
3.3 B-trees . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
3.4 Indexing in JFFS3 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
3.5 Indexing example . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
3.6 The Journal . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
3.7 Garbage collection . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
3.8 The superblock . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
4 The tree 13
4.1 Objects . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
4.2 Keys . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
4.2.1 Trivial key scheme . . . . . . . . . . . . . . . . . . . . . . . . . . 15
4.2.2 Keys comparison . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
4.2.3 Key schemes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
4.2.4 Keys compression . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
4.3 Links . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
5 Garbage Collection 20
6 The superblock 20
6.1 The superblock management algorithm . . . . . . . . . . . . . . . . . . . 20
6.2 The length of the chain . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
6.3 The superblock search . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
7 Issues/ideas/to be done 26
8 Definitions 27
9 Symbols 31
10 Abbreviations 32
11 Credits 32
12 References 33
1 JFFS2 overview
JFFS2, the Journalling Flash File System version 2 [1] is widely used in the embedded
systems world. JFFS2 was originally designed for small NOR flashes (less then about
32MB) and the first device with JFFS2 file system was a small bar-code scanner. Later,
when NAND flashes became widely used, NAND support was added to JFFS2. The first
NAND flashes were also small enough, but grew in size very quickly and are currently
much larger then 32MB (e.g., Samsung produces 2GB NAND flashes [4]).
JFFS2 has log-structured design, which basically means, that the whole file system
may be regarded as one large log. Any file system modification (i.e., file change, directory
creation, changing file乫s attributes, etc) is appended to the log. The log is the only data
structure on the flash media. Modifications are encapsulated into small data structures
called nodes.
So, JFFS2 is roughly a log, the log consists of nodes, each node contains a file system
modification. And this is basically all JFFS2 file system is. It is very simple from the
physical layout乫s standpoint. For more information about the design of JFFS2 and about
the log-structured design, look at [1], [2], and [3].
It is not the goal of this chapter to delve into details of JFFS2 but it is still wanted
to provide enough information to make it clear why JFFS2 has scalability problems and
why JFFS3 is needed. To keep this chapter simple, terms index or indexing information
are used.
The index is a crucial part of any file system as it is used to keep track of everything
that is stored in the file system. For example, the index may help to quickly locate the
addresses of physical blocks which correspond to the specified file at the specified offset,
or it helps to quickly find all the directory entries in a specified directory and so on.
For example, in case of ext2, the inode table, the bitmap and the set of direct, indirect,
doubly indirect and triply indirect pointers may be considered the index. In case of the
FAT file system, the File Allocation Table may be considered as the index, etc.
In traditional file systems the index is usually kept and maintained on the media, but
unfortunately, this is not the case for JFFS2. In JFFS2, the index is maintained in RAM,
not on the flash media. And this is the root of all the JFFS2 scalability problems.
Of course, as the index in kept in RAM, JFFS2 achieves extremely high throughput,
just because it does not need to update the index on flash after something has been
changed in the file system. And this works very well for relatively small flashes, for which
JFFS2 was originally designed. But as soon as one tries to use JFFS2 on large flashes
(starting from about 128MB), many problems come up.
At first, it is obvious that JFFS2 needs to build the index in RAM when it mounts
the file system. For this reason, it needs to scan the entire flash partition in order to
locate all the nodes which are present there. So, the larger is JFFS2 partition, the more
nodes it has, the longer it takes to mount it.
The second, it is evidently that the index consumes some RAM. And the larger is the
JFFS2 file system, the more nodes it has, the more memory is consumed.
To put it differently, if S is the size of the JFFS3 flash partition 1,
. JFFS2 mount time scales as O(S) (linearly);
. JFFS2 memory consumption scales as O(S) (linearly).
1Note, all the symbols used in this document are summarized in section 9
1
So, it may be stood that JFFS2 does not scale. But despite the scalability problems,
JFFS2 has many advantages, for example:
. very economical flash usage . data usually take as much flash space as it actually
need, without wasting a lot space as in case of traditional file systems for block
devices;
. admitting of 乭on-flight乭 compression which allows to fit a great deal of data to the
flash; note, there are few file systems which support compression;
. very good file system write throughput (no need to update any on-flash indexing
information as it simply does not exist there);
. unclean reboots robustness;
. good enough wear-leveling.
It is also worth noting here that there is a patch which is usually referred to as the
乭summary patch乭, that was implemented by Ferenc Havasi and was recently committed
to the JFFS2 CVS. This patch speeds up the JFFS2 mount greatly, especially in case of
NAND flashes. What the patch basically does is that it puts a small 乭summary乭 node
at the end of each flash erasable block. This node, roughly speaking, contains the copy
of headers of all the nodes in this eraseblocks. So, when JFFS2 mounts the file system,
it needs to glance to the end of each eraseblock and read the summary node. This
results in that JFFS2 only needs to read one or few NAND pages from the end of each
eraseblock. Instead, when there is no summary, JFFS2 reads almost every NAND page
in each eraseblock, because node headers are spread more or less evenly over eraseblocks.
Although the patch helps a lot, it is still a not scalable solution and it only relaxes the
coefficient of the JFFS2 mount time liner dependency. Let alone that it does not lessen
JFFS2 memory consumption.
2 JFFS3 Requirements
The following are the main user-level requirements JFFS3 has to meet.
R01 JFFS3 memory consumption must not depend on the size of JFFS3 partition, the
number of inodes in the file system, size of files, directories, and the like. Of course,
JFFS3 must be able to use the advantage of the available RAM, but only for
different kinds of caches which may be freed any time in case of memory pressure.
R02 JFFS3 have to provide very fast file system mount without the need to scan the
whole flash partition.
R03 JFFS3 have to provide good flash wear-levelling.
R04 JFFS3 must guarantee that unclean reboots cannot cause any file system corruption.
R05 JFFS3 must provide good enough performance.
R06 Unlike JFFS2, JFFS3 must implement write-behind caching for better performance.
2
R07 JFFS3 must gracefully deal with different kinds of data corruptions (flash bit-flips,
bad blocks may appear dynamically, etc).
R08 In case of serious corruptions it should be possible to reconstruct all the data which
were not damaged by means external tools like ckfs.jffs3.
R09 All the JFFS3 characteristics ought to scale not faster the logarithmic function.
JFFS2-like linear dependencies are not acceptable.
R10 JFFS3 must support extended attributes.
R11 JFFS3 must support the Access Control Lists feature (ACL).
R12 JFFS3 have to support on-flight compression.
R13 JFFS3 should provide good concurrency which means that it should be possible
to read the file system during Garbage Collection and to read/write during the
Journal Commit, read/write the file system simultaneously, etc.
3 Introduction to JFFS3
The main idea how to fix in JFFS2 to make it scalable is to move the index from RAM to
flash. Unfortunately, this requires complete JFFS2 redesign and re-implementation and
the design of JFFS3 is largely different to the design of JFFS2. This section discusses
the base JFFS3 design ideas without any detailed description.
3.1 Indexing problem
There is a large difference between block devices and flash devices in how they allow to
update the contents of a sector. Block devices admit of so-called 乭in-place updates乭, i.e.
the update may be written straight to the sector. Flash devices do not allow this unless
the whole eraseblock has been erased before.
Obviously, it is unacceptable to erase the whole eraseblock each time a sector is
updated. Instead, so-called 乭out-of-place updates乭 technique is usually used. This simply
means that no attempts to update sectors in-place are made but instead, updates are
written to some other sector and the contents of the previous sector is afterwords regarded
as garbage.
This 乭out-of-place writes乭 property of flash devices assumes that JFFS3 also has
log-structured design as in JFFS3 any update is written out-of-place. And it seems that
this is natural for any flash file system to have log-structured design.
It is interesting to notice that in log-structured file systems for block devices (like
the one described in [2]) not any update is 乭out-of-place乭. There are always some
fixed-position sectors present. These sectors usually refer the file system index, admit of
quick file system mount and they are updated in-place.
But flash devices have limited number of erase cycles for each eraseblock and it is
impossible to guaranty good wear-levelling if some eraseblocks are reserved for similar
purposes. So, it is important that in JFFS3 there are no in-place updates as good
wear-levelling is one of the main requirements to JFFS3 (see section 2).
3
The 乭out-of-place updates乭 property makes it difficult to maintain the index on the
flash media. Figure 1 demonstrates why.
A
B
C
D
Flash
A B C D D
1
D
1
Figure 1: JFFS3 indexing problem example.
Suppose the index is kept and maintained on flash and it consists of 4 parts A, B,
C, and D which refer each other: A refers B and C, B refers D, and C refers D. This
means, that A contains the physical flash address of B and C and so on.
Suppose D should be updated. Since it is updated out-of-place, the newer version
D1 is written to some other place. But there are B and C which still refer D, not D1,
and they ought to be updated as well. And when they are updated out-of-place, A will
still refer the old B and C, and so on. Thus, it is not that trivial to store and maintain
indexing information on the flash media.
3.2 Wandering trees
To address the above problem it is possible to use wandering trees. Figure 2 demonstrates
how do wandering trees work.
1. Suppose that the index is a tree and it is stored and maintained on the flash media.
The tree consists of nodes A, B, C, D, E, F, G, and H. Suppose node H should
be updated.
2. At first, the updated version H1 is written. Obviously, F still refers H.
3. Now the corresponding link in node F is changed and node F1 is written to flash.
F1 refers H1. But as F1 is also written out-of-place, A still refers the old node F.
4. Finally, the new root node A1 is written ant it refers F1.
5. Nodes A, F, H are now treated as garbage and the updated tree is composed by
nodes A1, B, C, D, E, F1, G, and H1.
So, wandering trees is the base idea of how the indexing information is going to be
maintained on the flash media in JFFS3. And it stands to reason that any tree may
be called 乭wandering tree乭 if any update in the tree requires updating parent nodes up
to the root. For example, it makes sense to talk about wandering Red-Black trees or
wandering B+-trees and so forth.
4
A
B E F
C D G H
(1)
A
B E F
C D G H
(2)
H
1
A
B E F
C D G H
(3)
H
1
F
1
A
B E F
C D G H
(4)
H
1
F
1
A
1
B E
C D G H
(5)
H
1
F
1
A
1
Figure 2: Wandering tree example.
3.3 B-trees
JFFS3 uses B+-trees and this subsection makes a short introduction to B+-trees. There
is a plenty of books where one may find more information. There are also many on-line
resources available, e.g. [6].
The inexact definition of B-tree may be formulated as a balanced search tree where
each node may have many children. The branching factor or the fanout defines the
maximal number of node乫s children. While B-trees may contain both useful data and
keys and links in non-leaf nodes, B+-trees are B-trees which store data only in leaf nodes,
while non-leaf nodes contain only keys and links.
. . .
... ...
. . .
Fanout (n)
Leaf nodes (level 0)
Depth
(L levels)
Root node (level L-1)
Figure 3: B+-tree example.
Figure 3 demonstrates a B+-tree with branching factor n and the number of level L.
5
Note, that in JFFS3 levels are numbered starting from leaf nodes (level 0) and ending
at the root node (level L . 1).
Leaf nodes in the B+-tree contain data which are indexed by keys. Non-leaf nodes do
not contain data, but contain only the indexing information, namely, keys and links.
...
Link 2
Link 3
Link n
Keys < Key 1
Keys > Key 1 and . Key 2
Keys . Key n-1
Link n-1
Link 1
Key 1
Key 2
Key 3
Key n-1
Keys > Key 2 and . Key 3
Keys > Key n-2 and . Key n-1
Figure 4: The structure of a non-leaf node in B+-tree.
Figure 4 depicts the structure of a non-leaf node. There are n links and n . 1 keys
in the node. Links may point to either leaf nodes or other non-leaf nodes. In the former
case, the leaf node will contain data which corresponds to the key which follows the link.
In the latter case, the pointed non-leaf node (and the whole subtree with the root in this
non-leaf node) will contain more keys in range (Key 1,Key 2].
Keys are sorted in the ascending order in non-leaf nodes, so it is not that difficult
to lookup data corresponding to any key. Furthermore, the tree is balanced, so the the
number of lookup steps does not depend on the key.
When objects are inserted or removed from the tree, re-balancing may be needed. The
tree is re-balanced by means of splitting nodes or merging them and there is a simple
enough algorithm exists. Please, refer to Donald Knuth乫s books for more information
about re-balancing B+-trees.
B+-trees are widely used when working with block devices (e.g., hard drives). Indeed,
these devices have a fixed input/output unit size (usually referred to as a sector ) and it
is natural to use B+-trees with node size multiple to the size of the sector in order to
store information on such devices.
3.4 Indexing in JFFS3
The way how JFFS3 stores and indexes the file system is similar to the approach used
by the Reiser4 file system (see [5]). All the file system objects (inodes, files, directory
entries, extended attributes, etc) are kept in one large B+-tree. Effectively, the whole
JFFS3 file system may be regarded as one large B+-tree. This tree is further referred to
just as 乭the tree乭.
Every object which is stored in the tree has a key, and objects are found in the tree
by their keys. To make it clearer what are object keys, the following is an example of
how they may look like:
6
. file data key: {inode number, offset};
. directory entry key: {parent directory inode number, direntry name hash} and the like.
The following are terms which are used in JFFS3 to refer nodes of different levels in
the tree:
. nodes of level 0 are leaf nodes;
. nodes of level 1 are twig nodes;
. nodes which are not the root, not leaf, and not twig are branch nodes;
. no-leaf nodes (i.e., the root, branch and twig) are indexing nodes.
Note, the same terminology (except indexing nodes) is used in the Reiser4 file system
[5].
Non-leaf nodes are called 乭indexing nodes乭 because they contain only indexing information,
nothing else. No file system data is kept in the indexing nodes. Indexing nodes
have fixed size which is equivalent to the flash sector size.
It is important to note that somewhat unusual terminology is used in this document.
The smallest input/output unit of the flash chip is called a sector. Since JFFS3 mainly
orients to NAND flashes, the sector is mostly the NAND page and is either 512 bytes
or 2 Kilobytes. For other flash types the sector may be different. If flash乫s minimal
input/output unit is very small (say, one bit as in case of NOR flash), there should be a
layer which emulates larger sectors (say, 512 bytes).
In opposite to indexing nodes, leaf nodes have flexible size, just like nodes in JFFS2.
So, roughly speaking, JFFS3 file system may be considered as JFFS2 file system (leaf
nodes) plus indexing information (indexing nodes) (see figure 5).
. . .
... ...
. . .
Leaf nodes have flexible size and the leaf level reminds JFFS2
Indexing nodes have fixed size
and index leaf nodes which
contain FS objects
Figure 5: The JFFS3 tree.
Similarly to JFFS2, leaf nodes consist of header and data. The header describes the
node data and contains information like the key of the node, the length, and the like.
7
Node data contains some file system data, for example a directory entry, file乫s contents,
etc.
Leaf and indexing nodes are physically separated, which means that there are eraseblocks
with only indexing nodes and with only leaf nodes. But of course, this does not
mean that the whole flash partition is divided on two parts, this only means that the
indexing and leaf nodes are not in one eraseblock. Figure 6 illustrates this.
...
Eraseblock with
indexing nodes
Eraseblock with
leaf nodes
Flash
Figure 6: An illustration of leaf and indexing nodes separation.
Eraseblocks which contain only indexing nodes are called indexing eraseblocks and
those with leaf nodes are called leaf eraseblocks.
The depth of the tree depends on how many objects are kept in the file system.
The more files, directories, etc are present in the file system, the deeper is the tree.
Fortunately, the number of tree levels grows very slowly with the growing number of file
system objects and the tree lookup scales as O(lognS) (logarithmically).
The following are advantages of the JFFS3 indexing approach.
. Many different key assignment schemes may be used and this gives a flexibility in
how objects are sorted in the tree. Thus, one may optimize JFFS3 for specific
workloads by means of changing the format of the keys.
. Leaf nodes may be compressed, so JFFS3 admits of the on-flight compression.
. In case of corruptions of the indexing information it is possible to re-create it by
means of scanning leaf nodes乫 headers.
. There is a clear separation between data and indexing information. This implies
that the indexing information and data may be cached separately, without overlapping
in the same cache lines. This leads to better cache usage as described in the
Reiser4 paper [5].
3.5 Indexing example
This section illustrates how does JFFS3 indexing work by means of a simple example.
The example is very rough but it shows JFFS3 indexing in action. It is assumed that
keys of direntries and data objects have the same layout that is mentioned in section 3.4.
Suppose that user does the following:
1. mounts JFFS3 file system to 乭/mnt/jffs3乭 directory;
2. issues 乭ls /mnt/jffs3乭 command;
3. reads the contents of 乭/mnt/jffs3/my file乭 file.
8
The following are comments about what is going on in JFFS3 during the above steps.
1. During mount JFFS3 locates the position of the root node. This is done with help
of the JFFS3 superblock which will be described later (see section 6).
2. To get the list of directory entries in the root directory, JFFS3 looks up all objects
matching the {2, } key pattern. Indeed, direntry keys have {parent inode #, name hash} format, the root directory inode number is 2 (or another predefined constant). 乭
means wildcard. Thus, JFFS3, {2, } will match to any direntry in the root directory.
3. To read the 乭my file乭 file, JFFS3 first needs to find out its inode number. The
inode number is stored in the directory entry object. Hence, JFFS3 reads my file乫s
direntry using {1, H("my file")} key (H() is the hash function).
Then JFFS3 searches for my file乫s data objects using {I, offset} keys. Depending
on which part of file should be read, offset may take different values.
The above description is somewhat simplified e.g., JFFS3 also needs to read my file乫s
attr-data object to fetch the inode length from there (see section 4.1), etc. But the aim
of the section is just to provide an idea of how JFFS3 indexing works.
3.6 The Journal
The JFFS3 tree is both B+-tree and wandering tree. Any file system change implies
that a new node is written to the flash media, which in turn, assumes that a number of
indexing nodes must be updated. Namely, the whole path of indexing nodes up to the
root node should be updated (see section 3.2).
Evidently, it is very expensive to update several indexing nodes on each file system
change and the journal provides a mechanism to avoid this.
The journal consists of a set of eraseblocks (the journal eraseblocks) which do not
have a fixed location on flash and are not contiguous on flash. Any flash eraseblock may
be used as an journal eraseblock.
...
Journal eraseblock Leaf eraseblock Indexing eraseblock
The journal tree
(in RAM)
Describes changes which
were not yet committed
Figure 7: The JFFS3 journal.
9
When something is changed in the JFFS3 file system, the corresponding leaf node is
written to the journal, but the corresponding indexing nodes are not updated. Instead,
JFFS3 keeps track of file system changes in RAM in a data structure called the journal
tree (see figure 7).
When something is read from the file system, JFFS3 first glimpses at the in-RAM
journal tree to figure out if the needed data is in the journal. If the data are there, the
journal is read, otherwise JFFS3 performs the usual tree lookup (see figure 8).
Lookup the journal tree
and find out if the
journal contains the
requested information.
Lookup the tree
Found
Read the journal
Not found
Read
request
JFFS3
Figure 8: Read request processing in JFFS3.
The journal is committed when it is full or in some other appropriate for JFFS3 time.
This means, that the indexing nodes corresponding to the journal changes are updated
and written to the flash. The committed journal eraseblocks are then treated as leaf
eraseblocks and new journal eraseblocks are picked by JFFS3 using the common JFFS3
wear-levelling algorithm.
The journal makes it possible to postpone indexing information updates to later and
potentially more appropriate time. It also allows to merge many indexing node updates
and lessen the amount of flash write operations.
When JFFS3 file system is being mounted, the journal should be read, 乭replayed乭
and the journal tree should be built. So, the larger is the journal, the longer it may
take to mount JFFS3. From the other hand, the larger is the journal, the more writes
may be deferred and the better performance may be achieved. By the other words, there
is a trade-off between the mount time and the performance and one may vary these
characteristics by means of changing the size of the journal.
3.7 Garbage collection
Garbage collection is a vital part of any log-structured file system. Over time, JFFS3
uses up all the flash free space and it needs to reclaim flash space occupied by garbage.
And the goal of Garbage Collector is to recycle garbage and reclame flash space which
it occupies. Since the only way to reclaim it is to erase the whole eraseblock, Garbage
Collector works in terms of eraseblocks.
JFFS2 Garbage Collector is quite simple and works in several steps.
1. To reclaim dirt from an eraseblock, JFFS2 moves all valid nodes from this eraseblock
to another eraseblock.
10
2. As nodes have changed their positions, the JFFS2 in-RAM index is adjusted.
3. The first eraseblock may be erased and re-used.
Note, JFFS2 (and JFFS3) aways reserves several eraseblocks in order to guarantee
that there are always some free eraseblocks available to perform garbage collection.
JFFS3 Garbage Collector is more complex. When valid data has been moved from
an eraseblock, the corresponding indexing nodes must be updated as well. Depending on
how many space Garbage Collector has reclaimed and how many space it has spent to
update indexing nodes, it might be that Garbage Collector produces more garbage that
it reclaims. This problem is demostrated in figure 9.
A B C
D E F G
A
B C
D E F G A B C
D E F G
1 2 Reserved for GC
D E F G A B C
1 2 Reserved for GC
D E F G
1 2 Reserved for GC
1.
2.
3.
Figure 9: JFFS3 garbage collection problem illustration.
There is a subtree with root in node A depicted in the figure. At the beginning
(snapshot 1), leaf nodes D, E, F, and G are situated in the eraseblock number 1. Indexing
nodes A, B, and C are situated in the eraseblock number 2. There are also two reserved
eraseblocks. Suppose all nodes have the same size equivlent to the size of sector which is
512 bytes in this example.
At snapshot 2 JFFS3 has decided to reclaim 512 bytes of dirty space form the eraseblock
number 1. Garbage Collector moves all the valid nodes from the eraseblock number
1 to one of the reserved eraseblocks. But as indexing nodes B and C still refer old
copies of the moved nodes in the eraseblock number 1, this eraseblock cannot be erased
so far. Indexing nodes A, B, and C have to be updated first.
At snapshot 3 Garbage Collector has updated indexing nodes A, B and C, putting
them to one of the reserved eraseblocks. From now on, old copies of nodes A, B, C, D,
E, F, and G at eraseblocks 1 and 2 comprise garbage. The eraseblock number 1 was
erased and is now free.
11
But unfortunatelly, the result is that Garbage Collector made more dirt that it reclaimed
space. Indeed, GC reclaimed 512 bytes while produced three times greater
amount of garbage (see the first three sectors at eraseblock 2, snapshot 3). Compare
snapshots 1 and 2.
Hence, it is obvious that garbage collection in JFFS3 is must be more complex that
in JFFS2. Chapter 5 discusses JFFS3 Garbage Collector in details.
3.8 The superblock
The JFFS3 superblock is a data structure that describes the file system as a whole and
contains important information like the offset of the root node, the journal eraseblocks,
etc. When the file system is being mounted, it first finds and reads the JFFS3 superblock.
In case of traditional file systems the superblock usually resides at a fixed position
on the disk and may be found very quickly. Conversely, due to the 乭out-of-place write乭
flash property it is impossible to assign a fixed position for the JFFS3 superblock. Things
are getting even more complex because of the need to provide good wear-levelling . it is
incorrect to just reserve several erasable blocks for the superblock unless it is guaranteed
that these eraseblocks will not be worn out earlier then the other eraseblocks.
We have the following two requirements that ought to be met in JFFS3:
. JFFS3 must be able to quickly find the superblock;
. the superblock management techniques must not spoil the overall flash wear levelling.
In the classical file systems the superblock usually contains a lot of static data which
is rarely updated and the superblock may have any size. In JFFS3, the superblock must
be updated quite often (e.g., each time the journal is committed). This means that to
lessen the amount of I/O, the JFFS3 superblock should be as small as it is possible,
namely, one sector. And there is no reason to keep any static data in the superblock
(e.g., the size of the file system, its version, etc). For static data, JFFS3 reserves the first
eraseblock of the JFFS3 partition.
Thus, the following terms are used in this document:
. static superblock . contains only static data which are never changed by JFFS3; the
static superblock resides at the static eraseblock; the static eraseblock is the first
non-bad eraseblock of the JFFS3 partition; it is supposed that the contents of the
static eraseblock may only be changed by external user-level tools;
. superblock . contains only dynamic data, is changed quite often and requires special
methods to deal with.
JFFS3 has a rather complicated superblock management scheme which makes it possible
to quickly find the superblock without full flash scanning when the file system is
being mounted. This scheme provides good flash wear-levelling. The superblock lookup
should take few milliseconds and scale as O(log2(S)). For more detailed information
about the superblock management scheme see section 6.1.
12
4 The tree
This chapter discusses the all the aspects related to the main JFFS3 entity . the tree.
Please, refer to section 3.4 for basic information about the JFFS3 tree.
4.1 Objects
JFFS3 keeps file system objects in the leaf level of the tree (in leaf nodes) and the following
is the list of supported objects.
1. Data objects contain files乫 data and are kept in data nodes. Each data node holds
one RAM page bytes of data (i.e., PAGE SIZE which is 4K on most 32-bit architectures).
But of course, in case of small files (less then one RAM page bytes) and
files乫 tails . less data data may be put to the data node.
3.1K
4K 4K 4K 1K
2.3K 3.7K
0.8K
A 13K file (logical view)
The JFFS3 tree
Figure 10: An illustration of files乫 data representation.
Figure 10 illustrates the correspondence between files乫 contents and data objects.
Each RAM page-size piece of a 13K file corresponds to a data node in the JFFS3
tree. The 1K tail of the file also corresponds to a data node. But because of
compression actual sizes of data nodes are less then the corresponding file fragments.
The division on RAM page-sized fragments relates to the Linux Virtual Memory
Management architecture. Namely, the Linux Page Cache works in terms of RAM
pages which means, that JFFS3 is always asked to read and write files乫 in units of
RAM page size.
It is worth noting that in order to optimize flash utilization, JFFS3 may store
multiple of RAM page bytes in one data node for static files. This admits of better
compression and leads to several other benefits.
13
2. Direntry objects contain the correspondence between directory entry names and
inode numbers. Direntry objects are stored in direntry nodes. Every directory
entry in the file system has a corresponding direntry object.
3. Attr-data objects contain attributes of inodes . both standard Unix attributes like
user ID, last modification time, inode length, etc and JFFS3-specific attributes like
the type of compression, etc. Each inode has only one corresponding attr-data
object.
4. Xentry objects contain the correspondence between names of extended attributes
and xattr IDs. Every extended attribute in the file system has a corresponding
xattr entry object. This is analogous to direntry objects, but direntries contain
{direntry name)inode number} mapping, instead of {xattr name)xattr ID} mapping in xentries.
Each extended attribute in JFFS3 has its own unique number . xattr ID, just like
every inode has its own unique inode number. And in fact, JFFS3 utilizes the same
space of numbers to enumerate inodes and extended attributes.
Xentry objects are stored in xentry nodes.
5. Xattr-data objects contain the data of extended attributes. The way how xattr-data
objects are kept in the tree is equivalent to the way how data objects a kept there.
Xattr-data objects are stored in xattr-data nodes.
6. Acl objects contain Access Control Lists (ACL) of inodes (information about ACLs
may be found out at [7]). Acl objects are stored in acl nodes.
In real-world systems a vast number of files have equivalent ACL while only few
files have unique ACL. For the former group of files (or more strictly . inodes)
JFFS3 makes use of shared acl objects. This means, that there is only one acl
object instance for all of these inodes. Shared acls are referred to from attr-data
objects of these inodes. If a shared acl is written to, a new acl object is created
(copy-on-write mechanism). Conversely, for the latter group there is a distinct acl
object per each inode.
4.2 Keys
Each object has its own key and may be quickly looked up in the tree by its key. As
there are 6 object types in JFFS3, there are also 6 key types:
1. data keys . index data objects;
2. direntry keys . index direntry objects;
3. attr-data keys . index attr-data objects;
4. xentry keys . index xentry objects;
5. xattr-data keys . index xattr-data objects;
6. acl keys . index acl objects.
14
4.2.1 Trivial key scheme
Lets start discussing JFFS3 keys with an example of a simple key layout which is further
referred to as the trivial key scheme. All keys in this scheme have the same 47-bit length
(see figure 11).
. Data keys consist of the 32-bit inode number the data belongs to, the unique 3-bit
key type identifier, and the 20-bit data offset.
. Direntry keys consist of the 32-bit parent directory inode number, the unique 3-bit
key type identifier, and the 20-bit direntry name hash value.
. Attr-data keys consist of the 32-bit inode number the attributes belong to, and the
unique 3-bit key type identifier.
. Xentry keys consist of the 32-bit inode number the extended attribute belongs to,
the unique 3-bit key type identifier, and the 20-bit extended attribute name hash
value.
. Xattr-data keys consist of the 32-bit xattr ID, the unique 3-bit key type identifier,
and the 20-bit extended attribute data offset.
. Acl keys consist of the 32-bit inode number the acl object belongs to, and the unique
3-bit key type identifier.
The following is the list of key type identifiers.
1. Data keys . 0 (000 bin);
2. Direntry keys . 1 (001 bin);
3. Attr-data keys . 2 (010 bin);
4. Xentry keys . 3 (011 bin);
5. Xattr-data keys . 4 (100 bin);
6. Acl keys . 4 (101 bin);
Since data objects may only contain multiple RAM pages of data (excluding small files
and files乫 tails) offsets in keys are always RAM page size-aligned. Assuming RAM page
is 4K (12 bits), 20 bits is enough to refer up to 4GB of data. To put it differently, the
trivial key scheme limits files乫 length to 4GB providing system乫s RAM page size is 4K.
It is also worth noting that several objects of the same type may have the same
key. For example, in case of hash collision, two direntries may have equivalent keys. In
this case objects may be distinguished by means of reading the corresponding leaf node
headers.
15
inode # (32) type (3) offset (20)
parent inode # (32) type (3) name hash (20)
inode # (32) type (3) offset (20)
inode # (32) type (3) name hash (20)
xattr ID (32) type (3) offset (20)
inode # (32) type (3) unused (20)
data key:
direntry key:
attr-data key:
xentry key:
xattr-data key:
acl key:
Figure 11: The trivial key scheme.
4.2.2 Keys comparison
The important topic is how keys are compared as it defines the relative order of objects
in the tree and is crucial for searching. Note, it only makes sense to compare keys of the
same type.
JFFS3 keys are usually comprised of one or more fields, i.e., keys K1 and K2 may be
represented as
K1 = {k1
1, k2
1, ..., kp
1},K2 = {k1
2, k2
2, ..., kp
2},
where p is the number of components in keys of this type.
Keys K1 and K2 are considered to be equivalent if and only if all their fields are
equivalent, i.e. ki1
= ki2
, i = 1, 2, ..., p.
Keys are compared field-by-field starting from the first field. If on i乫th step ki1
> ki2
,
then K1 is considered to be greater then K2. Similarly, if on i乫th step ki1
< ki2
, then K1
is considered to be less then K2.
4.2.3 Key schemes
Key schemes define layout of keys for all the 6 object types. Apparently, it would be too
inflexible to hardcode JFFS3 to support only one fixed key scheme. Indeed, there may
be a great deal of reasons why users may want to use different key schemes in different
situations . some examples go bellow.
. The inode number is encoded by a 32-bit integer in the trivial key scheme which
means that about 4 million inodes and extended attributes may exist on the file
system simultaneously. But in some cases this may be insufficient or, conversely,
too much and one may want to use less bits to encode inode numbers (say, only 24
bits), just for optimization.
16
. Similarly, offsets are encoded as 20-bit integers in the trivial key scheme which may
be insufficient when huge files (larger then 4G) should be supported. So one may
want to use more bits in certain cases.
. Depending on the concrete JFFS3 usage, different hash functions may be used in
direntry keys. The length of hash values may also vary depending on how many
directory entries are kept in directories. If there are huge directories with millions
of files there, long hash values should be used to avoid massive hash collisions (say,
64-bit hash values). But if it is known in advance that there will be no too large
directory entries, 2 the length of hash values may be shorter.
. It also possible that one may want to use some tricky key layouts to achieve different
kinds of optimization. For example, direntry keys may include the first 8 bytes
(64 bits) of the direntry name (see figure 12). In this case the getdents 3 Linux system
call will return direntries in 乭mostly乭 alphabetically sorted order and user-space
programs will not spend much time to sort them. In fact this technique is used in
the Reiser4 file system and it is claimed that slow sorting is a bottleneck in certain
real-life workloads. And the like.
. Different keys compression methods may be used in different key schemes (see
section 4.2.4 below).
parent inode # (32) type (3) name (64) name hash (19)
Figure 12: Direntry key layout example.
So it is obvious why JFFS3 is not attached to a fixed key scheme but instead, admits
of many different key schemes (one at a time of course) with a possibility to choose the
best suited key scheme.
4.2.4 Keys compression
The index is the most frequently re-written part of JFFS3. Indeed, every single change
at the leaf level of the tree requires re-writing L.1 indexing nodes. The number of index
updates is reduced by the write-behind cache and by the journal, but it is still changed
very often. So, it is extremely important for JFFS3 to keep the tree as shallow as it is
possible.
This means, that it makes sense to apply a sort of compression to keys in indexing
nodes. There are several ways to compress keys and the following are examples of possible
compression techniques.
Offsets coding. Offsets compression may be based on the observation that the overwhelming
majority of files in many file systems are small files. This means, that it
might makes sense to code smaller offsets by fewer bits.
2Note, when talking about directories, words 乭large乭 and 乭small乭 describe how many direntries are
kept in these directories. The more direntries a directory contains, the larger is it.
3See getdents (2) Linux manual pages
17
Table 1 contains an example of how offsets may be encoded. For offsets in range
0KB-8KB only 3 bits are enough, so the bit sequence 乭000乭 will encode offset 0,
and the bit sequence 乭001乭 will encode offset 4K 4. Offsets in range 8KB-64KB are
encoded by 6 bits and so on.
Offset range Bits in range Code prefix Code length
0KB-8KB 13 bits 00 3 bits
8KB-64KB 16 bits 01 6 bits
64KB-1MB 20 bits 10 10 bits
1MB-128MB 27 bits 110 18 bits
128MB-4GB 32 bits 111 23 bits
Table 1: An example of offset coding.
Inode number coding. If the approximate number of inodes on the file system is
known in advance, similar coding scheme may be exploited for inode numbers providing
JFFS3 may reuse deleted files乫 inode numbers.
Common prefix compression. In case of the trivial key scheme the first field of any
key is the inode number. Any other key scheme will likely also contain the inode
number in keys. If the inode number is the first component of the key, all keys
belonging to the same inode will go sequentially in indexing nodes. To put it
differently, there will be sequences of keys prefixed by the same inode number in
the tree.
inode # = 17 A
inode # = 17
Link inode # = 17 B Link inode # = 17 C Link
A Link B Link C Link
Key 1 Key 2 Key 3
Common Prefix Key 1 Key 2 Key 3
Figure 13: The common prefix compression idea illustration.
The evident compression method for these key sequences is to store the inode number
only once as the common prefix for the entire key sequence, instead of duplicating
it in every key. Figure 13 illustrates how does the prefix compression work.
Offsets sequence compression. In the trivial key scheme the last field of data keys
is the offset. The offset is multiple of RAM page size. Obviously, indexing nodes
will contain sequences of keys each of which describes data objects belonging to the
same file but with different offsets. Moreover, the keys will be ordered in increasing
key order.
4Recall, offsets in keys are RAM page-aligned and by default, the RAM page size is assumed to be
4K in this document
18
For sequences like this it is possible to only specify the starting offset, the ending
offset and the number of keys in the sequence, instead of wasting space storing the
offset in each key of the sequence.
0K 12K 4 A Link B Link C Link D Link
Key 1 Key 2 Key 3 Key 4
Start End Count Key1 Key2 Key3 Key4
A 0K Link B 4K Link C 8K Link A 12K Link
Figure 14: The Offsets sequence compression idea illustration.
Figure 14 presents an example of the offsets sequence compression method. Four
consecutive keys which describe four data objects belonging to the same inode may
be represented as a sequence of four keys without the offset field, but prefixed by
the starting offset, the ending offset and the number of keys in the sequence.
Note, the above compression methods may be combined to achieve better compression.
Because of compression, JFFS3 keys have variable size which means, that it is impossible
to directly apply the binary search algorithm to the contents of indexing nodes.
In JFFS3, indexing nodes are decompressed when read and are cached in decompressed
form. And after the indexing node has been decompressed, the binary search algorithm
is applicable.
We believe that keys compression will considerably reduce the amount of on-flash
indexing information and increase the overall performance just because the amount of
Input/Output will lessen. But only actual fixed-size keys vs. variable-size keys tests will
show if there is some real performance gain present.
4.3 Links
Links in JFFS3 have fixed length and are not compressed. The link width depends on
the size of JFFS3 partition . the larger is JFFS3 partition, the wider are links. Instead of
choosing a huge link width to suit the largest possible file systems (e.g. 64 bits), JFFS3
admits of flexible links width, depending on JFFS3 partition size.
As indexind nodes have fixed size equivalent to one sector, the width of links stored
in branch nodes and in the root nodes is
w = log2S . s.
Twig nodes refer variable-size leaf nodes so the width of links stored twih nodes is
w = log2S,
where S is the size of the JFFS3 partition and s is the size of sector.
19
5 Garbage Collection
Note! JFFS3 Garbage Collection is currently under development and this
chapter may be changed. Any suggestions and ideas are welcome.
6 The superblock
6.1 The superblock management algorithm
To implement the superblock management scheme, JFFS3 reserves the second and the
third good eraseblocks at the beginning of the flash partition (just next to the static
eraseblock). These two eraseblocks are called anchor eraseblocks, or the anchor area.
Anchor eraseblocks contain references to chain eraseblocks. Chain eraseblocks may
either refer other chain eraseblocks or the super eraseblock (see figure 15). The number
of chain eraseblocks varies depending on the size of the JFFS3 partition. If there are k
chain erase blocks, the anchor area will refer chain eraseblock 1, which will refer chain
eraseblock 2, which will refer chain eraseblock 3 and so forth. The chain eraseblock k will
refer the super eraseblock.
Anchor area chain eraseblock 1 chain eraseblock 2 super eraseblock
Figure 15: Types of eraseblocks involved to the superblock management scheme.
The super eraseblock contains the superblock which takes one sector. The chain
eraseblocks contain references to the next chain eraseblock or to the super eraseblock.
The JFFS3 superblock management mechanisms work as follows. Suppose there are k
chain eraseblocks in the current superblock management scheme. The superblock updates
are written to consecutive sectors of the super eraseblock. When the super eraseblock has
no more empty sectors, new super eraseblock is picked, the superblock update is written
to the new super eraseblock, and new reference is written to the chain eraseblock k.
Similarly, when there is no space in the chain eraseblock k, new chain eraseblock k is
picked and the corresponding reference is written to chain eraseblock k . 1, and so on.
When there are no free sectors in the chain eraseblock 1, new chain eraseblock 1 is picked
and the corresponding reference is written to the anchor area.
Figure 16 presents the example of the superblock management scheme (k = 2).
1. Initially, there are 2 chain eraseblocks (numbers 5 and 419) and the super eraseblock
(number 501). There is a reference in the first sector of the anchor area which refers
the chain eraseblock 1. The first sector of the chain eraseblock 1 refers the chain
eraseblock 2, and the first sector of the chain eraseblock 2 refers the super eraseblock.
The first sector of the super eraseblock contains the superblock.
2. After the superblock has been updated, the second sector of the super eraseblock
contains the valid copy of the superblock and the first sector contains garbage.
20
Anchor area
(eraseblocks 0 and 1)
chain eraseblock 1
(eraseblock 5)
chain eraseblock 2
(eraseblock 419)
super eraseblock
(eraseblock 501)
Anchor area
(eraseblocks 0 and 1)
chain eraseblock 1
(eraseblock 5)
chain eraseblock 2
(eraseblock 419)
super eraseblock
(eraseblock 501)
Anchor area
(eraseblocks 0 and 1)
chain eraseblock 1
(eraseblock 5)
chain eraseblock 2
(eraseblock 419)
super eraseblock
(eraseblock 501)
Anchor area
(eraseblocks 0 and 1)
chain eraseblock 1
(eraseblock 5)
chain eraseblock 2
(eraseblock 419)
super eraseblock
(eraseblock 7)
Anchor area
(eraseblocks 0 and 1)
chain eraseblock 1
(eraseblock 5)
chain eraseblock 2
(eraseblock 419)
super eraseblock
(eraseblock 100)
Anchor area
(eraseblocks 0 and 1)
chain eraseblock 1
(eraseblock 5)
chain eraseblock 2
(eraseblock 77)
super eraseblock
(eraseblock 398)
Anchor area
(eraseblocks 0 and 1) chain eraseblock 1
(eraseblock 65)
chain eraseblock 2
(eraseblock 44)
super eraseblock
(eraseblock 120)
Anchor area
(eraseblocks 0 and 1)
chain eraseblock 1
(eraseblock 101)
chain eraseblock 2
(eraseblock 150)
super eraseblock
(eraseblock 132)
1.
2.
3.
4.
5.
6.
7.
8.
Figure 16: The superblock management example.
3. The superblock has been updated many times and the valid superblock is at the
last sector of the super eraseblock while the other sectors of the super eraseblock
contain garbage.
21
4. As there were no free sectors at the super eraseblock, new super eraseblock was
chosen (eraseblock number 7) and the superblock update was written to the first
sector of the new super eraseblock. As the super eraseblock changed its position,
the corresponding reference at the chain eraseblock 2 was updated. It was updated
out-of-place and now the first sector of the chain eraseblock 2 is dirty while the
second sector contains the valid reference to the new super eraseblock.
5. The superblock has been updated many times and the super eraseblock changed
its position many times and it is currently at the eraseblock number 100. The
reference to the super eraseblock was also updated many times and at the moment
the last sector of the chain eraseblock 2 contains the valid reference while the other
sectors are obsolete. Similarly, the last sector of the super eraseblock contains valid
superblock while the other sectors are obsolete.
6. When the next superblock update came, there were no free sectors at the super
eraseblock and new super eraseblock was picked (eraseblock number 398) and the
valid copy of the superblock is currently at the first sector of the eraseblock number
398, Also, there were no free sectors at the chain eraseblock 2 and new chain
eraseblock 2 was picked (eraseblock number 77), so the first sector of the eraseblock
77 contains the valid reference to the super eraseblock. Since the chain eraseblock
2 changed its position, the corresponding reference at the chain eraseblock 1
was updated and at the moment the second sector of the chain eraseblock 1 contains
the valid reference to the chain eraseblock 2 while the first sector is dirty.
7. And analogously, after many superblock updates, the chain eraseblock 1 was updated
many times and when it became full it changed its position. Sure, the chain
eraseblock 2 and the super eraseblock changed their positions many times as well.
So, at the moment, the chain eraseblock 1 is at the eraseblock number 65, the
chain eraseblock 2 is at the eraseblock 44 and the super eraseblock is at the eraseblock
120. When the chain eraseblock 1 changed its position, the corresponding
reference at the anchor area was updated and currently the second sector of the
anchor eraseblock 1 contains the valid reference to the chain eraseblock 1 while the
firs sector is dirty.
8. And even more superblock updates happened. The anchor area was updated many
times. When there were no free sectors at the anchor eraseblock 1, the anchor
eraseblock 2 was used. So, at the moment, the valid reference to the chain eraseblock
1 is at the first sector of the anchor eraseblock 2. From now on, the first
anchor eraseblock may be erased and may be used again when the second anchor
eraseblock is full.
The following are important notes about the JFFS3 superblock management.
. The superblock takes one sector so the super eraseblock may be updated at most
N times (N is the number of sectors in the eraseblock).
. In case of NAND flash, the sector is the real minimal physical input/output unit, so
only N updates are possible in anchor eraseblocks and in chain eraseblocks. But if
the real input/output unit is smaller then the sector (i.e., if JFFS3 works on top of
22
NOR flash) the advantage of this may be used and more references may be packed
into one anchor or chain eraseblock.
. When JFFS3 picks new chain/super eraseblock, the common JFFS3 wear-levelling
scheme is utilized.
. Anchor area has 2 eraseblocks in order to ensure the tolerance to unclean reboots .
one anchor eraseblock may be safely erased while the other is being used.
. When a new reference is written to anchor/chain eraseblocks, the previous reference
becomes dirty and on mount JFFS3 should find the valid reference. To facilitate
this, each reference has its version number. Each subsequent reference has higher
version then the previous. Hence, JFFS3 may use the binary search algorithm to
quickly find the valid reference.
. As unclean reboot may happen anytime, no anchor/chain/super eraseblocks are
erased before the whole chain has been updated. This makes it possible to recover
from unclean reboots if they happen while the chain of the superblock-related
eraseblocks is being updated.
6.2 The length of the chain
The number of required eraseblocks in the superblock management scheme depends on
the size of the JFFS3 partition. The larger the partition, the more levels are needed.
This is determined by the need to ensure that the anchor area is not worn out earlier
then the rest of the JFFS3 partition.
Denote the number of required chain eraseblocks plus one (the super eraseblock) m
and calculate m assuming the worst case scenario: any file system data update requires
the superblock update. This would correspond to synchronous JFFS3 operation mode
with zero-length journal.
Obviously, what is wanted is to be sure that the anchor area is not worn out earlier
then the data area, i.e. the following inequality should be true:
TA
TD
> 1, (1)
where TA is the period of time of the total anchor area wear and TD is the period of
time of the total data area wear. Note, the whole JFFS3 partition excluding the static
superblock and the anchor area is referred to as the data area.
If RA is the average rate of the anchor area updates (sectors per second), RD s the
average rate of the data area updates and N is the number of sectors per the eraseblock,
then the anchor area will be written to with rate RA/N eraseblocks per second and the
data area will be written to with the rate RD/N eraseblocks per second. So, JFFS3 will
need to erase RA/N eraseblocks per second in the anchor area and RD/N eraseblocks per
second in the data area. Therefore, TA and TD may be expressed as
TA =
2D 丒 N
RA
,
TD =
(M . 3) 丒 D 丒 N
RD
,
23
where D is the maximum number of flash eraseblock erase cycles, and M is the number of
non-bad eraseblock on the JFFS3 partition. We subtracted 3 from M to get the number
of eraseblocks in the data area.
TA
TD
= 2 丒
RD
(M . 3) 丒 RA
. (2)
If m = 0, i.e., there are no chain/super eraseblocks and the superblock is stored in
the anchor area, then taking into account (2) and that in this case RA = RD = R, we
have
TA
TD
=
2
(M . 2)
.
Suppose m = 1. i.e., there are no chain eraseblocks and only the super eraseblock is
used. In this case each file system data update will require (a) the superblock update in
the data area and (b) the anchor area update. Therefore, the anchor area will be written
N times less frequently then when m = 0 and the data area will be written 2 times more
frequently then when m = 0. This means, that RA = R/N and RD = 2R and from (2)
we have
TA
TD
= 2 丒
2N
M . 3
.
When m = 2, i.e. the chain eraseblock 1 and the super eraseblock are used, the
anchor area will be written N2 times less frequently, while the data area will be written
2 + 1/N times more frequently then when m = 0 (one superblock update on each file
system update and one chain eraseblock 1 update per N superblock updates). Therefore,
RA = R/N2 and RD = (2 + 1/N) 丒 R and from (2) we have
TA
TD
= 2 丒
2N2 + N
M . 3
.
For m = 3, analogously,
TA
TD
= 2 丒
2N3 + N2 + N
M . 3
,
and for m = 0, 1, 2, . . .
TA
TD
= 2 丒
2Nm + Nm.1 + . . . + N
M . 3
.
Consequently, from (1) we have the following inequality:
2 丒
2Nm + Nm.1 + . . . + N
M . 3 > 1,
or neglecting the minor components,
4Nm
M . 3 > 1,
or
24
m > logN
M . 3
4
. (3)
Thus, form (3) it is obvious that the JFFS3 superblock management scheme scales
logarithmically.
Table 2 shows the value of m for different types of existing NAND flashes (see [10],
[11], [12], and [13]).
Type Size Sect. size M N m
Toshiba TC58DVM92A1FT 64MB 16KB 4096 32 2
Toshiba TH58NVG1S3AFT05 512MB 128KB 4096 64 2
ST Micro NAND08G-B 1GB 128KB 8192 64 2
Samsung K9K1G08X0B 2GB 128KB 16384 64 2
Table 2: The length of the JFFS3 superblock management chain for different types of
existing NAND flashes.
Note, providing that N = 64, m = 3 is enough to guarantee acceptable anchor area
wear leveling for up to 128GB flash, m = 4 . for up to 8TB flash (the inequality 3).
6.3 The superblock search
To find the superblock during mount, JFFS3 finds the valid reference in the anchor
eraseblocks, then finds the valid reference in chain erase blocks 1, 2, . . ., m . 1, and
finally finds the valid superblock in the super eraseblock. Since JFFS3 assigns versions
to records in anchor/chain/super eraseblocks and the versions are increased by one on
every update, the binary search algorithm may be used to quickly find the valid sector.
The valid reference in the anchor area may be found after log2(2N) + 2 steps (one
step involves one sector read operation), the reference in chain/super eraseblocks . after
log2(N) + 2 steps. Thus, to find the superblock, JFFS3 must read
S = 2m + log2(2N) + (m . 1) 丒 log2(N)
sectors.
Table 3 contains the approximate superblock search time for different existing NAND
flashes 5.
Type Size N m Sect. read S SB find
Toshiba TC58DVM92A1FT 64MB 32 2 50兪s 22 1.1ms
ST Micro NAND08G-B 1GB 64 2 130兪s 25 3.3ms
Samsung K9K1G08X0B 2GB 64 2 70兪s 25 1.6ms
Table 3: The superblock search time for different existing NAND flashes.
5the calculated superblock search time does not contain the ECC/CRC checking overhead as well as
any other CPU overhead.
25
For larger flash chips which would utilize the superblock management scheme with
m = 3 (no such flashes exist at the moment), the superblock search time would be about
4.3ms, providing the flash characteristics are the same as ST Micro乫s (see table 3).
7 Issues/ideas/to be done
This section contains a temporary list of issues which should be solved, ideas which
should be thought and analyzed deeper or things which were thought about but are not
yet described in this document.
The following is the list of things which should be thought about more.
1. Quota support. Will quota be supported? How will it look like . lust generic linux
quota or something better?
2. Transactions:
transaction open()/do many fs modifications()/transaction close() semantics?
Reiser4 pretends to support this via special sys reiser4() syscall. Would
be nice.
3. How can one select the compression mode on the per-inode basis? Xattrs with some
reserved name?
4. Orphaned files.
5. Holes.
6. Direct I/O.
7. How to chose/add a key scheme?
8. Extents.
The following is the list of topics which should be highlighted in this document as
well.
1. Garbage collection.
2. Tree balancing.
3. Tree locking.
4. Caching, write-behind cache.
5. An assumed flash model and the model of interactions between JFFS3 and the flash
I/O subsystem.
6. How the track of eraseblocks will be kept? Space accounting, good/bad, erase
count?
7. The wear-levelling algorithms.
8. The format of keys.
26
9. Branch nodes乫 links are sector numbers, twig nodes乫 links are absolute flash offsets.
So, the length of twig and branch keys are different and branches have greater
fanout.
10. Different optimizations may be achieved by means of changing the format of keys.
So, JFFS3 should be flexible in this respect and have a mechanism to change/select
the formats of keys.
11. The minimal amount of file乫s data in a node is PAGE SIZE. No way to create smaller
nodes as it it possible in JFFS2.
12. Portability (e.g., move FS between machines with different RAM page size, etc).
13. Errors handling.
14. Bad blocks handling.
The following is the list of ideas which were thought about but are not yet in the
document.
1. If the compression is disabled for an inode, then its nodes are (PAGE SIZE + header
size) in size, i.e., they do not fit into integer number of flash sectors. For these
nodes we may keep the header in the OOB area. In this case we should not mix
compressed nodes and uncompressed nodes in one eraseblock.
2. For large files which are mostly read-only, we may fit more then one page of data
in one node. This will mace compression better. When the file is read, all the
uncompressed pages are propagated to the page cache, like in the zisofs file system.
3. If there are few data in the superblock, we may keep this data in the root node. In
this case the root will have smaller fanout then branches.
The 乭to do乭 list.
1. Re-calculate digits for SB search time and m.
2. For now only the idea of keys compression methods is provides. Would be nice to
describe algorithms more strictly.
8 Definitions
1. Access Control Lists, ACL . a modern mechanism to control accesses to files
which provides much more flexibility that the standard Unix mechanism of owner/group/others
permissions, see [7] for more details.
2. Acl . an object in the tree containing inode乫s ACL. Refer to section 4.1 for more
information.
3. Acl key . acl object乫s key.
4. Acl node . a leaf node containing an acl object.
27
5. Anchor eraseblock, anchor area . the second and the third good eraseblocks
of the JFFS3 partition which are reserved for the superblock management. See
section 6.1 for more details.
6. Attr-data . an object in the tree where inode乫s attributes are stored (standard
Unix attributes like creation time, owner ID, etc and other attributes like the type
of compression, etc). Refer to section 4.1 for more information.
7. Attr-data key . attr-data object乫s key.
8. Attr-data node . a leaf node containing an attr-data object.
9. B-tree . a balanced search tree where each node has many children. See section 3.3.
10. B+-tree . a B-tree where no data is stored in non-leaf nodes but instead, is stored
only in leaf nodes.
11. Branch node . any node that is not leaf, not twig and not root.
12. Branching factor . the branching factor of the B-tree is the number of children
of a node.
13. Chain eraseblock . an eraseblock containing references to other chain eraseblocks
or to the super eraseblock. Chain eraseblocks facilitate quick SB searching and
are the part of the JFFS3 superblock management scheme (see section 6.1). The
main reason why chain eraseblocks are needed is the need to provide good flash
wear-levelling.
14. Clean eraseblock . an eraseblock which contains no garbage, only valid information.
15. Data area . the whole JFFS3 partition excluding the static superblock and anchor
eraseblocks.
16. Data key . data object乫s key.
17. Data node . a leaf node with file乫s data.
18. Directory entry, direntry . basically an association between the name and the
inode number.
19. Direntry key . direntry object乫s key.
20. Direntry node . a leaf node containing a direntry object.
21. Dirt, dirty space . information on flash which is not valid due to out-of-place
updates or objects deletion. It is the aim if the Garbage Collector to reclaim the
space occupied by dirt.
22. Dirty eraseblock . an eraseblock which contains some dirt along with valid nodes.
23. Dirty sector . a sector which contains dirt.
28
24. Erasable block, eraseblock . the minimal erasable unit of the flash chip from
the JFFS3乫s viewpoint.
25. Extended attributes, xattr . an association between names and data for files
and directories. See attr(5) Linux manual pages for more information.
26. Fanout . the same as branching factor.
27. Free eraseblock . an erased eraseblock (contains only 0xFF words).
28. Garbage . the same as dirt.
29. Garbage Collector . a part of any Flash File System which is responsible for
recycling dirty space and producing free eraseblocks.
30. Indexing information, index . data structures which do not contain any file
system data (files, directories, extended attributes, etc) but instead, keep track of
this data. For example, indexing information allows to quickly find all the directory
entries for any specified directory. In case of the FAT file system, the File Allocation
Table is may be treated as the index, in case of the ext2 file system the inode table,
the bitmap and the set of direct, indirect, doubly indirect and triply indirect pointers
may be considered as the index. In JFFS3, the index is comprised by the indexing
nodes. See section 3.4 for more information.
31. Indexing eraseblock . an eraseblock which contains indexing nodes.
32. Indexing node . a non-leaf node. Indexing nodes have fixed size (one sector) and
contain only keys and links.
33. In-place updates, in-place writes . a method of updating on-media data when
the update is written to the physical position where the data resides (in opposite
to out-of-place updates).
34. Journal . contains recent JFFS3 changes and all the file system updates first go
to the journal. The purpose of the Journal is to accumulate a bunch of JFFS3
file system changes and to postpone updating the index. See section 3.6 for more
information.
35. Journal commit . the process of re-building the indexing information for the
data which is in the journal. After the journal has been committed the journal
eraseblocks become just leaf eraseblocks.
36. Journal eraseblock . an eraseblock containing the journal data.
37. Journal tree . an in-memory tree referring Journal nodes which were not committed
so far. When JFFS3 reads, it first looks up the journal tree to find out whether
the searched information is there. See section 3.6 for more details.
38. Key . an identifier of objects in the tree.
39. Key type . an unique identifier of the key type. There are 6 key types in JFFS3
(see section 4.3).
29
40. Leaf eraseblock . an eraseblock containing leaf nodes.
41. Leaf node . any node from the leaf level of the tree (level 0). Leaf nodes contain
only data and do not further refer other nodes. For more information see section 3.4.
42. NAND page . a basic input output unit of NAND flash chips. ECC is calculated
on the per-NAND page basis. See any NAND flash manual for more details, e.g., 10.
43. Node . a pile of the tree (the tree consists of nodes) as well as the container for file
system data. There are different types of nodes in JFFS3. For more information
see section 3.4.
44. Obsolete nodes/data/sectors . the same as dirty nodes, data or sectors.
45. Out-of-place updates, out-of-place writes . a sort of data updates when the
update is not written to the same physical position, but instead, is written to some
other place and the previous contents is treated as garbage afterwords. Opposite
to in-place updates.
46. RAM page . an unit of memory management in Virtual Memory Management
subsystem of most modern operating systems (including Linux). See [9] for more
information.
47. Sector . the smallest writable unit of the flash chip from JFFS3乫s viewpoint. May
be equivalent to the minimal physical input/output unit (like in case of NAND
flashes) or larger (like in case of NOR flashes).
48. Shared acl . an acl object which is shared by many inodes for optimization purposes.
Refer to section 4.1 for more information.
49. Static eraseblock . the fist good erasable block of the JFFS3 partition where the
file system static data is stored. JFFS3 may only read it and it is created/changed
by external formatting tools.
50. Superblock . a data structure which describes the whole JFFS3 file system. Only
dynamic data is stored in the superblock, all the static data is kept in the static
superblock. There is a comprehensive superblock management scheme in JFFS3,
see section 6.1.
51. Super eraseblock . an eraseblock where the superblock is kept. See section 6.1
details.
52. Target eraseblock . the eraseblock which is currently being processed by the
Garbage Collector, i.e., nodes are moved from the target eraseblock and it is erased
afterwords.
53. Quota . a mechanism which allows to assign different limits on the file system
(e.g., restrict users in the number of files they may create or in the amount of space
they may consume, etc). See [8] for more details about quota support in Linux.
54. Tree . the main entity JFFS3 design revolves about. The JFFS3 tree is a wandering
B+-tree where all the file system stuff (files, directories, extended attributes, etc)
is stored.
30
55. Twig nodes . nodes which reside one level upper then leaf nodes (level 1).
56. Valid nodes . nodes which contain actual information, non-obsolete nodes.
57. Wandering tree . a method of updating trees when there is no possibility to
perform in-place updates. The JFFS3 tree is a wandering B+-tree. See section 3.2
for more information.
58. Xattr . a widely used contracted form for extended attributes.
59. Xattr-data . an object in the tree containing the data of an extended attribute.
60. xattr-data key . xattr-data object乫s key.
61. Xattr-data node . a leaf node containing an attr-data object.
62. Xattr ID . an unique identifier of an extended attribute. Refer to section 4.1 for
more information.
63. Xentry . an object in the tree which stores the association between the name of
an extended attribute and its xattr ID. Refer to section 4.1 for more information.
64. Xentry key . xentry object乫s key.
65. Xentry node . a leaf node containing a xentry object.
9 Symbols
The following is the list of symbols which are used to denote different things thought this
document.
. D . the number of guaranteed erases of flash eraseblocks (typically 105 for NAND
flashes);
. H() . the hash function JFFS3 uses to calculate names乫 hash for keys.
. I . inode number.
. K, Kx . tree乫s keys.
. k, kx . keys乫 fields.
. L . the number of levels in the tree.
. m . the number of eraseblocks used in the superblock management scheme without
the anchor eraseblocks, i.e. the number of chain eraseblocks plus one (the super
eraseblock).
. M . the total number of non-bad eraseblocks on the JFFS3 partition.
. n . the branching factor (fanout) of the tree.
. N . the number of sectors per eraseblock.
31
. S . the size of the JFFS3 flash partition (assuming there are no bad block).
. s . the size of sector.
. w . the bit-width of links.
10 Abbreviations
1. ACL . Access Control List
2. ECC . Error Correction Code
3. CRC . Cyclic Redundancy Check
4. JFFS2 . Journalling Flash File System version 2
5. JFFS3 . Journalling Flash File System version 3
6. MTD . Memory Technology Devices
7. RAM . Random Access Memory
8. VFS . Virtual File System
11 Credits
The following are the people I am very grateful for help (alphabetical order):
. David Woodhouse < [email protected]> . the author of JFFS2, answered a
great deal of my questions about MTD and JFFS2 and suggested some interesting
ideas for JFFS3.
. Joern Engel < [email protected]> . discussed some aspects of a new
scalable flash file system with me. Joern is developing his own flash file system
LogFS.
. Nikita Danilov < [email protected]> . used to work in Namesys and implemented
ReiserFS and Reiser4 file systems. Nikita answered my questions about
Reiser4 FS internals.
. Thomas Gleixner < [email protected]> . helped me with MTD-related things,
especially concerning flash hardware and low-level flash software.
. Victor V. Vengerov < [email protected]> . my colleague from OKTET Labs
who discussed some JFFS3 design approaches with me and suggested several interesting
ideas.
32
12 References
1. JFFS : The Journalling Flash File System,
http://sources.redhat.com/jffs2/jffs2-html/
2. The Design and Implementation of a Log-Structured File System,
http://www.cs.berkeley.edu/~brewer/cs262/LFS.pdf
3. Who wants another filesystem?,
http://cgi.cse.unsw.edu.au/~neilb/conf/lca2003/paper.pdf
4. Samsung Flash memory products,
http://www.samsung.com/Products/Semiconductor/Flash/index.htm
5. Reiser4 File System, http://www.namesys.com/
6. B-Trees, http://www.bluerwhite.org/btree/
7. POSIX Access Control Lists on Linux,
http://www.suse.de/~agruen/acl/linux-acls/
8. Quota mini-HOWTO, http://www.tldp.org/HOWTO/Quota.html
9. Wikipedia: Virtual Memory, http://en.wikipedia.org/wiki/Virtual_memory
10. Toshiba TC58DVM92A1FT NAND flash chip, http:
//www.toshiba.com/taec/components/Datasheet/TC58DVM92A1FT_030110.pdf
11. Toshiba TH58NVG1S3AFT05 NAND flash chip, http://www.toshiba.com/
taec/components/Datasheet/TH58NVG1S3AFT05_030519A.pdf
12. ST-micro NAND08G-B NAND flash chip,
http://www.st.com/stonline/products/literature/ds/11241.pdf
13. Samsung K9K1G08X0B NAND flash chip,
http://www.samsung.com/Products/Semiconductor/NANDFlash/SLC_
SmallBlock/1Gbit/K9K1G08U0B/ds_k9k1g08x0b_rev02.pdf
33

你可能感兴趣的:(File,tree,Flash,System,reference,compression)

关于流媒体播放器EasyPlayer和EasyPlayerPro的介绍以及其区别 EasyDarwin EasyDarwin 音视频 ffmpeg 人工智能大数据 ar
EasyPlayer是一款流媒体播放器系列项目，它支持多种流媒体协议的播放，包括但不限于RTSP、RTMP、HTTP、HLS、UDP、RTP、File等。除此之外，EasyPlayer还支持本地文件播放和多种功能特性，包括本地抓拍、本地录像、播放旋转、多屏播放、倍数播放等。EasyPlayer核心基于ffmpeg，稳定、高效、可靠、可控。随着多年的不断发展和迭代，EasyPlayer基于成功的实践
md5加密落地成佛
using(MD5md5=MD5.Create()){byte[]byteHash=md5.ComputeHash(System.Text.Encoding.Default.GetBytes(s));stringstrRes=BitConverter.ToString(byteHash).Replace("-","");returnstrRes.ToUpper();}
Effective C++ 条款10：令operator=返回一个reference to *this 君鼎 C++c++
EffectiveC++条款10：令operator=返回一个referenceto*this核心思想：赋值操作符（operator=）应始终返回当前对象的引用（*this），以实现连锁赋值并保持与内置类型一致的语义。⚠️1.问题场景：违反连锁赋值语义classWidget{public:voidoperator=(constWidget&rhs){//错误：返回voidvalue=rhs.val
代码随想录算法训练营第三十五天
01背包问题二维题目链接01背包问题二维题解importjava.util.Scanner;publicclassMain{publicstaticvoidmain(String[]args){Scannersc=newScanner(System.in);intM=sc.nextInt();intN=sc.nextInt();int[]space=newint[M];int[]value=new
python笔记14介绍几个魔法方法抢公主的大魔王 python python
python笔记14介绍几个魔法方法先声明一下各位大佬，这是我的笔记。如有错误，恳请指正。另外，感谢您的观看，谢谢啦！(1).__doc__输出对应的函数，类的说明文档print(print.__doc__)print(value,...,sep='',end='\n',file=sys.stdout,flush=False)Printsthevaluestoastream,ortosys.std
Pandas：数据科学的超级瑞士军刀科技林总 DeepSeek学AI 人工智能
**——从零基础到高效分析的进化指南**###**一、Pandas诞生：数据革命的救世主****2010年前的数据分析噩梦**：```python#传统Python处理表格数据data=[]forrowincsv_file:ifrow[3]>100androw[2]=="China":data.append(float(row[5])#代码冗长易错！```**核心痛点**：-Excel处理百万行崩
【Jupyter】个人开发常见命令 TIM老师 #Pycharm &VSCode python Jupyter
1.查看python版本importsysprint(sys.version)2.ipynb/py文件转换jupyternbconvert--topythonmy_file.ipynbipynb转换为mdjupyternbconvert--tomdmy_file.ipynbipynb转为htmljupyternbconvert--tohtmlmy_file.ipynbipython转换为pdfju
在Windows11上安装Linux操作系统的几种技术方案 yuanpan linux 运维服务器
在Windows11上安装Linux主要有以下几种技术方案，每种方案适用于不同的需求场景：1.WindowsSubsystemforLinux(WSL)适用场景：开发、命令行工具、轻量级Linux环境支持发行版：Ubuntu、Debian、KaliLinux、Fedora等优点：轻量级：无需虚拟机，直接在Windows上运行Linux命令行环境。无缝集成：可访问Windows文件系统，支持VSCo
Makefile if语句用法 java叶新东老师 c++makefile
文章目录语法1.BasicExpressions：2.LogicOperators:3.ExistenceChecks4.FileOperations5.Comparisons示例1、判断两个字符串是否相等2、判断文件路径是否目录完语法CMake中的if命令用于有条件地执行一组命令，其格式如下：if()elseif()#optionalblock,canberepeatedelse()#optio
Java 笔记 lambda 五行缺弦 Java笔记 java 笔记
✅Lambda基本语法(parameters)->expression或(parameters)->{statements}//无参数Runnabler=()->System.out.println("Hello");//单个参数（小括号可省略）Consumerc=s->System.out.println(s);//多参数+多语句Comparatorcomp=(a,b)->{System.out
xgboost原理茶尽
阅读XGBoost与BoostedTree基学习器：CART每个叶子节点上面有一个分数不够厉害，所以找一个更强的模型treeensemble对每个样本的预测结果是每棵树预测分数的和目标函数采用boosting（additivetraining）方法，每一次都加入一个新的函数。依赖每个数据点上的误差函数的一阶导数和二阶导（区别于GBDT）。树的复杂度复杂度包含了一棵树里面的叶子个数和输出分数的L2模
使用 C# 实现 FTP 上传的方法，包括详细的代码示例和测试代码 zhxup606 李工篇 C#实战教程 c#开发语言
以下是使用C#实现FTP上传的方法，包括详细的代码示例和测试代码。以下代码使用System.Net.FtpWebRequest实现文件上传，并附带一个简单的测试用例。C#FTP上传方法csharpusingSystem;usingSystem.IO;usingSystem.Net;publicclassFtpClient{//////上传文件到FTP服务器//////FTP服务器地址，例如ftp:
群晖 File Station：集中浏览与管理 NAS 文件的工具 Trihawk宇麦科技群晖NAS
FileStation是SynologyDSM（DiskStationManager）操作系统中的核心内建应用，以网页形式提供友好的图形界面，供用户在浏览器中管理NAS上的文件和共享资料核心功能特色1.文件浏览与管理Navigate文件夹、执行拖放上传、剪贴、重命名、移动、删除等常用操作，类似WindowsExplorer或macOSFinder的使用体验，直观且便捷支持批量操作、压缩与解压、多文
Redis反弹Shell 波吉爱睡觉 web安全 #未授权访问漏洞 #SSRF redis 网络安全 web安全
这里我来总结几种Redis反弹Shell的方法一、利用Redis写WebShell前提条件开了web服务器，并且知道路径，还需要有文件读写增删改查的权限条件比较苛刻，但是满足条件上传就会简单一点，我们直接将文件写入www目录下，完了使用工具连接即可。语句：redis:6379>configsetdir/var/www/html/redis:6379>configsetdbfilenameshell
pod 命令你飞跃俊杰
创建默认的Podfile$podinit第一次使用安装框架$podinstall安装框架，不更新本地索引，速度快，但是不会升级本地代码库$podinstall--no-repo-update今后升级、添加、删除框架$podupdate更新框架，不更新本地索引，速度快可以安装新框架或者删除不用的框架，但是不会升级项目已经安装的框架$podupdate--no-repo-update查看哪些框架有更新
基本服务 FTP & SMB 会飞的灰大狼 Centos7 linux
基本服务FTP&SMB前言：FTP简称为文件传输协议前面说的他可以做到备份的功能那么它可以做到文件传输的过程smb我们简单来说共享文件夹‍NFSNFS（NetworkFileSystem，网络文件系统）是一种分布式文件系统协议，允许不同计算机之间通过网络共享文件和目录，使远程文件系统像本地文件系统一样被访问。它最初由SunMicrosystems开发，现在已成为UNIX/Linux系统中常用的网络
Ubuntu lamp 会飞的灰大狼 linux ubuntu
Ubuntulamp前言在Ubuntu安装lamp架构我们了解到lamp是完整的架构我们前面了解到了集合了Linux系统apacheMySQL和PHP语言的完整架构我们前面说了Centos7中编译安装lamp那么我们去说一下在Ubuntu中安装‍‍安装apache2‍apt直接安装apache2apt-yinstallapache2‍启动apache2systemctlstartapache2#测
使用OpenCV对视频进行处理：视频读取、视频显示和视频保存，视频追踪等无规则ai OpenCV opencv 人工智能计算机视觉 python
一.视频的读写1.从文件中读取视频并播放（1）创建读取视频的对象cap=cv2.VideoCapture(filepath)filepath：视频文件的路径（2）视频的属性信息a.获取视频的某些属性retval=cap.get(propId)propId：从0到18的数字，每个数字表示视频的属性常用的属性有属性名对应数值功能描述CAP_PROP_POS_MSEC0视频当前的播放位置，单位为毫秒。C
Could not extract GUID in text file UserSettings\Layouts\CurrentMaximizeLayout.dwlt at line 924. zhannghong2003 Unity unity 游戏引擎
错误提示：无法在文本文件UserSettings\Layouts\CurrentMaximizeLayout.dwlt的第924行提取GUID。UnityEngine.GUIUtility:ProcessEvent(int,intptr,bool&)这个错误发生在Unity无法解析一个损坏或格式错误的布局文件(CurrentMaximizeLayout.dwlt)时。以下是解决方法：解决方案：关闭
C#?和??的作用 simpleshao C#C#???
1.可空类型修饰符（？）：引用类型可以使用空引用表示一个不存在的值，而值类型通常不能表示为空。例如：stringstr=null;是正确的，inti=null;编译器就会报错。为了使值类型也可为空，就可以使用可空类型，即用可空类型修饰符"？"来表示，表现形式为"T？"例如：int?表示可空的整形，DateTime?表示可为空的时间。T?其实是System.Nullable(泛型结构）的缩写形式，也
C#:类型定义中使用‌问号（?）曹牧 CSharp c#
在C#中，类型定义中的‌问号（?）‌主要用于控制类型的可空性，但具体行为因类型（值类型或引用类型）和C#版本而异。以下是清晰分类的说明：一、可空值类型（T?，适用于所有C#版本）‌用途‌：允许值类型（如int、DateTime等）存储null值。‌语法‌：在值类型后加?，底层由System.Nullable结构实现。‌示例‌：int?age=null;//声明可空整型DateTime?date=n
核心板：嵌入式系统的核心驱动力 MYZR1 核心板人工智能 SSD2351
核心板（CoreBoard）作为嵌入式系统开发的核心组件，已成为现代电子设备智能化的重要基石。这种高度集成的电路板将处理器、内存、存储和基本外设接口浓缩在一个紧凑的模块中，为各类智能设备提供强大的"大脑"。核心板的技术特点核心板通常采用先进的系统级封装(SiP)技术，在微小空间内集成了CPU/GPU、DDR内存、Flash存储以及电源管理单元。这种设计不仅大幅减小了体积，还提高了系统可靠性。以常见
streamline 这是个组合词，能猜出大概意思，组织董八七
title:streamlinedate:2018-08-2117:03:19NO_sents:76NO_references:58streamlinestreamlinedstreamlines组织一些流程（process）AlthoughfixedguidelineswouldsimplifyGWASPAsubstantiallybyprovidinganalytictoolsthatstre
Moviepy怎样使用？
MoviePy是一个用于视频编辑和视频脚本编写的Python库。以下是使用MoviePy的步骤：安装MoviePy：在命令行中运行pipinstallmoviepy来安装MoviePy库。导入MoviePy：在Python脚本中导入MoviePy库，通常使用importmoviepy.editorasmp。创建视频对象：使用mp.VideoFileClip('视频文件路径')来创建一个视频对象。视
openssl-1.1.1w-win64 创想未来CTF Qt C++https
下载地址：ICSDownload-Overbyte解压后添加环境变量或放在目录“C:\Windows\System32”下其他版本下载GitGodeOpenSSL1.1.1g安装包
moviepy用法大全
1.引用frommoviepy.editorimport*2.载入2.1载入视频video=VideoFileClip(filePath)2.2载入音频audio=AudioFileClip(filePath)2.3载入图片img=(ImageClip(videopath+videofengpi)#水印持续时间.set_duration(start_video_clip_begin.duratio
力扣 hot100 Day52
124.二叉树中的最大路径和二叉树中的路径被定义为一条节点序列，序列中每对相邻节点之间都存在一条边。同一个节点在一条路径序列中至多出现一次。该路径至少包含一个节点，且不一定经过根节点。路径和是路径中各节点值的总和。给你一个二叉树的根节点root，返回其最大路径和。//自己写的classSolution{public:intmaxpasssum(TreeNode*root,int&maxtmp){i
Error: C++14 standard requested but CXX14 is not defined 闹钟又响了
在R中安装包如果遇到上述问题，基本上去修改一下Makevars文件即可。可以用vim编辑，也可以在R中编辑，如下dotR<-file.path(Sys.getenv("HOME"),".R")if(!file.exists(dotR))dir.create(dotR)M<-file.path(dotR,"Makevars")if(!file.exists(M))file.create(M)cat(
Shortage of semiconductors 俗世尘沙
Semiconductorsposeanunwelcomeroadblockforcarmakers半导体成为了令汽车制造商头疼的“拦路虎”Ashortageofsemiconductorshasleftcarfirmsunabletoinstalltheelectronicsthatcontrolentertainmentsystems,safetyfeaturesanddrivingaids.
Python操作excel，工作效率提高篇_python对xlsx文档进行操作怎么提速 2401_84266286 程序员 python excel 网络
上面的代码是对工作簿最基本的操作，新建工作簿和保存工作簿，还有关闭当前工作簿。importosfile_path='table'file_list=os.listdir(file_path)foriinfile_list:print(i)列出文件夹下所有文件和子文件夹的名称，这是方便总结和查看文件的。importxlwingsasxwapp=xw.App(visible=False,add_boo
Java序列化进阶篇 g21121 java序列化
1.transient 类一旦实现了Serializable 接口即被声明为可序列化，然而某些情况下并不是所有的属性都需要序列化，想要人为的去阻止这些属性被序列化，就需要用到transient 关键字。
escape()、encodeURI()、encodeURIComponent()区别详解 aigo JavaScript Web
原文：http://blog.sina.com.cn/s/blog_4586764e0101khi0.html JavaScript中有三个可以对字符串编码的函数，分别是： escape,encodeURI,encodeURIComponent，相应3个解码函数：,decodeURI,decodeURIComponent 。下面简单介绍一下它们的区别 1 escape()函
ArcgisEngine实现对地图的放大、缩小和平移 Cb123456 添加矢量数据对地图的放大、缩小和平移 Engine
ArcgisEngine实现对地图的放大、缩小和平移: 个人觉得是平移，不过网上的都是漫游，通俗的说就是把一个地图对象从一边拉到另一边而已。就看人说话吧. 具体实现: 一、引入命名空间 using ESRI.ArcGIS.Geometry; using ESRI.ArcGIS.Controls; 二、代码实现.
Java集合框架概述天子之骄 Java集合框架概述
集合框架集合框架可以理解为一个容器，该容器主要指映射(map)、集合(set)、数组(array)和列表(list)等抽象数据结构。从本质上来说，Java集合框架的主要组成是用来操作对象的接口。不同接口描述不同的数据类型。简单介绍： Collection接口是最基本的接口，它定义了List和Set，List又定义了LinkLi
旗正4.0页面跳转传值问题何必如此 java jsp
跳转和成功提示 a) 成功字段非空forward 成功字段非空forward，不会弹出成功字段，为jsp转发，页面能超链接传值,传输变量时需要拼接。接拼接方式list.jsp?test="+strweightUnit+"或list.jsp?test="+weightUnit+&qu
全网唯一:移动互联网服务器端开发课程 cocos2d-x小菜 web开发移动开发移动端开发移动互联程序员
移动互联网时代来了！ App市场爆发式增长为Web开发程序员带来新一轮机遇，近两年新增创业者，几乎全部选择了移动互联网项目！传统互联网企业中超过98%的门户网站已经或者正在从单一的网站入口转向PC、手机、Pad、智能电视等多端全平台兼容体系。据统计，AppStore中超过85%的App项目都选择了PHP作为后端程
Log4J通用配置|注意问题笔记 7454103 DAO apache tomcat log4j Web
关于日志的等级那些去百度就知道了！这几天要搭个新框架配置了日志记下来！做个备忘！ #这里定义能显示到的最低级别,若定义到INFO级别,则看不到DEBUG级别的信息了~! log4j.rootLogger=INFO,allLog # DAO层 log记录到dao.log 控制台和总日志文件 log4j.logger.DAO=INFO,dao,C
SQLServer TCP/IP 连接失败问题 ---SQL Server Configuration Manager darkranger sql c windows SQL Server XP
当你安装完之后,连接数据库的时候可能会发现你的TCP/IP 没有启动.. 发现需要启动客户端协议 : TCP/IP 需要打开 SQL Server Configuration Manager... 却发现无法打开 SQL Server Configuration Manager..?? 解决方法: C:\WINDOWS\system32目录搜索framedyn.
[置顶] 做有中国特色的程序员 aijuans 程序员
从出版业说起网络作品排到靠前的，都不会太难看，一般人不爱看某部作品也是因为不喜欢这个类型，而此人也不会全不喜欢这些网络作品。究其原因，是因为网络作品都是让人先白看的，看的好了才出了头。而纸质作品就不一定了，排行榜靠前的，有好作品，也有垃圾。许多大牛都是写了博客，后来出了书。这些书也都不次，可能有人让为不好，是因为技术书不像小说，小说在读故事，技术书是在学知识或温习知识，有些技术书读得可
document.domain 跨域问题 avords document
document.domain用来得到当前网页的域名。比如在地址栏里输入：javascript:alert(document.domain); //www.315ta.com我们也可以给document.domain属性赋值，不过是有限制的，你只能赋成当前的域名或者基础域名。比如：javascript:alert(document.domain = "315ta.com");
关于管理软件的一些思考 houxinyou 管理
工作好多看年了,一直在做管理软件,不知道是我最开始做的时候产生了一些惯性的思维,还是现在接触的管理软件水平有所下降.换过好多年公司,越来越感觉现在的管理软件做的越来越乱. 在我看来,管理软件不论是以前的结构化编程,还是现在的面向对象编程,不管是CS模式,还是BS模式.模块的划分是很重要的.当然,模块的划分有很多种方式.我只是以我自己的划分方式来说一下. 做为管理软件,就像现在讲究MVC这
NoSQL数据库之Redis数据库管理(String类型和hash类型) bijian1013 redis 数据库 NoSQL
一.Redis的数据类型 1.String类型及操作 String是最简单的类型，一个key对应一个value，string类型是二进制安全的。Redis的string可以包含任何数据，比如jpg图片或者序列化的对象。 Set方法：设置key对应的值为string类型的value
Tomcat 一些技巧征客丶 java tomcat dos
以下操作都是在windows 环境下一、Tomcat 启动时配置 JAVA_HOME 在 tomcat 安装目录，bin 文件夹下的 catalina.bat 或 setclasspath.bat 中添加 set JAVA_HOME=JAVA 安装目录 set JRE_HOME=JAVA 安装目录/jre 即可；二、查看Tomcat 版本在 tomcat 安装目
【Spark七十二】Spark的日志配置 bit1129 spark
在测试Spark Streaming时，大量的日志显示到控制台，影响了Spark Streaming程序代码的输出结果的查看(代码中通过println将输出打印到控制台上)，可以通过修改Spark的日志配置的方式，不让Spark Streaming把它的日志显示在console 在Spark的conf目录下，把log4j.properties.template修改为log4j.p
Haskell版冒泡排序 bookjovi 冒泡排序 haskell
面试的时候问的比较多的算法题要么是binary search，要么是冒泡排序，真的不想用写C写冒泡排序了，贴上个Haskell版的，思维简单，代码简单，下次谁要是再要我用C写冒泡排序，直接上个haskell版的，让他自己去理解吧。 sort [] = [] sort [x] = [x] sort (x:x1:xs) | x>x1 = x1:so
java 路径配置文件读取 bro_feng java
这几天做一个项目，关于路径做如下笔记，有需要供参考。取工程内的文件，一般都要用相对路径，这个自然不用多说。在src统计目录建配置文件目录res,在res中放入配置文件。读取文件使用方式： 1. MyTest.class.getResourceAsStream("/res/xx.properties") 2. properties.load(MyTest.
读《研磨设计模式》-代码笔记-简单工厂模式 bylijinnan java 设计模式
声明：本文只为方便我个人查阅和理解，详细的分析以及源代码请移步原作者的博客http://chjavach.iteye.com/ package design.pattern; /* * 个人理解：简单工厂模式就是IOC; * 客户端要用到某一对象，本来是由客户创建的，现在改成由工厂创建，客户直接取就好了 */ interface IProduct {
SVN与JIRA的关联 chenyu19891124 SVN
SVN与JIRA的关联一直都没能装成功，今天凝聚心思花了一天时间整合好了。下面是自己整理的步骤：一、搭建好SVN环境，尤其是要把SVN的服务注册成系统服务二、装好JIRA，自己用是jira-4.3.4破解版三、下载SVN与JIRA的插件并解压，然后拷贝插件包下lib包里的三个jar，放到Atlassian\JIRA 4.3.4\atlassian-jira\WEB-INF\lib下，再
JWFDv0.96 最新设计思路 comsci 数据结构算法工作企业应用公告
随着工作流技术的发展，工作流产品的应用范围也不断的在扩展，开始进入了像金融行业(我已经看到国有四大商业银行的工作流产品招标公告了)，实时生产控制和其它比较重要的工程领域，而
vi 保存复制内容格式粘贴 daizj vi 粘贴复制保存原格式不变形
vi是linux中非常好用的文本编辑工具，功能强大无比，但对于复制带有缩进格式的内容时，粘贴的时候内容错位很严重，不会按照复制时的格式排版，vi能不能在粘贴时，按复制进的格式进行粘贴呢？答案是肯定的，vi有一个很强大的命令可以实现此功能。在命令模式输入:set paste，则进入paste模式，这样再进行粘贴时
shell脚本运行时报错误：/bin/bash^M: bad interpreter 的解决办法 dongwei_6688 shell脚本
出现原因：windows上写的脚本，直接拷贝到linux系统上运行由于格式不兼容导致解决办法： 1. 比如文件名为myshell.sh，vim myshell.sh 2. 执行vim中的命令 : set ff?查看文件格式，如果显示fileformat=dos，证明文件格式有问题 3. 执行vim中的命令 :set fileformat=unix 将文件格式改过来就可以了，然后:w
高一上学期难记忆单词 dcj3sjt126com word english
honest 诚实的；正直的 argue 争论 classical 古典的 hammer 锤子 share 分享；共有 sorrow 悲哀；悲痛 adventure 冒险 error 错误；差错 closet 壁橱；储藏室 pronounce 发音；宣告 repeat 重做；重复 majority 大多数；大半 native 本国的，本地的，本国
hibernate查询返回DTO对象，DTO封装了多个pojo对象的属性 frankco POJO hibernate查询 DTO
DTO-数据传输对象；pojo-最纯粹的java对象与数据库中的表一一对应。简单讲：DTO起到业务数据的传递作用，pojo则与持久层数据库打交道。有时候我们需要查询返回DTO对象，因为DTO
Partition List hcx2013 partition
Given a linked list and a value x, partition it such that all nodes less than x come before nodes greater than or equal to x. You should preserve the original relative order of th
Spring MVC测试框架详解——客户端测试 jinnianshilongnian
上一篇《Spring MVC测试框架详解——服务端测试》已经介绍了服务端测试，接下来再看看如果测试Rest客户端，对于客户端测试以前经常使用的方法是启动一个内嵌的jetty/tomcat容器，然后发送真实的请求到相应的控制器；这种方式的缺点就是速度慢；自Spring 3.2开始提供了对RestTemplate的模拟服务器测试方式，也就是说使用RestTemplate测试时无须启动服务器，而是模拟一
关于推荐个人观点 liyonghui160com 推荐系统关于推荐个人观点
回想起来，我也做推荐了3年多了，最近公司做了调整招聘了很多算法工程师，以为需要多么高大上的算法才能搭建起来的，从实践中走过来，我只想说【不是这样的】第一次接触推荐系统是在四年前入职的时候，那时候，机器学习和大数据都是没有的概念，什么大数据处理开源软件根本不存在，我们用多台计算机web程序记录用户行为，用.net的w
不间断旋转的动画 pangyulei 动画
CABasicAnimation* rotationAnimation; rotationAnimation = [CABasicAnimation animationWithKeyPath:@"transform.rotation.z"]; rotationAnimation.toValue = [NSNumber numberWithFloat: M
自定义annotation sha1064616837 java enum annotation reflect
对象有的属性在页面上可编辑，有的属性在页面只可读，以前都是我们在页面上写死的，时间一久有时候会混乱，此处通过自定义annotation在类属性中定义。越来越发现Java的Annotation真心很强大，可以帮我们省去很多代码，让代码看上去简洁。下面这个例子主要用到了 1.自定义annotation：@interface，以及几个配合着自定义注解使用的几个注解 2.简单的反射 3.枚举
Spring 源码 up2pu spring
1.Spring源代码 https://github.com/SpringSource/spring-framework/branches/3.2.x 注：兼容svn检出 2.运行脚本 import-into-eclipse.bat 注：需要设置JAVA_HOME为jdk 1.7 build.gradle compileJava { sourceCompatibilit
利用word分词来计算文本相似度 yangshangchuan word word分词文本相似度余弦相似度简单共有词
word分词提供了多种文本相似度计算方式：方式一：余弦相似度，通过计算两个向量的夹角余弦值来评估他们的相似度实现类：org.apdplat.word.analysis.CosineTextSimilarity 用法如下： String text1 = "我爱购物"; String text2 = "我爱读书"; String text3 =