mdadm

Commit Graph

Author	SHA1	Message	Date
Song Liu	69a481166b	Assemble array with write journal Example output: ./mdadm --assemble /dev/md0 /dev/sd[c-f] /dev/sdb1 mdadm: /dev/md0 has been started with 4 drives and 1 journal. mdadm checks superblock for journal devices. If the journal device is missing or faulty, mdadm will show warning ./mdadm --assemble /dev/md0 /dev/sd[c-q] /dev/sdb1 mdadm: Not safe to assemble with missing or stale journal device, consider --force. User can insist to start the array (read only) with --force ./mdadm --assemble /dev/md0 /dev/sd[c-q] /dev/sdb1 --force mdadm: Journal is missing or stale, starting array read only. mdadm: /dev/md0 has been started with 15 drives. Signed-off-by: Song Liu <songliubraving@fb.com> Signed-off-by: Shaohua Li <shli@fb.com> Signed-off-by: NeilBrown <neilb@suse.com>	2015-10-19 13:06:15 +11:00
Song Liu	cc1799c3dd	Enable create array with write journal (--write-journal DEVICE). Specify the write journal device with --write-journal DEVICE ./mdadm --create -f /dev/md0 --assume-clean -c 32 --raid-devices=4 --level=5 /dev/sd[c-f] --write-journal /dev/sdb1 mdadm: Defaulting to version 1.2 metadata mdadm: array /dev/md0 started. Only one journal device is allowed. If multiple --write-journal are given, mdadm will use the first and ignore others ./mdadm --create -f /dev/md0 --assume-clean -c 32 --raid-devices=4 --level=5 /dev/sd[c-f] --write-journal /dev/sdb1 --write-journal /dev/sdx mdadm: Please specify only one journal device for the array. mdadm: Ignoring --write-journal /dev/sdx... mdadm: Defaulting to version 1.2 metadata mdadm: array /dev/md0 started. Signed-off-by: Song Liu <songliubraving@fb.com> Signed-off-by: Shaohua Li <shli@fb.com> Signed-off-by: NeilBrown <neilb@suse.com>	2015-10-19 13:06:12 +11:00
Song Liu	ed94976d84	Show device as journal in --detail --examine Example output: ./mdadm --detail /dev/md127 /dev/md127: Version : 1.2 Creation Time : Wed May 13 17:01:12 2015 Raid Level : raid5 Array Size : 11720662464 (11177.69 GiB 12001.96 GB) Used Dev Size : 3906887488 (3725.90 GiB 4000.65 GB) Raid Devices : 4 Total Devices : 5 Persistence : Superblock is persistent Intent Bitmap : Internal Update Time : Wed May 13 17:01:12 2015 State : clean Active Devices : 4 Working Devices : 5 Failed Devices : 0 Spare Devices : 1 Layout : left-symmetric Chunk Size : 32K Name : 0 UUID : 8fb9ee05:3831d52f:e5c23825:28cd6881 Events : 0 Number Major Minor RaidDevice State 0 8 32 0 active sync /dev/sdc 1 8 48 1 active sync /dev/sdd 2 8 64 2 active sync /dev/sde 3 8 80 3 active sync /dev/sdf 4 8 17 - journal /dev/sdb1 ./mdadm -E /dev/sdb2 /dev/sdb2: Magic : a92b4efc Version : 1.2 Feature Map : 0x201 Array UUID : 562b2334:35b9bcc1:add50892:1f30c4bd Name : 0 Creation Time : Thu Aug 27 12:55:26 2015 Raid Level : raid5 Raid Devices : 15 Avail Dev Size : 249796608 (119.11 GiB 127.90 GB) Array Size : 54696423936 (52162.57 GiB 56009.14 GB) Used Dev Size : 7813774848 (3725.90 GiB 4000.65 GB) Data Offset : 262144 sectors Super Offset : 8 sectors Unused Space : before=262056 sectors, after=0 sectors State : active Device UUID : 5015e522:d39ba566:5909cf3c:9c51f2ff Internal Bitmap : 8 sectors from superblock Update Time : Thu Aug 27 13:16:55 2015 Bad Block Log : 512 entries available at offset 72 sectors Checksum : 4e6fd76d - correct Events : 262 Layout : left-symmetric Chunk Size : 256K Device Role : Journal Array State : AAAAAAAAAAAAAAA ('A' == active, '.' == missing, 'R' == replacing) Signed-off-by: Song Liu <songliubraving@fb.com> Signed-off-by: Shaohua Li <shli@fb.com> Signed-off-by: NeilBrown <neilb@suse.com>	2015-10-19 13:06:07 +11:00
Song Liu	fa7574f6d4	add macros for MD_DISK_ROLE_(SPARE/FAULTY) Replace special disk roles (0xffff, 0xfffe) with macros: define MD_DISK_ROLE_SPARE 0xffff define MD_DISK_ROLE_FAULTY 0xfffe Will add macro for journal device in next patch: define MD_DISK_ROLE_JOURNAL 0xfffd Signed-off-by: Song Liu <songliubraving@fb.com> Signed-off-by: Shaohua Li <shli@fb.com> Signed-off-by: NeilBrown <neilb@suse.com>	2015-10-19 13:05:59 +11:00
NeilBrown	86a406c226	super1: Do not create bad block log for clustered devices. We currently have no synchronization techniques for the bad block log, so disable it for the cluster. Reported-by: Goldwyn Rodrigues <rgoldwyn@suse.com> Signed-off-by: NeilBrown <neilb@suse.com>	2015-09-28 12:27:37 +10:00
Goldwyn Rodrigues	6d9c7c2551	Increment version for clustered bitmaps Add BITMAP_MAJOR_CLUSTERED as 5, in order to prevent older kernels to assemble a clustered device. In order to maximize compatibility, the major version is set to BITMAP_MAJOR_CLUSTERED only if the bitmap is clustered. Also, added MD_FEATURE_CLUSTERED in order to return error for older kernels which would assemble MD in case bitmap is corrupted. Signed-off-by: Goldwyn Rodrigues <rgoldwyn@suse.com> Signed-off-by: NeilBrown <neilb@suse.com>	2015-09-28 11:47:04 +10:00
Guoqing Jiang	2cf42394f0	md-cluster: use %-64s to print cluster_name Left align is better for cluster with name less than 64. Also make the output of cluster name is aligned with others. Signed-off-by: Guoqing Jiang <gqjiang@suse.com> Signed-off-by: NeilBrown <neilb@suse.com>	2015-07-29 17:26:12 +10:00
Guoqing Jiang	4a3d29edce	Reuse calc_bitmap_size to reduce code size We can use the new added calc_bitmap_size func to remove some redundant lines. Signed-off-by: Guoqing Jiang <gqjiang@suse.com> Signed-off-by: NeilBrown <neilb@suse.de>	2015-06-17 09:44:38 +10:00
Guoqing Jiang	7e6e839a26	mdadm: change the num of cluster node This extends nodes option for assemble mode, make the num of cluster node could be change by user. Before that, it is necessary to ensure there are enough space for those nodes, calc_bitmap_size is introduced to calculate the bitmap size of each node. Signed-off-by: Guoqing Jiang <gqjiang@suse.com> Signed-off-by: NeilBrown <neilb@suse.de>	2015-06-17 09:43:31 +10:00
Guoqing Jiang	0aa2f15b20	mdadm: add the ability to change cluster name To support change the cluster name, the commit do the followings: 1. extend original write_bitmap function for new scenario. 2. add the scenarion to handle the modification of cluster's name in write_bitmap1. 3. let the cluster name also show in examine_super1 and detail_super1 Signed-off-by: Guoqing Jiang <gqjiang@suse.com> Signed-off-by: NeilBrown <neilb@suse.de>	2015-06-17 09:33:39 +10:00
Guoqing Jiang	06bd679317	Skip clustered devices in incremental We want the clustered devices to be started exclusively by a cluster resource-agent. So, avoid starting using the incremental option. This also skips a clustered md from starting during boot in inactive mode. Signed-off-by: Goldwyn Rodrigues <rgoldwyn@suse.com> Signed-off-by: Guoqing Jiang <gqjiang@suse.com> Signed-off-by: NeilBrown <neilb@suse.de>	2015-06-17 09:33:18 +10:00
Guoqing Jiang	7716570e6d	Set home-cluster while creating an array The home-cluster is stored in the bitmap super block of the array. The device can be assembled on a cluster with the cluster name same as the one recorded in the bitmap. If home-cluster is not specified, this is auto-detected using dlopen corosync cmap library. neilb: allow code to compile when corosync-devel is not installed. Signed-off-by: Goldwyn Rodrigues <rgoldwyn@suse.com> Signed-off-by: Guoqing Jiang <gqjiang@suse.com> Signed-off-by: NeilBrown <neilb@suse.de>	2015-06-17 09:06:30 +10:00
Guoqing Jiang	529e2aa573	Add nodes option while creating md Specifies the maximum number of nodes in the cluster that may use this device simultaneously. This is equivalent to the number of bitmaps created in the internal superblock (patches to follow). Signed-off-by: Goldwyn Rodrigues <rgoldwyn@suse.com> Signed-off-by: Guoqing Jiang <gqjiang@suse.com> Signed-off-by: NeilBrown <neilb@suse.de>	2015-06-17 09:04:16 +10:00
Guoqing Jiang	95a05b37e8	Create n bitmaps for clustered mode For a clustered MD, create bitmaps equal to number of nodes so each node has an independent bitmap. Only the first bitmap is has the bits set so that the first node that assembles the device also performs the sync. The bitmaps are aligned to 4k boundaries. On-disk format: 0 4k 8k 12k ------------------------------------------------------------------- \| idle \| md super \| bm super [0] + bits \| \| bm bits[0, contd] \| bm super[1] + bits \| bm bits[1, contd] \| \| bm super[2] + bits \| bm bits [2, contd] \| bm super[3] + bits \| \| bm bits [3, contd] \| \| \| Signed-off-by: Goldwyn Rodrigues <rgoldwyn@suse.com> Signed-off-by: Guoqing Jiang <gqjiang@suse.com> Signed-off-by: NeilBrown <neilb@suse.de>	2015-06-17 07:54:03 +10:00
NeilBrown	7a862a020f	Don't break long strings onto multiple lines. It is best to keep strings all together so that they are easier to search for in the source code. If a string is so long that it looks ugly one line, them maybe it should be broken into multiple lines for display too. Only strings which contain a newline can be broken into multiple lines: "It is OK to\n" "break this string\n" Signed-off-by: NeilBrown <neilb@suse.de>	2015-02-12 13:46:53 +11:00
NeilBrown	1ade5cc15a	Consistently print program Name and __func__ in debug messages. make dprintf() print program name and __func__, so that this messaging is consistent. Also remove all __func__ messages from pr_err(). We shouldn't leak that internal data in error message. If we really want function name there, we new pr_XXX might be wanted. Signed-off-by: NeilBrown <neilb@suse.de>	2015-02-12 13:21:17 +11:00
NeilBrown	21dc47172d	super1: remove some debugging printfs in update_super1 These should never have been there. Signed-off-by: NeilBrown <neilb@suse.de>	2014-11-03 12:56:37 +11:00
NeilBrown	6ac17e734b	super1: make sure 'room' includes 'bbl_size' when creating array. Because we then go ahead and subtrace bbl_size from room. Signed-off-by: NeilBrown <neilb@suse.de>	2014-08-21 10:57:55 +10:00
NeilBrown	268cccac2e	super1: don't allow adding a bitmap if there is no space. If the data is too close to the superblock there may be no space for a bitmap. If that happens, fail the adding of the bitmap rather than corrupt data. Reported-by: Lars Wijtemans <rhelbugzilla@lars.wijtemans.nl> Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=922944	2014-08-15 15:45:54 +10:00
NeilBrown	e2efe9e7bc	config: new option to suppress adding bad block lists. CREATE bbl=no in mdadm.conf will cause any devices added to an array to not have a bad block list. By default they do for 1.x metadata. This is useful if you are suspicious of the bad-block-list implementation. Reported-by: Ethan Wilson <ethan.wilson@shiftmail.org> Signed-off-by: NeilBrown <neilb@suse.de>	2014-08-07 12:23:45 +10:00
NeilBrown	f4dc5e9b7f	super: make sure to ignore disk state flags that we don't understand. This make it easier to add new flags that some super-types don't understand. Signed-off-by: NeilBrown <neilb@suse.de>	2014-08-07 11:34:50 +10:00
Cristian Rodríguez	04f903b21a	mdadm: Do not reimplment offsetof Proper implementations have offsetof in stddef.h Signed-off-by: NeilBrown <neilb@suse.de>	2014-05-22 14:29:14 +10:00
NeilBrown	4c0ea7b0d9	super1: fix setting of data_offset for 1.0 metadata. commit `23bf42cc79` super1: simplify setting of array size. removed the setting for sb->data_offset for 1.0 metadata for some reason, and messed up the size calculation for 1.0 metadata too. Signed-off-by: NeilBrown <neilb@suse.de>	2013-08-14 17:16:35 +10:00
NeilBrown	23bf42cc79	super1: simplify setting of array size. Currently the extra space to leave before the data in the array is calculated in two separate places, and they can be inconsistent. Instead, do it all in validate_geometry. This records the 'data_offset' chosen which all other devices then use. 'write_init_super' now just uses the value rather than doing all the calculations again. This results in more consistent numbers. Also, load_super sets st->data_offset so that it is used by "--add", so the new device has a data offset matching a pre-existing device. Signed-off-by: NeilBrown <neilb@suse.de>	2013-07-30 17:05:47 +10:00
NeilBrown	641da74591	super1: separate to version of _avail_space1(). _avail_space1() is calls from both avail_space1() and validate_geometry1() and does slightly different things. The partial code sharing doesn't really help. In particularly the responsibility for setting the size of the array is currently confused. So duplicate the code into the two locations - one where 'super' is always NULL (validate_geometry1) and one where it is never NULL (avail_space1), and simplify. No behaviour change - just code re-organisation. Signed-off-by: NeilBrown <neilb@suse.de>	2013-07-30 15:59:03 +10:00
NeilBrown	7ccc4cc4fc	Manage: remove call to validate_geometry. This call to validate_geometry is really rather gratuitous. It is purely about the fact that super0 cannot use more than 4TB. So just make it an explicit test - less confusing that way. With this, validate_geometry is only called from Create, which makes it easier to reason about. Also validate_geometry is now never passed NULL for the 'chunk' parameter, so we can remove those annoying tests for NULL. Signed-off-by: NeilBrown <neilb@suse.de>	2013-07-30 13:45:22 +10:00
NeilBrown	2bf62891c1	super0/1: fix typo in error messages. Signed-off-by: NeilBrown <neilb@suse.de>	2013-07-24 12:22:58 +10:00
NeilBrown	419e018284	super1: update data_size when performing "revert-reshape". The "data_size" is with respect to "data_offset". When the kernel changes "data_offset" it modifies "data_size" to match - see md_finish_reshape() in the kernel. So when mdadm switches the data_offset for the new data_offset, it must update data_size correspondingly. Signed-off-by: NeilBrown <neilb@suse.de>	2013-07-24 10:21:27 +10:00
NeilBrown	efb3994e48	revert-reshape: only impose reshape_position tests on raid[456] This test is irrelevant for RAID10, so restrict it to those levels in which it is meaningful. Signed-off-by: NeilBrown <neilb@suse.de>	2013-07-02 16:10:27 +10:00
NeilBrown	a2836f12c4	revert-reshape: make sure reshape_position is acceptable. We can only revert a reshape if the reshape_position aligns properly for the old geometry. If it doesn't we just fail for now. Also fix a +/- error with updating raid_disks for super1.c Signed-off-by: NeilBrown <neilb@suse.de>	2013-06-27 16:38:53 +10:00
NeilBrown	0ddc35beed	super1: fix space_{before,after} for RAID0 For RAID0 we need to use 'data_size', no 'size' as later is 0. Signed-off-by: NeilBrown <neilb@suse.de>	2013-06-24 16:24:08 +10:00
NeilBrown	ccec2685ab	Add test for --update=metadata and fix bug it found. We were not setting device size correctly for raid0. Signed-off-by: NeilBrown <neilb@suse.de>	2013-06-19 16:28:05 +10:00
NeilBrown	1011e8344a	Remove lots of unnecessary white space. Now that I am using white-space mode in Emacs I can see all of this, and I don't like it :-) Signed-off-by: NeilBrown <neilb@suse.de>	2013-06-19 12:31:45 +10:00
NeilBrown	26bf55874d	super1: set RESHAPE_NO_BACKUP based on new_offset. We need to check for a backup iff the data_offset has changed. Testing against level==10 was an effective but short-sighted approach. Signed-off-by: NeilBrown <neilb@suse.de>	2013-05-28 16:58:18 +10:00
NeilBrown	199f1a1fad	Assemble: allow --update=revert-reshape This will cause a reshape to start going backwards.	2013-05-28 16:44:23 +10:00
NeilBrown	afa368f49a	Assemble: --update=metadata converts v0.90 to v1.0 This allows the smooth conversion of legacy 0.90 arrays to 1.0 metadata. Old metadata is likely to remain but will be ignored. It can be removed with mdadm --zero-superblock --metadata=0.90 /dev/whatever Signed-off-by: NeilBrown <neilb@suse.de>	2013-05-28 16:44:22 +10:00
NeilBrown	d6e4b44fdb	super1: fix some casts of signed superblock fields. These need to be cast to uint32_t before being cast to 'long', else sign extension doesn't happen on 64bit hosts. And bitmap_offset is le32, not le64 !! Signed-off-by: NeilBrown <neilb@suse.de>	2013-05-28 16:43:03 +10:00
NeilBrown	5e1863d49d	Examine/super1 - report Unused space, before and after. Might be confusing, or might be useful when reshaping. Signed-off-by: NeilBrown <neilb@suse.de>	2013-05-22 16:37:19 +10:00
NeilBrown	f79bbf4f69	super1: don't put the bblog at the end of the free space. It seems like a nice location, but it means that we cannot decrease the data_offset during a reshape. So put it just after the bitmap, leaving 32K. Signed-off-by: NeilBrown <neilb@suse.de>	2013-05-22 16:00:21 +10:00
NeilBrown	c4b26c643d	Grow: allow metadata to indicate that changing data_offset not supported. If space_after and space_before are zero (the default) then assume that metadata doesn't support changing data_offset. Signed-off-by: NeilBrown <neilb@suse.de>	2013-05-22 12:26:19 +10:00
NeilBrown	cc3130a786	super1: improve calculation of space_before/space_after 1/ these must allow for bad-block-list 2/ they must match the kernel, which has a 32k buffer after the superblock. Signed-off-by: NeilBrown <neilb@suse.de>	2013-05-21 15:38:49 +10:00
NeilBrown	8772113ab2	Examine/super1: don't report "New Offset" when feature not set. The "new_offset" field may be non-zero, but if the feature flag is not set, it should be ignored. Signed-off-by: NeilBrown <neilb@suse.de>	2013-05-21 15:37:20 +10:00
NeilBrown	74db60b00a	Add --dump / --restore functionality. This allows the metadata on a device to be saved and later restored. This can be useful before experimenting on an array that is misbehaving. Suggested-by: Dan Williams <dan.j.williams@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2013-05-16 15:07:16 +10:00
NeilBrown	0cf8322999	Always test return value of posix_memalign. FORTIFY_SOURCE likes this, and it is good practice. Signed-off-by: NeilBrown <neilb@suse.de>	2013-05-13 17:09:55 +10:00
NeilBrown	5a23a06ea4	mdassemble - fix new compile-time problems. Signed-off-by: NeilBrown <neilb@suse.de>	2013-05-13 17:05:16 +10:00
NeilBrown	4dd2df0966	Discard devnum in favour of devnm We widely use a "devnum" which is 0 or +ve for md%d devices and -ve for md_d%d devices. But I want to be able to use md_%s device names. So get rid of devnum (a number) and use devnm (a 32char string). eg. md0 md_d2 md_home Signed-off-by: NeilBrown <neilb@suse.de>	2013-02-21 17:05:23 +11:00
NeilBrown	def1133297	make --update=homehost work again Commit `1e2b276535` (Report error in --update string is not recognised) broke homehost updating functionality because it depended on each string comparison being done even after we already found a match. Make it work again by restructuring code. Reported-by: (and original version by) Justin Maggard <jmaggard10@gmail.com> Signed-off-by: NeilBrown <neilb@suse.de>	2013-02-10 15:40:42 +11:00
NeilBrown	9698df15d9	Avoid using BLKFLSBUF. Now that we use O_DIRECT for all device IO, BLKFLSBUF is not needed to ensure we get current data, and it can impose a cost if any flush-out is needed. So remove it. To be safe, add O_DIRECT to one place where it isn't currently used: when reading a bitmap. Signed-off-by: NeilBrown <neilb@suse.de>	2013-02-05 16:00:55 +11:00
NeilBrown	6d388a8816	MISC: Add --examine-badblocks option This will list the contents of the bad-blocks log, if one is present. Signed-off-by: NeilBrown <neilb@suse.de>	2012-12-05 12:56:31 +11:00
NeilBrown	72e7fb13f0	Incremental: support replacement devices. These need to be counted in the number of 'active' devices. Signed-off-by: NeilBrown <neilb@suse.de>	2012-10-24 12:06:51 +11:00
NeilBrown	aacb2f816a	Assemble: add support for replacement devices. Need to possibly collect 2 devices for each slot, and original and a replacement. Signed-off-by: NeilBrown <neilb@suse.de>	2012-10-24 09:48:18 +11:00
NeilBrown	24c7bc8432	Report replacement devices correctly with --detail and --examine --detail needs to be read to report 2 devices in each slot, and --examine need to report if the device is the original or the replacement. Signed-off-by: NeilBrown <neilb@suse.de>	2012-10-23 17:16:16 +11:00
NeilBrown	5d5002289c	Replace a lot of leading spaces with tabs. Signed-off-by: NeilBrown <neilb@suse.de>	2012-10-10 18:33:26 +11:00
NeilBrown	72ca9bcff3	Allow data-offset to be specified per-device for create mdadm --create /dev/md0 .... /dev/sda1:1024 /dev/sdb1:2048 ... The size is in K unless a suffix: K M G is given. The suffix 's' means sectors. Signed-off-by: NeilBrown <neilb@suse.de>	2012-10-04 16:34:21 +10:00
NeilBrown	cb19a251a5	super1: reserve at least 2 chunks for reshape headroom. sometimes 0.1% isn't enough, though mostly only in testing. We need one chunk for a successful reshape, so reserve 2. Signed-off-by: NeilBrown <neilb@suse.de>	2012-10-04 16:34:21 +10:00
NeilBrown	5e88ab2e2f	New RESHAPE_NO_BACKUP flag to track when backup action is needed. Some arrays (raid10) never need a backup file, so during assembly we can avoid the whole Grow_continue check in that case. Achieve this using a flag set by the metadata handler. Also get "mdadm -I" to fail if a backup process would be needed. It currently does fail as the kernel rejects things, but it is nicer to have this explicit. Signed-off-by: NeilBrown <neilb@suse.de>	2012-10-04 16:34:21 +10:00
NeilBrown	80bf913592	Add space_before/space_after fields to mdinfo These will be needed to guide changes to data_offset during reshape. Only set them for super1 for now. Signed-off-by: NeilBrown <neilb@suse.de>	2012-10-04 16:34:21 +10:00
NeilBrown	8fe1c44f82	super1: add new_offset field. The 'new_offset' is used for reshaping to avoid the need for a backup file. For now we only report the value when it is set. Signed-off-by: NeilBrown <neilb@suse.de>	2012-10-04 16:34:21 +10:00
NeilBrown	83cd1e97cb	Add data_offset arg to ->init_super and use it in super1.c So if ->data_offset is already set, use that rather than computing one. Signed-off-by: NeilBrown <neilb@suse.de>	2012-10-04 16:34:20 +10:00
NeilBrown	af4348ddd1	Add data_offset arg to ->validate_geometry. This is needed to return correct available size. It isn't really used yet. Signed-off-by: NeilBrown <neilb@suse.de>	2012-10-04 16:34:20 +10:00
NeilBrown	387fcd593c	Add data_offset arg to ->avail_size This is currently only useful for 1.x metadata and will allow an explicit --data-offset request on command line. Signed-off-by: NeilBrown <neilb@suse.de>	2012-10-04 16:34:20 +10:00
NeilBrown	7103b9b88d	Handles spaces in array names better. 1/ When printing the "name=" entry for --brief output, enclose name in quotes if it contains spaces etc. Quotes are already supported for reading mdadm.conf 2/ When a name is used as a device name, translate spaces and tabs to '_', as well as the current translation of '/' to '-'. Signed-off-by: NeilBrown <neilb@suse.de>	2012-10-04 16:34:20 +10:00
NeilBrown	88af981fa5	super1: ensure bitmap doesn't overlap bad block log. If a bad block log already exists when adding a bitmap, make sure the bitmap stays before the log. Signed-off-by: NeilBrown <neilb@suse.de>	2012-10-03 17:07:20 +10:00
NeilBrown	688e99a77d	Allow --update to add or remove space for a bad block list. --update=bbl will add a bad block list to each device. --update=no-bblk will remove the bad block list providing that it is empty. Signed-off-by: NeilBrown <neilb@suse.de>	2012-10-03 17:07:13 +10:00
NeilBrown	bf95d0f38c	Bad block log	2012-10-03 17:07:11 +10:00
Maciej Naruszewicz	80730bae52	Add MD_ARRAY_SIZE for --examine --export An additional pair of key=value for --examine --export. Signed-off-by: Maciej Naruszewicz <maciej.naruszewicz@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2012-10-02 16:42:25 +10:00
NeilBrown	ba728be72f	Convert 'quiet' to 'not verbose' in various places. If we change some functions to accept 'verbose', where <0 means to be quiet, in place of 'quiet', then we will be able to merge 'quiet' and 'verbose' together for simplicity. Signed-off-by: NeilBrown <neilb@suse.de>	2012-07-09 17:18:09 +10:00
NeilBrown	503975b9d5	Remove scattered checks for malloc success. malloc should never fail, and if it does it is unlikely that anything else useful can be done. Best approach is to abort and let some super-daemon restart. So define xmalloc, xcalloc, xrealloc, xstrdup which don't fail but just print a message and exit. Then use those removing all the tests for failure. Also replace all "malloc;memset" sequences with 'xcalloc'. Signed-off-by: NeilBrown <neilb@suse.de>	2012-07-09 17:14:16 +10:00
NeilBrown	e7b84f9d50	Introduce pr_err for printing error messages. 'pr_err("' is a lot shorter than 'fprintf(stderr, Name ": ' cont_err() is also available. Signed-off-by: NeilBrown <neilb@suse.de>	2012-07-09 17:14:16 +10:00
majianpeng	4687f16027	mdadm: Fix Segmentation fault. In function write_init_super1(): If "rv = store_super1(st, di->fd)" return error and the di is the last. Then the di = NULL && rv > 0, so exec: if (rv) fprintf(stderr, Name ": Failed to write metadata to%s\n", di->devname); will be segmentation fault. Signed-off-by: majianpeng <majianpeng@gmail.com> Signed-off-by: NeilBrown <neilb@suse.de>	2012-05-29 09:21:51 +10:00
NeilBrown	d9751e06a6	super1: fix choice of data_offset. While it is nice to set a high data_offset to leave plenty of head room it is much more important to leave enough space to allow of the data of the array. So after we check that sb->size is still available, only reduce the 'reserved', don't increase it. This fixes a bug where --adding a spare fails because it does not have enough space in it. Reported-by: nowhere <nowhere@hakkenden.ath.cx> Signed-off-by: NeilBrown <neilb@suse.de>	2012-05-15 09:51:03 +10:00
Jes Sorensen	34a13953fa	Fix sign extension of bitmap_offset in super1.c `fbdef49811` incorrectly tried to fix sign extension of the bitmap offset. However mdinfo->bitmap_offset is a u32 and needs to be converted to a 32 bit signed integer before the sign extension. Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com> Signed-off-by: NeilBrown <neilb@suse.de>	2012-04-30 09:56:22 +10:00
NeilBrown	508a7f16b2	super1: leave more space in front of data by default. The kernel is growing the ability to avoid the need for a backup file during reshape by being able to change the data offset. For this to be useful we need plenty of free space before the data so the data offset can be reduced. So for v1.1 and v1.2 metadata make the default data_offset much larger. Aim for 128Meg, but keep a power of 2 and don't use more than 0.1% of each device. Don't change v1.0 as that is used when the data_offset is required to be zero. Signed-off-by: NeilBrown <neilb@suse.de>	2012-04-04 14:04:28 +10:00
NeilBrown	fbdef49811	Bitmap_offset is a signed number As the bitmap can be before the superblock, bitmap_offset is signed. But some of the code didn't honour that :-( Signed-off-by: NeilBrown <neilb@suse.de>	2012-04-04 14:03:45 +10:00
NeilBrown	d4633e06df	Examine: fix array size calculation for RAID10. RAID10 arrays with an odd number of devices had the arraysize reported wrongly by --examine due to a rounding error. Reported-by: Chris Francy <zoredache@gmail.com> Signed-off-by: NeilBrown <neilb@suse.de>	2012-03-22 14:43:09 +11:00
Jes Sorensen	0a2f189415	super1.c: use ROUND_UP/ROUND_UP_PTR Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com> Signed-off-by: NeilBrown <neilb@suse.de>	2012-03-21 08:06:35 +11:00
Jes Sorensen	308340aa58	Use struct align_fd to cache fd's block size for aligned reads/writes This uses a struct to cache the block size for aligned reads/writes, to avoid repeated ioctl(BLKSSZGET) calls. Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com> Signed-off-by: NeilBrown <neilb@suse.de>	2012-03-21 08:01:20 +11:00
Jes Sorensen	3c0bcd4609	Use 4K buffer alignment for superblock allocations To better accommodate 4K sector drives, use 4K buffer alignment for superblock buffers. Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com> Signed-off-by: NeilBrown <neilb@suse.de>	2012-03-21 08:01:04 +11:00
Jes Sorensen	2de0b8a2b4	match_metadata_desc1(): Use calloc instead of malloc+memset Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com> Signed-off-by: NeilBrown <neilb@suse.de>	2012-03-21 08:00:50 +11:00
Jes Sorensen	1afa9308d2	init_super1() memset full buffer allocated for superblock Avoid possibly using stale data in bitmap and misc area of superblock. In addition, remove superfluous memsets already covered by memset of full superblock. Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com> Signed-off-by: NeilBrown <neilb@suse.de>	2012-03-21 08:00:26 +11:00
Jes Sorensen	4122675629	Define and use SUPER1_SIZE for allocations Use a #define rather than calculate the size of the superblock buffer on every allocation. Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com> Signed-off-by: NeilBrown <neilb@suse.de>	2012-03-21 08:00:07 +11:00
Jes Sorensen	b2bfdfa0fe	super1.c don't keep recalculating bitmap pointer We just calculated the pointer to the bitmap, so use it instead of recalculating. Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com> Signed-off-by: NeilBrown <neilb@suse.de>	2012-03-21 07:42:48 +11:00
NeilBrown	911cead7f1	super1: support superblocks up to 4K. The current 1024 byte limit on 1.x superblocks limits us to 384 devices. Sometimes people want more. The kernel is already prepared for superblocks up to 4K, so enable that in mdadm allowing up to (4096-256)/2 == 1920 devices (active plus spare). Signed-off-by: NeilBrown <neilb@suse.de>	2012-03-08 15:40:52 +11:00
Jes Sorensen	4011421332	Print error message if failing to write super for 1.x metadata In addition remove attempt to print an error message if write_init_super() fails, as this is handled in the various write_init_super() functions. This avoids a segfault on error. Reported by Jim Meyering in https://bugzilla.redhat.com/show_bug.cgi?id=795461 Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com> Signed-off-by: NeilBrown <neilb@suse.de>	2012-02-23 08:55:19 +11:00
Jes Sorensen	d669228f29	Use posix_memalign() for memory used to write bitmaps This makes super[01].c properly align buffers used for the bitmap using posix_memalign() to make sure the writes don't fail in case the bitmap is opened using O_DIRECT. This is based on https://bugzilla.redhat.com/show_bug.cgi?id=789898 and an initial patch by Alexander Murashkin. Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com> Signed-off-by: NeilBrown <neilb@suse.de>	2012-02-16 14:16:03 +11:00
NeilBrown	6ef89052d8	super1: make aread/awrite always use an aligned buffer. A recently change to write_bitmap1 meant awrite would sometimes write from a non-aligned buffer which of course break. So change awrite (and aread) to always use their own aligned buffer to ensure safety. Reported-by: Alexander Lyakas <alex.bolshoy@gmail.com> Signed-off-by: NeilBrown <neilb@suse.de>	2012-02-07 11:55:18 +11:00
Alexander Lyakas	d59770567c	getinfo_super1: Use MaxSector in place of sb->size when deciding whether the array is clean or dirty, compare sb->resync_offset against MaxSector and not against sb->size With RAID6 resyncing and subsequent drive failures, it is possible to reach the case, in which sb->resync_offset==sb->size. This happens when resync is aborted due to drive failures, and immediately a rebuild of a spare starts. In this case, mdadm was considered the array as clean, while kernel was considering the array as dirty. It is better for mdadm also to consider the array as dirty in this case. Signed-off-by: NeilBrown <neilb@suse.de>	2012-02-07 10:15:20 +11:00
NeilBrown	c0c1acd691	Grow/bitmap: support adding bitmap via sysfs. Adding a bitmap via ioctl can only add it at a fixed location. That location is not suitable for 4K-block devices. So allow setting the bitmap location via sysfs if kernel supports it and aim to always use 4K alignments. Signed-off-by: NeilBrown <neilb@suse.de>	2011-12-23 14:10:41 +11:00
NeilBrown	b6db6fab11	super1: use awrite when writing a new bitmap. This ensures it will succeed on 4K block devices like DASD. Signed-off-by: NeilBrown <neilb@suse.de>	2011-12-23 14:09:56 +11:00
NeilBrown	adbb382b55	super1 - fix for bigendian machines. devflags is a single byte so endian conversions are now wanted. Signed-off-by: NeilBrown <neilb@suse.de>	2011-12-23 14:07:47 +11:00
NeilBrown	cb0997242c	super1: getinfo_super should set write-mostly flag. Otherwise it is not preserved when you re-add a device to an array. Signed-off-by: NeilBrown <neilb@suse.de>	2011-12-20 15:01:53 +11:00
Adam Kwolek	6e75048bc5	Add recovery blocked field to mdinfo When container is assembled while reshape is active on one of its member whole container can be required to be blocked from monitoring. For such purpose field recovery blocked is added to mdinfo structure. When metadata handler finds active reshape in container it should set recovery_blocked field to disable whole container monitoring during reshape. For arrays that doesn't use containers, recovery_blocked field has the same value as reshape_active field e.g. super0/1. In fact,recovery is blocked during reshape for such arrays. For ddf, metadata handler doesn't set reshape_active field, so recovery_blocked is not set also. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-10-05 13:30:50 +11:00
Doug Ledford	16715c01f7	Fix readding of a readwrite drive into a writemostly array If you create a two drive raid1 array with one device writemostly, then fail the readwrite drive, when you add a new device, it will get the writemostly bit copied out of the remaining device's superblock into it's own. You can then remove the new drive and readd it as readwrite, which will work for the readd, but it leaves the stale WriteMostly1 bit in devflags resulting in the device going back to writemostly on the next assembly. The fix is to make sure that A) when we readd a device and we might have filled the st->sb info from a running device instead of the device being readded, then clear/set the WriteMostly1 bit in the super1 struct in addition to setting the disk state (ditto for super0, but slightly different mechanism) and B) when adding a clean device to an array (when we most certainly did copy the superblock info from an existing device), then clear any writemostly bits. Signed-off-by: Doug Ledford <dledford@redhat.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-09-19 13:06:38 +10:00
NeilBrown	6218489119	super1: fix spacing for 'Flags' field in --examine. Signed-off-by: NeilBrown <neilb@suse.de>	2011-08-02 13:36:08 +10:00
Scott Schaefer	9a88e7b6d5	--add incorrectly sets writemostly Origin: vendor, http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=628667 Bug-Debian: http://bugs.debian.org/628667 Signed-off-by: NeilBrown <neilb@suse.de>	2011-08-02 13:27:32 +10:00
Luca Berra	3b7e9d0cbe	Fix some type-aliasing issues. Warnings for these are reported with -Wstrict-aliasing=2, and avoiding the cast is certainly an improvement. Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-17 14:38:14 +10:00
Luca Berra	e4c72d1dc6	Fix some compiler warnings. Original by Luca, with various changes by Neil Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-17 14:35:06 +10:00
NeilBrown	95eeceeb32	getinfo_super now clears the 'info' structure before filling it in. Some code currently clears 'info' before calling getinfo_super, some code doesn't. To be consistent, change it so no caller ever clears 'info', but ever getinfo_super function must clear it. Note that ->raid_disk may be meaningful if that 'map' is passed non-NULL. In that case it is copied out before the structure is zeroed. Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-08 15:54:13 +10:00
NeilBrown	0f23aa88f8	config: restore the possibility of a NULL homehost As homehost defaults to the system name it is not possible to specify a NULL homehost. This patch restored this ability with either --homehost="" or --homehost="<none>". This allows the creation of v1.x arrays without a "hostname:" prefix in the name. Signed-off-by: NeilBrown <neilb@suse.de>	2011-05-10 16:17:12 +10:00
NeilBrown	ca6529edf6	Merge branch 'master' into devel-3.2 Conflicts: Grow.c Manage.c managemon.c mdadm.8.in util.c	2011-03-10 17:37:04 +11:00

1 2 3 4 5 ...

314 Commits