mdadm

Commit Graph

Author	SHA1	Message	Date
NeilBrown	1011e8344a	Remove lots of unnecessary white space. Now that I am using white-space mode in Emacs I can see all of this, and I don't like it :-) Signed-off-by: NeilBrown <neilb@suse.de>	2013-06-19 12:31:45 +10:00
NeilBrown	8cde842b18	Assemble: when forcing a single-degraded RAID6 array, trigger a 'repair'. When an active/degraded RAID6 array is force-started we clear the 'active' flag, but it is still possible that some parity is no in sync. This is because there are two parity block. It would be nice to be able to tell the kernel "P is OK, Q maybe not". But that is not possible. So when we force-assemble such an array, trigger a 'repair' to fix up any errant Q blocks. This is not ideal as a restart during the repair will not be continued after the restart, but it is the best we can do without kernel help. Signed-off-by: NeilBrown <neilb@suse.de>	2013-06-19 11:09:33 +10:00
NeilBrown	f80057aec5	Assemble/Incr: Don't include spares with too-high event count. Some failure scenarios can leave a spare with a higher event count than an in-sync device. Assembling an array like this will confuse the kernel. So detect spares with event counts higher than the best non-spare event count and exclude them from the array. Reported-by: Alexander Lyakas <alex.bolshoy@gmail.com> Signed-off-by: NeilBrown <neilb@suse.de>	2013-06-17 16:55:31 +10:00
NeilBrown	a7dec3fd92	Make sure NOFILE resource limit is big enough. Some people want to create truely enormous arrays. As we sometimes need to hold one file descriptor for each device, this can hit the NOFILE limit. So raise the limit if it ever looks like it might be a problem. Signed-off-by: NeilBrown <neilb@suse.de>	2013-05-30 14:31:09 +10:00
NeilBrown	afa368f49a	Assemble: --update=metadata converts v0.90 to v1.0 This allows the smooth conversion of legacy 0.90 arrays to 1.0 metadata. Old metadata is likely to remain but will be ignored. It can be removed with mdadm --zero-superblock --metadata=0.90 /dev/whatever Signed-off-by: NeilBrown <neilb@suse.de>	2013-05-28 16:44:22 +10:00
NeilBrown	4dd2df0966	Discard devnum in favour of devnm We widely use a "devnum" which is 0 or +ve for md%d devices and -ve for md_d%d devices. But I want to be able to use md_%s device names. So get rid of devnum (a number) and use devnm (a 32char string). eg. md0 md_d2 md_home Signed-off-by: NeilBrown <neilb@suse.de>	2013-02-21 17:05:23 +11:00
NeilBrown	8cf2eb96b2	Assemble: fix spelling: report_missmatch -> report_mismatch Signed-off-by: NeilBrown <neilb@suse.de>	2012-12-05 11:40:28 +11:00
NeilBrown	1d04e27570	Assemble: Don't auto-assemble arrays which conflict with mdadm.conf When auto-assembling we might find an array which appear in mdadm.conf. This can happen if the array (based on UUID) doesn't match what is in mdadm.conf. For consistency we should avoid auto-assembling such an array just as we avoid regular-assembling of the array. Reported-by: Ross Boylan <ross@biostat.ucsf.edu> Signed-off-by: NeilBrown <neilb@suse.de>	2012-12-05 11:06:55 +11:00
NeilBrown	66eb2c93a6	Assemble: ensure that <ignore>d arrays are not auto-assembled. It isn't enough to simply not assemble arrays found to be called <ignore>, as the final stage of auto-assemble doesn't check for names in mdadm.conf. So add a check to Assemble, similar to the check in Incremental() Reported-by: Mike Frysinger <vapier@gentoo.org> Signed-off-by: NeilBrown <neilb@suse.de>	2012-11-22 17:04:20 +11:00
NeilBrown	b20c8a502d	Assemble: fix call to wait_for Recent patch closed 'mdfd' before calling wait_for, which means it doesn't work. Put the close back in the right place. Signed-off-by: NeilBrown <neilb@suse.de>	2012-11-20 12:08:56 +11:00
NeilBrown	5e9fd96f21	Assemble: Fix critical-section-recovery when assembling a growing array. commit `aacb2f816a` Assemble: add support for replacement devices. broke the restoring of the 'critical section' because it messed up the list of file descriptors passed to Grow_restart. Put it back the way it should be. Signed-off-by: NeilBrown <neilb@suse.de>	2012-11-20 12:08:36 +11:00
NeilBrown	cb8f6859d1	IMSM - allow assembling any imsm array even without OROM. It is important to check for compatibility with 'platform' or Option ROM when creating or changing and array. However there is no real need when simply assembling the array. On some systems there are situations where the platform information is not available. e.g. on some UEFI systems, UEFI is not available during 'kdump' handling. This makes it impossible to assemble an IMSM array to receive the dump. So remove the requirements that the platform be visible to assemble an IMSM array. Signed-off-by: NeilBrown <neilb@suse.de>	2012-11-20 12:07:30 +11:00
NeilBrown	aacb2f816a	Assemble: add support for replacement devices. Need to possibly collect 2 devices for each slot, and original and a replacement. Signed-off-by: NeilBrown <neilb@suse.de>	2012-10-24 09:48:18 +11:00
NeilBrown	79f9f56da6	Assemble.c - re-indent file. Make sure spaces and indents are consistent. Signed-off-by: NeilBrown <neilb@suse.de>	2012-10-22 17:25:19 +11:00
NeilBrown	6f4dbdc4e8	Assemble: remove support for assembling arrays with ancient kernel. Using "START_ARRAY" ioctl never really worked reliably, was removed a decade ago, and just clutters the code. So remove it. Signed-off-by: NeilBrown <neilb@suse.de>	2012-10-22 17:23:25 +11:00
NeilBrown	ddc1b11fb5	Assemble: split out "start_array()" function. Apart from code movement, there is a small functional change here. If the array is not successfully started, it is stopped. Previously we would sometimes leave the array in a partially-assembled but inactive state. This just causes confusion. "--incremental" can be used to partially assemble arrays. Signed-off-by: NeilBrown <neilb@suse.de>	2012-10-22 17:23:11 +11:00
NeilBrown	9f5470ce8d	Assemble: split out force_array() force_array() is called if --force was specified to update and metadata necessary to make the array assemble. Signed-off-by: NeilBrown <neilb@suse.de>	2012-10-18 17:30:51 +11:00
NeilBrown	2c355c225e	Assemble: split out load_devices() functionality. Once we have found the devices we want, we need to load the metadata from them and store it. This new function extracts that functionality out of Assemble() Signed-off-by: NeilBrown <neilb@suse.de>	2012-10-18 16:39:49 +11:00
NeilBrown	95425a89fc	Assemble: split out select_devices function. Assemble() is way too big. This patch starts cleaning it up by pulling the 'select_devices()' function. This examines the device to make sure they all belong to one array, or select those that do (depending on exact use case). Signed-off-by: NeilBrown <neilb@suse.de>	2012-10-18 15:31:20 +11:00
NeilBrown	0431869cec	Fix up interactions between --assemble and --incremental If --incremental has partly assembled an array and --assemble is asked to assemble it, the just finds remaining devices and makes a new array. Not good. So: 1/ modify locking policy so that assemble can be sure that no --incremental is running once it locks the map file 2/ Assemble() checks the map file for a duplicate and adds to that array instead of creating a new one. Signed-off-by: NeilBrown <neilb@suse.de>	2012-10-10 18:27:32 +11:00
NeilBrown	5e88ab2e2f	New RESHAPE_NO_BACKUP flag to track when backup action is needed. Some arrays (raid10) never need a backup file, so during assembly we can avoid the whole Grow_continue check in that case. Achieve this using a flag set by the metadata handler. Also get "mdadm -I" to fail if a backup process would be needed. It currently does fail as the kernel rejects things, but it is nicer to have this explicit. Signed-off-by: NeilBrown <neilb@suse.de>	2012-10-04 16:34:21 +10:00
NeilBrown	56dcaa6ba0	Assemble: don't leak memory with fdlist. We should free fdlist when finished with it. Signed-off-by: NeilBrown <neilb@suse.de>	2012-07-09 17:20:25 +10:00
NeilBrown	11b6d91dd0	Change Incremental and related functions to take struct context Signed-off-by: NeilBrown <neilb@suse.de>	2012-07-09 17:20:22 +10:00
NeilBrown	4977146a84	Convert Assemble() to take a context rather than a list of options. Signed-off-by: NeilBrown <neilb@suse.de>	2012-07-09 17:19:07 +10:00
NeilBrown	0ea8f5b167	Assemble: allow arrays to be assembled read-only. The option was there, but never used. Signed-off-by: NeilBrown <neilb@suse.de>	2012-07-09 17:14:16 +10:00
NeilBrown	503975b9d5	Remove scattered checks for malloc success. malloc should never fail, and if it does it is unlikely that anything else useful can be done. Best approach is to abort and let some super-daemon restart. So define xmalloc, xcalloc, xrealloc, xstrdup which don't fail but just print a message and exit. Then use those removing all the tests for failure. Also replace all "malloc;memset" sequences with 'xcalloc'. Signed-off-by: NeilBrown <neilb@suse.de>	2012-07-09 17:14:16 +10:00
NeilBrown	e7b84f9d50	Introduce pr_err for printing error messages. 'pr_err("' is a lot shorter than 'fprintf(stderr, Name ": ' cont_err() is also available. Signed-off-by: NeilBrown <neilb@suse.de>	2012-07-09 17:14:16 +10:00
Alexander Lyakas	135a31f5ed	Don't consider disks with a valid recovery offset as candidates for bumping up event count When we are looking for a candidate disk to bump up the event count, we consider only disks that have recovery_start==MaxSector. However, after we find one such disk, we agree to accept more disks having same event count, regardless of their recovery_start. Be consistent and don't accept disks with a valid recovery_start at all. Signed-off-by: NeilBrown <neilb@suse.de>	2012-05-15 14:20:42 +10:00
Adam Kwolek	4aecb54a21	FIX: Assembled second array is in read only state during reshape When arrays using external metadata are assembled, and one of array in container is under reshape, second array will remain in read only state (not auto read only). It is caused by array fact that array is frozen and mdmon doesn't has opportunity to switch array in r/w mode. Freezing not reshaped array just after it is being assembled allows mdmon to enable it for writing. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2012-04-17 12:33:38 +10:00
NeilBrown	e62b778573	Assemble: improve verbose logging when including old devices. Reporting: mdadm: added /dev/loop1 to /dev/md0 as 1 mdadm: added /dev/loop2 to /dev/md0 as 2 mdadm: added /dev/loop0 to /dev/md0 as 0 mdadm: /dev/md0 has been started with 2 drives (out of 3). is confusing - why only 2? Code now reports: mdadm: added /dev/loop1 to /dev/md0 as 1 mdadm: added /dev/loop2 to /dev/md0 as 2 (possibly out of date) mdadm: added /dev/loop0 to /dev/md0 as 0 mdadm: /dev/md0 has been started with 2 drives (out of 3). which is somewhat clearer. Signed-off-by: NeilBrown <neilb@suse.de>	2012-03-22 14:52:21 +11:00
NeilBrown	b720636a58	Assemble: support assembling of a RAID0 being reshaped. This is a bit of a hack and the code need to be made more general. But this adds the special case of a RAID0 being reshaped which looks like a RAID4 but doesn't need as many devices. Signed-off-by: NeilBrown <neilb@suse.de>	2012-03-07 10:47:34 +11:00
NeilBrown	56d1885944	Assemble: don't use O_EXCL until we have checked device content. If we open with O_EXCL before checking that the device is one that we really want, then that could cause some other process to think the device is busy when it isn't really. This particularly affects running "mdadm -A devname" in parallel for different arrays. One might be looking at a device that it won't end up using while another trys and fails to look at a device that it needs. So delay the O_EXCL until after all identity checks. Multiple "mdadm -As" will still have races, but that is fundamentally racy anyway. Signed-off-by: NeilBrown <neilb@suse.de>	2012-03-07 10:41:24 +11:00
Adam Kwolek	111e9fdaa8	FIX: Array is not run when expansion disks are added When added disk is disk added by expansion and this is last disk added to array, assemble_container_content() will not even try to run such array. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2012-02-09 12:20:51 +11:00
NeilBrown	da8fe5aa9b	Assemble: fix --force assemble during reshape. If we have to --force assembly during reshape, we need to check by the 'before' and 'after' cases to make sure there are enough devices. Reported-by: Richard Herd <2001oddity@gmail.com> Signed-off-by: NeilBrown <neilb@suse.de>	2012-02-07 14:06:44 +11:00
NeilBrown	de5a472ea3	Remove avail_disks arg from 'enough'. It can easily be calculated from 'avail' and 'raid_disks', and we will soon have a case where we don't have it easily available to pass in. Signed-off-by: NeilBrown <neilb@suse.de>	2012-02-07 14:04:47 +11:00
NeilBrown	887162637f	Assemble: fix count in "assembled with .. but not started". We need to include the count of pre-existing devices here. Signed-off-by: NeilBrown <neilb@suse.de>	2011-12-23 10:49:07 +11:00
NeilBrown	576d028002	Assemble: make some plurals conditional. "1 devices" is ugly. Fix it. Signed-off-by: NeilBrown <neilb@suse.de>	2011-12-23 10:49:07 +11:00
NeilBrown	81a5b4f52f	Remove update_private This fields doesn't work any more as ->getinfo_super clears the info structure at an awkward time. So get rid of it and do it differently. The issue is that the metadata handler cannot tell if the uuid it has was randomly generated or explicitly requested, except on the first call. And we don't want to accept explicit requests for IMSM. So when it was auto-generated, make it look distinctive by having the same int copied in all 4 positions. If someone requests a uuid like that, I guess they get away with it. Reported-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-12-20 10:30:34 +11:00
NeilBrown	a648241517	Resolve some more warnings unused variables when MDASSEMBLE is defined, and a typo in mdadm.8 Signed-off-by: NeilBrown <neilb@suse.de>	2011-12-13 13:24:52 +11:00
Lukasz Dorau	7728e1c635	fix: correct metadata's update communication The problem occurs when array under migration is assembled incrementally. st->update_tail is not initialized in function assemble_container_content() and during reshape the checkpoint information in metadata is not being updated. The value of st->update_tail is now initialized in function assemble_container_content() and during reshape the checkpoint information in metadata is being updated correctly on all disks. Signed-off-by: Lukasz Dorau <lukasz.dorau@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-11-21 16:17:56 +11:00
Jes Sorensen	518a60f385	Assemble(): don't dup_super() before we need it. Avoid resource leak in case we bail loop early Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-11-02 10:48:53 +11:00
Jes Sorensen	22472ee1d2	assemble_container_content(): fix memory leak Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-11-02 10:48:53 +11:00
Jes Sorensen	83366b3352	Fix memory leak Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-11-01 14:50:44 +11:00
Labun, Marcin	81219e70f2	kill-subarray: fix, IMSM cannot kill-subarray with unsupported metadata container_content retrieves volume information from disks in the container. For unsupported volumes the function was not returning mdinfo. When all volumes were unsupported the function was returning NULL pointer to block actions on the volumes. Therefore, such volumes were not activated in Incremental and Assembly. As side effect they also could not be deleted using kill-subarray since "kill" function requires to obtain a valid mdinfo from container_content. This patch fixes the kill-subarray problem by allowing to obtain mdinfo of all volumes types including unsupported and introducing new array.status flags. There are following changes: 1. Added MD_SB_BLOCK_VOLUME for blocking an array, other arrays in the container can be activated. 2. Added MD_SB_BLOCK_CONTAINER_RESHAPE block container wide reshapes (like changing disk numbers in arrays). 3. IMSM container_content handler is to load mdinfo for all volumes and set both blocking flags in array.state field in mdinfo of unsupported volumes. In case of some errors, all volumes can be affected. Only blocked array is not activated (also reshaped as result). The container wide reshapes are also blocked since by metadata definition they require modifications of both arrays. 4. Incremental_container and Assemble functions check array.state and do not activate volumes with blocking bits set. 5. assemble_container_content is changed to check container wide reshapes before activating reshapes of assembled containers. 6. Grow_reshape and Grow_continue_command checks blocking bits before starting reshapes or continueing (-G --continue) reshapes. 7. kill-subarray ignores array.state info and can remove requested array. Signed-off-by: Marcin Labun <marcin.labun@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-10-31 11:29:46 +11:00
Adam Kwolek	3bd58dc65f	Always run Grow_continue() for started array. So far there were 2 reshape continuation cases: 1. array is started /e.g. reshape was already invoked during initrd start-up stage using "--freeze-reshape" option/ 2. array is not started yet /"normal" assembling array under reshape case/ This patch narrows continuation cases in to single one. To do this array should be started /set readonly in to array_state/ before calling Grow_continue() function. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-10-07 09:46:07 +11:00
Adam Kwolek	a93ada3b7d	Monitor reshaped array Reshape can be run for monitored arrays only /external metadata case/. Before reshape can be executed, make sure that just starter array/container is monitored. If not, run mdmon for it. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-10-05 13:59:28 +11:00
Adam Kwolek	6e75048bc5	Add recovery blocked field to mdinfo When container is assembled while reshape is active on one of its member whole container can be required to be blocked from monitoring. For such purpose field recovery blocked is added to mdinfo structure. When metadata handler finds active reshape in container it should set recovery_blocked field to disable whole container monitoring during reshape. For arrays that doesn't use containers, recovery_blocked field has the same value as reshape_active field e.g. super0/1. In fact,recovery is blocked during reshape for such arrays. For ddf, metadata handler doesn't set reshape_active field, so recovery_blocked is not set also. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-10-05 13:30:50 +11:00
Adam Kwolek	b76b30e0f9	Do not continue reshape during initrd phase During initrd phase continuing reshape will cause file system context lost. This blocks ability to control reshape using checkpoints. To avoid this, during initrd phase assemble has to be executed with '--freeze-reshape' option. This causes that mdadm restores reshape critical section only. Reshape can be continued later after system full boot. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-10-03 09:15:22 +11:00
Adam Kwolek	3f54bd62dc	Move restore backup code to function Reshape backup should be able to be restored during reshape continuation also. To reuse already existing code it is moved to function. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-09-21 12:17:30 +10:00
Adam Kwolek	910e9fa7f9	FIX: Memory leak during Assembly For fdlist pointer allocated in assemble_container_content() function, free() is never called. This patch fixes this memory leak. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-09-21 11:55:15 +10:00

1 2 3 4 5 ...

260 Commits