mdadm

Commit Graph

Author	SHA1	Message	Date
NeilBrown	352452c364	Handle incremental assembly of containers. mdadm -I /dev/part-of-container should add that to a container, creating if it needed, and then try to assemble any arrays in the container. Signed-off-by: NeilBrown <neilb@suse.de>	2008-09-18 16:01:57 +10:00
NeilBrown	f35f252592	Move calls to SET_ARRAY_INFO to common helper. When we assemble an array, there are three different approaches depending on whether metadata is internal or external, and on kernel version. Move all this to a common helper instead of duplicating in 3 places. Signed-off-by: NeilBrown <neilb@suse.de>	2008-09-18 16:01:55 +10:00
NeilBrown	7801ac2092	Factor out add-disk code The variety of approaches to 'add_disk' are factored out into a separate function, and Incremental mode benefits by being closer to supporting the assembly of containers. Also remove the adding-to-array-data-structure out of sysfs_add_disk and into add_disk. And add some tests for --incremental mode to make sure we don't break it. Signed-off-by: NeilBrown <neilb@suse.de>	2008-09-18 15:13:32 +10:00
NeilBrown	c69b251bc7	Teach --detail about containers and members there-of. Make --detail on a container more useful by suppressing irrelevant detail and adding useful detail like a list of member arrays. Ditto for members of a container: report the name of the container array. Signed-off-by: NeilBrown <neilb@suse.de>	2008-09-18 15:05:20 +10:00
Dan Williams	1770662bca	'mdadm --wait-clean' wait for array to be marked clean For use in distro shutdown scripts with a RAID root file system. Returns immediately if the array is 'readonly', or not an externally managed array. It is up to the distro's scripts to make sure no new writes hit the device after this returns 'true'. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-09-15 20:58:42 -07:00
Dan Williams	c94709e83f	Add ping_monitor() to mdadm --wait The action we are waiting for may not be complete until the monitor has had a chance to take action on the result. The following script can now remove the device on the first attempt, versus a few attempts with the original Wait(): #!/bin/bash #export MDADM_NO_MDMON=1 export IMSM_DEVNAME_AS_SERIAL=1 ./mdadm -Ss ./mdadm --zero-superblock /dev/loop[0-3] echo 2 > /proc/sys/dev/raid/speed_limit_max ./mdadm --create /dev/imsm /dev/loop[0-3] -n 4 -e imsm -a md ./mdadm --create /dev/md/r1 /dev/loop[0-3] -n 4 -l 5 --force -a mdp ./mdadm --fail /dev/md/r1 /dev/loop3 ./mdadm --wait /dev/md/r1 x=0 while ! ./mdadm --remove /dev/imsm /dev/loop3 > /dev/null 2>&1 do x=$((x+1)) done echo "removed after $x attempts" ./mdadm --add /dev/imsm /dev/loop3 Include 2 small cleanups: * remove the almost open coded fd2devnum() in Wait() by introducing a new utility routine stat2devnum() * teach connect_monitor() to parse the container device from a subarray string Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-09-15 20:58:42 -07:00
Dan Williams	8ed3e5e1bf	Honor safemode_delay at Create() and Incremental() time Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-09-15 20:58:42 -07:00
Dan Williams	a67dd8cc58	Allow metadata handlers to communicate desired safemode delay via mdinfo Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-09-15 20:58:42 -07:00
NeilBrown	e9dd159873	Allow an externally managed array to be marked readonly If the metadata_version is -mdXXX/whatever rather than /mdXXX/whatever then the array is readonly and should be left alone by mdmon. Signed-off-by: NeilBrown <neilb@suse.de>	2008-08-19 17:55:15 +10:00
NeilBrown	3c558363a1	Factor out test for subarray version string. We are about to change the syntax of the version string for 'subarray's. So factor out the test into a single function. Signed-off-by: NeilBrown <neilb@suse.de>	2008-08-19 17:55:15 +10:00
NeilBrown	01f157d74a	Extra option for set_array_state: you choose dirty or clean. When we first start an array, it might be good to start recovery straight away. That requires setting the array to 'dirty', but only the metadata handler can know if that is required or not. So have a third possible 'consistent' option to set_array_state. Either 'no' or 'yes' or 'you choose'. Return value indicates what was chosen. '1' (no) should be chosen unless there is a good reason. Signed-off-by: NeilBrown <neilb@suse.de>	2008-08-19 14:54:55 +10:00
Dan Williams	9296754385	mdmon: handle failures versus readauto arrays Transition readauto arrays to active before failing drives. Hmm... why do we keep reblocking / renotifying in the readonly case? Need to bottom out on this, but not right now. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-08-15 10:58:43 -07:00
Dan Williams	f1d267661d	mdmon: allow degraded arrays to be monitored manage_new is too strict in the face of failed devices. Teach it to monitor degraded arrays. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-08-15 10:58:43 -07:00
Dan Williams	755c99faf2	sysfs: deprecate sysfs_disk_to_sg The cmd_filter patch merged for 2.6.27 broke retrieving the serial number via an ioctl to /dev/sgN. In debugging this I found that other utilities like sdparm simply run the ioctl on /dev/sdX. So just convert to that for protection in numbers, but scream on the mailing list for the inconvenience grr... Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-07-24 17:26:24 -07:00
NeilBrown	8850ee3e1e	Factor common code into new "start_mdmon". Signed-off-by: Neil Brown <neilb@suse.de>	2008-07-18 16:37:11 +10:00
Dan Williams	5dcfcb715d	mdadm: add an environment variable to prevent auto-launching mdmon Useful for attaching gdb to mdmon before any action is taken on the array. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-07-14 14:59:32 -07:00
Neil Brown	77472ff8d0	Introduce devname2devnum and use it instead of opencoding.	2008-07-12 20:28:38 +10:00
Neil Brown	2c514b7120	Pass 'verbose' flag to validate_geometry That way it can be silent when we are just trying to figure out which metadata to use, and noisy when detecting a real problem.	2008-07-12 20:28:38 +10:00
Neil Brown	6416d5275d	Use O_DIRECT for all IO to devices. Using buffered IO risks non-atomic updates to parts of the device that we don't actually want to write to. This isn't in general safe. So switch to O_DIRECT for all that IO and make sure we have properly aligned buffers.	2008-07-12 20:28:33 +10:00
Neil Brown	edd8d13c02	Create arrays via metadata-update Support creating arrays inside an active ddf container by sending a metadata update over a pipe to mdmon.	2008-07-12 20:27:40 +10:00
Neil Brown	4d43913ce0	Remove mgr_pipe for communicating from manage to monitor. Data is being passed in shared memory, so the pipe is only being use as a wakeup. This can more easily be done with a thread-signal.	2008-07-12 20:27:40 +10:00
Neil Brown	2f64e61a50	Remove mon_pipe for communicating from monitor to manager The returned value was never used, and we don't really want this return path anyway as writing to a pipe could conceivably block, and the monitor must not block.	2008-07-12 20:27:40 +10:00
Neil Brown	f94d52f43e	Handle device removal from container This really should be done in mdadm, not mdmon. We ensure the device won't be suddenly commited as a hot-spare using O_EXCL, then check the 'holders' sysfs directory to make sure it is only in use once.	2008-07-12 20:27:40 +10:00
Neil Brown	78e449282e	Remove the multiple super_switchs for ddf. It is simpler if there is just one, and the methods make decisions as appropriate.	2008-07-12 20:27:39 +10:00
Neil Brown	d2ca644994	Remove getinfo_super_n and do some other cleaning up. Getting close to a sensible description of what some of the superswitch methods are supposed to do!	2008-07-12 20:27:39 +10:00
Neil Brown	f7e7067b47	Add subarray field to supertype. When loading the metadata for a subarray (super_by_fd), we set ->subarray to be the name read from md/metadata_version so that getinfo_super can return info about the correct array. With this we can differentiate between a container and an array within the container by looking at ->subarray[0].	2008-07-12 20:27:38 +10:00
Neil Brown	6adfd3affd	Add some comments to explain some of the bits of superswitch.	2008-07-12 20:27:38 +10:00
Neil Brown	0063ecba3d	Hide subordinate superswitch structures. Only one superswitch should be externally visible for each general type. Others which handle different flavours (e.g. container/data-array) should be internal only.	2008-07-12 20:27:38 +10:00
Neil Brown	b8ac196795	Remove 'major' from superswitch. It isn't generally meaningful.	2008-07-12 20:27:37 +10:00
Neil Brown	1522c538b1	Use text_version in map_file rather than major.minor.	2008-07-12 20:27:37 +10:00
Dan Williams	8b35327854	imsm: 'volume' is the proper name for imsm container members Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-06-13 17:42:09 -07:00
Dan Williams	f1665f7200	sysfs: helper routine to retrieve the scsi id imsm records this information in its metadata Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-06-13 17:27:30 -07:00
Dan Williams	90c8b70714	sysfs: provide a helper function for locating scsi_generic interfaces imsm records and validates this data in its metadata Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-06-13 17:27:30 -07:00
Neil Brown	6c3fb95c44	Support adding a spare to a degraded array. When signalled by the monitor, the manager will find spares and add them to the array and initiate a recovery.	2008-06-12 10:13:29 +10:00
Neil Brown	2e735d1982	Allow passing metadata update to the monitor. Code in manager can now just call queue_metadata_update with a (freeable) buf holding the update, and it will get passed to the monitor and written out.	2008-06-12 10:13:23 +10:00
Neil Brown	cba0191bad	Parse the 'instance' part of external:/mdXX/INST in metadata handler. This give more flexability.	2008-05-27 09:18:57 +10:00
Neil Brown	dd15dc4a4d	Discard st->container_member 'container_member' isn't really a well defined concept. Each metadata might enumerate members differently, so just let each format /mdX/YYYY as appropriate.	2008-05-27 09:18:56 +10:00
Neil Brown	159c3a1a77	Remove st->text_version in favour of info->text_version I want the metadata handler to have more control over the 'version', particularly for arrays which are members of containers. So discard st->text_version and instead use info->text_version which getinfo_super can initialise.	2008-05-27 09:18:55 +10:00
Neil Brown	ed9d66aade	Change mark_clean to set_array_state. DDF needs more fine grained understanding of the array state.	2008-05-27 09:18:54 +10:00
Neil Brown	a931db9ed7	auto-start mdmon on --create FIXME uses sill hardcoded path. Need --assemble too.	2008-05-27 09:18:42 +10:00
Neil Brown	e0d6609fe6	Exit when there are no more arrays to manage.	2008-05-27 09:18:41 +10:00
Neil Brown	5869a76c90	Remove supertype->devfd It is never used.	2008-05-27 09:18:40 +10:00
Neil Brown	1ed3f38758	Remove stopped arrays. When an array becomes inactive, clean up and forget it. This involves signalling the manager.	2008-05-27 09:18:39 +10:00
Neil Brown	7a7cc50430	Set status of devices in ddf. Might work a little bit....	2008-05-27 09:18:38 +10:00
Neil Brown	4e5528c6f7	Implement mark_clean for ddf and remove mark_dirty and mark_sync mark_dirty is just a special case of mark_clean - with sync_pos == 0. mark_sync is not required. We don't modify the metadata when sync finishes. Only when the array becomes non-writeable at which point we use mark_clean to record how far the resync progressed.	2008-05-27 09:18:38 +10:00
Neil Brown	2318b9f0dc	Remove 'fd' arg from sysfs_add_disk It it never used, and removing means there are several 'open's that can go.	2008-05-27 09:18:32 +10:00
Dan Williams	3e70c845e2	add infrastructure to receive higher order commands, like remove_device From: Dan Williams <dan.j.williams@intel.com> Each md_message encapsulates a single command. A command includes an 'action' member which describes what if any data comes after the action. Communication with the monitor involves updating the active_cmd pointer and then writing to mgr_pipe. Pass/fail status is returned via mon_pipe. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-05-15 16:48:54 +10:00
Dan Williams	8d45d1969b	handle disk failures From: Dan Williams <dan.j.williams@intel.com> Added curr_state as a parameter to set_disk. Handlers look at this to record components failures, and set global 'degraded' or 'failed' status. When reading the state as faulty: 1/ mark the disk failed in the metadata 2/ write '-blocked' to the rdev state to allow the kernel's failure mechanism to advance 3/ the kernel will take away the drive's role in remove_and_add_spares() 4/ once the disk no longer has a role writing 'remove' to the rdev state will get the disk out of array. There is a window after writing '-blocked' where the kernel will return -EBUSY to remove requests. We rely on the fact that the disk will continue to show faulty so we lazily wait until the kernel is ready to remove the disk. If the manager thread needs to get the disk out of the way it can ping the monitor and wait, just like the replace_array() case. [buglet fix: swap the parameters of attr_match in read_dev_state] Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-05-15 16:48:49 +10:00
Dan Williams	fd7cde1bf0	handle resync completion From: Dan Williams <dan.j.williams@intel.com> Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-05-15 16:48:42 +10:00
Neil Brown	549e9569c6	Merge mdmon	2008-05-15 16:48:37 +10:00

1 2 3 4

172 Commits