mdadm

Commit Graph

Author	SHA1	Message	Date
NeilBrown	a201e6803f	Release mdadm-3.2 - developer only release Signed-off-by: NeilBrown <neilb@suse.de>	2011-02-01 16:11:13 +11:00
NeilBrown	71204a5029	Various compile fixes. Make "make everything" succeed. This fixed some real bugs. Signed-off-by: NeilBrown <neilb@suse.de>	2011-02-01 15:48:03 +11:00
NeilBrown	87eb4fabe3	Various man page fixes. Signed-off-by: NeilBrown <neilb@suse.de>	2011-02-01 15:06:44 +11:00
NeilBrown	152b223157	tests: add IMSM_NO_PLATFORM to some places that were missing it. Signed-off-by: NeilBrown <neilb@suse.de>	2011-02-01 14:44:02 +11:00
NeilBrown	f54a6742b2	managemon: don't try to add spares when resync/recovery is happening. kernel should reject this anyway, and we really should not be trying as it can only lead to confusion. Signed-off-by: NeilBrown <neilb@suse.de>	2011-02-01 14:44:02 +11:00
NeilBrown	a5d10dcec8	Allow explicitly listed spared to be included by default. When the metadata doesn't identify which array a spare belongs to we normally require an explicit domain match to connect a spare with an array. However when the spare is explicitly listed in argv, it should be safe to include as long as there is no domain conflict. Signed-off-by: NeilBrown <neilb@suse.de>	2011-02-01 14:44:02 +11:00
NeilBrown	e5508b361d	Allow domain_test to report that no domains were found. Sometime we will need to know the difference between no domains found and domains didn't match. So allow domain_test to return different values and fix up all callers to maintain current behaviour. Signed-off-by: NeilBrown <neilb@suse.de>	2011-02-01 14:44:02 +11:00
NeilBrown	d11128690b	test: remind where the log file is. Signed-off-by: NeilBrown <neilb@suse.de>	2011-02-01 14:44:02 +11:00
NeilBrown	3cdcfda4b0	test: remove all the environment handling. Instead, just include the environ explicitly in the test file or, where shared, source the shared file. Signed-off-by: NeilBrown <neilb@suse.de>	2011-02-01 14:43:59 +11:00
NeilBrown	e5e5d7cea3	Incr: don't exclude 'active' devices from auto inclusion in a container. For containers, it is always appropriate to include a device in the container. Whether it should then be included in an array is a separate question. Signed-off-by: NeilBrown <neilb@suse.de>	2011-02-01 13:07:36 +11:00
NeilBrown	ac597b1c21	free_super after assembling a container Else the devices are held open. Signed-off-by: NeilBrown <neilb@suse.de>	2011-02-01 13:07:24 +11:00
NeilBrown	d438679977	Assemble: ignore unknown devices not listed on command line. If we find a device that has not superblock, we currently fail unless in auto_assem mode. However we really should only fail if the device was explicitly listed in the arg list. So add a test for that. Signed-off-by: NeilBrown <neilb@suse.de>	2011-02-01 13:07:07 +11:00
Czarnowska, Anna	3c7b4a2595	Assemble: allow to assemble container with uuid=0:0:0:0 When there are any arrays in config file the spares with domain not matching any array are not assembled because auto assembly is not attempted. Addition of ARRAY line with uuid=0:0:0:0 in config will work with modified condition for gathering spares. Signed-off-by: Anna Czarnowska <anna.czarnowska@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-02-01 10:40:56 +11:00
Czarnowska, Anna	bfd76b9309	Monitor: do not move partitions to external container Arrays on partitions are not supported for external metadata so do not take such spare from native array. Signed-off-by: Anna Czarnowska <anna.czarnowska@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-02-01 10:40:56 +11:00
Adam Kwolek	1dfaa38015	imsm: FIX: map coping causes mdmon crash Too big map was copied (outside allocated memory) and this causes mdmon crash for 2 raid0 arrays in container. Map of correct (smaller) size should be copied, to not overwrite any internal data. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-02-01 10:40:56 +11:00
Adam Kwolek	401d313b7f	imsm: FIX: mdmon crash during 2 raid0 arrays expansion When expansion is run on 2 raid0 arrays in container no update is sent to mdmon because mdmon is off (mdadm performs update) Memory size for first reshaped array is allocated to satisfy memory requirements for expanded maps. Memory for second device is allocated using old disks number, as in metadata there is no information about this array reshape. When mdmon initiates second array reshape it overwrites internal structures and crashes). There is no place to keep expanded maps. To avoid this situation during loading metadata, allocated memory should be performed using the maximum used disks number in particular container. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-02-01 10:31:27 +11:00
Adam Kwolek	820eb8dba7	imsm: Update metadata for second array When second array reshape is about to start external metadata should be updated by mdmon in imsm_set_array_state(). For this purposes imsm_progress_container_reshape() is reused. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-02-01 10:31:06 +11:00
Adam Kwolek	d098291aec	imsm:FIX: change arrays reshape order Reshape is started from second array, so it causes imsm incompatibility and problems during second array start. Reshape should be started in arrays metadata order. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-02-01 10:17:06 +11:00
NeilBrown	e4b1107355	Grow: make sure to break out of the backup loop when finished. If there is nothing more to backup, then break out of the loop. Signed-off-by: NeilBrown <neilb@suse.de>	2011-02-01 10:08:24 +11:00
NeilBrown	b8b286a639	Make sure odisks is consistent between creating and using the fdlist reshape_prepare_fdlist and child_monitor currently have slightly different ideas of the 'old number of raid devices' which can cause major confusion. So settle on one value, and assign it to odisks early and always use it. Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-31 17:09:20 +11:00
NeilBrown	7f913e9b21	Grow: round max_progress to old chunk size too. kernel requires sync_max to be a multiple of the current chunk size. This is not really 'correct', but we need to work with it. So round down. Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-31 17:04:37 +11:00
NeilBrown	0d711ba4d3	Allow test to detect 'resync=DELAYED' state There is no space around the '=' when resync is delayed, so allow for that in pattern matching. Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-31 16:59:40 +11:00
NeilBrown	ca4fe0bfd3	Initialise all of file when opening backup file for reshape. Due to a miscalculation we didn't initialise the whole file. There is 4K (8 sectors) for the metadata, then the data. Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-31 16:57:40 +11:00
NeilBrown	1e971e7163	Grow: when restarting, do set new details if they are already set. When restarting a reshape with internal metadata, the new geometry is already set and the reshape has been start (but has not been allowed to continue yet). So in that case, don't set things and don't ask for a reshape. Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-31 15:32:19 +11:00
NeilBrown	6ef421be17	Grow:make sure 'array' is up-to-date before SET_ARRAY_INFO The value of 'array' might not be current, so SET_ARRAY_INFO and fail. Just refresh it before setting raid_disks. Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-31 15:30:15 +11:00
NeilBrown	cdc6068148	Grow: don't try setting new geometry when restarting a native reshape. md won't let us change raid_disks in this case, so don't even try. Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-31 15:01:15 +11:00
NeilBrown	c3f26510c6	open_mddev: open RDONLY if RDWR doesn't work. If an array is read-only then "mdadm -S" cannot open it to stop it without this fix. Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-31 14:49:39 +11:00
NeilBrown	562e70e4c4	Call free_super before attempting to add a new device Now that write_init_super doesn't close fds any more, we need to call free_super before the ADD_NEW_DISK ioctl. Also call free_super before some error returns, for cleanliness. Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-31 13:53:35 +11:00
Przemyslaw Czarnowski	210597d11f	Man pages update for policy framework Includes description of POLICY line in /etc/mdadm.conf and of changes in Monitor and Incremental related to autorebuild. Signed-off-by: Anna Czarnowska <anna.czarnowska@intel.com> Signed-off-by: Przemyslaw Czarnowski <przemyslaw.hawrylewicz.czarnowski@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-31 11:41:11 +11:00
Labun, Marcin	c7cb34136a	11spare-migration: pass conditions for tests 9 and 12 should be reversed Test 9: We do not block spare migration between different metadatas. test 13: Migrated spare must belong the same domain as destination - there is no additional condition for action. Signed-off-by: Marcin Labun <marcin.labun@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-31 11:36:25 +11:00
Labun, Marcin	61a31627c6	env-11spare-migration: imsm requires IMSM_NO_PLATFORM set with loop devices By default IMSM checks if member device belongs to AHCI or ISCI controller. When using loop devices one must disable these checks by setting IMSM_NO_PLATFORM. Signed-off-by: Marcin Labun <marcin.labun@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-31 11:35:25 +11:00
NeilBrown	6946681db0	Call free_super earlier when creating an array. As free_super now closes fds for member devices, rather than write_init_super doing it, we need to call free_super earlier, so that the device (on which we hold an O_EXCL open) is closed before it is added to the array. So close at the end of pass-1 rather than after pass-2. Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-31 11:34:42 +11:00
NeilBrown	e809000535	super1: fix regression in write_init_super. Now that a 'supertype' container more information, the simplistic copying of 'st' into 'refst' is incorrect and results in closing some fds when load_super1(refst) calls free_super(). So do it more correctly using dup_super. Reported-by: "Labun, Marcin" <Marcin.Labun@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-31 11:33:18 +11:00
Adam Kwolek	cb82edca14	imsm: FIX: not all disks are released in free_imsm_disks() Adding spare disks to imsm container fails due to problem with writing new_dev to sysfs. This problem was caused by not closed handle (opened exclusively) in Manage.c:803. Disk handle was not closed by free_imsm(). This is due to not released disk_mgmt_list in free_imsm_disks(). Proper release of imsm metadata allows for spare adding without problems. Memory leak was fixed also. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-31 11:06:42 +11:00
Czarnowska, Anna	a1e49d6956	Monitor: avoid adding too many spares to container Tests revealed that sometimes there are still more spares taken than needed. The reason for this is that after adding one spare to container with degraded subarray if between ioctl in main loop and load_container in try_spare_migration mdmon activates the spare we see active<raid but find no spares in parent container and so add an extra spare. To prevent such behaviour we count active disks in the list returned by getinfo_super_disks and compare it with subarray->active. If the number has increased it means new spare was added and activated so there is no need for more. Signed-off-by: Anna Czarnowska <anna.czarnowska@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-28 11:18:57 +10:00
Krzysztof Wojcik	24aebf3add	FIX: Meet SET_ARRAY_INFO ioctl requirements Problem has been observed when raid10<->raid0 takeover operation is executed. In code updating layout, raid_disks and chunk_size for non-restriping operations in reshape array functions SET_ARRAY_INFO ioctl call was not succeeded. Takeover process finish execution with error, mdadm shows message: "mdadm: failed to set disks" Cause is not meeting SET_ARRAY_INFO ioctl requirements: - only one parameter may be changed at one time - level of current array info and new info should be the same Patch introduces solution for this issue. At the beginning of discussed code we read current information about array and then compare them with new values should be set. If particular value is different (and should be set), we are overwrite only this one in array info and then call ioctl. Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-28 11:03:12 +10:00
Krzysztof Wojcik	d5ca4a23fc	FIX: Remove disks in mdmon for external metadata For raid10 -> raid0 takeover operation we should reject disks in mirror by marking them as 'failed' and then remove them from array by writing "remove" to disk state. For external metadata second action is executed by mdmon. According the description in monitor.c:175 when monitor detect "faulty" in device state, it blocks the device, mark it as failed in metadata, unblocks the device and finally writes "remove" to device state. For external case writing "remove" to device state in mdadm is not necessary and harmful. It may cause following issues: 1. "remove" operation for external case in mdadm is not finish with successful result because monitor may block the device or disk has been already removed by monitor. 2. If disk is removed by mdadm earlier than mdmon catch "failed" state, metadata is not properly updated- is not marked as failed. Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-28 11:03:11 +10:00
Adam Kwolek	10d0d365eb	WORKAROUND: mdadm hangs during reshape (PART #2 ) After loop can occurs that due to 0 value reported by kernel we have 0 in completed variable. This is wrong. we are interested in real completed point. 0 value means that we reached sync point set in md, so we can set completed variable to just reached point. this point value is stored in max_progress variable. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-28 11:03:11 +10:00
Adam Kwolek	fab32c9702	FIX: start_reshape status should be checked mdadm should verify if reshape is started before it goes in to check-pointing machine. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-28 10:40:42 +10:00
Adam Kwolek	a9c3e78fdd	FIX: Array after takeover has to be frozen Problem occurs when we want to expand single disk raid0 array. This is done via degraded 2 disks raid4 array. When new spare is added to array, md immediately initiates recovery before mdadm can configure and start reshape. This is due fact that 2 disk raid4/5 array is special md case. Mdmon does nothing here because container is blocked. Put array in to frozen state allows mdadm to finish configuration before reshape is executed in md. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-28 10:26:15 +10:00
Adam Kwolek	d7d205bd25	imsm: FIX: do not allow for container operation for the same disks number imsm_reshape_super() currently allows for expansion when requested raid_disks number is the same as current. This is wrong. Existing in code condition is too weak. We should allow for expansion when new disks_number is greater than current only. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-28 10:25:26 +10:00
Dan Williams	aa4cab513d	fix extended partition detection # mdadm --detail --export /dev/md127p1 Before: MD_LEVEL=raid5 MD_DEVICES=4 MD_METADATA=0.90 After: MD_LEVEL=raid5 MD_DEVICES=4 MD_CONTAINER=/dev/md0 MD_MEMBER=0 MD_UUID=55746a20:925d24a7:4f9bd7e2:9c9a411f We parse the symlink target with a format: ../../block/mdXXX/mdXXXpYY ...and need the second '/' from the end of the string to read detect a 'md' device. Reported-by: Krzysztof Wasilewski <krzysztof.wasilewski@intel.com> Cc: Przemyslaw Czarnowski <przemyslaw.hawrylewicz.czarnowski@intel.com> Signed-off-by: Dan Williams <dan.j.williams@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-27 12:56:51 +10:00
Labun, Marcin	20b60dcd6c	Dynamic hot-plug udev rules for policies Neil, Please consider this patch that once was discussed and I think agreed with in general direction. It was sent a while ago but somehow did not merged into your devel3-2. This patch enables hot-plug of so called bare devices (as understand by domain policies rules in mdadm.conf). Without this patch we do NOT serve hot-plug of bare devices at all. Thanks, Marcin Labun Subject was: FW: Autorebuild, new dynamic udev rules for hot-plugs >>From c0aecd4dd96691e8bfa6f2dc187261ec8bb2c5a2 Mon Sep 17 00:00:00 2001 From: Przemyslaw Czarnowski <przemyslaw.hawrylewicz.czarnowski@intel.com> Date: Thu, 23 Dec 2010 16:35:01 +0100 Subject: [PATCH] Dynamic hot-plug udev rules for policies Cc: linux-raid@vger.kernel.org, Williams, Dan J <dan.j.williams@intel.com>, Ciechanowski, Ed <ed.ciechanowski@intel.com> When introducing policies, new hot-plug rules were added to support bare disks. Mdadm was started for each hot plugged block device to determine if it could be used as spare or as a replacement member for degraded array. This patch introduces limitation of range of devices that are handled by mdadm. It limits them to the ones specified in domains associated with the actions: spare-same-port, spare and spare-force. In order to enable hot-plug for bare disks one must update udev rules with command mdadm --activate-domains[=filename] Above command writes udev rule configuration to stdout. If 'filename' is given output is written to the file provided as parameter. It is up to system administrator what should be done later. To make such rule permanent (i.e. remain after reboot) rule should be writen to /lib/udev/rules.d directory. Other cases will just need to write it to /dev/.udev/rules.d directory where temporary rules lies. One should be aware of the meaning of names/priorities of the udev rules. After mdadm.conf is changed one is obliged to re-run "mdadm --activate-domains" command in order to bring the system configuration up to date. All hot-plugged disks containing metadata are still handled by existing rules. Signed-off-by: Przemyslaw Czarnowski <przemyslaw.hawrylewicz.czarnowski@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-27 12:48:04 +10:00
NeilBrown	d6bd632c41	Ignore/don't set data_disks for level=1 When analyse_change sets level=1, data_disks is meaningless as is layout. So don't set them, and make sure we ignore them. Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-27 12:47:58 +10:00
Krzysztof Wojcik	c8b06d8239	Mistake in raid1->raid5 migration 1. Mistake in target level comparison. 2. Initialize reshape->after.data_disks field to proper spares_needed calculation Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-27 12:47:53 +10:00
Krzysztof Wojcik	dfe77a9ed2	Add raid1->raid0 takeover support Add support for raid1 to raid0 takeover operation in user space. This patch includes support for native and imsm metadata. Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-27 12:47:15 +10:00
Adam Kwolek	26d6e1574a	WORKAROUND: mdadm hangs during reshape During reshape when reshape is finished in md, progress_reshape() hangs on select(). This is because 'sync_completed' is reset to zero before 'sync_action' becomes 'idle', and we don't look for notification on 'sync_action'. So if completed becomes zero after reshape_progress has made some progress, then deduce that reshape has finished. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-27 07:56:21 +11:00
Adam Kwolek	16d4d84e5d	FIX: monitor doesn't handshake with md when in container are present raid0 and raid5 arrays, and reshape order is: 1. raid0 array 2. raid5 array mdadm cannot set new raid_disks for raid0 array. For this action md has to have handshake with mdmon. We have the following conditions: 1. Raid0 is not monitored 2. raid0 has been just takeovered to raid4/5 (it has to be monitored 3. monitor has to start monitor new raid4/5 array 4. monitor is not started (it is started to second raid5 array) In such situation pig_monitor is required to let know to m monitor about new array (not in the starting monitor case only) Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-27 07:31:25 +11:00
Labun Marcin	96234762a6	imsm: support for Intel SAS controller in get_disk_controller_domain handler get_disk_controller_domain recognizes Intel (R) SAS controller (isci). The function returns three different strings that differentiate disk attached to AHCI, ISCI or unknown controller types to create separate domains for each case. Signed-off-by: Marcin Labun <marcin.labun@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-26 11:22:34 +10:00
Labun Marcin	155cbb4c2c	imsm: detail_platform_imsm supports Intel SAS controller (isci driver) Added support in detail_platform_imsm for Intel (R) SAS controller. Function supports AHCI and ISCI controllers. RAID properties are derived from common OROM for both types. Based on code From: Artur Wojcik <artur.wojcik@intel.com> Signed-off-by: Marcin Labun <marcin.labun@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-26 11:22:07 +10:00

1 2 3 4 5 ...

1675 Commits All Branches Search

1675 Commits

All Branches