Comments on: vSphere SCSI reservation conflict https://blog.michaelfmcnamara.com/2009/09/vsphere-scsi-reserv-co/ technology, networking, virtualization and IP telephony Sat, 30 Oct 2021 17:24:51 +0000 hourly 1 https://wordpress.org/?v=6.7.2 By: Eric K. Miller https://blog.michaelfmcnamara.com/2009/09/vsphere-scsi-reserv-co/comment-page-1/#comment-3466 Wed, 19 Jan 2011 05:18:45 +0000 http://blog.michaelfmcnamara.com/?p=942#comment-3466 Hi Michael,

I saw the post! :) Thanks for answering, though.

As a follow-up to my previous messages, there is apparently a firmware update that was released last year for the HP MSA1500cs that supposedly fixes the “table full” issue that caused locked LUNs on that device. Only 7 years after the release of the hardware. :)

Also, our NEC SAN works fantastic. We had a couple initial firmware issues, but they have been solved and it’s been running flawlessly for 6+ months.

Eric

]]>
By: Michael McNamara https://blog.michaelfmcnamara.com/2009/09/vsphere-scsi-reserv-co/comment-page-1/#comment-3465 Wed, 19 Jan 2011 05:03:43 +0000 http://blog.michaelfmcnamara.com/?p=942#comment-3465 In reply to Valerian Crasto.

Hi Valerian,

I’m not sure that Eric will see your post but here are your answers.

1) The backslash before the pipe tells egrep to look for ether term “Reserved” or “Pending”. It’s an OR operator for egrep and grep.

2) The -B tells egrep/grep to display X amount of lines before the line matching the search criteria.

Cheers!

]]>
By: Valerian Crasto https://blog.michaelfmcnamara.com/2009/09/vsphere-scsi-reserv-co/comment-page-1/#comment-3464 Wed, 19 Jan 2011 04:18:44 +0000 http://blog.michaelfmcnamara.com/?p=942#comment-3464 Hi Eric,

i have small question.
1.) what is the difference between # esxcfg-info -s | egrep -B16 “s Reserved|Pending” and esxcfg-info -s | grep -i -B 30 Pending(insert a backslash and then a space here)Reservations………………………….1

2.) what does ‘B 30 or B16’ means?

Note:- we have VMware ESX 4.0.
thnx in advance.

Valerian Crasto.

]]>
By: Eric K. Miller https://blog.michaelfmcnamara.com/2009/09/vsphere-scsi-reserv-co/comment-page-1/#comment-1674 Sat, 23 Jan 2010 01:10:52 +0000 http://blog.michaelfmcnamara.com/?p=942#comment-1674 Hi Mike,

That would be great if you could provide feedback. I suspect its price is tempting, and their marketing is definitely fantastic, but I’m sure there are practical limits. Unfortunately, you can’t easily figure out what these limits are until you actually use it. Thankfully, NEC is going to provide a trial unit for us to push to its limits to see what we might need.

I’ll keep you posted on my progress as well. Feel free to email me at emiller at genesishosting.com if you want to discuss.

Thanks!

Eric

]]>
By: Michael McNamara https://blog.michaelfmcnamara.com/2009/09/vsphere-scsi-reserv-co/comment-page-1/#comment-1673 Sat, 23 Jan 2010 01:04:50 +0000 http://blog.michaelfmcnamara.com/?p=942#comment-1673 In reply to Eric K. Miller.

Hi Eric,

I honestly wasn’t very involved in the IBM XIV discussions. I know it wasn’t the product of choice but because of price it somehow made it in the door. I know quite a few guys are skeptical that it’ll be able to handle the IO load of 32+ ESX 4 hosts, I believe the ESX 4 hosts will even be booting from SAN.

We’re suppose to be getting one soon so I’ll keep you posted if you’d like.

Mike

]]>
By: Eric K. Miller https://blog.michaelfmcnamara.com/2009/09/vsphere-scsi-reserv-co/comment-page-1/#comment-1672 Sat, 23 Jan 2010 00:16:10 +0000 http://blog.michaelfmcnamara.com/?p=942#comment-1672 Hi Michael,

Off topic, but do you know much about the XIV? I know it’s a product they purchased from some other company. The main bad point I’ve heard is the possibility that you can lose the entire array (100’s of disks) if more than 2 disks fail in the whole unit. I may be off on the number of disks, but it was something like that, where the concept sounds good, but there is some major issue that can cause a whole unit to go down.

We’re looking at NEC currently for our next SAN solution (D3 and D8).

Eric

]]>
By: Michael McNamara https://blog.michaelfmcnamara.com/2009/09/vsphere-scsi-reserv-co/comment-page-1/#comment-1671 Sat, 23 Jan 2010 00:12:42 +0000 http://blog.michaelfmcnamara.com/?p=942#comment-1671 In reply to David Gibbons.

Hi Dave,

I’m happy the information helped you out! I have Openfiler running here at home with ESXi 3.5 host and really like it. At work all our clusters are running ESX 4 on an HP EVA SAN which is soon to be upgraded to an IBM XIV.

Cheers!

]]>
By: David Gibbons https://blog.michaelfmcnamara.com/2009/09/vsphere-scsi-reserv-co/comment-page-1/#comment-1670 Fri, 22 Jan 2010 19:21:32 +0000 http://blog.michaelfmcnamara.com/?p=942#comment-1670 Michael,

Your post saved me! Thanks for the tip about vmkfstools -L.

Looks like these reservation issues are becoming a bigger and bigger deal with ESXi4. Apparently the software iSCSI stack is so much more robust now it challenges some of the target stacks. My issue in particular was (is!) with openfiler as the target. Here’s the link to aid others in googling for answers:
https://forums.openfiler.com/viewtopic.php?pid=19063#p19063

Thanks again
Dave

]]>
By: Eric K. Miller https://blog.michaelfmcnamara.com/2009/09/vsphere-scsi-reserv-co/comment-page-1/#comment-1251 Tue, 08 Sep 2009 23:49:13 +0000 http://blog.michaelfmcnamara.com/?p=942#comment-1251 In reply to Eric K. Miller.

Sorry, that last command got changed by the blog.

It should be:

esxcfg-info -s | grep -i -B 30 Pending(insert a backslash and then a space here)Reservations………………………….1

]]>
By: Eric K. Miller https://blog.michaelfmcnamara.com/2009/09/vsphere-scsi-reserv-co/comment-page-1/#comment-1250 Tue, 08 Sep 2009 23:43:03 +0000 http://blog.michaelfmcnamara.com/?p=942#comment-1250 In reply to Michael McNamara.

Thanks for the reply! I also suspected that it could be related to those numerous factors. In our case, we have a new unit that had a LOT of load on it, and never any problems, but then we started to see this happen more and more, even after moving VMs off of the SAN (Storage VMotion’ing). It’s barely used now and I’ve still seen the problem.

Every bit of equipment we have is on the HCL, thankfully, but I have seen other HCL-listed hardware have problems, so it doesn’t mean much to me anymore, other than getting support and having a “better” chance of getting a resolution to problems.

I’m “still” on hold waiting for VMware. :)

I’ll write back if I find a specific issue that caused our problem.

In case this helps, this is the command I modified from a VMware forum to work with ESX 3.5 to find the locked LUNs on a host:

esxcfg-info -s | grep -i -B 30 PendingReservations………………………….1

Eric

]]>
By: Michael McNamara https://blog.michaelfmcnamara.com/2009/09/vsphere-scsi-reserv-co/comment-page-1/#comment-1249 Tue, 08 Sep 2009 23:38:02 +0000 http://blog.michaelfmcnamara.com/?p=942#comment-1249 In reply to Eric K. Miller.

Thanks for the comment Eric!

While I’m not a SAN expert by any means I understand the problem can arise from a number of different reasons. The research I did pointed to numerous possible contributors. The most frequently mentioned were SAN performance, LUN size, and SAN switch and HBA compatibility. I would venture to guess that software/driver compatibility would also be a concern.

Good Luck on resolving your problem!

]]>
By: Eric K. Miller https://blog.michaelfmcnamara.com/2009/09/vsphere-scsi-reserv-co/comment-page-1/#comment-1248 Tue, 08 Sep 2009 22:11:13 +0000 http://blog.michaelfmcnamara.com/?p=942#comment-1248 We ran into a similar situation (LUN locking) but in a totally different way, with ESX 3.5 Update 4 in a 2-node cluster and a relatively new Infortrend SAN (S16F-R1840).

Quite often we’re seeing random LUNs stop responding on one of the hosts. It requires a vmkfstools -L lunreset to re-gain access to the LUN from the host that can’t gain access to the LUN.

What’s strange is that we don’t have this problem on any of our other SANs, only the Infortrend LUNs.

I’ve notified Infortrend of the issue and have been on hold waiting for VMware to discuss the issue to see if there is a resolution.

Eric

]]>