In October of 2003 we got a new Dell PowerVault 775 NAS server. This server was essentially a PowerEdge 2650 running Windows Storage Server 2003, which is a slightly tweaked version of Windows Server 2003. The server originally shipped with a pair of 18 GB drives in a mirror set for the OS and a trio of 147 GB drives in a RAID 5 for data storage. The first problem the server had was almost daily bluescreens. It was found that these were caused by the Dell OpenManage software. After we removed the software the server stabilized.
The next problem to show up shortly was a small amount of corruption on the data drive. A few files would get corrupted each week or so. I would run CHKDSK on the weekend and it would fix the corruption, usually with no data loss. The corruption was never large but it was consistent. I created many cases with Dell to attempt to resolve this problem. I tried to check consistency on RAID set, rebuilt the drive and tried many other similar fixes. Nothing worked long term however.
Eventually we added a PV220S to the PV775. When we did this we moved the data drives from the internal slots to the external enclosure. After doing this magically any data corruption on the data drive stopped. The news isn't all good however. The OS drives started having problems then. The server started having weird memory errors and would just generally stop responding.
I again called Dell to try and trouble shoot this new but related problem. The first thing they had me do was update drivers and firmware on RAID controllers and SCSI drives. After that they began taking dumps of drive issues using a modified version of CHKDSK. Eventually they determined there was a flaw in the RAID controller. Dell did not make the controller and thus were waiting on a fix from the ODM. As time dragged on they gave me other options. Most of the options involved removing or disabling the RAID controller. The problem with this solution was they I would need to rebuild the OS. Further I would no longer be able to use Storage Server since it was built assuming that RAID was there. I didn't think this was a good solution and decided to give them a bit longer to come up with a fix for hardware.
About a week ago Dell called and said they couldn't find a fix for the problem. As a result they would be swapping the hardware out. This brings up a whole new suite of problems. The main problem was that Dell no longer made an equivalent NAS server. We cannot get a new box running Windows Storage Server 2003 R2, which is what I really wanted. Instead we will be getting a PowerEdge 2850 running Windows Server 2003 R2. The base server will be considerably faster than the older server since it is three years newer. As a bonus we also get a new three year warranty for free.
It is still going to be a bit of a hassle for me since I will have to transition the data drives to new server as well as all the share definitions. Also I have to switch around some drives so I can send back the ones the server originally came with. Fun Fun. At least Dell followed through and resolved the issue.
Update: Woo-Hoo, well today was my lucky day it seems. Dell announced the PowerEdge 2950 with Windows Storage Server 2003 R2 today. This is the exact server I was hoping to get. Dell has cancelled the order for the 2850 that was previously discussed and will instead begin the order for a 2950. It isn't really out yet so we will have to wait a little while, but it will be worth it. This server has the latest dual core Xeon's and all the other newest technologies. Awesome.
Update 2: My Dell support person got me exactly what I wanted, it is an awesome server. Dell has a very happy customer now. Even if Dell did make me suffer for a while along the way, they made it right in the end.