Worked on Orac yesterday

Yesterday, I decided that I should poke around Orac in preparation for building that new 5x2TB RAID5 array. My backuppc pool keeps filling up, and deleting old backups isn't freeing up enough space anymore (5x1.5TB RAID6 array.)

At first it was just going to be a simple procedure to add two eSata cables from the computer to the Sans Digital MS2T+B, and then shutdown and move the two 1TB drives in RAID 1 from the bottoms slots of my TR5M-B (upper 3 are unoccupied). But, as I was pushing on the cables the bracket popped in...so I had to open the case to put it back in place.

While I was in, I decided to see what kind of fan is on the drive cage. A couple of weeks earlier, I had gotten a SMART alert of high temperature and cacti showed that there was a rise for a few hours in the late evening. It had been warm that day, warm enough that I touched the AC briefly in the earlier part of the evening. Guess I needed it to stay on the rest of the evening.... But, there was no fan...though there were mount holes for a fan, and there's supposed to be a fan connector on the motherboard for it. Though not all models had one, so evidently not the GT5635E. Though wonder if I should, maybe the addition of the second internal drive makes it desirable (so far I've had to replace the second drive twice....the current one is now a newer generation than the original drives, and runs cooler.)

Since there wasn't a fan, I didn't take the cage out....so I didn't measure what size fan might go in the spot. Though I have a rough guess as what size it is, so perhaps later on I'll pick up a fan to put there. I should probably see about a new rear fan....perhaps a faster one might help.

But, I put things back together and moved the drives, started things back up.

Of course, the first problem is now all my drive letters are different....which messed things up like cacti (hddtemp recording), gkrellm and rc.local (where I changed the i/o schedule for the RAID6 array drives to noop). There was also the complication that the filesystem on the drives I just moved hadn't been fsck'd in over 6 months. So, it needed to fsck them. root also needed be fsck'd. It took a long time to boot, but eventually did. Though the filesystem on the moved RAID1 array hadn't finished checking...had set 'nobootwait' on filesystem. I'll probably remove those now...since they have fixed Bug #563916. But, after fixing rc.local (and before fixing cacti) I rebooted....so it still needed to do the fsck.

Later I still had to reboot again, because two services that reference the filesystem....weren't happy...well, one was really unhappy and the other wasn't reporting valid information for it and needed a restart to make it work (I couldn't probably have fixed the issue without a reboot, but I wanted to make sure it rebooted right after another change I had made. There's still something odd going on, affecting a different RAID1 array....will look into that later (or I could opt to move the filesystem to the RAID6 array)....since it could use a grow.

5-bay RAID and backuppc

Ever since I learned about 5-bay port multiplier SATA enclosures, I've been wanting to get one to play with. And, when I acquired 'orac'...I sought out a port multiplier eSATA adapter to include in the build of the system. And, then waited for support of the adapter in ubuntu. Stopped on 8.04LTS and waited for inclusion in a kernel.

I took a step closer when I finally got the 2-bay enclosure to mirror a pair of drives I had impulsively purchased. Since it worked, I then then bought a 5-bay enclosure (it came up as a NewEgg ShellShocker).

But, after some thought, I decided that what I wanted to do was build a 5 disk RAID5 using 1.5TB drives....so it would get me at least 5TB of real storage. And, set that I would wait until 1.5TB drives were under $100.

In the meantime, I collected a bunch of old SATA drives to see what I trouble I could get into....it didn't last long, as one was definitely dead and two quickly failed. Of course, two of the had been kicked out of RAID1 setups previously, and the other was a failing drive that I replaced/upgraded. So, it was back to waiting.

Eventually, I got 3 Samsung 1.5TB drives through NewEgg ShellShocker. And, I set up a 3 disk RAID5. Sadly, it didn't last very long....one of the drives failed while I was building the array. I exchanged it for a new drive with NewEgg, and waited for it before trying again. While I was waiting, I formatted it a bunch of times and learning about smartctl and various things of drive repair. One of the remaining two original drives, was showing signs of slow death and one unrecoverable sector. When, the new drive arrived...I did a long self test and a quick, but full, format....taking a couple days before I created the 3 disk RAID 5.

Once, that was done, I created a new big filesystem that is /var/lib/backuppc to have backuppc on 'orac'. Initially a 1TB filesystem.

Then one day, the bad happened. A drive on 'gumby' failed. It was time to stop putting off figuring out how to setup backuppc and start putting stuff into it. It turned out to be a lot easier than it thought to get started, though I did have to rebuild smbclient to have a longer timeout to get it successfully complete a full backup of a given partition. I then learned some other stuff and fine tuned the configs.... One big change was I broke out the windows hosts into individual partitions (ie, gumby_c, gumby_e, gumby_f), and came up with a 'semaphore' solution so that it wouldn't backup more than one partition at a time from a windows host. Eventually, I had all my systems in it (it fully backs up everything except itself, where it backs up only key parts....) I even got it backing up my work Mac Book Pro.

I later grew the filesystem out to 2TB.

