Enterprise Storage Megathread: Why is my NAS a SAN?

The Something Awful Forums > Discussion > Serious Hardware/Software Crap > Enterprise Storage Megathread: Why is my NAS a SAN?

«‹›207 »

Picardy Beet: Feb 7, 2006; Singing in the summer.

Quick question : I've got quotations from the ususal suspects Dell / IBM /hp concernning a n new db server with 2 SSDs 400 Gb ( the db isn't quite big (180 Gb) , but severily hammered).
It is easy to find informations concerning hp and IBM SLCs and eMLCs disks, but not really concerning Dell - or at least recent info. Anyone has real world information concerning Dell 400 GB SSD SAS value Disks?

# ? Mar 5, 2013 09:35

Adbot: ADBOT LOVES YOU

# ? May 13, 2024 13:25

the spyder: Feb 18, 2011

"Bitter[HATE posted:

" post="412894656"]
Now that the Backblaze Pod 3.0 is out, anyone have any experience with using one of them?

http://blog.backblaze.com/2013/02/20/180tb-of-good-vibrations-storage-pod-3-0/

We are looking to backup about 100TB offsite for disaster recovery and this looks like a really good deal. Works out to around $19000 for a completed unit where 45drives.com builds the unit and then you populate it with your own Hard Drives. They sell a version with redundant Power Supplys and OS drives. Plan is to seed it here with Crashplan ProE and send it off to the Colo. Anyone seen a better deal for that much storage? We have shitloads of data but tiiiny budgets

We use more supermicro hardware + OpenIndiana with Napp-IT. I can build ~180TB for $3k more then BackBlaze using LSI controllers/Supermicro chassis. Same idea though.

# ? Mar 6, 2013 07:51

BonoMan: Feb 20, 2002; Jade Ear Joe

Long post incoming. I have a small situation that has me stumped and thrown headfirst into the world of SAN storage (of which I know nothing about).

Intro:

I work at a production house/ad agency and we used to have several other offices...times got tough and we've downsized into one office. Part of that had me driving around the country and collecting all of the equipment from other places. Most of which has remained in storage until now.

Our shared storage system for one of our Avid went down. And rather than pay to replace it or get a standalone storage system they want to see if we can repurpose older equipment we have. We have no official IT or SysAdmin guys. Just me (an animator/director) and one of our coders trying to figure this all out.

Inventory:

So I went through and found what they used to use at our Tampa office. It consists of (excuse my lack of knowledge of the terminology):

A 12 Drive BrightDrive RAID unit. The back of it has 2 HBA's (a term I just learned) with 2 fiber ports each (but only one of each is populated by the fiber �slider/tray/thingie�).

It's this model: The RS-1220-x

A BrightDrive server/controller. Has 3 80gig drives in it plus a CD-Rom. 1 single fiber port at the back of it. And ethernet/vga/keyboard/mouse of course.

A 16 fiber port Sanbox 5600 switch.

A Boxx workstation with XP Pro on it.

First impressions:

Ultimately I'm trying to figure out how each device fits into the pipeline. Our goal is to connect 1 avid to the RAID array via fiber so it can be worked off as a media drive system (not just slow mass storage). If other computers can connect to the array in the end that�s fine..but ultimately we just have one computer that definitely needs to use it.

It appears that the BrightDrive Server connects to the BrightDrive RAID Array via fiber. Then the RAID connects to the switch via fiber and then the PCs connect into the switch via fiber (but that's just my random guessing). I'm pretty sure the Boxx was just used as a render farm distributor. However I noticed two things:

1.)When booting up to the BrightDrive Server it mentions can't find volume nl01 to load. I figure that's the RAID array. I have it connected via fiber (tried both HBAs) but it never detected it.

2.)Then when I booted up the Boxx I noticed that in My Computer there was an entry for a Samba share volume nl01. Did they just have the RAID array configured as a dumb volume?

Oh...also booting up the BrightDrive server takes me to a login...of which I do not know the login or password. All the employees that knew all of this stuff and set it up are out of contact (although I have sent several emails out just in case).

Pictures:

Here are some pics just in case.

The overall equipment setup - laid out on a table as I try to figure everything out.

The back of the BDS (BrightDrive Server/Controller).

Back of the 12 bay RAID array and it's HBAs.

And screenshot of the Samba connection from the Boxx workstation.

In Conclusion:

So in summary, we just want to figure out how to set this up in a way where we can connect the Avid (which does have a fiber card) to this array via fiber so we can work off of it in a speedy fashion. Also while it would be nice to try to see what's on the drives now...if wiping and restarting ends up needing to be done that's not a big deal at all.

big edit: So one of the old employees finally got in touch with me and basically said the system won't work with the Avid anyway because it needed a major upgrade to do so and it cost too much money (all of this info I didn't realize). He said I can try hooking up the RAID array directly via fiber to the AVID tower and see if that works. Also I tried booting up the controller again today and got a "8110 error severity: major" which seems to be a processor thing, so ... basically scratch my whole post.

BonoMan fucked around with this message at 18:12 on Mar 6, 2013

# ? Mar 6, 2013 16:18

madsushi: Apr 19, 2009; Baller.
#essereFerrari

So my company is doing a POC of a Nimble CS240 (DR) and a Nimble CS460 (Prod). We run about 100 VMs, ~5 SQL instances, a 1K-user Exchange 2010, and a 50-user Exchange 2010 environment. I just went out to SwitchNAP in Vegas last weekend to put it in. Thought I would share a few of my early findings.

1) The VMware plugin is awful. Really. First: when you register the plugin, either by NetBIOS, FQDN, or IP address, you can only log into your vCenter server using that same name/IP. So if you register the plugin using the IP address, but try logging into vCenter with your vSphere client using the name, the Nimble plugin will fail to load. Even if both are valid, you have to type in exactly the same thing every time. Want to log in to your vCenter server locally with "localhost"? Tough luck. The restore tool is also buggy/slow and we ended up having to manually remove the cloned datastores more than once.

In addition, when the VMware plugin doesn't set your host iSCSI/MPIO settings, so you have to do that manually ahead of time.

The worst offense is that when it makes the new LUN/Volume, it doesn't do any of the masking for you, so the LUN is available for any iSCSI initiator. Which means that your Windows SQL or Exchange server could easily grab that LUN at some point and corrupt all your data (Windows will instantly corrupt a VMFS-formatted LUN if it tries to "online" it). So you have to go to the Nimble console and set the masking anyway.

2) The performance is good. Really. We can throw essentially an infinite number of 4K writes at it, and it is pushing nearly 2Gbps of large-block writes. Reads are also good, so far, but we still need to get everything migrated to see what our cache hit ratio really is.

3) The compression is OK. I was guessing around 25% compression total (because 2X is a pipe dream), and we're seeing just under 30% but most of our stuff still isn't moved yet. My guess is that 25% will be as high as it sits.

4) The replication/QoS tools are slick. I was really happy at how easy it was to set replication throttling schedules, especially when compared to what I had been doing with NetApp (which was "plink" and a scheduled task on a server). Replication seems to be pretty standard, although the inability to say "take a snapshot now and replicate it" at a moment's notice is unfortunate. If you want to take a snapshot and replicate it NOW (like before some dev work), you have to set up a one-off schedule for 5 minutes from now, let it run, then get started.

5) The lack of app-specific software sucks. I am going to have to figure out how to configure Exchange and SQL to properly truncate their logs, etc, when before my SnapManager products just did everything for me. I never had to question whether my maintenance plan was set up correctly because SnapManager knew what it was doing.

madsushi fucked around with this message at 18:16 on Mar 6, 2013

# ? Mar 6, 2013 18:12

three: Aug 9, 2007; i fantasize about ndamukong suh licking my doodoo hole

I don't feel companies like Nimble are sustainable. They were created to start doing something new from a technology standpoint that larger companies didn't seem to want to dive into as quickly, and it really seems like their business model rode on a bigger storage company buying them up.

How can Nimble, Tintri, etc. compete against EMC, NetApp, Hitachi, IBM, and Dell when their advantage (cool flash overlays) is now just a standard feature?

I'd be shocked if Nimble existed in 5 years. I think someone posted that they didn't care because what are the odds they go out of business in the next 2 years (or whatever their refresh cycle was); that's an interesting way to look at it and I can understand that argument.

I guess their main market now is that they're relatively cheap, but they're not significantly cheaper than an EMC VNXe or Equallogic (I don't know pricing of the other top SAN vendors' cheapest options).

# ? Mar 6, 2013 19:28

YOLOsubmarine: Oct 19, 2004; When asked which Pokemon he evolved into, Kamara pauses.

"Motherfucking, what's that big dragon shit? That orange motherfucker. Charizard."

madsushi posted:

2) The performance is good. Really. We can throw essentially an infinite number of 4K writes at it, and it is pushing nearly 2Gbps of large-block writes. Reads are also good, so far, but we still need to get everything migrated to see what our cache hit ratio really is.

I'd be curious to know how you're testing write performance, and whether you have tools to see how front end IO is translated into IO on the SATA disk in back. I'd like to know what sort of disk utilization numbers you see as you drive writes up, and how the array handles it when the SATA disks in the back start to get overloaded. Nimble uses an extent based filesystem so I imagine they handle large block better than NetApp but they should still have fundamentally similar problems with significant write IO overloading the real world maximums of relatively slow SATA.

It's definitely going to be tough to test read performance since most synthetic IO tools probably aren't going to build working sets larger than the cache size. I would be curious to see what something like Loadgen or Orion would show is properly sized. Orion will crush pretty much any storage you throw at it if you have a large enough front end.

Edit: I've heard a few people express their opinion that based on the size of their customer base, their aggressive pricing, their number of employees, and the amount of money they've raised through venture capital that they are probably not yet turning a profit and have a relatively small window of time in which to either gain a lot more customers or, more likely, get bought out. At the prices they are selling equipment at they would have a very hard time making enough money to pay back investors in a reasonable time, and I don't imagine an IPO could generate much given that they are a relatively small and unknown company.

YOLOsubmarine fucked around with this message at 00:54 on Mar 7, 2013

# ? Mar 7, 2013 00:50

Dilbert As FUCK: Sep 8, 2007; by Cowcaster; Pillbug

three posted:

I don't feel companies like Nimble are sustainable. They were created to start doing something new from a technology standpoint that larger companies didn't seem to want to dive into as quickly, and it really seems like their business model rode on a bigger storage company buying them up.

How can Nimble, Tintri, etc. compete against EMC, NetApp, Hitachi, IBM, and Dell when their advantage (cool flash overlays) is now just a standard feature?

I'd be shocked if Nimble existed in 5 years. I think someone posted that they didn't care because what are the odds they go out of business in the next 2 years (or whatever their refresh cycle was); that's an interesting way to look at it and I can understand that argument.

I guess their main market now is that they're relatively cheap, but they're not significantly cheaper than an EMC VNXe or Equallogic (I don't know pricing of the other top SAN vendors' cheapest options).

I have a hunch we will be seeing them work 'very' closely with cisco in the next year. It wouldn't surprise me one bit if Cisco bought into them and made them their "UCS: STORAGE" department.

# ? Mar 7, 2013 00:54

Ninja Rope: Oct 22, 2005; Wee.

Anyone have any thought/horror stories about CleverSafe?

# ? Mar 8, 2013 03:02

skipdogg: Nov 29, 2004; Resident SRT-4 Expert

Ninja Rope posted:

Anyone have any thought/horror stories about CleverSafe?

Never even heard of them until now.

# ? Mar 8, 2013 08:22

sanchez: Feb 26, 2003

A client has an EMC vnxe 3150 with a couple of SAS shelves attached, and a total of about 22 300gb SAS drives installed. When he originally set it up, following EMC's advice, every drive (minus hotspares) has been added to a single RAID 5 array. Since it's not in production yet we advised they move to RAID 10 instead (space will still be fine) mainly because I'm worried about rebuild times on an array that large along with redundancy in general. Is this rational?

# ? Mar 11, 2013 16:09

Goon Matchmaker: Oct 23, 2003; I play too much EVE-Online

It's not rational to have an array of 22 300GB disks unless you have a VERY good reason. They should be broken up into a set of arrays, probably something like 5 disk raid 5s.

# ? Mar 11, 2013 16:59

Gravel: Mar 11, 2013; Gravel is a delicious food for people.

NippleFloss posted:

I'd be curious to know how you're testing write performance, and whether you have tools to see how front end IO is translated into IO on the SATA disk in back. I'd like to know what sort of disk utilization numbers you see as you drive writes up, and how the array handles it when the SATA disks in the back start to get overloaded. Nimble uses an extent based filesystem so I imagine they handle large block better than NetApp but they should still have fundamentally similar problems with significant write IO overloading the real world maximums of relatively slow SATA.

Seconding this, if you don't mind - how are you driving those writes, what does "essentially infinite" 4k writes mean?

# ? Mar 11, 2013 17:08

paperchaseguy: Feb 21, 2002; THEY'RE GONNA SAY NO

sanchez posted:

A client has an EMC vnxe 3150 with a couple of SAS shelves attached, and a total of about 22 300gb SAS drives installed. When he originally set it up, following EMC's advice, every drive (minus hotspares) has been added to a single RAID 5 array. Since it's not in production yet we advised they move to RAID 10 instead (space will still be fine) mainly because I'm worried about rebuild times on an array that large along with redundancy in general. Is this rational?

I would set up 1 hot spare and three RAID 5 6+1 groups. If you have 1-2 hot spares and the rest is all a single RAID 5 group, that's not a good idea. RAID 10 would be if you need high performance or high redundancy.

# ? Mar 11, 2013 17:17

bull3964: Nov 18, 2000; DO YOU HEAR THAT? THAT'S THE SOUND OF ME PATTING MYSELF ON THE BACK.

Just out of curiosity, why is RAID 5 instead of RAID 6? I'm not sure I would be comfortable with single parity in a production setting, even if you did have a hot spare.

# ? Mar 11, 2013 17:23

madsushi: Apr 19, 2009; Baller.
#essereFerrari

Gravel posted:

Seconding this, if you don't mind - how are you driving those writes, what does "essentially infinite" 4k writes mean?

Six HP blades, one Windows VM per blade running IOMeter, each VM running two workers, total of 12 workers.

During the "pure" write testing, I had 6 workers using 100% random 4K writes and 6 workers using 100% sequential 4K writes. With this test, we were seeing ~50,000 IOPS, which for a 3U box and our needs, is "essentially infinite" since we're never going to need anywhere near that. I will admit that was probably too enthusiastic. We ran the test for 15 minutes without issue.

With a 32K block size for pure writes (again 6x 100% random and 6x 100% sequential workers), the write IOPS went down (of course) but our bandwidth went up to nearly 2Gbps, which is our current cap since we only have 2 uplinks to the Nimble at the moment. Same with any larger block size: it immediately goes to 2Gbps.

Using a mixed workload test (given to us by Nimble) of 4Kb - 62% write, 38% read, 48% seq, 52% random, we were seeing 20K write IOPS and 12K read IOPS while latency was under 5ms.

When saturating the cache and just doing pure read IOPS, we hit 150K IOPS, which isn't really valuable since we all know that cache is fast.

NippleFloss: I don't get to see any of the back-end IO stats; the SNMP support is really, really bad. No disk utilization (or ANY per-disk stats), etc. They really want this thing to be a "black box" where all the decisions are made in advance and hidden from the user. There are no RAID groups or aggregates or system volumes or anything like that. There are no virtual interfaces or VLANs or configurable partner interfaces/etc. While it makes it easy for a new administrator to set up, it also means that if your needs don't fit into their fixed bucket size, they're not a match. Feels like they've just found a ratio/design that works for most small/medium businesses (i.e. 10% flash/90% storage, 12 SATA disk, etc) and that's the only market that they make sense in right now.

# ? Mar 11, 2013 17:39

Docjowles: Apr 9, 2009

bull3964 posted:

Just out of curiosity, why is RAID 5 instead of RAID 6? I'm not sure I would be comfortable with single parity in a production setting, even if you did have a hot spare.

The VNXe GUI is kind of "storage for dummies" and only lets you configure certain types of drives into certain types of RAID configs. It may actually not be an option.

Disclaimer: I evaluated the VNXe at my old job like a year and a half ago, the software may have changed.

# ? Mar 11, 2013 17:52

paperchaseguy: Feb 21, 2002; THEY'RE GONNA SAY NO

bull3964 posted:

Just out of curiosity, why is RAID 5 instead of RAID 6? I'm not sure I would be comfortable with single parity in a production setting, even if you did have a hot spare.

RAID 6 is an option if you want to do two 8+2 groups and two hot spares. Slightly more capacity at somewhat lower performance. If you want more redundancy, sure, do RAID 6 or 10. I'd advise not using RAID 5 on 1TB and above drives, or SATA.

It's all about what your requirements are, though for 300GB drives RAID 5 is pretty standard.

# ? Mar 11, 2013 17:53

Goon Matchmaker: Oct 23, 2003; I play too much EVE-Online

Docjowles posted:

The VNXe GUI is kind of "storage for dummies" and only lets you configure certain types of drives into certain types of RAID configs. It may actually not be an option.

Disclaimer: I evaluated the VNXe at my old job like a year and a half ago, the software may have changed.

It's still storage for dummies...

# ? Mar 11, 2013 21:47

YOLOsubmarine: Oct 19, 2004; When asked which Pokemon he evolved into, Kamara pauses.

"Motherfucking, what's that big dragon shit? That orange motherfucker. Charizard."