Forums >> StudioSysAdmins Lists (posts via e-mail only) >> Discuss@StudioSysAdmins.com


RE: Data deduplication

<!--[if gte mso 9]><![endif]--><!--[if gte mso 9]><![endif]-->

I’ve used dedupe on both NFS and CIFS (CIFS currently), and seen various results – up to %60 savings on Engineering data, and as low as %20 on CIFS art data (i.e. jpegs).  While I like dedupe (as it is free…), it isn’t as ideal a disk space saver due to it running on a daily schedule (could increase, but that would effect CPU).

Had some pretty good success with a product called Storwize, which does on the fly compression (saving around %50).  Benefit of it is that it does compression before writes occur, so less writes and see the savings right away – before it hits the disk.  Saves on backup tapes and time as well, as the data doesn’t have to decompress on read/write to tape.  When it does iSCSI as well, I’ll definitely take a closer look at it for the environment I’m currently in.

 

Again, I like dedupe, but it isn’t the be all and end all I don’t think.  Also had issues with various incremental ndmpcopies when a dedupe is run in between them…

 

Cheers,

-Adrian

 

From: studiosysadmins-discuss-bounces@mailman.studiosysadmins.com [mailto:studiosysadmins-discuss-bounces@mailman.studiosysadmins.com] On Behalf Of Stefan Maletych
Sent: Wednesday, February 24, 2010 6:21 AM
To: Stefan Maletych
Cc: discuss@studiosysadmins.com
Subject: Re: Data deduplication

 

Sorry, I should specify that a reboot of the server seems to fix it...although only temporary...



Sent from my iPhone


On 2010-02-24, at 9:18 AM, Stefan Maletych <a href="mailto:stefan.maletych@rogers.com"stefan.maletych@rogers.com> wrote:

Yeah, we've got it running here on 2008 (just recently) 

 

Lately our server has been quite slow with Vista/Win7 and OS X machines (almost overnight - which makes me suspect sis)

 

The server is fine with Linux, XP32 and XP64, but is abysmal on the Vista/7 platform along with Leopard and Snow leopard (which rules out remote differential compression and autotuning...although I did turn them off and tried all sorts of combinations)

 

When we copy to or from the server, we'll randomly get errors like "file or folder not found" "cannot copy, file was not found, etc." when the file and folder do indeed exist!

 

And, these are even files or folders that have been deduplicated - they're the only copy on the server

 

All updates are installed on the workstations ad server

 

Also, a reboot seems to fix it....it's so strange...

Sent from my iPhone


On 2010-02-24, at 8:57 AM, Barry Zubel <barry@annix.com> wrote:

We use SIS on Storage Server 2008.

There is a service that runs periodically (called the Groveler)  which searches for identical files, and replaces references to them with a 'Junction Point'  (the Windows equivalent of a symlink).  This is handled at the filesystem level and is transparent to machines accessing the files across a share.

If either of the files is changed independantly, a new copy of the file is created (using copy-on-write) and the junction point removed.

>From my experience, it saved us around 10-15% on our primary storage, and the load from the groveller service is negligible (it runs as a background priority thread).

I've not found a gui to control  it - all configuration is done via the SISADMIN command line tool.  It also only works at the drive level - you can't independantly turn it on for folders.

HTH

B.


On 24/02/2010 06:17, Stefan Maletych wrote:

Yes, I've read a bit about the netapp stuff - but what I'm more concerned with are all of your experiences, and recommendations....your trials and your errors....  

 

;)

Sent from my iPhone


On 2010-02-24, at 12:21 AM, James Bourne - DRD <a href="mailto:james.bourne@drdstudios.com"james.bourne@drdstudios.com> wrote:

Check this out:

http://www.netapp.com/us/products/platform-os/dedupe.html

See about a 30% space saving.

j.

On Wed, Feb 24, 2010 at 4:03 PM, Stefan Maletych <a href="mailto:stefan.maletych@rogers.com"stefan.maletych@rogers.com> wrote:

Hi everyone,

Can any of you give me a quick rundown of deduplication

I.e what works best in your experience - ZFS, Windows SIS....what problems/incompatabilities and issues with certain software i can expect, and rendering issues, etc.?

It doesn't need to be in depth - generalizations are fine...

Thanks!!!
Stefan

Sent from my iPhone
_
StudioSysAdmins-Discuss mailing list
StudioSysAdmins-Discuss@mailman.studiosysadmins.com
http://mailman.studiosysadmins.com/mailman/listinfo/studiosysadmins-discuss




--
Regards,

James

Dr D Studios
30 Orwell Street
Potts Point NSW 2011
Australia

w: http://www.drdstudios.com
p: +61 2 9357 2322
m: +61 405 803 712
f: + 61 2 9356 3162

 

_

StudioSysAdmins-Discuss mailing list

a
href="mailto:StudioSysAdmins-Discuss@mailman.studiosysadmins.com"StudioSysAdmins-Discuss@mailman.studiosysadmins.com

http://mailman.studiosysadmins.com/mailman/listinfo/studiosysadmins-discuss

  

 


    Contests/Surveys
  • ** COMING SOON **

TopofBlogs