git.sur5r.net Git - bacula/bacula/blob - bacula/projects

   1
   2 Projects:
   3                      Bacula Projects Roadmap
   4                     Status updated 26 August 2008
   5
   6 Items Completed:
   7
   8 Summary:
   9 Item  1:  Accurate restoration of renamed/deleted files
  10 Item  2:  Allow FD to initiate a backup
  11 Item  3:  Merge multiple backups (Synthetic Backup or Consolidation)
  12 Item  4:  Implement Catalog directive for Pool resource in Director
  13 Item  5:  Add an item to the restore option where you can select a Pool
  14 Item  6:  Deletion of disk Volumes when pruned
  15 Item  7:  Implement Base jobs
  16 Item  8:  Implement Copy pools
  17 Item  9:  Scheduling syntax that permits more flexibility and options
  18 Item 10:  Message mailing based on backup types
  19 Item 11:  Cause daemons to use a specific IP address to source communications
  20 Item 12:  Add Plug-ins to the FileSet Include statements.
  21 Item 13:  Restore only file attributes (permissions, ACL, owner, group...)
  22 Item 14:  Add an override in Schedule for Pools based on backup types
  23 Item 15:  Implement more Python events and functions
  24 Item 16:  Allow inclusion/exclusion of files in a fileset by creation/mod times
  25 Item 17:  Automatic promotion of backup levels based on backup size
  26 Item 18:  Better control over Job execution
  27 Item 19:  Automatic disabling of devices
  28 Item 20:  An option to operate on all pools with update vol parameters
  29 Item 21:  Include timestamp of job launch in "stat clients" output
  30 Item 22:  Implement Storage daemon compression
  31 Item 23:  Improve Bacula's tape and drive usage and cleaning management
  32 Item 24:  Multiple threads in file daemon for the same job
  33 Item 25:  Archival (removal) of User Files to Tape
  34
  35
  36 Item  1:  Accurate restoration of renamed/deleted files
  37   Date:   28 November 2005
  38   Origin: Martin Simmons (martin at lispworks dot com)
  39   Status:
  40
  41   What:   When restoring a fileset for a specified date (including "most
  42           recent"), Bacula should give you exactly the files and directories
  43           that existed at the time of the last backup prior to that date.
  44
  45           Currently this only works if the last backup was a Full backup.
  46           When the last backup was Incremental/Differential, files and
  47           directories that have been renamed or deleted since the last Full
  48           backup are not currently restored correctly.  Ditto for files with
  49           extra/fewer hard links than at the time of the last Full backup.
  50
  51   Why:    Incremental/Differential would be much more useful if this worked.
  52
  53   Notes:  Merging of multiple backups into a single one seems to
  54           rely on this working, otherwise the merged backups will not be
  55           truly equivalent to a Full backup.
  56
  57   Note:   Kern: notes shortened. This can be done without the need for
  58           inodes. It is essentially the same as the current Verify job,
  59           but one additional database record must be written, which does
  60           not need any database change.
  61
  62   Notes:  Kern: see if we can correct restoration of directories if
  63           replace=ifnewer is set.  Currently, if the directory does not
  64           exist, a "dummy" directory is created, then when all the files
  65           are updated, the dummy directory is newer so the real values
  66           are not updated.
  67
  68 Item  2:  Allow FD to initiate a backup
  69   Origin: Frank Volf (frank at deze dot org)
  70   Date:   17 November 2005
  71   Status:
  72
  73    What:  Provide some means, possibly by a restricted console that
  74           allows a FD to initiate a backup, and that uses the connection
  75           established by the FD to the Director for the backup so that
  76           a Director that is firewalled can do the backup.
  77
  78    Why:   Makes backup of laptops much easier.
  79
  80
  81 Item  3:  Merge multiple backups (Synthetic Backup or Consolidation)
  82   Origin: Marc Cousin and Eric Bollengier
  83   Date:   15 November 2005
  84   Status:
  85
  86   What:   A merged backup is a backup made without connecting to the Client.
  87           It would be a Merge of existing backups into a single backup.
  88           In effect, it is like a restore but to the backup medium.
  89
  90           For instance, say that last Sunday we made a full backup.  Then
  91           all week long, we created incremental backups, in order to do
  92           them fast.  Now comes Sunday again, and we need another full.
  93           The merged backup makes it possible to do instead an incremental
  94           backup (during the night for instance), and then create a merged
  95           backup during the day, by using the full and incrementals from
  96           the week.  The merged backup will be exactly like a full made
  97           Sunday night on the tape, but the production interruption on the
  98           Client will be minimal, as the Client will only have to send
  99           incrementals.
 100
 101           In fact, if it's done correctly, you could merge all the
 102           Incrementals into single Incremental, or all the Incrementals
 103           and the last Differential into a new Differential, or the Full,
 104           last differential and all the Incrementals into a new Full
 105           backup.  And there is no need to involve the Client.
 106
 107   Why:    The benefit is that :
 108           - the Client just does an incremental ;
 109           - the merged backup on tape is just as a single full backup,
 110             and can be restored very fast.
 111
 112           This is also a way of reducing the backup data since the old
 113           data can then be pruned (or not) from the catalog, possibly
 114           allowing older volumes to be recycled
 115
 116 Item  4:  Implement Catalog directive for Pool resource in Director
 117   Origin: Alan Davis adavis@ruckus.com
 118   Date:   6 March 2007
 119   Status: Submitted
 120
 121   What:   The current behavior is for the director to create all pools
 122           found in the configuration file in all catalogs.  Add a
 123           Catalog directive to the Pool resource to specify which
 124           catalog to use for each pool definition.
 125
 126   Why:    This allows different catalogs to have different pool
 127           attributes and eliminates the side-effect of adding
 128           pools to catalogs that don't need/use them.
 129
 130   Notes:  Kern: I think this is relatively easy to do, and it is really
 131           a pre-requisite to a number of the Copy pool, ... projects
 132           that are listed here.
 133
 134 Item  5:  Add an item to the restore option where you can select a Pool
 135   Origin: kshatriyak at gmail dot com
 136     Date: 1/1/2006
 137   Status:
 138
 139     What: In the restore option (Select the most recent backup for a
 140           client) it would be useful to add an option where you can limit
 141           the selection to a certain pool.
 142
 143      Why: When using cloned jobs, most of the time you have 2 pools - a
 144           disk pool and a tape pool.  People who have 2 pools would like to
 145           select the most recent backup from disk, not from tape (tape
 146           would be only needed in emergency).  However, the most recent
 147           backup (which may just differ a second from the disk backup) may
 148           be on tape and would be selected.  The problem becomes bigger if
 149           you have a full and differential - the most "recent" full backup
 150           may be on disk, while the most recent differential may be on tape
 151           (though the differential on disk may differ even only a second or
 152           so).  Bacula will complain that the backups reside on different
 153           media then.  For now the only solution now when restoring things
 154           when you have 2 pools is to manually search for the right
 155           job-id's and enter them by hand, which is a bit fault tolerant.
 156
 157   Notes:  Kern: This is a nice idea. It could also be the way to support
 158           Jobs that have been Copied (similar to migration, but not yet
 159           implemented).
 160
 161
 162
 163 Item  6:  Deletion of disk Volumes when pruned
 164   Date:   Nov 25, 2005
 165   Origin: Ross Boylan <RossBoylan at stanfordalumni dot org> (edited
 166           by Kern)
 167   Status:
 168
 169    What:  Provide a way for Bacula to automatically remove Volumes
 170           from the filesystem, or optionally to truncate them.
 171           Obviously, the Volume must be pruned prior removal.
 172
 173   Why:    This would allow users more control over their Volumes and
 174           prevent disk based volumes from consuming too much space.
 175
 176   Notes:  The following two directives might do the trick:
 177
 178           Volume Data Retention = <time period>
 179           Remove Volume After = <time period>
 180
 181           The migration project should also remove a Volume that is
 182           migrated. This might also work for tape Volumes.
 183
 184 Item  7:  Implement Base jobs
 185   Date:   28 October 2005
 186   Origin: Kern
 187   Status:
 188
 189   What:   A base job is sort of like a Full save except that you
 190           will want the FileSet to contain only files that are
 191           unlikely to change in the future (i.e.  a snapshot of
 192           most of your system after installing it).  After the
 193           base job has been run, when you are doing a Full save,
 194           you specify one or more Base jobs to be used.  All
 195           files that have been backed up in the Base job/jobs but
 196           not modified will then be excluded from the backup.
 197           During a restore, the Base jobs will be automatically
 198           pulled in where necessary.
 199
 200   Why:    This is something none of the competition does, as far as
 201           we know (except perhaps BackupPC, which is a Perl program that
 202           saves to disk only).  It is big win for the user, it
 203           makes Bacula stand out as offering a unique
 204           optimization that immediately saves time and money.
 205           Basically, imagine that you have 100 nearly identical
 206           Windows or Linux machine containing the OS and user
 207           files.  Now for the OS part, a Base job will be backed
 208           up once, and rather than making 100 copies of the OS,
 209           there will be only one.  If one or more of the systems
 210           have some files updated, no problem, they will be
 211           automatically restored.
 212
 213   Notes:  Huge savings in tape usage even for a single machine.
 214           Will require more resources because the DIR must send
 215           FD a list of files/attribs, and the FD must search the
 216           list and compare it for each file to be saved.
 217
 218
 219 Item  8:  Implement Copy pools
 220   Date:   27 November 2005
 221   Origin: David Boyes (dboyes at sinenomine dot net)
 222   Status:
 223
 224   What:   I would like Bacula to have the capability to write copies
 225           of backed-up data on multiple physical volumes selected
 226           from different pools without transferring the data
 227           multiple times, and to accept any of the copy volumes
 228           as valid for restore.
 229
 230   Why:    In many cases, businesses are required to keep offsite
 231           copies of backup volumes, or just wish for simple
 232           protection against a human operator dropping a storage
 233           volume and damaging it. The ability to generate multiple
 234           volumes in the course of a single backup job allows
 235           customers to simple check out one copy and send it
 236           offsite, marking it as out of changer or otherwise
 237           unavailable. Currently, the library and magazine
 238           management capability in Bacula does not make this process
 239           simple.
 240
 241           Restores would use the copy of the data on the first
 242           available volume, in order of Copy pool chain definition.
 243
 244           This is also a major scalability issue -- as the number of
 245           clients increases beyond several thousand, and the volume
 246           of data increases, transferring the data multiple times to
 247           produce additional copies of the backups will become
 248           physically impossible due to transfer speed
 249           issues. Generating multiple copies at server side will
 250           become the only practical option.
 251
 252   How:    I suspect that this will require adding a multiplexing
 253           SD that appears to be a SD to a specific FD, but 1-n FDs
 254           to the specific back end SDs managing the primary and copy
 255           pools.  Storage pools will also need to acquire parameters
 256           to define the pools to be used for copies.
 257
 258   Notes:  I would commit some of my developers' time if we can agree
 259           on the design and behavior.
 260
 261   Notes:  Additional notes from David:
 262           I think there's two areas where new configuration would be needed.
 263
 264           1) Identify a "SD mux" SD (specify it in the config just like a
 265           normal SD. The SD configuration would need something like a
 266           "Daemon Type = Normal/Mux" keyword to identify it as a
 267           multiplexor.  (The director code would need modification to add
 268           the ability to do the multiple session setup, but the impact of
 269           the change would be new code that was invoked only when a SDmux is
 270           needed).
 271
 272           2) Additional keywords in the Pool definition to identify the need
 273           to create copies.  Each pool would acquire a Copypool= attribute
 274           (may be repeated to generate more than one copy.  3 is about the
 275           practical limit, but no point in hardcoding that).
 276
 277           Example:
 278           Pool {
 279             Name = Primary
 280             Pool Type = Backup
 281             Copypool = Copy1
 282             Copypool = OffsiteCopy2
 283           }
 284
 285           where Copy1 and OffsiteCopy2 are valid pools.
 286
 287           In terms of function (shorthand): Backup job X is defined
 288           normally, specifying pool Primary as the pool to use.  Job gets
 289           scheduled, and Bacula starts scheduling resources.  Scheduler
 290           looks at pool definition for Primary, sees that there are a
 291           non-zero number of copypool keywords.  The director then connects
 292           to an available SDmux, passes it the pool ids for Primary, Copy1,
 293           and OffsiteCopy2 and waits.  SDmux then goes out and reserves
 294           devices and volumes in the normal SDs that serve Primary, Copy1
 295           and OffsiteCopy2.  When all are ready, the SDmux signals ready
 296           back to the director, and the FD is given the address of the SDmux
 297           as the SD to communicate with.  Backup proceeds normally, with the
 298           SDmux duplicating blocks to each connected normal SD, and
 299           returning ready when all defined copies have been written.  At
 300           EOJ, FD shuts down connection with SDmux, which closes down the
 301           normal SD connections and goes back to an idle state.  SDmux does
 302           not update database; normal SDs do (noting that file is present on
 303           each volume it has been written to).
 304
 305           On restore, director looks for the volume containing the file in
 306           pool Primary first, then Copy1, then OffsiteCopy2.  If the volume
 307           holding the file in pool Primary is missing or busy (being written
 308           in another job, etc), or one of the volumes from the copypool list
 309           that have the file in question is already mounted and ready for
 310           some reason, use it to do the restore, else mount one of the
 311           copypool volumes and proceed.
 312
 313
 314 Item  9:  Scheduling syntax that permits more flexibility and options
 315    Date:  15 December 2006
 316   Origin: Gregory Brauer (greg at wildbrain dot com) and
 317           Florian Schnabel <florian.schnabel at docufy dot de>
 318   Status:
 319
 320    What:  Currently, Bacula only understands how to deal with weeks of the
 321           month or weeks of the year in schedules.  This makes it impossible
 322           to do a true weekly rotation of tapes.  There will always be a
 323           discontinuity that will require disruptive manual intervention at
 324           least monthly or yearly because week boundaries never align with
 325           month or year boundaries.
 326
 327           A solution would be to add a new syntax that defines (at least)
 328           a start timestamp, and repetition period.
 329
 330           An easy option to skip a certain job  on a certain date.
 331
 332
 333      Why: Rotated backups done at weekly intervals are useful, and Bacula
 334           cannot currently do them without extensive hacking.
 335
 336           You could then easily skip tape backups on holidays.  Especially
 337           if you got no autochanger and can only fit one backup on a tape
 338           that would be really handy, other jobs could proceed normally
 339           and you won't get errors that way.
 340
 341
 342    Notes: Here is an example syntax showing a 3-week rotation where full
 343           Backups would be performed every week on Saturday, and an
 344           incremental would be performed every week on Tuesday.  Each
 345           set of tapes could be removed from the loader for the following
 346           two cycles before coming back and being reused on the third
 347           week.  Since the execution times are determined by intervals
 348           from a given point in time, there will never be any issues with
 349           having to adjust to any sort of arbitrary time boundary.  In
 350           the example provided, I even define the starting schedule
 351           as crossing both a year and a month boundary, but the run times
 352           would be based on the "Repeat" value and would therefore happen
 353           weekly as desired.
 354
 355
 356           Schedule {
 357               Name = "Week 1 Rotation"
 358               #Saturday.  Would run Dec 30, Jan 20, Feb 10, etc.
 359               Run {
 360                   Options {
 361                       Type   = Full
 362                       Start  = 2006-12-30 01:00
 363                       Repeat = 3w
 364                   }
 365               }
 366               #Tuesday.  Would run Jan 2, Jan 23, Feb 13, etc.
 367               Run {
 368                   Options {
 369                       Type   = Incremental
 370                       Start  = 2007-01-02 01:00
 371                       Repeat = 3w
 372                   }
 373               }
 374           }
 375
 376           Schedule {
 377               Name = "Week 2 Rotation"
 378               #Saturday.  Would run Jan 6, Jan 27, Feb 17, etc.
 379               Run {
 380                   Options {
 381                       Type   = Full
 382                       Start  = 2007-01-06 01:00
 383                       Repeat = 3w
 384                   }
 385               }
 386               #Tuesday.  Would run Jan 9, Jan 30, Feb 20, etc.
 387               Run {
 388                   Options {
 389                       Type   = Incremental
 390                       Start  = 2007-01-09 01:00
 391                       Repeat = 3w
 392                   }
 393               }
 394           }
 395
 396           Schedule {
 397               Name = "Week 3 Rotation"
 398               #Saturday.  Would run Jan 13, Feb 3, Feb 24, etc.
 399               Run {
 400                   Options {
 401                       Type   = Full
 402                       Start  = 2007-01-13 01:00
 403                       Repeat = 3w
 404                   }
 405               }
 406               #Tuesday.  Would run Jan 16, Feb 6, Feb 27, etc.
 407               Run {
 408                   Options {
 409                       Type   = Incremental
 410                       Start  = 2007-01-16 01:00
 411                       Repeat = 3w
 412                   }
 413               }
 414           }
 415
 416    Notes: Kern: I have merged the previously separate project of skipping
 417           jobs (via Schedule syntax) into this.
 418
 419
 420 Item 10:  Message mailing based on backup types
 421  Origin:  Evan Kaufman <evan.kaufman@gmail.com>
 422    Date:  January 6, 2006
 423  Status:
 424
 425    What:  In the "Messages" resource definitions, allowing messages
 426           to be mailed based on the type (backup, restore, etc.) and level
 427           (full, differential, etc) of job that created the originating
 428           message(s).
 429
 430  Why:     It would, for example, allow someone's boss to be emailed
 431           automatically only when a Full Backup job runs, so he can
 432           retrieve the tapes for offsite storage, even if the IT dept.
 433           doesn't (or can't) explicitly notify him.  At the same time, his
 434           mailbox wouldnt be filled by notifications of Verifies, Restores,
 435           or Incremental/Differential Backups (which would likely be kept
 436           onsite).
 437
 438  Notes:   One way this could be done is through additional message types, for example:
 439
 440    Messages {
 441      # email the boss only on full system backups
 442      Mail = boss@mycompany.com = full, !incremental, !differential, !restore,
 443             !verify, !admin
 444      # email us only when something breaks
 445      MailOnError = itdept@mycompany.com = all
 446    }
 447
 448    Notes: Kern: This should be rather trivial to implement.
 449
 450
 451 Item 11:  Cause daemons to use a specific IP address to source communications
 452  Origin:  Bill Moran <wmoran@collaborativefusion.com>
 453  Date:    18 Dec 2006
 454  Status:
 455  What:    Cause Bacula daemons (dir, fd, sd) to always use the ip address
 456           specified in the [DIR|DF|SD]Addr directive as the source IP
 457           for initiating communication.
 458  Why:     On complex networks, as well as extremely secure networks, it's
 459           not unusual to have multiple possible routes through the network.
 460           Often, each of these routes is secured by different policies
 461           (effectively, firewalls allow or deny different traffic depending
 462           on the source address)
 463           Unfortunately, it can sometimes be difficult or impossible to
 464           represent this in a system routing table, as the result is
 465           excessive subnetting that quickly exhausts available IP space.
 466           The best available workaround is to provide multiple IPs to
 467           a single machine that are all on the same subnet.  In order
 468           for this to work properly, applications must support the ability
 469           to bind outgoing connections to a specified address, otherwise
 470           the operating system will always choose the first IP that
 471           matches the required route.
 472  Notes:   Many other programs support this.  For example, the following
 473           can be configured in BIND:
 474           query-source address 10.0.0.1;
 475           transfer-source 10.0.0.2;
 476           Which means queries from this server will always come from
 477           10.0.0.1 and zone transfers will always originate from
 478           10.0.0.2.
 479
 480
 481 Item 12:  Add Plug-ins to the FileSet Include statements.
 482   Date:   28 October 2005
 483   Origin: Kern
 484   Status: Partially coded in 1.37 -- much more to do.
 485
 486   What:   Allow users to specify wild-card and/or regular
 487           expressions to be matched in both the Include and
 488           Exclude directives in a FileSet.  At the same time,
 489           allow users to define plug-ins to be called (based on
 490           regular expression/wild-card matching).
 491
 492   Why:    This would give the users the ultimate ability to control
 493           how files are backed up/restored.  A user could write a
 494           plug-in knows how to backup his Oracle database without
 495           stopping/starting it, for example.
 496
 497
 498 Item 13:  Restore only file attributes (permissions, ACL, owner, group...)
 499   Origin: Eric Bollengier
 500   Date:   30/12/2006
 501   Status: Implemented by Eric, see project-restore-attributes-only.patch
 502
 503   What:   The goal of this project is to be able to restore only rights
 504           and attributes of files without crushing them.
 505
 506   Why:    Who have never had to repair a chmod -R 777, or a wild update
 507           of recursive right under Windows? At this time, you must have
 508           enough space to restore data, dump attributes (easy with acl,
 509           more complex with unix/windows rights) and apply them to your
 510           broken tree. With this options, it will be very easy to compare
 511           right or ACL over the time.
 512
 513   Notes:  If the file is here, we skip restore and we change rights.
 514           If the file isn't here, we can create an empty one and apply
 515           rights or do nothing.
 516
 517           This will not work with win32 stream, because it seems that we
 518           can't split the WriteBackup stream to get only ACL and ownerchip.
 519
 520 Item 14:  Add an override in Schedule for Pools based on backup types
 521 Date:     19 Jan 2005
 522 Origin:   Chad Slater <chad.slater@clickfox.com>
 523 Status:
 524
 525   What:   Adding a FullStorage=BigTapeLibrary in the Schedule resource
 526           would help those of us who use different storage devices for different
 527           backup levels cope with the "auto-upgrade" of a backup.
 528
 529   Why:    Assume I add several new devices to be backed up, i.e. several
 530           hosts with 1TB RAID.  To avoid tape switching hassles, incrementals are
 531           stored in a disk set on a 2TB RAID.  If you add these devices in the
 532           middle of the month, the incrementals are upgraded to "full" backups,
 533           but they try to use the same storage device as requested in the
 534           incremental job, filling up the RAID holding the differentials.  If we
 535           could override the Storage parameter for full and/or differential
 536           backups, then the Full job would use the proper Storage device, which
 537           has more capacity (i.e. a 8TB tape library.
 538
 539
 540 Item 15:  Implement more Python events and functions
 541   Date:   28 October 2005
 542   Origin: Kern
 543   Status:
 544
 545   What:   Allow Python scripts to be called at more places
 546           within Bacula and provide additional access to Bacula
 547           internal variables.
 548
 549           Implement an interface for Python scripts to access the
 550           catalog through Bacula.
 551
 552   Why:    This will permit users to customize Bacula through
 553           Python scripts.
 554
 555   Notes:  Recycle event
 556           Scratch pool event
 557           NeedVolume event
 558           MediaFull event
 559
 560           Also add a way to get a listing of currently running
 561           jobs (possibly also scheduled jobs).
 562
 563
 564           to start the appropriate job.
 565
 566
 567 Item 16:  Allow inclusion/exclusion of files in a fileset by creation/mod times
 568   Origin: Evan Kaufman <evan.kaufman@gmail.com>
 569   Date:   January 11, 2006
 570   Status:
 571
 572   What:   In the vein of the Wild and Regex directives in a Fileset's
 573           Options, it would be helpful to allow a user to include or exclude
 574           files and directories by creation or modification times.
 575
 576           You could factor the Exclude=yes|no option in much the same way it
 577           affects the Wild and Regex directives.  For example, you could exclude
 578           all files modified before a certain date:
 579
 580    Options {
 581      Exclude = yes
 582      Modified Before = ####
 583    }
 584
 585            Or you could exclude all files created/modified since a certain date:
 586
 587    Options {
 588       Exclude = yes
 589      Created Modified Since = ####
 590    }
 591
 592            The format of the time/date could be done several ways, say the number
 593            of seconds since the epoch:
 594            1137008553 = Jan 11 2006, 1:42:33PM   # result of `date +%s`
 595
 596            Or a human readable date in a cryptic form:
 597            20060111134233 = Jan 11 2006, 1:42:33PM   # YYYYMMDDhhmmss
 598
 599   Why:    I imagine a feature like this could have many uses. It would
 600           allow a user to do a full backup while excluding the base operating
 601           system files, so if I installed a Linux snapshot from a CD yesterday,
 602           I'll *exclude* all files modified *before* today.  If I need to
 603           recover the system, I use the CD I already have, plus the tape backup.
 604           Or if, say, a Windows client is hit by a particularly corrosive
 605           virus, and I need to *exclude* any files created/modified *since* the
 606           time of infection.
 607
 608   Notes:  Of course, this feature would work in concert with other
 609           in/exclude rules, and wouldnt override them (or each other).
 610
 611   Notes:  The directives I'd imagine would be along the lines of
 612           "[Created] [Modified] [Before|Since] = <date>".
 613           So one could compare against 'ctime' and/or 'mtime', but ONLY 'before'
 614            or 'since'.
 615
 616
 617 Item 17:  Automatic promotion of backup levels based on backup size
 618    Date:  19 January 2006
 619   Origin: Adam Thornton <athornton@sinenomine.net>
 620   Status:
 621
 622     What: Amanda has a feature whereby it estimates the space that a
 623           differential, incremental, and full backup would take.  If the
 624           difference in space required between the scheduled level and the next
 625           level up is beneath some user-defined critical threshold, the backup
 626           level is bumped to the next type.  Doing this minimizes the number of
 627           volumes necessary during a restore, with a fairly minimal cost in
 628           backup media space.
 629
 630     Why:  I know at least one (quite sophisticated and smart) user
 631           for whom the absence of this feature is a deal-breaker in terms of
 632           using Bacula; if we had it it would eliminate the one cool thing
 633           Amanda can do and we can't (at least, the one cool thing I know of).
 634
 635
 636 Item 18:  Better control over Job execution
 637    Date:  18 August 2007
 638   Origin: Kern
 639   Status:
 640
 641     What: Bacula needs a few extra features for better Job execution:
 642           1. A way to prevent multiple Jobs of the same name from
 643              being scheduled at the same time (ususally happens when
 644              a job is missed because a client is down).
 645           2. Directives that permit easier upgrading of Job types
 646              based on a period of time. I.e. "do a Full at least
 647              once every 2 weeks", or "do a differential at least
 648              once a week". If a lower level job is scheduled when
 649              it begins to run it will be upgraded depending on
 650              the specified criteria.
 651
 652     Why:  Obvious.
 653
 654
 655 Item 19:  Automatic disabling of devices
 656    Date:  2005-11-11
 657   Origin: Peter Eriksson <peter at ifm.liu dot se>
 658   Status:
 659
 660    What:  After a configurable amount of fatal errors with a tape drive
 661           Bacula should automatically disable further use of a certain
 662           tape drive. There should also be "disable"/"enable" commands in
 663           the "bconsole" tool.
 664
 665    Why:   On a multi-drive jukebox there is a possibility of tape drives
 666           going bad during large backups (needing a cleaning tape run,
 667           tapes getting stuck). It would be advantageous if Bacula would
 668           automatically disable further use of a problematic tape drive
 669           after a configurable amount of errors has occurred.
 670
 671           An example: I have a multi-drive jukebox (6 drives, 380+ slots)
 672           where tapes occasionally get stuck inside the drive. Bacula will
 673           notice that the "mtx-changer" command will fail and then fail
 674           any backup jobs trying to use that drive. However, it will still
 675           keep on trying to run new jobs using that drive and fail -
 676           forever, and thus failing lots and lots of jobs... Since we have
 677           many drives Bacula could have just automatically disabled
 678           further use of that drive and used one of the other ones
 679           instead.
 680
 681 Item 20:  An option to operate on all pools with update vol parameters
 682   Origin: Dmitriy Pinchukov <absh@bossdev.kiev.ua>
 683    Date:  16 August 2006
 684   Status: Patch made by  Nigel Stepp
 685
 686    What:  When I do update -> Volume parameters -> All Volumes
 687           from Pool, then I have to select pools one by one.  I'd like
 688           console to have an option like "0: All Pools" in the list of
 689           defined pools.
 690
 691    Why:   I have many pools and therefore unhappy with manually
 692           updating each of them using update -> Volume parameters -> All
 693           Volumes from Pool -> pool #.
 694
 695
 696 Item 21:  Include timestamp of job launch in "stat clients" output
 697   Origin: Mark Bergman <mark.bergman@uphs.upenn.edu>
 698   Date:   Tue Aug 22 17:13:39 EDT 2006
 699   Status:
 700
 701   What:   The "stat clients" command doesn't include any detail on when
 702           the active backup jobs were launched.
 703
 704   Why:    Including the timestamp would make it much easier to decide whether
 705           a job is running properly.
 706
 707   Notes:  It may be helpful to have the output from "stat clients" formatted
 708           more like that from "stat dir" (and other commands), in a column
 709           format. The per-client information that's currently shown (level,
 710           client name, JobId, Volume, pool, device, Files, etc.) is good, but
 711           somewhat hard to parse (both programmatically and visually),
 712           particularly when there are many active clients.
 713
 714
 715
 716 Item 22:  Implement Storage daemon compression
 717   Date:   18 December 2006
 718   Origin: Vadim A. Umanski , e-mail umanski@ext.ru
 719   Status:
 720   What:   The ability to compress backup data on the SD receiving data
 721           instead of doing that on client sending data.
 722   Why:    The need is practical. I've got some machines that can send
 723           data to the network 4 or 5 times faster than compressing
 724           them (I've measured that). They're using fast enough SCSI/FC
 725           disk subsystems but rather slow CPUs (ex. UltraSPARC II).
 726           And the backup server has got a quite fast CPUs (ex. Dual P4
 727           Xeons) and quite a low load. When you have 20, 50 or 100 GB
 728           of raw data - running a job 4 to 5 times faster - that
 729           really matters. On the other hand, the data can be
 730           compressed 50% or better - so losing twice more space for
 731           disk backup is not good at all. And the network is all mine
 732           (I have a dedicated management/provisioning network) and I
 733           can get as high bandwidth as I need - 100Mbps, 1000Mbps...
 734           That's why the server-side compression feature is needed!
 735   Notes:
 736
 737 Item 23:  Improve Bacula's tape and drive usage and cleaning management
 738   Date:   8 November 2005, November 11, 2005
 739   Origin: Adam Thornton <athornton at sinenomine dot net>,
 740           Arno Lehmann <al at its-lehmann dot de>
 741   Status:
 742
 743   What:   Make Bacula manage tape life cycle information, tape reuse
 744           times and drive cleaning cycles.
 745
 746   Why:    All three parts of this project are important when operating
 747           backups.
 748           We need to know which tapes need replacement, and we need to
 749           make sure the drives are cleaned when necessary.  While many
 750           tape libraries and even autoloaders can handle all this
 751           automatically, support by Bacula can be helpful for smaller
 752           (older) libraries and single drives.  Limiting the number of
 753           times a tape is used might prevent tape errors when using
 754           tapes until the drives can't read it any more.  Also, checking
 755           drive status during operation can prevent some failures (as I
 756           [Arno] had to learn the hard way...)
 757
 758   Notes:  First, Bacula could (and even does, to some limited extent)
 759           record tape and drive usage.  For tapes, the number of mounts,
 760           the amount of data, and the time the tape has actually been
 761           running could be recorded.  Data fields for Read and Write
 762           time and Number of mounts already exist in the catalog (I'm
 763           not sure if VolBytes is the sum of all bytes ever written to
 764           that volume by Bacula).  This information can be important
 765           when determining which media to replace.  The ability to mark
 766           Volumes as "used up" after a given number of write cycles
 767           should also be implemented so that a tape is never actually
 768           worn out.  For the tape drives known to Bacula, similar
 769           information is interesting to determine the device status and
 770           expected life time: Time it's been Reading and Writing, number
 771           of tape Loads / Unloads / Errors.  This information is not yet
 772           recorded as far as I [Arno] know.  A new volume status would
 773           be necessary for the new state, like "Used up" or "Worn out".
 774           Volumes with this state could be used for restores, but not
 775           for writing. These volumes should be migrated first (assuming
 776           migration is implemented) and, once they are no longer needed,
 777           could be moved to a Trash pool.
 778
 779           The next step would be to implement a drive cleaning setup.
 780           Bacula already has knowledge about cleaning tapes.  Once it
 781           has some information about cleaning cycles (measured in drive
 782           run time, number of tapes used, or calender days, for example)
 783           it can automatically execute tape cleaning (with an
 784           autochanger, obviously) or ask for operator assistance loading
 785           a cleaning tape.
 786
 787           The final step would be to implement TAPEALERT checks not only
 788           when changing tapes and only sending the information to the
 789           administrator, but rather checking after each tape error,
 790           checking on a regular basis (for example after each tape
 791           file), and also before unloading and after loading a new tape.
 792           Then, depending on the drives TAPEALERT state and the known
 793           drive cleaning state Bacula could automatically schedule later
 794           cleaning, clean immediately, or inform the operator.
 795
 796           Implementing this would perhaps require another catalog change
 797           and perhaps major changes in SD code and the DIR-SD protocol,
 798           so I'd only consider this worth implementing if it would
 799           actually be used or even needed by many people.
 800
 801           Implementation of these projects could happen in three distinct
 802           sub-projects: Measuring Tape and Drive usage, retiring
 803           volumes, and handling drive cleaning and TAPEALERTs.
 804
 805 Item 24:  Multiple threads in file daemon for the same job
 806   Date:   27 November 2005
 807   Origin: Ove Risberg (Ove.Risberg at octocode dot com)
 808   Status:
 809
 810   What:   I want the file daemon to start multiple threads for a backup
 811           job so the fastest possible backup can be made.
 812
 813           The file daemon could parse the FileSet information and start
 814           one thread for each File entry located on a separate
 815           filesystem.
 816
 817           A confiuration option in the job section should be used to
 818           enable or disable this feature. The confgutration option could
 819           specify the maximum number of threads in the file daemon.
 820
 821           If the theads could spool the data to separate spool files
 822           the restore process will not be much slower.
 823
 824   Why:    Multiple concurrent backups of a large fileserver with many
 825           disks and controllers will be much faster.
 826
 827 Item 25:  Archival (removal) of User Files to Tape
 828   Date:   Nov. 24/2005
 829   Origin: Ray Pengelly [ray at biomed dot queensu dot ca
 830   Status:
 831
 832   What:   The ability to archive data to storage based on certain parameters
 833           such as age, size, or location.  Once the data has been written to
 834           storage and logged it is then pruned from the originating
 835           filesystem. Note! We are talking about user's files and not
 836           Bacula Volumes.
 837
 838   Why:    This would allow fully automatic storage management which becomes
 839           useful for large datastores.  It would also allow for auto-staging
 840           from one media type to another.
 841
 842           Example 1) Medical imaging needs to store large amounts of data.
 843           They decide to keep data on their servers for 6 months and then put
 844           it away for long term storage.  The server then finds all files
 845           older than 6 months writes them to tape.  The files are then removed
 846           from the server.
 847
 848           Example 2) All data that hasn't been accessed in 2 months could be
 849           moved from high-cost, fibre-channel disk storage to a low-cost
 850           large-capacity SATA disk storage pool which doesn't have as quick of
 851           access time.  Then after another 6 months (or possibly as one
 852           storage pool gets full) data is migrated to Tape.
 853
 854
 855
 856
 857
 858 ========= New Items since the last vote =================
 859
 860 Item 26: Add a new directive to bacula-dir.conf which permits inclusion of all subconfiguration files in a given directory
 861 Date:   18 October 2008
 862 Origin: Database, Lda. Maputo, Mozambique
 863 Contact:Cameron Smith / cameron.ord@database.co.mz
 864 Status: New request
 865
 866 What: A directive something like "IncludeConf = /etc/bacula/subconfs" Every
 867       time Bacula Director restarts or reloads, it will walk the given
 868       directory (non-recursively) and include the contents of any files
 869       therein, as though they were appended to bacula-dir.conf
 870
 871 Why: Permits simplified and safer configuration for larger installations with
 872       many client PCs.  Currently, through judicious use of JobDefs and
 873       similar directives, it is possible to reduce the client-specific part of
 874       a configuration to a minimum.  The client-specific directives can be
 875       prepared according to a standard template and dropped into a known
 876       directory.  However it is still necessary to add a line to the "master"
 877       (bacula-dir.conf) referencing each new file.  This exposes the master to
 878       unnecessary risk of accidental mistakes and makes automation of adding
 879       new client-confs, more difficult (it is easier to automate dropping a
 880       file into a dir, than rewriting an existing file).  Ken has previously
 881       made a convincing argument for NOT including Bacula's core configuration
 882       in an RDBMS, but I believe that the present request is a reasonable
 883       extension to the current "flat-file-based" configuration philosophy.
 884
 885 Notes: There is NO need for any special syntax to these files.  They should
 886        contain standard directives which are simply "inlined" to the parent
 887        file as already happens when you explicitly reference an external file.
 888
 889  Item n: List inChanger flag when doing restore.
 890  Origin: Jesper Krogh<jesper@krogh.cc>
 891    Date:   17 oct. 2008
 892  Status:
 893
 894    What: When doing a restore the restore selection dialog ends by telling stuff
 895       like this:
 896   The job will require the following
 897    Volume(s)                 Storage(s)                SD Device(s)
 898    ===========================================================================
 899     000741L3                  LTO-4                     LTO3
 900     000866L3                  LTO-4                     LTO3
 901     000765L3                  LTO-4                     LTO3
 902     000764L3                  LTO-4                     LTO3
 903     000756L3                  LTO-4                     LTO3
 904     001759L3                  LTO-4                     LTO3
 905     001763L3                  LTO-4                     LTO3
 906     001762L3                  LTO-4                     LTO3
 907     001767L3                  LTO-4                     LTO3
 908
 909    When having an autochanger, it would be really nice with an inChanger
 910    column so the operator knew if this restore job would stop waiting for
 911    operator intervention. This is done just by selecting the inChanger flag
 912    from the catalog and printing it in a seperate column.
 913
 914
 915    Why:    This would help getting large restores through minimizing the
 916       time spent waiting for operator to drop by and change tapes in the library.
 917
 918   Notes: [Kern] I think it would also be good to have the Slot as well,
 919       or some indication that Bacula thinks the volume is in the autochanger
 920       because it depends on both the InChanger flag and the Slot being
 921       valid.
 922
 923
 924 Item 1:   Implement an interface between Bacula and Amazon's S3.
 925   Date:   25 August 2008
 926   Origin: Soren Hansen <soren@ubuntu.com>
 927   Status: Not started.
 928   What:   Enable the storage daemon to store backup data on Amazon's
 929           S3 service.
 930
 931   Why:    Amazon's S3 is a cheap way to store data off-site. Current
 932           ways to integrate Bacula and S3 involve storing all the data
 933           locally and syncing them to S3, and manually fetching them
 934           again when they're needed. This is very cumbersome.
 935
 936 Item: Store and restore extended attributes, especially selinux file contexts
 937    Date: 28 December 2007
 938    Origin: Frank Sweetser <fs@wpi.edu>
 939    What: The ability to store and restore extended attributes on
 940          filesystems that support them, such as ext3.
 941
 942    Why: Security Enhanced Linux (SELinux) enabled systems make extensive
 943         use of extended attributes.  In addition to the standard user,
 944         group, and permission, each file has an associated SELinux context
 945         stored as an extended attribute.  This context is used to define
 946         which operations a given program is permitted to perform on that
 947         file.  Storing contexts on an SELinux system is as critical as
 948         storing ownership and permissions.  In the case of a full system
 949         restore, the system will not even be able to boot until all
 950         critical system files have been properly relabeled.
 951
 952  Notes: Fedora ships with a version of tar that has been patched to handle
 953         extended attributes.  The patch has not been integrated upstream
 954         yet, so could serve as a good starting point.
 955
 956         http://linux.die.net/man/2/getxattr
 957         http://linux.die.net/man/2/setxattr
 958         http://linux.die.net/man/2/listxattr
 959         ===
 960         http://linux.die.net/man/3/getfilecon
 961         http://linux.die.net/man/3/setfilecon
 962
 963 Item 1:   enable/disable compression depending on storage device (disk/tape)
 964   Origin: Ralf Gross ralf-lists@ralfgross.de
 965   Date:   2008-01-11
 966   Status: Initial Request
 967
 968   What: Add a new option to the storage resource of the director.  Depending
 969           on this option, compression will be enabled/disabled for a device.
 970
 971   Why: If different devices (disks/tapes) are used for full/diff/incr
 972           backups, software compression will be enabled for all backups
 973           because of the FileSet compression option.  For backup to tapes
 974           wich are able to do hardware compression this is not desired.
 975
 976
 977   Notes:
 978          http://news.gmane.org/gmane.comp.sysutils.backup.bacula.devel/cutoff=11124
 979          It must be clear to the user, that the FileSet compression option
 980          must still be enabled use compression for a backup job at all.
 981          Thus a name for the new option in the director must be
 982          well-defined.
 983
 984   Notes: KES I think the Storage definition should probably override what
 985          is in the Job definition or vice-versa, but in any case, it must
 986          be well defined.
 987
 988
 989 Item 1: Backup and Restore of Windows Encrypted Files through raw encryption
 990         functions
 991
 992   Origin: Michael Mohr, SAG  Mohr.External@infineon.com
 993
 994   Date:   22 February 2008
 995
 996   Status:
 997
 998   What: Make it possible to backup and restore Encypted Files from and to
 999           Windows systems without the need to decrypt it by using the raw
1000           encryption functions API (see:
1001           http://msdn2.microsoft.com/en-us/library/aa363783.aspx)
1002
1003           that is provided for that reason by Microsoft.
1004           If a file ist encrypted could be examined by evaluating the
1005           FILE_ATTRIBUTE_ENCRYTED flag of the GetFileAttributes
1006           function.
1007
1008   Why:    Without the usage of this interface the fd-daemon running
1009           under the system account can't read encypted Files because
1010           the key needed for the decrytion is missed by them. As a result
1011           actually encrypted files are not backed up
1012           by bacula and also no error is shown while missing these files.
1013
1014   Notes:  ./.
1015
1016    Item 1: Possibilty to schedule Jobs on last Friday of the month
1017    Origin: Carsten Menke <bootsy52 at gmx dot net>
1018    Date:   02 March 2008
1019    Status:
1020
1021    What: Currently if you want to run your monthly Backups on the last
1022            Friday of each month this is only possible with workarounds (e.g
1023            scripting) (As some months got 4 Fridays and some got 5 Fridays)
1024            The same is true if you plan to run your yearly Backups on the
1025            last Friday of the year.  It would be nice to have the ability to
1026            use the builtin scheduler for this.
1027
1028    Why:    In many companies the last working day of the week is Friday (or
1029            Saturday), so to get the most data of the month onto the monthly
1030            tape, the employees are advised to insert the tape for the
1031            monthly backups on the last friday of the month.
1032
1033    Notes: To give this a complete functionality it would be nice if the
1034            "first" and "last" Keywords could be implemented in the
1035            scheduler, so it is also possible to run monthy backups at the
1036            first friday of the month and many things more.  So if the syntax
1037            would expand to this {first|last} {Month|Week|Day|Mo-Fri} of the
1038            {Year|Month|Week} you would be able to run really flexible jobs.
1039
1040            To got a certain Job run on the last Friday of the Month for example one could
1041            then write
1042
1043               Run = pool=Monthly last Fri of the Month at 23:50
1044
1045               ## Yearly Backup
1046
1047               Run = pool=Yearly last Fri of the Year at 23:50
1048
1049               ## Certain Jobs the last Week of a Month
1050
1051               Run = pool=LastWeek last Week of the Month at 23:50
1052
1053               ## Monthly Backup on the last day of the month
1054
1055               Run = pool=Monthly last Day of the Month at 23:50
1056
1057    Date: 20 March 2008
1058
1059    Origin: Frank Sweetser <fs@wpi.edu>
1060
1061    What: Add a new SD directive, "minimum spool size" (or similar).  This
1062          directive would specify a minimum level of free space available for
1063          spooling.  If the unused spool space is less than this level, any
1064          new spooling requests would be blocked as if the "maximum spool
1065          size" threshold had bee reached.  Already spooling jobs would be
1066          unaffected by this directive.
1067
1068    Why: I've been bitten by this scenario a couple of times:
1069
1070         Assume a maximum spool size of 100M. Two concurrent jobs, A and B,
1071         are both running.  Due to timing quirks and previously running jobs,
1072         job A has used 99.9M of space in the spool directory.  While A is
1073         busy despooling to disk, B is happily using the remaining 0.1M of
1074         spool space.  This ends up in a spool/despool sequence every 0.1M of
1075         data.  In addition to fragmenting the data on the volume far more
1076         than was necessary, in larger data sets (ie, tens or hundreds of
1077         gigabytes) it can easily produce multi-megabyte report emails!
1078
1079    Item n?: Expand the Verify Job capability to verify Jobs older than the
1080    last one. For VolumeToCatalog Jobs
1081    Date: 17 Januar 2008
1082    Origin: portrix.net Hamburg, Germany.
1083    Contact: Christian Sabelmann
1084    Status: 70% of the required Code is part of the Verify function since v. 2.x
1085
1086    What:
1087    The ability to tell Bacula which Job should verify instead of
1088    automatically verify just the last one.
1089
1090    Why:
1091    It is sad that such a powerfull feature like Verify Jobs
1092    (VolumeToCatalog) is restricted to be used only with the last backup Job
1093    of a client.  Actual users who have to do daily Backups are forced to
1094    also do daily Verify Jobs in order to take advantage of this useful
1095    feature.  This Daily Verify after Backup conduct is not always desired
1096    and Verify Jobs have to be sometimes scheduled.  (Not necessarily
1097    scheduled in Bacula).  With this feature Admins can verify Jobs once a
1098    Week or less per month, selecting the Jobs they want to verify.  This
1099    feature is also not to difficult to implement taking in account older bug
1100    reports about this feature and the selection of the Job to be verified.
1101
1102    Notes: For the verify Job, the user could select the Job to be verified
1103    from a List of the latest Jobs of a client. It would also be possible to
1104    verify a certain volume.  All of these would naturaly apply only for
1105    Jobs whose file information are still in the catalog.
1106
1107 Item X:   Add EFS support on Windows
1108    Origin: Alex Ehrlich (Alex.Ehrlich-at-mail.ee)
1109    Date:   05 August 2008
1110    Status:
1111
1112    What:   For each file backed up or restored by FD on Windows, check if
1113            the file is encrypted; if so then use OpenEncryptedFileRaw,
1114            ReadEncryptedFileRaw, WriteEncryptedFileRaw,
1115            CloseEncryptedFileRaw instead of BackupRead and BackupWrite
1116            API calls.
1117
1118    Why:    Many laptop users utilize the EFS functionality today; so do.
1119            some non-laptop ones, too.
1120            Currently files encrypted by means of EFS cannot be backed up.
1121            It means a Windows boutique cannot rely on Bacula as its
1122            backup solution, at least when using Windows 2K, XPP,
1123            "better" Vista etc on workstations, unless EFS is
1124            forbidden by policies.
1125            The current situation might result into "false sense of
1126            security" among the end-users.
1127
1128    Notes:  Using xxxEncryptedFileRaw API would allow to backup and
1129            restore EFS-encrypted files without decrypting their data.
1130            Note that such files cannot be restored "portably" (at least,
1131            easily) but they would be restoreable to a different (or
1132            reinstalled) Win32 machine; the restore would require setup
1133            of a EFS recovery agent in advance, of course, and this shall
1134            be clearly reflected in the documentation, but this is the
1135            normal Windows SysAdmin's business.
1136            When "portable" backup is requested the EFS-encrypted files
1137            shall be clearly reported as errors.
1138            See MSDN on the "Backup and Restore of Encrypted Files" topic:
1139            http://msdn.microsoft.com/en-us/library/aa363783.aspx
1140            Maybe the EFS support requires a new flag in the database for
1141            each file, too?
1142            Unfortunately, the implementation is not as straightforward as
1143            1-to-1 replacement of BackupRead with ReadEncryptedFileRaw,
1144            requiring some FD code rewrite to work with
1145            encrypted-file-related callback functions.
1146
1147                                                                                                                                                                                               encrypted-file-related callback functions.
1148 ========== Already implemented ================================
1149
1150 Item  n:  make changing "spooldata=yes|no" possible for
1151           manual/interactive jobs
1152   Origin: Marc Schiffbauer <marc@schiffbauer.net>
1153   Date:   12 April 2007)
1154   Status:
1155
1156   What:   Make it possible to modify the spooldata option
1157           for a job when being run from within the console.
1158           Currently it is possible to modify the backup level
1159           and the spooldata setting in a Schedule resource.
1160           It is also possible to modify the backup level when using
1161           the "run" command in the console.
1162           But it is currently not possible to to the same
1163           with "spooldata=yes|no" like:
1164
1165           run job=MyJob level=incremental spooldata=yes
1166
1167   Why:    In some situations it would be handy to be able to switch
1168           spooldata on or off for interactive/manual jobs based on
1169           which data the admin expects or how fast the LAN/WAN
1170           connection currently is.
1171
1172   Notes:  ./.
1173
1174 Item 1:   Implement an option to modify the last written date for volumes
1175 Date:     16 September 2008
1176 Origin:   Franck (xeoslaenor at gmail dot com)
1177 Status:   Proposing
1178 What:     The ability to modify the last written date for a volume
1179 Why:      It's sometime necessary to jump a volume when you have a pool of volume
1180           which recycles the oldest volume at each backup.
1181           Sometime, it needs to cancel a set of backup (one day
1182           backup, completely) and we want to avoid that bacula
1183           choose the volume (which is not written at all) from
1184           the cancelled backup (It has to jump to next volume).
1185           in this case, we just need to update the written date
1186           manually to avoir the "oldest volume" purge.
1187 Notes:    An option can be add to "update volume" command (like 'written date'
1188           choice for example)
1189
1190 ============= Empty Feature Request form ===========
1191 Item  n:  One line summary ...
1192   Date:   Date submitted
1193   Origin: Name and email of originator.
1194   Status:
1195
1196   What:   More detailed explanation ...
1197
1198   Why:    Why it is important ...
1199
1200   Notes:  Additional notes or features (omit if not used)
1201 ============== End Feature Request form ==============
1202
1203 ========== Items on put hold by Kern ============================
1204
1205 Item h1:  Split documentation
1206   Origin: Maxx <maxxatworkat gmail dot com>
1207   Date:   27th July 2006
1208   Status: Approved, awaiting implementation
1209
1210   What:   Split documentation in several books
1211
1212   Why:    Bacula manual has now more than 600 pages, and looking for
1213           implementation details is getting complicated.  I think
1214           it would be good to split the single volume in two or
1215           maybe three parts:
1216
1217           1) Introduction, requirements and tutorial, typically
1218              are useful only until first installation time
1219
1220           2) Basic installation and configuration, with all the
1221              gory details about the directives supported 3)
1222              Advanced Bacula: testing, troubleshooting, GUI and
1223              ancillary programs, security managements, scripting,
1224              etc.
1225
1226   Notes:  This is a project that needs to be done, and will be implemented,
1227           but it is really a developer issue of timing, and does not
1228           needed to be included in the voting.
1229
1230
1231 Item h2:  Implement support for stacking arbitrary stream filters, sinks.
1232 Date:     23 November 2006
1233 Origin:   Landon Fuller <landonf@threerings.net>
1234 Status:   Planning. Assigned to landonf.
1235
1236   What:   Implement support for the following:
1237           - Stacking arbitrary stream filters (eg, encryption, compression,
1238             sparse data handling))
1239           - Attaching file sinks to terminate stream filters (ie, write out
1240             the resultant data to a file)
1241           - Refactor the restoration state machine accordingly
1242
1243    Why: The existing stream implementation suffers from the following: - All
1244            state (compression, encryption, stream restoration), is
1245              global across the entire restore process, for all streams. There are
1246              multiple entry and exit points in the restoration state machine, and
1247              thus multiple places where state must be allocated, deallocated,
1248              initialized, or reinitialized. This results in exceptional complexity
1249              for the author of a stream filter.
1250            - The developer must enumerate all possible combinations of filters
1251              and stream types (ie, win32 data with encryption, without encryption,
1252              with encryption AND compression, etc).
1253
1254   Notes: This feature request only covers implementing the stream filters/
1255           sinks, and refactoring the file daemon's restoration
1256           implementation accordingly.  If I have extra time, I will also
1257           rewrite the backup implementation.  My intent in implementing the
1258           restoration first is to solve pressing bugs in the restoration
1259           handling, and to ensure that the new restore implementation
1260           handles existing backups correctly.
1261
1262           I do not plan on changing the network or tape data structures to
1263           support defining arbitrary stream filters, but supporting that
1264           functionality is the ultimate goal.
1265
1266           Assistance with either code or testing would be fantastic.
1267
1268   Notes:  Kern: this project has a lot of merit, and we need to do it, but
1269           it is really an issue for developers rather than a new feature
1270           for users, so I have removed it from the voting list, but kept it
1271           here, but at some point, it will be implemented.
1272
1273 Item h3:  Filesystem watch triggered backup.
1274   Date:   31 August 2006
1275   Origin: Jesper Krogh <jesper@krogh.cc>
1276   Status:
1277
1278   What:   With inotify and similar filesystem triggeret notification
1279           systems is it possible to have the file-daemon to monitor
1280           filesystem changes and initiate backup.
1281
1282   Why:    There are 2 situations where this is nice to have.
1283           1) It is possible to get a much finer-grained backup than
1284              the fixed schedules used now.. A file created and deleted
1285              a few hours later, can automatically be caught.
1286
1287           2) The introduced load on the system will probably be
1288              distributed more even on the system.
1289
1290   Notes:  This can be combined with configration that specifies
1291           something like: "at most every 15 minutes or when changes
1292           consumed XX MB".
1293
1294 Kern Notes: I would rather see this implemented by an external program
1295           that monitors the Filesystem changes, then uses the console
1296
1297
1298 Item h4:  Directive/mode to backup only file changes, not entire file
1299   Date:   11 November 2005
1300   Origin: Joshua Kugler <joshua dot kugler at uaf dot edu>
1301           Marek Bajon <mbajon at bimsplus dot com dot pl>
1302   Status:
1303
1304   What:   Currently when a file changes, the entire file will be backed up in
1305           the next incremental or full backup.  To save space on the tapes
1306           it would be nice to have a mode whereby only the changes to the
1307           file would be backed up when it is changed.
1308
1309   Why:    This would save lots of space when backing up large files such as
1310           logs, mbox files, Outlook PST files and the like.
1311
1312   Notes:  This would require the usage of disk-based volumes as comparing
1313           files would not be feasible using a tape drive.
1314
1315   Notes:  Kern: I don't know how to implement this. Put on hold until someone
1316           provides a detailed implementation plan.
1317
1318
1319 Item h5:  Implement multiple numeric backup levels as supported by dump
1320 Date:     3 April 2006
1321 Origin:   Daniel Rich <drich@employees.org>
1322 Status:
1323 What: Dump allows specification of backup levels numerically instead of just
1324           "full", "incr", and "diff".  In this system, at any given level,
1325           all files are backed up that were were modified since the last
1326           backup of a higher level (with 0 being the highest and 9 being the
1327           lowest).  A level 0 is therefore equivalent to a full, level 9 an
1328           incremental, and the levels 1 through 8 are varying levels of
1329           differentials.  For bacula's sake, these could be represented as
1330           "full", "incr", and "diff1", "diff2", etc.
1331
1332 Why: Support of multiple backup levels would provide for more advanced
1333           backup rotation schemes such as "Towers of Hanoi".  This would
1334           allow better flexibility in performing backups, and can lead to
1335           shorter recover times.
1336
1337 Notes:    Legato Networker supports a similar system with full, incr, and 1-9 as
1338           levels.
1339
1340 Notes:    Kern: I don't see the utility of this, and it would be a *huge*
1341           modification to existing code.
1342
1343 Item h6:  Implement NDMP protocol support
1344   Origin: Alan Davis
1345   Date:   06 March 2007
1346   Status:
1347
1348   What:   Network Data Management Protocol is implemented by a number of
1349           NAS filer vendors to enable backups using third-party
1350           software.
1351
1352   Why:    This would allow NAS filer backups in Bacula without incurring
1353           the overhead of NFS or SBM/CIFS.
1354
1355   Notes:  Further information is available:
1356           http://www.ndmp.org
1357           http://www.ndmp.org/wp/wp.shtml
1358           http://www.traakan.com/ndmjob/index.html
1359
1360           There are currently no viable open-source NDMP
1361           implementations.  There is a reference SDK and example
1362           app available from ndmp.org but it has problems
1363           compiling on recent Linux and Solaris OS'.  The ndmjob
1364           reference implementation from Traakan is known to
1365           compile on Solaris 10.
1366
1367   Notes:  Kern: I am not at all in favor of this until NDMP becomes
1368           an Open Standard or until there are Open Source libraries
1369           that interface to it.
1370
1371 Item h7:  Commercial database support
1372   Origin: Russell Howe <russell_howe dot wreckage dot org>
1373   Date:   26 July 2006
1374   Status:
1375
1376   What: It would be nice for the database backend to support more databases.
1377           I'm thinking of SQL Server at the moment, but I guess Oracle, DB2,
1378           MaxDB, etc are all candidates.  SQL Server would presumably be
1379           implemented using FreeTDS or maybe an ODBC library?
1380
1381   Why: We only really have one database server, which is MS SQL Server 2000.
1382           Maintaining a second one for the backup software (we grew out of
1383           SQLite, which I liked, but which didn't work so well with our
1384           database size).  We don't really have a machine with the resources
1385           to run postgres, and would rather only maintain a single DBMS.
1386           We're stuck with SQL Server because pretty much all the company's
1387           custom applications (written by consultants) are locked into SQL
1388           Server 2000.  I can imagine this scenario is fairly common, and it
1389           would be nice to use the existing properly specced database server
1390           for storing Bacula's catalog, rather than having to run a second
1391           DBMS.
1392
1393   Notes: This might be nice, but someone other than me will probably need to
1394           implement it, and at the moment, proprietary code cannot legally
1395           be mixed with Bacula GPLed code.  This would be possible only
1396           providing the vendors provide GPLed (or OpenSource) interface
1397           code.
1398
1399 Item h8:  Incorporation of XACML2/SAML2 parsing
1400    Date:   19 January 2006
1401    Origin: Adam Thornton <athornton@sinenomine.net>
1402    Status: Blue sky
1403
1404    What: XACML is "eXtensible Access Control Markup Language" and "SAML is
1405           the "Security Assertion Markup Language"--an XML standard for
1406           making statements about identity and authorization.  Having these
1407           would give us a framework to approach ACLs in a generic manner,
1408           and in a way flexible enough to support the four major sorts of
1409           ACLs I see as a concern to Bacula at this point, as well as
1410           (probably) to deal with new sorts of ACLs that may appear in the
1411           future.
1412
1413    Why: Bacula is beginning to need to back up systems with ACLs that do not
1414           map cleanly onto traditional Unix permissions.  I see four sets of
1415           ACLs--in general, mutually incompatible with one another--that
1416           we're going to need to deal with.  These are: NTFS ACLs, POSIX
1417           ACLs, NFSv4 ACLS, and AFS ACLS. (Some may question the relevance
1418           of AFS; AFS is one of Sine Nomine's core consulting businesses,
1419           and having a reputable file-level backup and restore technology
1420           for it (as Tivoli is probably going to drop AFS support soon since
1421           IBM no longer supports AFS) would be of huge benefit to our
1422           customers; we'd most likely create the AFS support at Sine Nomine
1423           for inclusion into the Bacula (and perhaps some changes to the
1424           OpenAFS volserver) core code.)
1425
1426           Now, obviously, Bacula already handles NTFS just fine.  However, I
1427           think there's a lot of value in implementing a generic ACL model,
1428           so that it's easy to support whatever particular instances of ACLs
1429           come down the pike: POSIX ACLS (think SELinux) and NFSv4 are the
1430           obvious things arriving in the Linux world in a big way in the
1431           near future.  XACML, although overcomplicated for our needs,
1432           provides this framework, and we should be able to leverage other
1433           people's implementations to minimize the amount of work *we* have
1434           to do to get a generic ACL framework.  Basically, the costs of
1435           implementation are high, but they're largely both external to
1436           Bacula and already sunk.
1437
1438    Notes: As you indicate this is a bit of "blue sky" or in other words,
1439           at the moment, it is a bit esoteric to consider for Bacula.
1440
1441 Item h9:  Archive data
1442   Date:   15/5/2006
1443   Origin: calvin streeting calvin at absentdream dot com
1444   Status:
1445
1446   What:   The abilty to archive to media (dvd/cd) in a uncompressed format
1447           for dead filing (archiving not backing up)
1448
1449     Why: At work when jobs are finished and moved off of the main
1450           file servers (raid based systems) onto a simple Linux
1451           file server (ide based system) so users can find old
1452           information without contacting the IT dept.
1453
1454           So this data dosn't realy change it only gets added to,
1455           But it also needs backing up.  At the moment it takes
1456           about 8 hours to back up our servers (working data) so
1457           rather than add more time to existing backups i am trying
1458           to implement a system where we backup the acrhive data to
1459           cd/dvd these disks would only need to be appended to
1460           (burn only new/changed files to new disks for off site
1461           storage).  basialy understand the differnce between
1462           achive data and live data.
1463
1464   Notes:  Scan the data and email me when it needs burning divide
1465           into predefined chunks keep a recored of what is on what
1466           disk make me a label (simple php->mysql=>pdf stuff) i
1467           could do this bit ability to save data uncompresed so
1468           it can be read in any other system (future proof data)
1469           save the catalog with the disk as some kind of menu
1470           system
1471
1472    Notes: Kern: I don't understand this item, and in any case, if it
1473           is specific to DVD/CDs, which we do not recommend using,
1474           it is unlikely to be implemented except as a user
1475           submitted patch.
1476
1477
1478 Item h10: Clustered file-daemons
1479   Origin: Alan Brown ajb2 at mssl dot ucl dot ac dot uk
1480   Date:   24 July 2006
1481   Status:
1482   What:   A "virtual" filedaemon, which is actually a cluster of real ones.
1483
1484   Why:    In the case of clustered filesystems (SAN setups, GFS, or OCFS2, etc)
1485           multiple machines may have access to the same set of filesystems
1486
1487           For performance reasons, one may wish to initate backups from
1488           several of these machines simultaneously, instead of just using
1489           one backup source for the common clustered filesystem.
1490
1491           For obvious reasons, normally backups of $A-FD/$PATH and
1492           B-FD/$PATH are treated as different backup sets. In this case
1493           they are the same communal set.
1494
1495           Likewise when restoring, it would be easier to just specify
1496           one of the cluster machines and let bacula decide which to use.
1497
1498           This can be faked to some extent using DNS round robin entries
1499           and a virtual IP address, however it means "status client" will
1500           always give bogus answers. Additionally there is no way of
1501           spreading the load evenly among the servers.
1502
1503           What is required is something similar to the storage daemon
1504           autochanger directives, so that Bacula can keep track of
1505           operating backups/restores and direct new jobs to a "free"
1506           client.
1507
1508    Notes: Kern: I don't understand the request enough to be able to
1509           implement it. A lot more design detail should be presented
1510           before voting on this project.