git.sur5r.net Git - bacula/bacula/blob - bacula/projects

   1
   2 Projects:
   3                      Bacula Projects Roadmap
   4                     Status updated 25 February 2010
   5
   6 Summary:
   7 * => item complete
   8
   9 Item  1: Ability to restart failed jobs
  10 Item  2: Scheduling syntax that permits more flexibility and options
  11 Item  3: Data encryption on storage daemon
  12 Item  4: Add ability to Verify any specified Job.
  13 Item  5: Improve Bacula's tape and drive usage and cleaning management
  14 Item  6: Allow FD to initiate a backup
  15 Item  7: Implement Storage daemon compression
  16 Item  8: Reduction of communications bandwidth for a backup
  17 Item  9: Ability to reconnect a disconnected comm line
  18 Item 10: Start spooling even when waiting on tape
  19 Item 11: Include all conf files in specified directory
  20 Item 12: Multiple threads in file daemon for the same job
  21 Item 13: Possibilty to schedule Jobs on last Friday of the month
  22 Item 14: Include timestamp of job launch in "stat clients" output
  23 Item 15: Message mailing based on backup types
  24 Item 16: Ability to import/export Bacula database entities
  25 Item 17: Implementation of running Job speed limit.
  26 Item 18: Add an override in Schedule for Pools based on backup types
  27 Item 19: Automatic promotion of backup levels based on backup size
  28 Item 20: Allow FileSet inclusion/exclusion by creation/mod times
  29 Item 21: Archival (removal) of User Files to Tape
  30 Item 22: An option to operate on all pools with update vol parameters
  31 Item 23: Automatic disabling of devices
  32 Item 24: Ability to defer Batch Insert to a later time
  33 Item 25: Add MaxVolumeSize/MaxVolumeBytes to Storage resource
  34 Item 26: Enable persistent naming/number of SQL queries
  35 Item 27: Bacula Dir, FD and SD to support proxies
  36 Item 28: Add Minumum Spool Size directive
  37 Item 29: Handle Windows Encrypted Files using Win raw encryption
  38 Item 30: Implement a Storage device like Amazon's S3.
  39 Item 31: Convert tray monitor on Windows to a stand alone program
  40 Item 32: Relabel disk volume after recycling
  41 Item 33: Command that releases all drives in an autochanger
  42 Item 34: Run bscan on a remote storage daemon from within bconsole.
  43 Item 35: Implement a Migration job type that will create a reverse
  44 Item 36: Job migration between different SDs
  45 Item 37: Concurrent spooling and despooling withini a single job.
  46 Item 39: Extend the verify code to make it possible to verify
  47 Item 40: Separate "Storage" and "Device" in the bacula-dir.conf
  48
  49
  50 Item  1: Ability to restart failed jobs
  51    Date: 26 April 2009
  52  Origin: Kern/Eric
  53  Status:
  54
  55   What:  Often jobs fail because of a communications line drop or max run time,
  56           cancel, or some other non-critical problem.  Currrently any data
  57           saved is lost.  This implementation should modify the Storage daemon
  58           so that it saves all the files that it knows are completely backed
  59           up to the Volume
  60
  61           The jobs should then be marked as incomplete and a subsequent
  62           Incremental Accurate backup will then take into account all the
  63           previously saved job.
  64
  65   Why:   Avoids backuping data already saved.
  66
  67   Notes: Requires Accurate to restart correctly.  Must completed have a minimum
  68           volume of data or files stored on Volume before enabling.
  69
  70
  71 Item  2: Scheduling syntax that permits more flexibility and options
  72    Date: 15 December 2006
  73   Origin: Gregory Brauer (greg at wildbrain dot com) and
  74           Florian Schnabel <florian.schnabel at docufy dot de>
  75   Status:
  76
  77    What: Currently, Bacula only understands how to deal with weeks of the
  78           month or weeks of the year in schedules.  This makes it impossible
  79           to do a true weekly rotation of tapes.  There will always be a
  80           discontinuity that will require disruptive manual intervention at
  81           least monthly or yearly because week boundaries never align with
  82           month or year boundaries.
  83
  84           A solution would be to add a new syntax that defines (at least)
  85           a start timestamp, and repetition period.
  86
  87           An easy option to skip a certain job  on a certain date.
  88
  89
  90      Why: Rotated backups done at weekly intervals are useful, and Bacula
  91           cannot currently do them without extensive hacking.
  92
  93           You could then easily skip tape backups on holidays.  Especially
  94           if you got no autochanger and can only fit one backup on a tape
  95           that would be really handy, other jobs could proceed normally
  96           and you won't get errors that way.
  97
  98
  99    Notes: Here is an example syntax showing a 3-week rotation where full
 100           Backups would be performed every week on Saturday, and an
 101           incremental would be performed every week on Tuesday.  Each
 102           set of tapes could be removed from the loader for the following
 103           two cycles before coming back and being reused on the third
 104           week.  Since the execution times are determined by intervals
 105           from a given point in time, there will never be any issues with
 106           having to adjust to any sort of arbitrary time boundary.  In
 107           the example provided, I even define the starting schedule
 108           as crossing both a year and a month boundary, but the run times
 109           would be based on the "Repeat" value and would therefore happen
 110           weekly as desired.
 111
 112
 113           Schedule {
 114               Name = "Week 1 Rotation"
 115               #Saturday.  Would run Dec 30, Jan 20, Feb 10, etc.
 116               Run {
 117                   Options {
 118                       Type   = Full
 119                       Start  = 2006-12-30 01:00
 120                       Repeat = 3w
 121                   }
 122               }
 123               #Tuesday.  Would run Jan 2, Jan 23, Feb 13, etc.
 124               Run {
 125                   Options {
 126                       Type   = Incremental
 127                       Start  = 2007-01-02 01:00
 128                       Repeat = 3w
 129                   }
 130               }
 131           }
 132
 133           Schedule {
 134               Name = "Week 2 Rotation"
 135               #Saturday.  Would run Jan 6, Jan 27, Feb 17, etc.
 136               Run {
 137                   Options {
 138                       Type   = Full
 139                       Start  = 2007-01-06 01:00
 140                       Repeat = 3w
 141                   }
 142               }
 143               #Tuesday.  Would run Jan 9, Jan 30, Feb 20, etc.
 144               Run {
 145                   Options {
 146                       Type   = Incremental
 147                       Start  = 2007-01-09 01:00
 148                       Repeat = 3w
 149                   }
 150               }
 151           }
 152
 153           Schedule {
 154               Name = "Week 3 Rotation"
 155               #Saturday.  Would run Jan 13, Feb 3, Feb 24, etc.
 156               Run {
 157                   Options {
 158                       Type   = Full
 159                       Start  = 2007-01-13 01:00
 160                       Repeat = 3w
 161                   }
 162               }
 163               #Tuesday.  Would run Jan 16, Feb 6, Feb 27, etc.
 164               Run {
 165                   Options {
 166                       Type   = Incremental
 167                       Start  = 2007-01-16 01:00
 168                       Repeat = 3w
 169                   }
 170               }
 171           }
 172
 173    Notes: Kern: I have merged the previously separate project of skipping
 174           jobs (via Schedule syntax) into this.
 175
 176
 177 Item  3: Data encryption on storage daemon
 178   Origin: Tobias Barth <tobias.barth at web-arts.com>
 179   Date:  04 February 2009
 180   Status: new
 181
 182   What: The storage demon should be able to do the data encryption that can
 183         currently be done by the file daemon.
 184
 185   Why: This would have 2 advantages:
 186        1) one could encrypt the data of unencrypted tapes by doing a
 187           migration job
 188        2) the storage daemon would be the only machine that would have
 189           to keep the encryption keys.
 190
 191   Notes from Landon:
 192           As an addendum to the feature request, here are some crypto
 193           implementation details I wrote up regarding SD-encryption back in Jan
 194           2008:
 195           http://www.mail-archive.com/bacula-users@lists.sourceforge.net/msg28860.html
 196
 197
 198 Item  4: Add ability to Verify any specified Job.
 199 Date: 17 January 2008
 200 Origin: portrix.net Hamburg, Germany.
 201 Contact: Christian Sabelmann
 202 Status: 70% of the required Code is part of the Verify function since v. 2.x
 203
 204    What:
 205    The ability to tell Bacula which Job should verify instead of
 206    automatically verify just the last one.
 207
 208    Why:
 209    It is sad that such a powerfull feature like Verify Jobs
 210    (VolumeToCatalog) is restricted to be used only with the last backup Job
 211    of a client.  Actual users who have to do daily Backups are forced to
 212    also do daily Verify Jobs in order to take advantage of this useful
 213    feature.  This Daily Verify after Backup conduct is not always desired
 214    and Verify Jobs have to be sometimes scheduled.  (Not necessarily
 215    scheduled in Bacula).  With this feature Admins can verify Jobs once a
 216    Week or less per month, selecting the Jobs they want to verify.  This
 217    feature is also not to difficult to implement taking in account older bug
 218    reports about this feature and the selection of the Job to be verified.
 219
 220    Notes: For the verify Job, the user could select the Job to be verified
 221    from a List of the latest Jobs of a client. It would also be possible to
 222    verify a certain volume.  All of these would naturaly apply only for
 223    Jobs whose file information are still in the catalog.
 224
 225
 226 Item  5: Improve Bacula's tape and drive usage and cleaning management
 227   Date:  8 November 2005, November 11, 2005
 228   Origin: Adam Thornton <athornton at sinenomine dot net>,
 229           Arno Lehmann <al at its-lehmann dot de>
 230   Status:
 231
 232   What:  Make Bacula manage tape life cycle information, tape reuse
 233           times and drive cleaning cycles.
 234
 235   Why:   All three parts of this project are important when operating
 236           backups.
 237           We need to know which tapes need replacement, and we need to
 238           make sure the drives are cleaned when necessary.  While many
 239           tape libraries and even autoloaders can handle all this
 240           automatically, support by Bacula can be helpful for smaller
 241           (older) libraries and single drives.  Limiting the number of
 242           times a tape is used might prevent tape errors when using
 243           tapes until the drives can't read it any more.  Also, checking
 244           drive status during operation can prevent some failures (as I
 245           [Arno] had to learn the hard way...)
 246
 247   Notes: First, Bacula could (and even does, to some limited extent)
 248           record tape and drive usage.  For tapes, the number of mounts,
 249           the amount of data, and the time the tape has actually been
 250           running could be recorded.  Data fields for Read and Write
 251           time and Number of mounts already exist in the catalog (I'm
 252           not sure if VolBytes is the sum of all bytes ever written to
 253           that volume by Bacula).  This information can be important
 254           when determining which media to replace.  The ability to mark
 255           Volumes as "used up" after a given number of write cycles
 256           should also be implemented so that a tape is never actually
 257           worn out.  For the tape drives known to Bacula, similar
 258           information is interesting to determine the device status and
 259           expected life time: Time it's been Reading and Writing, number
 260           of tape Loads / Unloads / Errors.  This information is not yet
 261           recorded as far as I [Arno] know.  A new volume status would
 262           be necessary for the new state, like "Used up" or "Worn out".
 263           Volumes with this state could be used for restores, but not
 264           for writing. These volumes should be migrated first (assuming
 265           migration is implemented) and, once they are no longer needed,
 266           could be moved to a Trash pool.
 267
 268           The next step would be to implement a drive cleaning setup.
 269           Bacula already has knowledge about cleaning tapes.  Once it
 270           has some information about cleaning cycles (measured in drive
 271           run time, number of tapes used, or calender days, for example)
 272           it can automatically execute tape cleaning (with an
 273           autochanger, obviously) or ask for operator assistance loading
 274           a cleaning tape.
 275
 276           The final step would be to implement TAPEALERT checks not only
 277           when changing tapes and only sending the information to the
 278           administrator, but rather checking after each tape error,
 279           checking on a regular basis (for example after each tape
 280           file), and also before unloading and after loading a new tape.
 281           Then, depending on the drives TAPEALERT state and the known
 282           drive cleaning state Bacula could automatically schedule later
 283           cleaning, clean immediately, or inform the operator.
 284
 285           Implementing this would perhaps require another catalog change
 286           and perhaps major changes in SD code and the DIR-SD protocol,
 287           so I'd only consider this worth implementing if it would
 288           actually be used or even needed by many people.
 289
 290           Implementation of these projects could happen in three distinct
 291           sub-projects: Measuring Tape and Drive usage, retiring
 292           volumes, and handling drive cleaning and TAPEALERTs.
 293
 294
 295 Item  6: Allow FD to initiate a backup
 296 Origin:  Frank Volf (frank at deze dot org)
 297 Date:    17 November 2005
 298 Status:
 299
 300 What:  Provide some means, possibly by a restricted console that
 301        allows a FD to initiate a backup, and that uses the connection
 302        established by the FD to the Director for the backup so that
 303        a Director that is firewalled can do the backup.
 304 Why:   Makes backup of laptops much easier.
 305 Notes: This project consists of the following points
 306        1. Modify the FD to have a "mini-console" interface that
 307           permits it to connect to the Director and start a
 308           backup job of itself.
 309        2. The list of jobs that can be started by the FD are
 310           defined in the Director (possibly via a restricted
 311           console).
 312        3. Modify the existing tray monitor code in the Win32 FD
 313           so that it is a separate program from the FD.
 314        4. The tray monitor program should be extended to permit
 315           initiating a backup.
 316        5. No new Director directives should be added without
 317           prior consultation with the Bacula developers.
 318        6. The comm line used by the FD to connect to the Director
 319           should be re-used by the Director to do the backup.
 320           This feature is partially implemented in the Director.
 321        7. The FD may have a new directive that allows it to start
 322           a backup when the FD starts.
 323        8. The console interface to the FD should be extended to
 324           permit a properly authorized console to initiate a
 325           backup via the FD.
 326
 327
 328 Item  7: Implement Storage daemon compression
 329   Date:  18 December 2006
 330   Origin: Vadim A. Umanski , e-mail umanski@ext.ru
 331   Status:
 332   What:  The ability to compress backup data on the SD receiving data
 333           instead of doing that on client sending data.
 334   Why:   The need is practical. I've got some machines that can send
 335           data to the network 4 or 5 times faster than compressing
 336           them (I've measured that). They're using fast enough SCSI/FC
 337           disk subsystems but rather slow CPUs (ex. UltraSPARC II).
 338           And the backup server has got a quite fast CPUs (ex. Dual P4
 339           Xeons) and quite a low load. When you have 20, 50 or 100 GB
 340           of raw data - running a job 4 to 5 times faster - that
 341           really matters. On the other hand, the data can be
 342           compressed 50% or better - so losing twice more space for
 343           disk backup is not good at all. And the network is all mine
 344           (I have a dedicated management/provisioning network) and I
 345           can get as high bandwidth as I need - 100Mbps, 1000Mbps...
 346           That's why the server-side compression feature is needed!
 347   Notes:
 348
 349
 350 Item  8: Reduction of communications bandwidth for a backup
 351    Date: 14 October 2008
 352  Origin: Robin O'Leary (Equiinet)
 353  Status:
 354
 355   What:  Using rdiff techniques, Bacula could significantly reduce
 356           the network data transfer volume to do a backup.
 357
 358   Why:   Faster backup across the Internet
 359
 360   Notes: This requires retaining certain data on the client during a Full
 361           backup that will speed up subsequent backups.
 362
 363
 364 Item  9: Ability to reconnect a disconnected comm line
 365   Date:  26 April 2009
 366   Origin: Kern/Eric
 367   Status:
 368
 369   What:  Often jobs fail because of a communications line drop. In that
 370           case, Bacula should be able to reconnect to the other daemon and
 371           resume the job.
 372
 373   Why:   Avoids backuping data already saved.
 374
 375   Notes: *Very* complicated from a design point of view because of authenication.
 376
 377 Item 10: Start spooling even when waiting on tape
 378   Origin: Tobias Barth <tobias.barth@web-arts.com>
 379   Date:  25 April 2008
 380   Status:
 381
 382   What: If a job can be spooled to disk before writing it to tape, it should
 383           be spooled immediately.  Currently, bacula waits until the correct
 384           tape is inserted into the drive.
 385
 386   Why:   It could save hours.  When bacula waits on the operator who must insert
 387           the correct tape (e.g.  a new tape or a tape from another media
 388           pool), bacula could already prepare the spooled data in the spooling
 389           directory and immediately start despooling when the tape was
 390           inserted by the operator.
 391
 392           2nd step: Use 2 or more spooling directories.  When one directory is
 393           currently despooling, the next (on different disk drives) could
 394           already be spooling the next data.
 395
 396   Notes: I am using bacula 2.2.8, which has none of those features
 397          implemented.
 398
 399
 400 Item 11: Include all conf files in specified directory
 401 Date:  18 October 2008
 402 Origin: Database, Lda. Maputo, Mozambique
 403 Contact:Cameron Smith / cameron.ord@database.co.mz
 404 Status: New request
 405
 406 What: A directive something like "IncludeConf = /etc/bacula/subconfs" Every
 407       time Bacula Director restarts or reloads, it will walk the given
 408       directory (non-recursively) and include the contents of any files
 409       therein, as though they were appended to bacula-dir.conf
 410
 411 Why: Permits simplified and safer configuration for larger installations with
 412       many client PCs.  Currently, through judicious use of JobDefs and
 413       similar directives, it is possible to reduce the client-specific part of
 414       a configuration to a minimum.  The client-specific directives can be
 415       prepared according to a standard template and dropped into a known
 416       directory.  However it is still necessary to add a line to the "master"
 417       (bacula-dir.conf) referencing each new file.  This exposes the master to
 418       unnecessary risk of accidental mistakes and makes automation of adding
 419       new client-confs, more difficult (it is easier to automate dropping a
 420       file into a dir, than rewriting an existing file).  Ken has previously
 421       made a convincing argument for NOT including Bacula's core configuration
 422       in an RDBMS, but I believe that the present request is a reasonable
 423       extension to the current "flat-file-based" configuration philosophy.
 424
 425 Notes: There is NO need for any special syntax to these files.  They should
 426        contain standard directives which are simply "inlined" to the parent
 427        file as already happens when you explicitly reference an external file.
 428
 429 Notes: (kes) this can already be done with scripting
 430      From: John Jorgensen <jorgnsn@lcd.uregina.ca>
 431      The bacula-dir.conf at our site contains these lines:
 432
 433    #
 434    # Include subfiles associated with configuration of clients.
 435    # They define the bulk of the Clients, Jobs, and FileSets.
 436    #
 437    @|"sh -c 'for f in /etc/bacula/clientdefs/*.conf ; do echo @${f} ; done'"
 438
 439     and when we get a new client, we just put its configuration into
 440     a new file called something like:
 441
 442     /etc/bacula/clientdefs/clientname.conf
 443
 444
 445 Item 12: Multiple threads in file daemon for the same job
 446   Date:  27 November 2005
 447   Origin: Ove Risberg (Ove.Risberg at octocode dot com)
 448   Status:
 449
 450   What:  I want the file daemon to start multiple threads for a backup
 451           job so the fastest possible backup can be made.
 452
 453           The file daemon could parse the FileSet information and start
 454           one thread for each File entry located on a separate
 455           filesystem.
 456
 457           A confiuration option in the job section should be used to
 458           enable or disable this feature. The confgutration option could
 459           specify the maximum number of threads in the file daemon.
 460
 461           If the theads could spool the data to separate spool files
 462           the restore process will not be much slower.
 463
 464   Why:   Multiple concurrent backups of a large fileserver with many
 465           disks and controllers will be much faster.
 466
 467   Notes: (KES) This is not necessary and could be accomplished
 468          by having two jobs.  In addition, the current VSS code
 469          is single thread.
 470
 471
 472 Item 13: Possibilty to schedule Jobs on last Friday of the month
 473 Origin: Carsten Menke <bootsy52 at gmx dot net>
 474 Date:   02 March 2008
 475 Status:
 476
 477    What: Currently if you want to run your monthly Backups on the last
 478            Friday of each month this is only possible with workarounds (e.g
 479            scripting) (As some months got 4 Fridays and some got 5 Fridays)
 480            The same is true if you plan to run your yearly Backups on the
 481            last Friday of the year.  It would be nice to have the ability to
 482            use the builtin scheduler for this.
 483
 484    Why:   In many companies the last working day of the week is Friday (or
 485            Saturday), so to get the most data of the month onto the monthly
 486            tape, the employees are advised to insert the tape for the
 487            monthly backups on the last friday of the month.
 488
 489    Notes: To give this a complete functionality it would be nice if the
 490            "first" and "last" Keywords could be implemented in the
 491            scheduler, so it is also possible to run monthy backups at the
 492            first friday of the month and many things more.  So if the syntax
 493            would expand to this {first|last} {Month|Week|Day|Mo-Fri} of the
 494            {Year|Month|Week} you would be able to run really flexible jobs.
 495
 496            To got a certain Job run on the last Friday of the Month for example
 497            one could then write
 498
 499               Run = pool=Monthly last Fri of the Month at 23:50
 500
 501               ## Yearly Backup
 502
 503               Run = pool=Yearly last Fri of the Year at 23:50
 504
 505               ## Certain Jobs the last Week of a Month
 506
 507               Run = pool=LastWeek last Week of the Month at 23:50
 508
 509               ## Monthly Backup on the last day of the month
 510
 511               Run = pool=Monthly last Day of the Month at 23:50
 512
 513
 514 Item 14: Include timestamp of job launch in "stat clients" output
 515   Origin: Mark Bergman <mark.bergman@uphs.upenn.edu>
 516   Date:  Tue Aug 22 17:13:39 EDT 2006
 517   Status:
 518
 519   What:  The "stat clients" command doesn't include any detail on when
 520           the active backup jobs were launched.
 521
 522   Why:   Including the timestamp would make it much easier to decide whether
 523           a job is running properly.
 524
 525   Notes: It may be helpful to have the output from "stat clients" formatted
 526           more like that from "stat dir" (and other commands), in a column
 527           format. The per-client information that's currently shown (level,
 528           client name, JobId, Volume, pool, device, Files, etc.) is good, but
 529           somewhat hard to parse (both programmatically and visually),
 530           particularly when there are many active clients.
 531
 532
 533 Item 15: Message mailing based on backup types
 534  Origin: Evan Kaufman <evan.kaufman@gmail.com>
 535    Date: January 6, 2006
 536  Status:
 537
 538    What: In the "Messages" resource definitions, allowing messages
 539           to be mailed based on the type (backup, restore, etc.) and level
 540           (full, differential, etc) of job that created the originating
 541           message(s).
 542
 543  Why:    It would, for example, allow someone's boss to be emailed
 544           automatically only when a Full Backup job runs, so he can
 545           retrieve the tapes for offsite storage, even if the IT dept.
 546           doesn't (or can't) explicitly notify him.  At the same time, his
 547           mailbox wouldnt be filled by notifications of Verifies, Restores,
 548           or Incremental/Differential Backups (which would likely be kept
 549           onsite).
 550
 551  Notes: One way this could be done is through additional message types, for
 552  example:
 553
 554    Messages {
 555      # email the boss only on full system backups
 556      Mail = boss@mycompany.com = full, !incremental, !differential, !restore,
 557             !verify, !admin
 558      # email us only when something breaks
 559      MailOnError = itdept@mycompany.com = all
 560    }
 561
 562    Notes: Kern: This should be rather trivial to implement.
 563
 564
 565 Item 16: Ability to import/export Bacula database entities
 566    Date: 26 April 2009
 567  Origin: Eric
 568  Status:
 569
 570   What:  Create a Bacula ASCII SQL database independent format that permits
 571           importing and exporting database catalog Job entities.
 572
 573   Why:   For achival, database clustering, tranfer to other databases
 574           of any SQL engine.
 575
 576   Notes: Job selection should be by Job, time, Volume, Client, Pool and possibly
 577           other criteria.
 578
 579
 580 Item 17: Implementation of running Job speed limit.
 581 Origin: Alex F, alexxzell at yahoo dot com
 582 Date: 29 January 2009
 583
 584 What: I noticed the need for an integrated bandwidth limiter for
 585       running jobs.  It would be very useful just to specify another
 586       field in bacula-dir.conf, like speed = how much speed you wish
 587       for that specific job to run at
 588
 589 Why: Because of a couple of reasons.  First, it's very hard to implement a
 590      traffic shaping utility and also make it reliable.  Second, it is very
 591      uncomfortable to have to implement these apps to, let's say 50 clients
 592      (including desktops, servers).  This would also be unreliable because you
 593      have to make sure that the apps are properly working when needed; users
 594      could also disable them (accidentally or not).  It would be very useful
 595      to provide Bacula this ability.  All information would be centralized,
 596      you would not have to go to 50 different clients in 10 different
 597      locations for configuration; eliminating 3rd party additions help in
 598      establishing efficiency.  Would also avoid bandwidth congestion,
 599      especially where there is little available.
 600
 601
 602 Item 18: Add an override in Schedule for Pools based on backup types
 603 Date:    19 Jan 2005
 604 Origin:  Chad Slater <chad.slater@clickfox.com>
 605 Status:
 606
 607   What:  Adding a FullStorage=BigTapeLibrary in the Schedule resource
 608           would help those of us who use different storage devices for different
 609           backup levels cope with the "auto-upgrade" of a backup.
 610
 611   Why:   Assume I add several new devices to be backed up, i.e. several
 612           hosts with 1TB RAID.  To avoid tape switching hassles, incrementals are
 613           stored in a disk set on a 2TB RAID.  If you add these devices in the
 614           middle of the month, the incrementals are upgraded to "full" backups,
 615           but they try to use the same storage device as requested in the
 616           incremental job, filling up the RAID holding the differentials.  If we
 617           could override the Storage parameter for full and/or differential
 618           backups, then the Full job would use the proper Storage device, which
 619           has more capacity (i.e. a 8TB tape library.
 620
 621
 622 Item 19: Automatic promotion of backup levels based on backup size
 623    Date: 19 January 2006
 624   Origin: Adam Thornton <athornton@sinenomine.net>
 625   Status:
 626
 627     What: Other backup programs have a feature whereby it estimates the space
 628           that a differential, incremental, and full backup would take.  If
 629           the difference in space required between the scheduled level and the
 630           next level up is beneath some user-defined critical threshold, the
 631           backup level is bumped to the next type.  Doing this minimizes the
 632           number of volumes necessary during a restore, with a fairly minimal
 633           cost in backup media space.
 634
 635     Why: I know at least one (quite sophisticated and smart) user for whom the
 636           absence of this feature is a deal-breaker in terms of using Bacula;
 637           if we had it it would eliminate the one cool thing other backup
 638           programs can do and we can't (at least, the one cool thing I know
 639           of).
 640
 641
 642 Item 20: Allow FileSet inclusion/exclusion by creation/mod times
 643   Origin: Evan Kaufman <evan.kaufman@gmail.com>
 644   Date:  January 11, 2006
 645   Status:
 646
 647   What:  In the vein of the Wild and Regex directives in a Fileset's
 648           Options, it would be helpful to allow a user to include or exclude
 649           files and directories by creation or modification times.
 650
 651           You could factor the Exclude=yes|no option in much the same way it
 652           affects the Wild and Regex directives.  For example, you could exclude
 653           all files modified before a certain date:
 654
 655    Options {
 656      Exclude = yes
 657      Modified Before = ####
 658    }
 659
 660            Or you could exclude all files created/modified since a certain date:
 661
 662    Options {
 663       Exclude = yes
 664      Created Modified Since = ####
 665    }
 666
 667            The format of the time/date could be done several ways, say the number
 668            of seconds since the epoch:
 669            1137008553 = Jan 11 2006, 1:42:33PM   # result of `date +%s`
 670
 671            Or a human readable date in a cryptic form:
 672            20060111134233 = Jan 11 2006, 1:42:33PM   # YYYYMMDDhhmmss
 673
 674   Why:   I imagine a feature like this could have many uses. It would
 675           allow a user to do a full backup while excluding the base operating
 676           system files, so if I installed a Linux snapshot from a CD yesterday,
 677           I'll *exclude* all files modified *before* today.  If I need to
 678           recover the system, I use the CD I already have, plus the tape backup.
 679           Or if, say, a Windows client is hit by a particularly corrosive
 680           virus, and I need to *exclude* any files created/modified *since* the
 681           time of infection.
 682
 683   Notes: Of course, this feature would work in concert with other
 684           in/exclude rules, and wouldnt override them (or each other).
 685
 686   Notes: The directives I'd imagine would be along the lines of
 687           "[Created] [Modified] [Before|Since] = <date>".
 688           So one could compare against 'ctime' and/or 'mtime', but ONLY 'before'
 689            or 'since'.
 690
 691
 692 Item 21: Archival (removal) of User Files to Tape
 693   Date:  Nov. 24/2005
 694   Origin: Ray Pengelly [ray at biomed dot queensu dot ca
 695   Status:
 696
 697   What:  The ability to archive data to storage based on certain parameters
 698           such as age, size, or location.  Once the data has been written to
 699           storage and logged it is then pruned from the originating
 700           filesystem. Note! We are talking about user's files and not
 701           Bacula Volumes.
 702
 703   Why:   This would allow fully automatic storage management which becomes
 704           useful for large datastores.  It would also allow for auto-staging
 705           from one media type to another.
 706
 707           Example 1) Medical imaging needs to store large amounts of data.
 708           They decide to keep data on their servers for 6 months and then put
 709           it away for long term storage.  The server then finds all files
 710           older than 6 months writes them to tape.  The files are then removed
 711           from the server.
 712
 713           Example 2) All data that hasn't been accessed in 2 months could be
 714           moved from high-cost, fibre-channel disk storage to a low-cost
 715           large-capacity SATA disk storage pool which doesn't have as quick of
 716           access time.  Then after another 6 months (or possibly as one
 717           storage pool gets full) data is migrated to Tape.
 718
 719
 720 Item 22: An option to operate on all pools with update vol parameters
 721   Origin: Dmitriy Pinchukov <absh@bossdev.kiev.ua>
 722    Date: 16 August 2006
 723   Status: Patch made by  Nigel Stepp
 724
 725    What: When I do update -> Volume parameters -> All Volumes
 726           from Pool, then I have to select pools one by one.  I'd like
 727           console to have an option like "0: All Pools" in the list of
 728           defined pools.
 729
 730    Why:  I have many pools and therefore unhappy with manually
 731           updating each of them using update -> Volume parameters -> All
 732           Volumes from Pool -> pool #.
 733
 734
 735 Item 23: Automatic disabling of devices
 736    Date: 2005-11-11
 737   Origin: Peter Eriksson <peter at ifm.liu dot se>
 738   Status:
 739
 740    What: After a configurable amount of fatal errors with a tape drive
 741           Bacula should automatically disable further use of a certain
 742           tape drive. There should also be "disable"/"enable" commands in
 743           the "bconsole" tool.
 744
 745    Why:  On a multi-drive jukebox there is a possibility of tape drives
 746           going bad during large backups (needing a cleaning tape run,
 747           tapes getting stuck). It would be advantageous if Bacula would
 748           automatically disable further use of a problematic tape drive
 749           after a configurable amount of errors has occurred.
 750
 751           An example: I have a multi-drive jukebox (6 drives, 380+ slots)
 752           where tapes occasionally get stuck inside the drive. Bacula will
 753           notice that the "mtx-changer" command will fail and then fail
 754           any backup jobs trying to use that drive. However, it will still
 755           keep on trying to run new jobs using that drive and fail -
 756           forever, and thus failing lots and lots of jobs... Since we have
 757           many drives Bacula could have just automatically disabled
 758           further use of that drive and used one of the other ones
 759           instead.
 760
 761
 762 Item 24: Ability to defer Batch Insert to a later time
 763    Date: 26 April 2009
 764  Origin: Eric
 765  Status:
 766
 767   What:  Instead of doing a Job Batch Insert at the end of the Job
 768           which might create resource contention with lots of Job,
 769           defer the insert to a later time.
 770
 771   Why:   Permits to focus on getting the data on the Volume and
 772           putting the metadata into the Catalog outside the backup
 773           window.
 774
 775   Notes: Will use the proposed Bacula ASCII database import/export
 776           format (i.e. dependent on the import/export entities project).
 777
 778
 779 Item 25: Add MaxVolumeSize/MaxVolumeBytes to Storage resource
 780    Origin: Bastian Friedrich <bastian.friedrich@collax.com>
 781    Date:  2008-07-09
 782    Status: -
 783
 784    What:  SD has a "Maximum Volume Size" statement, which is deprecated and
 785            superseded by the Pool resource statement "Maximum Volume Bytes".
 786            It would be good if either statement could be used in Storage
 787            resources.
 788
 789    Why:   Pools do not have to be restricted to a single storage type/device;
 790            thus, it may be impossible to define Maximum Volume Bytes in the
 791            Pool resource.  The old MaxVolSize statement is deprecated, as it
 792            is SD side only.  I am using the same pool for different devices.
 793
 794    Notes: State of idea currently unknown.  Storage resources in the dir
 795            config currently translate to very slim catalog entries; these
 796            entries would require extensions to implement what is described
 797            here.  Quite possibly, numerous other statements that are currently
 798            available in Pool resources could be used in Storage resources too
 799            quite well.
 800
 801
 802 Item 26: Enable persistent naming/number of SQL queries
 803   Date:  24 Jan, 2007
 804   Origin: Mark Bergman
 805   Status:
 806
 807   What:
 808         Change the parsing of the query.sql file and the query command so that
 809         queries are named/numbered by a fixed value, not their order in the
 810         file.
 811
 812
 813   Why:
 814         One of the real strengths of bacula is the ability to query the
 815         database, and the fact that complex queries can be saved and
 816         referenced from a file is very powerful. However, the choice
 817         of query (both for interactive use, and by scripting input
 818         to the bconsole command) is completely dependent on the order
 819         within the query.sql file. The descriptve labels are helpful for
 820         interactive use, but users become used to calling a particular
 821         query "by number", or may use scripts to execute queries. This
 822         presents a problem if the number or order of queries in the file
 823         changes.
 824
 825         If the query.sql file used the numeric tags as a real value (rather
 826         than a comment), then users could have a higher confidence that they
 827         are executing the intended query, that their local changes wouldn't
 828         conflict with future bacula upgrades.
 829
 830         For scripting, it's very important that the intended query is
 831         what's actually executed. The current method of parsing the
 832         query.sql file discourages scripting because the addition or
 833         deletion of queries within the file will require corresponding
 834         changes to scripts. It may not be obvious to users that deleting
 835         query "17" in the query.sql file will require changing all
 836         references to higher numbered queries. Similarly, when new
 837         bacula distributions change the number of "official" queries,
 838         user-developed queries cannot simply be appended to the file
 839         without also changing any references to those queries in scripts
 840         or procedural documentation, etc.
 841
 842         In addition, using fixed numbers for queries would encourage more
 843         user-initiated development of queries, by supporting conventions
 844         such as:
 845
 846                 queries numbered 1-50 are supported/developed/distributed by
 847                         with official bacula releases
 848
 849                 queries numbered 100-200 are community contributed, and are
 850                 related to media management
 851
 852                 queries numbered 201-300 are community contributed, and are
 853                 related to checksums, finding duplicated files across
 854                 different backups, etc.
 855
 856                 queries numbered 301-400 are community contributed, and are
 857                 related to backup statistics (average file size, size per
 858                 client per backup level, time for all clients by backup level,
 859                 storage capacity by media type, etc.)
 860
 861                 queries numbered 500-999 are locally created
 862
 863   Notes:
 864         Alternatively, queries could be called by keyword (tag), rather
 865         than by number.
 866
 867
 868 Item 27: Bacula Dir, FD and SD to support proxies
 869 Origin: Karl Grindley @ MIT Lincoln Laboratory <kgrindley at ll dot mit dot edu>
 870 Date:  25 March 2009
 871 Status: proposed
 872
 873 What:  Support alternate methods for nailing up a TCP session such
 874         as SOCKS5, SOCKS4 and HTTP (CONNECT) proxies.  Such a feature
 875         would allow tunneling of bacula traffic in and out of proxied
 876         networks.
 877
 878 Why:   Currently, bacula is architected to only function on a flat network, with
 879         no barriers or limitations.  Due to the large configuration states of
 880         any network and the infinite configuration where file daemons and
 881         storage daemons may sit in relation to one another, bacula often is
 882         not usable on a network where filtered or air-gaped networks exist.
 883         While often solutions such as ACL modifications to firewalls or port
 884         redirection via SNAT or DNAT will solve the issue, often however,
 885         these solutions are not adequate or not allowed by hard policy.
 886
 887         In an air-gapped network with only a highly locked down proxy services
 888         are provided (SOCKS4/5 and/or HTTP and/or SSH outbound) ACLs or
 889         iptable rules will not work.
 890
 891 Notes: Director resource tunneling: This configuration option to utilize a
 892         proxy to connect to a client should be specified in the client
 893         resource Client resource tunneling: should be configured in the client
 894         resource in the director config file?  Or configured on the bacula-fd
 895         configuration file on the fd host itself?  If the ladder, this would
 896         allow only certain clients to use a proxy, where others do not when
 897         establishing the TCP connection to the storage server.
 898
 899         Also worth noting, there are other 3rd party, light weight apps that
 900         could be utilized to bootstrap this.  Instead of sockifing bacula
 901         itself, use an external program to broker proxy authentication, and
 902         connection to the remote host.  OpenSSH does this by using the
 903         "ProxyCommand" syntax in the client configuration and uses stdin and
 904         stdout to the command.  Connect.c is a very popular one.
 905         (http://bent.latency.net/bent/darcs/goto-san-connect-1.85/src/connect.html).
 906         One could also possibly use stunnel, netcat, etc.
 907
 908
 909 Item 28: Add Minumum Spool Size directive
 910 Date: 20 March 2008
 911 Origin: Frank Sweetser <fs@wpi.edu>
 912
 913    What: Add a new SD directive, "minimum spool size" (or similar).  This
 914          directive would specify a minimum level of free space available for
 915          spooling.  If the unused spool space is less than this level, any
 916          new spooling requests would be blocked as if the "maximum spool
 917          size" threshold had bee reached.  Already spooling jobs would be
 918          unaffected by this directive.
 919
 920    Why: I've been bitten by this scenario a couple of times:
 921
 922         Assume a maximum spool size of 100M. Two concurrent jobs, A and B,
 923         are both running.  Due to timing quirks and previously running jobs,
 924         job A has used 99.9M of space in the spool directory.  While A is
 925         busy despooling to disk, B is happily using the remaining 0.1M of
 926         spool space.  This ends up in a spool/despool sequence every 0.1M of
 927         data.  In addition to fragmenting the data on the volume far more
 928         than was necessary, in larger data sets (ie, tens or hundreds of
 929         gigabytes) it can easily produce multi-megabyte report emails!
 930
 931
 932 Item 29: Handle Windows Encrypted Files using Win raw encryption
 933   Origin: Michael Mohr, SAG  Mohr.External@infineon.com
 934   Date:  22 February 2008
 935   Origin: Alex Ehrlich (Alex.Ehrlich-at-mail.ee)
 936   Date:  05 August 2008
 937   Status:
 938
 939   What: Make it possible to backup and restore Encypted Files from and to
 940           Windows systems without the need to decrypt it by using the raw
 941           encryption functions API (see:
 942           http://msdn2.microsoft.com/en-us/library/aa363783.aspx)
 943           that is provided for that reason by Microsoft.
 944           If a file ist encrypted could be examined by evaluating the
 945           FILE_ATTRIBUTE_ENCRYTED flag of the GetFileAttributes
 946           function.
 947           For each file backed up or restored by FD on Windows, check if
 948           the file is encrypted; if so then use OpenEncryptedFileRaw,
 949           ReadEncryptedFileRaw, WriteEncryptedFileRaw,
 950           CloseEncryptedFileRaw instead of BackupRead and BackupWrite
 951           API calls.
 952
 953   Why:   Without the usage of this interface the fd-daemon running
 954           under the system account can't read encypted Files because
 955           the key needed for the decrytion is missed by them. As a result
 956           actually encrypted files are not backed up
 957           by bacula and also no error is shown while missing these files.
 958
 959    Notes: Using xxxEncryptedFileRaw API would allow to backup and
 960            restore EFS-encrypted files without decrypting their data.
 961            Note that such files cannot be restored "portably" (at least,
 962            easily) but they would be restoreable to a different (or
 963            reinstalled) Win32 machine; the restore would require setup
 964            of a EFS recovery agent in advance, of course, and this shall
 965            be clearly reflected in the documentation, but this is the
 966            normal Windows SysAdmin's business.
 967            When "portable" backup is requested the EFS-encrypted files
 968            shall be clearly reported as errors.
 969            See MSDN on the "Backup and Restore of Encrypted Files" topic:
 970            http://msdn.microsoft.com/en-us/library/aa363783.aspx
 971            Maybe the EFS support requires a new flag in the database for
 972            each file, too?
 973            Unfortunately, the implementation is not as straightforward as
 974            1-to-1 replacement of BackupRead with ReadEncryptedFileRaw,
 975            requiring some FD code rewrite to work with
 976            encrypted-file-related callback functions.
 977
 978
 979 Item 30: Implement a Storage device like Amazon's S3.
 980   Date:  25 August 2008
 981   Origin: Soren Hansen <soren@ubuntu.com>
 982   Status: Not started.
 983   What:  Enable the storage daemon to store backup data on Amazon's
 984           S3 service.
 985
 986   Why:   Amazon's S3 is a cheap way to store data off-site.
 987
 988   Notes: If we configure the Pool to put only one job per volume (they don't
 989          support append operation), and the volume size isn't to big (100MB?),
 990          it should be easy to adapt the disk-changer script to add get/put
 991          procedure with curl. So, the data would be safetly copied during the
 992          Job.
 993
 994          Cloud should be only used with Copy jobs, users should always have
 995          a copy of their data on their site.
 996
 997          We should also think to have our own cache, trying always to have
 998          cloud volume on the local disk. (I don't know if users want to store
 999          100GB on cloud, so it shouldn't be a disk size problem). For example,
1000          if bacula want to recycle a volume, it will start by downloading the
1001          file to truncate it few seconds later, if we can avoid that...
1002
1003 Item 31: Convert tray monitor on Windows to a stand alone program
1004    Date: 26 April 2009
1005  Origin: Kern/Eric
1006  Status:
1007
1008   What:  Separate Win32 tray monitor to be a separate program.
1009
1010   Why:   Vista does not allow SYSTEM services to interact with the
1011           desktop, so the current tray monitor does not work on Vista
1012           machines.
1013
1014   Notes: Requires communicating with the FD via the network (simulate
1015           a console connection).
1016
1017
1018 Item 32: Relabel disk volume after recycling
1019   Origin: Pasi Kärkkäinen <pasik@iki.fi>
1020   Date:   07 May 2009.
1021   Status: Not implemented yet, no code written.
1022
1023   What: The ability to relabel the disk volume (and thus rename the file on the
1024         disk) after it has been recycled. Useful when you have a single job
1025         per disk volume, and you use a custom Label format, for example:
1026         Label Format =
1027         "${Client}-${Level}-${NumVols:p/4/0/r}-${Year}_${Month}_${Day}-${Hour}_${Minute}"
1028
1029   Why: Disk volumes in Bacula get the label/filename when they are used for the
1030        first time.  If you use recycling and custom label format like above,
1031        the disk volume name doesn't match the contents after it has been
1032        recycled.  This feature makes it possible to keep the label/filename
1033        in sync with the content and thus makes it easy to check/monitor the
1034        backups from the shell and/or normal file management tools, because
1035        the filenames of the disk volumes match the content.
1036
1037   Notes:  The configuration option could be "Relabel after Recycling = Yes".
1038
1039 Item 33: Command that releases all drives in an autochanger
1040   Origin: Blake Dunlap (blake@nxs.net)
1041   Date:   10/07/2009
1042   Status: Request
1043
1044   What:  It would be nice if there was a release command that
1045          would release all drives in an autochanger instead of having to
1046          do each one in turn.
1047
1048   Why: It can take some time for a release to occur, and the
1049        commands must be given for each drive in turn, which can quicky
1050        scale if there are several drives in the library.  (Having to
1051        watch the console, to give each command can waste a good bit of
1052        time when you start getting into the 16 drive range when the
1053        tapes can take up to 3 minutes to eject each)
1054
1055   Notes: Due to the way some autochangers/libraries work, you
1056        cannot assume that new tapes inserted will go into slots that are
1057        not currently believed to be in use by bacula (the tape from that
1058        slot is in a drive).  This would make any changes in
1059        configuration quicker/easier, as all drives need to be released
1060        before any modifications to slots.
1061
1062 Item 34: Run bscan on a remote storage daemon from within bconsole.
1063   Date:  07 October 2009
1064   Origin: Graham Keeling <graham@equiinet.com>
1065   Status: Proposing
1066
1067   What:  The ability to be able to run bscan on a remote storage daemon from
1068          within bconsole in order to populate your catalog.
1069
1070   Why:   Currently, it seems you have to:
1071          a) log in to a console on the remote machine
1072          b) figure out where the storage daemon config file is
1073          c) figure out the storage device from the config file
1074          d) figure out the catalog IP address
1075          e) figure out the catalog port
1076          f) open the port on the catalog firewall
1077          g) configure the catalog database to accept connections from the
1078             remote host
1079          h) build a 'bscan' command from (b)-(e) above and run it
1080          It would be much nicer to be able to type something like this into
1081          bconsole:
1082          *bscan storage=<storage> device=<device> volume=<volume>
1083          or something like:
1084          *bscan storage=<storage> all
1085          It seems to me that the scan could also do a better job than the
1086          external bscan program currently does. It would possibly be able to
1087          deduce some extra details, such as the catalog StorageId for the
1088          volumes.
1089
1090   Notes: (Kern). If you need to do a bscan, you have done something wrong,
1091          so this functionality should not need to be integrated into the
1092          the Storage daemon.  However, I am not opposed to someone implementing
1093          this feature providing that all the code is in a shared object (or dll)
1094          and does not add significantly to the size of the Storage daemon. In
1095          addition, the code should be written in a way such that the same source
1096          code is used in both the bscan program and the Storage daemon to avoid
1097          adding a lot of new code that must be maintained by the project.
1098
1099 Item 35: Implement a Migration job type that will create a reverse
1100           incremental (or decremental) backup from two existing full backups.
1101   Date:   05 October 2009
1102   Origin: Griffith College Dublin.  Some sponsorship available.
1103           Contact: Gavin McCullagh <gavin.mccullagh@gcd.ie>
1104   Status:
1105
1106   What:   The ability to take two full backup jobs and derive a reverse
1107           incremental backup from them.  The older full backup data may then
1108           be discarded.
1109
1110   Why:    Long-term backups based on keeping full backups can be expensive in
1111           media.  In many cases (eg a NAS), as the client accumulates files
1112           over months and years, the same file will be duplicated unchanged,
1113           across many media and datasets.  Eg, Less than 10% (and
1114           shrinking) of our monthly full mail server backup is new files,
1115           the other 90% is also in the previous full backup.
1116           Regularly converting the oldest full backup into a reverse
1117           incremental backup allows the admin to keep access to old backup
1118           jobs, but remove all of the duplicated files, freeing up media.
1119
1120   Notes:  This feature was previously discussed on the bacula-devel list
1121           here: http://www.mail-archive.com/bacula-devel@lists.sourceforge.net/msg04962.html
1122
1123 Item 36: Job migration between different SDs
1124 Origin:  Mariusz Czulada <manieq AT wp DOT eu>
1125 Date:    07 May 2007
1126 Status:  NEW
1127
1128 What:   Allow to specify in migration job devices on Storage Daemon other then
1129         the one used for migrated jobs (possibly on different/distant host)
1130
1131 Why:    Sometimes we have more then one system which requires backup
1132         implementation.  Often, these systems are functionally unrelated and
1133         placed in different locations.  Having a big backup device (a tape
1134         library) in each location is not cost-effective.  It would be much
1135         better to have one powerful enough tape library which could handle
1136         backups from all systems, assuming relatively fast and reliable WAN
1137         connections.  In such architecture backups are done in service windows
1138         on local bacula servers, then migrated to central storage off the peak
1139         hours.
1140
1141 Notes:  If migration to different SD is working, migration to the same SD, as
1142         now, could be done the same way (i mean 'localhost') to unify the
1143         whole process
1144
1145 Item 37: Concurrent spooling and despooling withini a single job.
1146 Date:  17 nov 2009
1147 Origin: Jesper Krogh <jesper@krogh.cc>
1148 Status: NEW
1149 What:  When a job has spooling enabled and the spool area size is
1150        less than the total volumes size the storage daemon will:
1151        1) Spool to spool-area
1152        2) Despool to tape
1153        3) Go to 1 if more data to be backed up.
1154
1155        Typical disks will serve data with a speed of 100MB/s when
1156        dealing with large files, network it typical capable of doing 115MB/s
1157        (GbitE). Tape drives will despool with 50-90MB/s (LTO3) 70-120MB/s
1158        (LTO4) depending on compression and data.
1159
1160        As bacula currently works it'll hold back data from the client until
1161        de-spooling is done, now matter if the spool area can handle another
1162        block of data. Say given a FileSet of 4TB and a spool-area of 100GB and
1163        a Maximum Job Spool Size set to 50GB then above sequence could be
1164        changed to allow to spool to the other 50GB while despooling the first
1165        50GB and not holding back the client while doing it. As above numbers
1166        show, depending on tape-drive and disk-arrays this potentially leads to
1167        a cut of the backup-time of 50% for the individual jobs.
1168
1169        Real-world example, backing up 112.6GB (large files) to LTO4 tapes
1170        (despools with ~75MB/s, data is gzipped on the remote filesystem.
1171        Maximum Job Spool Size = 8GB
1172
1173        Current:
1174        Size: 112.6GB
1175        Elapsed time (total time): 46m 15s => 2775s
1176        Despooling time: 25m 41s => 1541s (55%)
1177        Spooling time: 20m 34s => 1234s (45%)
1178        Reported speed: 40.58MB/s
1179        Spooling speed: 112.6GB/1234s => 91.25MB/s
1180        Despooling speed: 112.6GB/1541s => 73.07MB/s
1181
1182        So disk + net can "keep up" with the LTO4 drive (in this test)
1183
1184        Prosed change would effectively make the backup run in the "despooling
1185        time" 1541s giving a reduction to 55% of the total run time.
1186
1187        In the situation where the individual job cannot keep up with LTO-drive
1188        spooling enables efficient multiplexing of multiple concurrent jobs onto
1189        the same drive.
1190
1191 Why:   When dealing with larger volumes the general utillization of the
1192        network/disk is important to maximize in order to be able to run a full
1193        backup over a weekend. Current work-around is to split the FileSet in
1194        smaller FileSet and Jobs but that leads to more configuration mangement
1195        and is harder to review for completeness. Subsequently it makes restores
1196        more complex.
1197
1198 Item 39: Extend the verify code to make it possible to verify
1199           older jobs, not only the last one that has finished
1200   Date:   10 April 2009
1201   Origin: Ralf Gross (Ralf-Lists <at> ralfgross.de)
1202   Status: not implemented or documented
1203
1204   What:   At the moment a VolumeToCatalog job compares only the
1205           last job with the data in the catalog. It's not possible
1206           to compare the data (md5sums) of an older volume with the
1207           data in the catalog.
1208
1209   Why:    If a verify job fails, one has to immediately check the
1210           source of the problem, fix it and rerun the verify job.
1211           This has to happen before the next backup of the
1212           verified backup job starts.
1213           More important: It's not possible to check jobs that are
1214           kept for a long time (archiv). If a jobid could be
1215           specified for a verify job, older backups/tapes could be
1216           checked on a regular base.
1217
1218   Notes:  verify documentation:
1219           VolumeToCatalog: This level causes Bacula to read the file
1220           attribute data written to the Volume from the last Job [...]
1221
1222           Verify Job = <Job-Resource-Name> If you run a verify job
1223           without this directive, the last job run will be compared
1224           with the catalog, which means that you must immediately
1225           follow a backup by a verify command. If you specify a Verify
1226           Job Bacula will find the last job with that name that ran [...]
1227
1228           example bconsole verify dialog:
1229
1230           Run Verify job
1231           JobName:     VerifyServerXXX
1232           Level:       VolumeToCatalog
1233           Client:      ServerXXX-fd
1234           FileSet:     ServerXXX-Vol1
1235           Pool:        Full (From Job resource)
1236           Storage:     Neo4100 (From Pool resource)
1237           Verify Job:  ServerXXX-Vol1
1238           Verify List:
1239           When:        2009-04-20 09:03:04
1240           Priority:    10
1241           OK to run? (yes/mod/no): m
1242           Parameters to modify:
1243                1: Level
1244                2: Storage
1245                3: Job
1246                4: FileSet
1247                5: Client
1248                6: When
1249                7: Priority
1250                8: Pool
1251                9: Verify Job
1252
1253
1254
1255 Item 40: Separate "Storage" and "Device" in the bacula-dir.conf
1256   Date:   29 April 2009
1257   Origin: "James Harper" <james.harper@bendigoit.com.au>
1258   Status: not implemented or documented
1259
1260   What:   Separate "Storage" and "Device" in the bacula-dir.conf
1261           The resulting config would looks something like:
1262
1263           Storage {
1264             Name = name_of_server
1265             Address = hostname/IP address
1266             SDPort = 9103
1267             Password = shh_its_a_secret
1268             Maximum Concurrent Jobs = 7
1269           }
1270
1271           Device {
1272             Name = name_of_device
1273             Storage = name_of_server
1274             Device = name_of_device_on_sd
1275             Media Type = media_type
1276             Maximum Concurrent Jobs = 1
1277           }
1278
1279           Maximum Concurrent Jobs would be specified with a server and a device
1280           maximum, which would both be honoured by the director. Almost everything
1281           that mentions a 'Storage' would need to be changed to 'Device', although
1282           perhaps a 'Storage' would just be a synonym for 'Device' for backwards
1283           compatibility...
1284
1285   Why:    If you have multiple Storage definitions pointing to different
1286           Devices in the same Storage daemon, the "status storage" command
1287           prompts for each different device, but they all give the same
1288           information.
1289
1290   Notes:
1291
1292 ========= New items after last vote ====================
1293
1294
1295 ========= Add new items above this line =================
1296
1297
1298 ============= Empty Feature Request form ===========
1299 Item  n: One line summary ...
1300   Date:  Date submitted
1301   Origin: Name and email of originator.
1302   Status:
1303
1304   What:  More detailed explanation ...
1305
1306   Why:   Why it is important ...
1307
1308   Notes: Additional notes or features (omit if not used)
1309 ============== End Feature Request form ==============
1310
1311
1312 ========== Items put on hold by Kern ============================
1313
1314
1315 ========== Items completed in version 5.0.0 ====================
1316 *Item  2: 'restore' menu: enter a JobId, automatically select dependents
1317 *Item  5: Deletion of disk Volumes when pruned (partial -- truncate when pruned)
1318 *Item  6: Implement Base jobs
1319 *Item 10: Restore from volumes on multiple storage daemons
1320 *Item 15: Enable/disable compression depending on storage device (disk/tape)
1321 *Item 20: Cause daemons to use a specific IP address to source communications
1322 *Item 23: "Maximum Concurrent Jobs" for drives when used with changer device
1323 *Item 31: List InChanger flag when doing restore.
1324 *Item 35: Port bat to Win32