git.sur5r.net Git - bacula/docs/blob - docs/manual/catmaintenance.tex

   1 %%
   2 %%
   3
   4 \chapter{Catalog Maintenance}
   5 \label{CatMaintenanceChapter}
   6 \index[general]{Maintenance!Catalog }
   7 \index[general]{Catalog Maintenance }
   8
   9 Without proper setup and maintenance, your Catalog may continue to grow
  10 indefinitely as you run Jobs and backup Files, and/or it may become
  11 very inefficient and slow. How fast the size of your
  12 Catalog grows depends on the number of Jobs you run and how many files they
  13 backup. By deleting records within the database, you can make space available
  14 for the new records that will be added during the next Job. By constantly
  15 deleting old expired records (dates older than the Retention period), your
  16 database size will remain constant.
  17
  18 If you started with the default configuration files, they already contain
  19 reasonable defaults for a small number of machines (less than 5), so if you
  20 fall into that case, catalog maintenance will not be urgent if you have a few
  21 hundred megabytes of disk space free. Whatever the case may be, some knowledge
  22 of retention periods will be useful.
  23 \label{Retention}
  24
  25 \section{Setting Retention Periods}
  26 \index[general]{Setting Retention Periods }
  27 \index[general]{Periods!Setting Retention }
  28
  29 {\bf Bacula} uses three Retention periods: the {\bf File Retention} period,
  30 the {\bf Job Retention} period, and the {\bf Volume Retention} period. Of
  31 these three, the File Retention period is by far the most important in
  32 determining how large your database will become.
  33
  34 The {\bf File Retention} and the {\bf Job Retention} are specified in each
  35 Client resource as is shown below. The {\bf Volume Retention} period is
  36 specified in the Pool resource, and the details are given in the next chapter
  37 of this manual.
  38
  39 \begin{description}
  40
  41 \item [File Retention = \lt{}time-period-specification\gt{}]
  42    \index[dir]{File Retention  }
  43    The  File Retention record defines the length of time that  Bacula will keep
  44 File records in the Catalog database.  When this time period expires, and if
  45 {\bf AutoPrune} is set to {\bf yes}, Bacula will prune (remove) File records
  46 that  are older than the specified File Retention period. The pruning  will
  47 occur at the end of a backup Job for the given Client.  Note that the Client
  48 database record contains a copy of the  File and Job retention periods, but
  49 Bacula uses the  current values found in the Director's Client resource to  do
  50 the pruning.
  51
  52 Since File records in the database account for probably 80 percent of the
  53 size of the database, you should carefully determine exactly what File
  54 Retention period you need. Once the File records have been removed from
  55 the database, you will no longer be able to restore individual files
  56 in a Job. However, with Bacula version 1.37 and later, as long as the
  57 Job record still exists, you will be able to restore all files in the
  58 job.
  59
  60 Retention periods are specified in seconds, but as a convenience, there are
  61 a number of modifiers that permit easy specification in terms of minutes,
  62 hours, days, weeks, months, quarters, or years on the record.  See the
  63 \ilink{ Configuration chapter}{Time} of this manual for additional details
  64 of modifier specification.
  65
  66 The default File retention period is 60 days.
  67
  68 \item [Job Retention = \lt{}time-period-specification\gt{}]
  69    \index[dir]{Job Retention  }
  70    The Job Retention record defines the length of time that {\bf Bacula}
  71 will keep Job records in the Catalog database.  When this time period
  72 expires, and if {\bf AutoPrune} is set to {\bf yes} Bacula will prune
  73 (remove) Job records that are older than the specified Job Retention
  74 period.  Note, if a Job record is selected for pruning, all associated File
  75 and JobMedia records will also be pruned regardless of the File Retention
  76 period set.  As a consequence, you normally will set the File retention
  77 period to be less than the Job retention period.
  78
  79 As mentioned above, once the File records are removed from the database,
  80 you will no longer be able to restore individual files from the Job.
  81 However, as long as the Job record remains in the database, you will be
  82 able to restore all the files backuped for the Job (on version 1.37 and
  83 later). As a consequence, it is generally a good idea to retain the Job
  84 records much longer than the File records.
  85
  86 The retention period is specified in seconds, but as a convenience, there
  87 are a number of modifiers that permit easy specification in terms of
  88 minutes, hours, days, weeks, months, quarters, or years.  See the \ilink{
  89 Configuration chapter}{Time} of this manual for additional details of
  90 modifier specification.
  91
  92 The default Job Retention period is 180 days.
  93
  94 \item [AutoPrune = \lt{}yes/no\gt{}]
  95    \index[dir]{AutoPrune  }
  96    If AutoPrune is set to  {\bf yes} (default), Bacula will  automatically apply
  97 the File retention period and the Job  retention period for the Client at the
  98 end of the Job.
  99
 100 If you turn this off by setting it to {\bf no}, your  Catalog will grow each
 101 time you run a Job.
 102 \end{description}
 103
 104 \label{CompactingMySQL}
 105 \section{Compacting Your MySQL Database}
 106 \index[general]{Database!Compacting Your MySQL }
 107 \index[general]{Compacting Your MySQL Database }
 108
 109 Over time, as noted above, your database will tend to grow. I've noticed that
 110 even though Bacula regularly prunes files, {\bf MySQL} does not effectively
 111 use the space, and instead continues growing. To avoid this, from time to
 112 time, you must compact your database. Normally, large commercial database such
 113 as Oracle have commands that will compact a database to reclaim wasted file
 114 space. MySQL has the {\bf OPTIMIZE TABLE} command that you can use, and SQLite
 115 version 2.8.4 and greater has the {\bf VACUUM} command. We leave it to you to
 116 explore the utility of the {\bf OPTIMIZE TABLE} command in MySQL.
 117
 118 All database programs have some means of writing the database out in ASCII
 119 format and then reloading it. Doing so will re-create the database from
 120 scratch producing a compacted result, so below, we show you how you can do
 121 this for MySQL, PostgreSQL and SQLite.
 122
 123 For a {\bf MySQL} database, you could write the Bacula database as an ASCII
 124 file (bacula.sql) then reload it by doing the following:
 125
 126 \footnotesize
 127 \begin{verbatim}
 128 mysqldump -f --opt bacula > bacula.sql
 129 mysql bacula < bacula.sql
 130 rm -f bacula.sql
 131 \end{verbatim}
 132 \normalsize
 133
 134 Depending on the size of your database, this will take more or less time and a
 135 fair amount of disk space. For example, if I cd to the location of the MySQL
 136 Bacula database (typically /opt/mysql/var or something similar) and enter:
 137
 138 \footnotesize
 139 \begin{verbatim}
 140 du bacula
 141 \end{verbatim}
 142 \normalsize
 143
 144 I get {\bf 620,644} which means there are that many blocks containing 1024
 145 bytes each or approximately 635 MB of data. After doing the {\bf mysqldump}, I
 146 had a bacula.sql file that had {\bf 174,356} blocks, and after doing the {\bf
 147 mysql} command to recreate the database, I ended up with a total of {\bf
 148 210,464} blocks rather than the original {\bf 629,644}. In other words, the
 149 compressed version of the database took approximately one third of the space
 150 of the database that had been in use for about a year.
 151
 152 As a consequence, I suggest you monitor the size of your database and from
 153 time to time (once every six months or year), compress it.
 154
 155 \label{DatabaseRepair}
 156 \label{RepairingMySQL}
 157 \section{Repairing Your MySQL Database}
 158 \index[general]{Database!Repairing Your MySQL }
 159 \index[general]{Repairing Your MySQL Database }
 160
 161 If you find that you are getting errors writing to your MySQL database, or
 162 Bacula hangs each time it tries to access the database, you should consider
 163 running MySQL's database check and repair routines. The program you need to
 164 run depends on the type of database indexing you are using. If you are using
 165 the default, you will probably want to use {\bf myisamchk}. For more details
 166 on how to do this, please consult the MySQL document at:
 167 \elink{
 168 http://www.mysql.com/doc/en/Repair.html}
 169 {http://www.mysql.com/doc/en/Repair.html}.
 170
 171 If the errors you are getting are simply SQL warnings, then you might try
 172 running dbcheck before (or possibly after) using the MySQL database repair
 173 program. It can clean up many of the orphaned record problems, and certain
 174 other inconsistencies in the Bacula database.
 175
 176 \section{MySQL Table is Full}
 177 \index[general]{Database!MySQL Table is Full}
 178 \index[general]{MySQL Table is Full}
 179
 180
 181 If you are running into the error {\bf The table 'File' is full ...},
 182 it is probably because on version 4.x MySQL, the table is limited by
 183 default to a maximum size of 4 GB and you have probably run into
 184 the limit. The solution can be found at:
 185 \elink{http://dev.mysql.com/doc/refman/5.0/en/full-table.html}
 186 {http://dev.mysql.com/doc/refman/5.0/en/full-table.html}
 187
 188 You can display the maximum length of your table with:
 189
 190 \footnotesize
 191 \begin{verbatim}
 192 mysql bacula
 193 SHOW TABLE STATUS FROM bacula like "File";
 194 \end{verbatim}
 195 \normalsize
 196
 197 If the column labeled "Max\_data\_length" is around 4Gb, this is likely
 198 to be the source of your problem, and you can modify it with:
 199
 200 \footnotesize
 201 \begin{verbatim}
 202 mysql bacula
 203 ALTER TABLE File MAX_ROWS=281474976710656;
 204 \end{verbatim}
 205 \normalsize
 206
 207 Alternatively you can modify your /etc/my.conf file before creating the
 208 Bacula tables, and in the [mysqld] section set:
 209
 210 \footnotesize
 211 \begin{verbatim}
 212 set-variable = myisam_data_pointer_size=6
 213 \end{verbatim}
 214 \normalsize
 215
 216 The above myisam data pointer size must be made before you create your
 217 Bacula tables or it will have no effect.
 218
 219 The row and pointer size changes should already be the default on MySQL
 220 version 5.x, so making these changes should only be necessary on MySQL 4.x
 221 depending on the size of your catalog database.
 222
 223 \section{MySQL Server Has Gone Away}
 224 \index[general]{Database!MySQL Server Has Gone Away}
 225 \index[general]{MySQL Server Has Gone Away}
 226 If you are having problems with the MySQL server disconnecting or with
 227 messages saying that your MySQL server has gone away, then please read
 228 the MySQL documentation, which can be found at:
 229
 230 \elink{http://dev.mysql.com/doc/refman/5.0/en/gone-away.html}
 231 {http://dev.mysql.com/doc/refman/5.0/en/gone-away.html}
 232
 233
 234 \label{RepairingPSQL}
 235 \section{Repairing Your PostgreSQL Database}
 236 \index[general]{Database!Repairing Your PostgreSQL }
 237 \index[general]{Repairing Your PostgreSQL Database }
 238
 239 The same considerations apply that are indicated above for MySQL. That is,
 240 consult the PostgreSQL documents for how to repair the database, and also
 241 consider using Bacula's dbcheck program if the conditions are reasonable for
 242 using (see above).
 243
 244 \label{DatabasePerformance}
 245 \section{Database Performance Issues}
 246 \index[general]{Database Performance Issues}
 247 \index[general]{Performance!Database}
 248
 249 There are a considerable number of ways each of the databases can be
 250 tuned to improve the performance. Going from an untuned database to one
 251 that is properly tuned can make a difference of a factor of 100 or more
 252 in the time to insert or search for records.
 253
 254 For each of the databases, you may get significant improvements by adding
 255 additional indexes. The comments in the Bacula make\_xxx\_tables give some
 256 indications as to what indexes may be appropriate.  Please see below
 257 for specific instructions on checking indexes.
 258
 259 For MySQL, what is very important is to use the examine the
 260 my.cnf file (usually in /etc/my.cnf).
 261 You may obtain significant performances by switching to
 262 the my-large.cnf or my-huge.cnf files that come with the MySQL source
 263 code.
 264
 265 For SQLite3, one significant factor in improving the performance is
 266 to ensure that there is a "PRAGMA synchronous = NORMAL;" statement.
 267 This reduces the number of times that the database flushes the in memory
 268 cache to disk. There are other settings for this PRAGMA that can
 269 give even further performance improvements at the risk of a database
 270 corruption if your system crashes.
 271
 272 For PostgreSQL, you might want to consider turning fsync off.  Of course
 273 doing so can cause corrupted databases in the event of a machine crash.
 274 There are many different ways that you can tune PostgreSQL, the
 275 following document discusses a few of them:
 276 \elink{
 277 http://www.varlena.com/varlena/GeneralBits/Tidbits/perf.html}
 278 {http://www.varlena.com/varlena/GeneralBits/Tidbits/perf.html}.
 279
 280 There is also a PostgreSQL FAQ question number 3.3 that may
 281 answer some of your questions about how to improve performance
 282 of the PostgreSQL engine:
 283 \elink{
 284 http://www.postgresql.org/docs/faqs.FAQ.html\#3.3}
 285 {http://www.postgresql.org/docs/faqs.FAQ.html\#3.3}.
 286 % TODO: verify above is correct. is this okay for book?
 287
 288 Also for PostgreSQL, look at what "effective\_cache\_size". For a 2GB memory
 289 machine, you probably want to set it at 131072, but don't set it too high.
 290 In addition, for a 2GB system, work\_mem = 256000 and
 291 maintenance\_work\_mem = 256000 seem to be reasonable values.  Make
 292 sure your checkpoint\_segments is set to at least 8.
 293
 294
 295
 296 \section{Performance Issues Indexes}
 297 \index[general]{Database Performance Issues Indexes}
 298 \index[general]{Performance!Database}
 299 One of the most important considerations for improving performance on
 300 the Bacula database is to ensure that it has all the appropriate indexes.
 301 Several users have reported finding that their database did not have
 302 all the indexes in the default configuration.  In addition, you may
 303 find that because of your own usage patterns, you need additional indexes.
 304
 305 The most important indexes for performance are the three indexes on the
 306 {\bf File} table.  The first index is on {\bf FileId} and is automatically
 307 made because it is the unique key used to access the table.  The other
 308 two are the JobId index and the (Filename, PathId) index.  If these Indexes
 309 are not present, your performance may suffer a lot.
 310
 311 \subsection{PostgreSQL Indexes}
 312 On PostgreSQL, you can check to see if you have the proper indexes using
 313 the following commands:
 314
 315 \footnotesize
 316 \begin{verbatim}
 317 psql bacula
 318 select * from pg_indexes where tablename='file';
 319 \end{verbatim}
 320 \normalsize
 321
 322 If you do not see output that indicates that all three indexes
 323 are created, you can create the two additional indexes using:
 324
 325 \footnotesize
 326 \begin{verbatim}
 327 psql bacula
 328 CREATE INDEX file_jobid_idx on file (jobid);
 329 CREATE INDEX file_fp_idx on file (filenameid, pathid);
 330 \end{verbatim}
 331 \normalsize
 332
 333 \subsection{MySQL Indexes}
 334 On MySQL, you can check if you have the proper indexes by:
 335
 336 \footnotesize
 337 \begin{verbatim}
 338 mysql bacula
 339 show index from File;
 340 \end{verbatim}
 341 \normalsize
 342
 343 If the indexes are not present, especially the JobId index, you can
 344 create them with the following commands:
 345
 346 \footnotesize
 347 \begin{verbatim}
 348 mysql bacula
 349 CREATE INDEX file_jobid_idx on File (JobId);
 350 CREATE INDEX file_jpf_idx on File (Job, FilenameId, PathId);
 351 \end{verbatim}
 352 \normalsize
 353
 354 Though normally not a problem, you should ensure that the indexes
 355 defined for Filename and Path are both set to 255 characters. Some users
 356 reported performance problems when their indexes were set to 50 characters.
 357 To check, do:
 358
 359 \footnotesize
 360 \begin{verbatim}
 361 mysql bacula
 362 show index from Filename;
 363 show index from Path;
 364 \end{verbatim}
 365 \normalsize
 366
 367 and what is important is that for Filename, you have an index with
 368 Key\_name "Name" and Sub\_part "255". For Pth, you should have a Key\_name
 369 "Path" and Sub\_part "255".  If one or the other does not exist or the
 370 Sub\_part is less that 255, you can drop and recreate the appropriate
 371 index with:
 372
 373 \footnotesize
 374 \begin{verbatim}
 375 mysql bacula
 376 DROP INDEX Path on Path;
 377 CREATE INDEX Path on Path (Path(255);
 378
 379 DROP INDEX Name on Filename;
 380 CREATE INDEX Name on Filename (Name(255));
 381 \end{verbatim}
 382 \normalsize
 383
 384
 385 \subsection{SQLite Indexes}
 386 On SQLite, you can check if you have the proper indexes by:
 387
 388 \footnotesize
 389 \begin{verbatim}
 390 sqlite <path>bacula.db
 391 select * from sqlite_master where type='index' and tbl_name='File';
 392 \end{verbatim}
 393 \normalsize
 394
 395 If the indexes are not present, especially the JobId index, you can
 396 create them with the following commands:
 397
 398 \footnotesize
 399 \begin{verbatim}
 400 mysql bacula
 401 CREATE INDEX file_jobid_idx on File (JobId);
 402 CREATE INDEX file_jfp_idx on File (Job, FilenameId, PathId);
 403 \end{verbatim}
 404 \normalsize
 405
 406
 407
 408 \label{CompactingPostgres}
 409 \section{Compacting Your PostgreSQL Database}
 410 \index[general]{Database!Compacting Your PostgreSQL }
 411 \index[general]{Compacting Your PostgreSQL Database }
 412
 413 Over time, as noted above, your database will tend to grow. I've noticed that
 414 even though Bacula regularly prunes files, PostgreSQL has a {\bf VACUUM}
 415 command that will compact your database for you. Alternatively you may want to
 416 use the {\bf vacuumdb} command, which can be run from a cron job.
 417
 418 All database programs have some means of writing the database out in ASCII
 419 format and then reloading it. Doing so will re-create the database from
 420 scratch producing a compacted result, so below, we show you how you can do
 421 this for PostgreSQL.
 422
 423 For a {\bf PostgreSQL} database, you could write the Bacula database as an
 424 ASCII file (bacula.sql) then reload it by doing the following:
 425
 426 \footnotesize
 427 \begin{verbatim}
 428 pg_dump -c bacula > bacula.sql
 429 cat bacula.sql | psql bacula
 430 rm -f bacula.sql
 431 \end{verbatim}
 432 \normalsize
 433
 434 Depending on the size of your database, this will take more or less time and a
 435 fair amount of disk space. For example, you can {\bf cd} to the location of
 436 the Bacula database (typically /usr/local/pgsql/data or possible
 437 /var/lib/pgsql/data) and check the size.
 438
 439 There are certain PostgreSQL users who do not recommend the above
 440 procedure. They have the following to say:
 441 PostgreSQL does not
 442 need to be dumped/restored to keep the database efficient.  A normal
 443 process of vacuuming will prevent the database from every getting too
 444 large.  If you want to fine-tweak the database storage, commands such
 445 as VACUUM FULL, REINDEX, and CLUSTER exist specifically to keep you
 446 from having to do a dump/restore.
 447
 448 Finally, you might want to look at the PostgreSQL documentation on
 449 this subject at
 450 \elink{http://www.postgresql.org/docs/8.1/interactive/maintenance.html}
 451 {http://www.postgresql.org/docs/8.1/interactive/maintenance.html}.
 452
 453 \section{Compacting Your SQLite Database}
 454 \index[general]{Compacting Your SQLite Database }
 455 \index[general]{Database!Compacting Your SQLite }
 456
 457 First please read the previous section that explains why it is necessary to
 458 compress a database. SQLite version 2.8.4 and greater have the {\bf Vacuum}
 459 command for compacting the database.
 460
 461 \footnotesize
 462 \begin{verbatim}
 463 cd {\bf working-directory}
 464 echo 'vacuum;' | sqlite bacula.db
 465 \end{verbatim}
 466 \normalsize
 467
 468 As an alternative, you can use the following commands, adapted to your system:
 469
 470
 471 \footnotesize
 472 \begin{verbatim}
 473 cd {\bf working-directory}
 474 echo '.dump' | sqlite bacula.db > bacula.sql
 475 rm -f bacula.db
 476 sqlite bacula.db < bacula.sql
 477 rm -f bacula.sql
 478 \end{verbatim}
 479 \normalsize
 480
 481 Where {\bf working-directory} is the directory that you specified in the
 482 Director's configuration file. Note, in the case of SQLite, it is necessary to
 483 completely delete (rm) the old database before creating a new compressed
 484 version.
 485
 486 \section{Migrating from SQLite to MySQL}
 487 \index[general]{MySQL!Migrating from SQLite to }
 488 \index[general]{Migrating from SQLite to MySQL }
 489
 490 You may begin using Bacula with SQLite then later find that you want to switch
 491 to MySQL for any of a number of reasons: SQLite tends to use more disk than
 492 MySQL; when the database is corrupted it is often more catastrophic than
 493 with MySQL or PostgreSQL.
 494 Several users have succeeded in converting from SQLite to MySQL by
 495 exporting the MySQL data and then processing it with Perl scripts
 496 prior to putting it into MySQL. This is, however, not a simple
 497 process.
 498
 499 \label{BackingUpBacula}
 500 \section{Backing Up Your Bacula Database}
 501 \index[general]{Backing Up Your Bacula Database }
 502 \index[general]{Database!Backing Up Your Bacula }
 503
 504 If ever the machine on which your Bacula database crashes, and you need to
 505 restore from backup tapes, one of your first priorities will probably be to
 506 recover the database. Although Bacula will happily backup your catalog
 507 database if it is specified in the FileSet, this is not a very good way to do
 508 it, because the database will be saved while Bacula is modifying it. Thus the
 509 database may be in an instable state. Worse yet, you will backup the database
 510 before all the Bacula updates have been applied.
 511
 512 To resolve these problems, you need to backup the database after all the backup
 513 jobs have been run. In addition, you will want to make a copy while Bacula is
 514 not modifying it. To do so, you can use two scripts provided in the release
 515 {\bf make\_catalog\_backup} and {\bf delete\_catalog\_backup}. These files
 516 will be automatically generated along with all the other Bacula scripts. The
 517 first script will make an ASCII copy of your Bacula database into {\bf
 518 bacula.sql} in the working directory you specified in your configuration, and
 519 the second will delete the {\bf bacula.sql} file.
 520
 521 The basic sequence of events to make this work correctly is as follows:
 522
 523 \begin{itemize}
 524 \item Run all your nightly backups
 525 \item After running your nightly backups, run a Catalog backup Job
 526 \item The Catalog backup job must be scheduled after your last nightly backup
 527
 528 \item You use {\bf RunBeforeJob} to create the ASCII  backup file and {\bf
 529    RunAfterJob} to clean up
 530 \end{itemize}
 531
 532 Assuming that you start all your nightly backup jobs at 1:05 am (and that they
 533 run one after another), you can do the catalog backup with the following
 534 additional Director configuration statements:
 535
 536 \footnotesize
 537 \begin{verbatim}
 538 # Backup the catalog database (after the nightly save)
 539 Job {
 540   Name = "BackupCatalog"
 541   Type = Backup
 542   Client=rufus-fd
 543   FileSet="Catalog"
 544   Schedule = "WeeklyCycleAfterBackup"
 545   Storage = DLTDrive
 546   Messages = Standard
 547   Pool = Default
 548   RunBeforeJob = "/home/kern/bacula/bin/make_catalog_backup"
 549   RunAfterJob  = "/home/kern/bacula/bin/delete_catalog_backup"
 550   Write Bootstrap = "/home/kern/bacula/working/BackupCatalog.bsr"
 551 }
 552 # This schedule does the catalog. It starts after the WeeklyCycle
 553 Schedule {
 554   Name = "WeeklyCycleAfterBackup
 555   Run = Level=Full sun-sat at 1:10
 556 }
 557 # This is the backup of the catalog
 558 FileSet {
 559   Name = "Catalog"
 560   Include {
 561     Options {
 562       signature=MD5
 563     }
 564     File = \lt{}working_directory\gt{}/bacula.sql
 565   }
 566 }
 567 \end{verbatim}
 568 \normalsize
 569
 570 Be sure to write a bootstrap file as in the above example. However, it is preferable
 571 to write or copy the bootstrap file to another computer. It will allow
 572 you to quickly recover the database backup should that be necessary.  If
 573 you do not have a bootstrap file, it is still possible to recover your
 574 database backup, but it will be more work and take longer.
 575
 576 \label{BackingUPOtherDBs}
 577 \section{Backing Up Third Party Databases}
 578 \index[general]{Backing Up Third Party Databases }
 579 \index[general]{Databases!Backing Up Third Party }
 580
 581 If you are running a database in production mode on your machine, Bacula will
 582 happily backup the files, but if the database is in use while Bacula is
 583 reading it, you may back it up in an unstable state.
 584
 585 The best solution is to shutdown your database before backing it up, or use
 586 some tool specific to your database to make a valid live copy perhaps by
 587 dumping the database in ASCII format. I am not a database expert, so I cannot
 588 provide you advice on how to do this, but if you are unsure about how to
 589 backup your database, you might try visiting the Backup Central site, which
 590 has been renamed Storage Mountain (www.backupcentral.com). In particular,
 591 their
 592 \elink{ Free Backup and Recovery
 593 Software}{http://www.backupcentral.com/toc-free-backup-software.html} page has
 594 links to scripts that show you how to shutdown and backup most major
 595 databases.
 596 \label{Size}
 597
 598 \section{Database Size}
 599 \index[general]{Size!Database }
 600 \index[general]{Database Size }
 601
 602 As mentioned above, if you do not do automatic pruning, your Catalog will grow
 603 each time you run a Job. Normally, you should decide how long you want File
 604 records to be maintained in the Catalog and set the {\bf File Retention}
 605 period to that time. Then you can either wait and see how big your Catalog
 606 gets or make a calculation assuming approximately 154 bytes for each File
 607 saved and knowing the number of Files that are saved during each backup and
 608 the number of Clients you backup.
 609
 610 For example, suppose you do a backup of two systems, each with 100,000 files.
 611 Suppose further that you do a Full backup weekly and an Incremental every day,
 612 and that the Incremental backup typically saves 4,000 files. The size of your
 613 database after a month can roughly be calculated as:
 614
 615 \footnotesize
 616 \begin{verbatim}
 617    Size = 154 * No. Systems * (100,000 * 4 + 10,000 * 26)
 618 \end{verbatim}
 619 \normalsize
 620
 621 where we have assumed four weeks in a month and 26 incremental backups per month.
 622 This would give the following:
 623
 624 \footnotesize
 625 \begin{verbatim}
 626    Size = 154 * 2 * (100,000 * 4 + 10,000 * 26)
 627 or
 628    Size = 308 * (400,000 + 260,000)
 629 or
 630    Size = 203,280,000 bytes
 631 \end{verbatim}
 632 \normalsize
 633
 634 So for the above two systems, we should expect to have a database size of
 635 approximately 200 Megabytes. Of course, this will vary according to how many
 636 files are actually backed up.
 637
 638 Below are some statistics for a MySQL database containing Job records for five
 639 Clients beginning September 2001 through May 2002 (8.5 months) and File
 640 records for the last 80 days. (Older File records have been pruned). For these
 641 systems, only the user files and system files that change are backed up. The
 642 core part of the system is assumed to be easily reloaded from the Red Hat rpms.
 643
 644
 645 In the list below, the files (corresponding to Bacula Tables) with the
 646 extension .MYD contain the data records whereas files with the extension .MYI
 647 contain indexes.
 648
 649 You will note that the File records (containing the file attributes) make up
 650 the large bulk of the number of records as well as the space used (459 Mega
 651 Bytes including the indexes). As a consequence, the most important Retention
 652 period will be the {\bf File Retention} period. A quick calculation shows that
 653 for each File that is saved, the database grows by approximately 150 bytes.
 654
 655 \footnotesize
 656 \begin{verbatim}
 657       Size in
 658        Bytes   Records    File
 659  ============  =========  ===========
 660           168          5  Client.MYD
 661         3,072             Client.MYI
 662   344,394,684  3,080,191  File.MYD
 663   115,280,896             File.MYI
 664     2,590,316    106,902  Filename.MYD
 665     3,026,944             Filename.MYI
 666           184          4  FileSet.MYD
 667         2,048             FileSet.MYI
 668        49,062      1,326  JobMedia.MYD
 669        30,720             JobMedia.MYI
 670       141,752      1,378  Job.MYD
 671        13,312             Job.MYI
 672         1,004         11  Media.MYD
 673         3,072             Media.MYI
 674     1,299,512     22,233  Path.MYD
 675       581,632             Path.MYI
 676            36          1  Pool.MYD
 677         3,072             Pool.MYI
 678             5          1  Version.MYD
 679         1,024             Version.MYI
 680 \end{verbatim}
 681 \normalsize
 682
 683 This database has a total size of approximately 450 Megabytes.
 684
 685 If we were using SQLite, the determination of the total database size would be
 686 much easier since it is a single file, but we would have less insight to the
 687 size of the individual tables as we have in this case.
 688
 689 Note, SQLite databases may be as much as 50\% larger than MySQL databases due
 690 to the fact that all data is stored as ASCII strings. That is even binary
 691 integers are stored as ASCII strings, and this seems to increase the space
 692 needed.