git.sur5r.net Git - cc65/blob - doc/ca65.sgml

   1 <!doctype linuxdoc system>      <!-- -*- text-mode -*- -->
   2
   3 <article>
   4 <title>ca65 Users Guide
   5 <author>Ullrich von Bassewitz, <htmlurl url="mailto:uz@cc65.org" name="uz@cc65.org">
   6 <date>2000-07-19, 2000-11-29, 2001-10-02, 2005-09-08
   7
   8 <abstract>
   9 ca65 is a powerful macro assembler for the 6502, 65C02 and 65816 CPUs. It is
  10 used as a companion assembler for the cc65 crosscompiler, but it may also be
  11 used as a standalone product.
  12 </abstract>
  13
  14 <!-- Table of contents -->
  15 <toc>
  16
  17 <!-- Begin the document -->
  18
  19 <sect>Overview<p>
  20
  21 ca65 is a replacement for the ra65 assembler that was part of the cc65 C
  22 compiler, originally developed by John R. Dunning. I had some problems with
  23 ra65 and the copyright does not permit some things which I wanted to be
  24 possible, so I decided to write a completely new assembler/linker/archiver
  25 suite for the cc65 compiler. ca65 is part of this suite.
  26
  27 Some parts of the assembler (code generation and some routines for symbol
  28 table handling) are taken from an older crossassembler named a816 written
  29 by me a long time ago.
  30
  31
  32 <sect1>Design criteria<p>
  33
  34 Here's a list of the design criteria, that I considered important for the
  35 development:
  36
  37 <itemize>
  38
  39 <item>  The assembler must support macros. Macros are not essential, but they
  40         make some things easier, especially when you use the assembler in the
  41         backend of a compiler.
  42 <item>  The assembler must support the newer 65C02 and 65816 CPUs. I have been
  43         thinking about a 65816 backend for the C compiler, and even my old
  44         a816 assembler had support for these CPUs, so this wasn't really a
  45         problem.
  46 <item>  The assembler must produce relocatable code. This is necessary for the
  47         compiler support, and it is more convenient.
  48 <item>  Conditional assembly must be supported. This is a must for bigger
  49         projects written in assembler (like Elite128).
  50 <item>  The assembler must support segments, and it must support more than
  51         three segments (this is the count, most other assemblers support).
  52         Having more than one code segments helps developing code for systems
  53         with a divided ROM area (like the C64).
  54 <item>  The linker must be able to resolve arbitrary expressions. It should
  55         be able to get things like
  56 <tscreen><verb>
  57         .import S1, S2
  58         .export Special
  59         Special = 2*S1 + S2/7
  60 </verb></tscreen>
  61         right.
  62 <item>  True lexical nesting for symbols. This is very convenient for larger
  63         assembly projects.
  64 <item>  "Cheap" local symbols without lexical nesting for those quick, late
  65         night hacks.
  66 <item>  I liked the idea of "options" as Anre Fachats .o65 format has it, so I
  67         introduced the concept into the object file format use by the new cc65
  68         binutils.
  69 <item>  The assembler will be a one pass assembler. There was no real need for
  70         this decision, but I've written several multipass assemblers, and it
  71         started to get boring. A one pass assembler needs much more elaborated
  72         data structures, and because of that it's much more fun:-)
  73 <item>  Non-GPLed code that may be used in any project without restrictions or
  74         fear of "GPL infecting" other code.
  75 </itemize>
  76 <p>
  77
  78
  79 <sect>Usage<p>
  80
  81
  82 <sect1>Command line option overview<p>
  83
  84 The assembler accepts the following options:
  85
  86 <tscreen><verb>
  87 ---------------------------------------------------------------------------
  88 Usage: ca65 [options] file
  89 Short options:
  90   -D name[=value]               Define a symbol
  91   -I dir                        Set an include directory search path
  92   -U                            Mark unresolved symbols as import
  93   -V                            Print the assembler version
  94   -W n                          Set warning level n
  95   -g                            Add debug info to object file
  96   -h                            Help (this text)
  97   -i                            Ignore case of symbols
  98   -l name                       Create a listing file if assembly was ok
  99   -mm model                     Set the memory model
 100   -o name                       Name the output file
 101   -s                            Enable smart mode
 102   -t sys                        Set the target system
 103   -v                            Increase verbosity
 104
 105 Long options:
 106   --auto-import                 Mark unresolved symbols as import
 107   --bin-include-dir dir         Set a search path for binary includes
 108   --cpu type                    Set cpu type
 109   --create-dep name             Create a make dependency file
 110   --create-full-dep name        Create a full make dependency file
 111   --debug-info                  Add debug info to object file
 112   --feature name                Set an emulation feature
 113   --forget-inc-paths            Forget include search paths
 114   --help                        Help (this text)
 115   --ignore-case                 Ignore case of symbols
 116   --include-dir dir             Set an include directory search path
 117   --large-alignment             Don't warn about large alignments
 118   --listing name                Create a listing file if assembly was ok
 119   --list-bytes n                Maximum number of bytes per listing line
 120   --macpack-dir dir             Set a macro package directory
 121   --memory-model model          Set the memory model
 122   --pagelength n                Set the page length for the listing
 123   --smart                       Enable smart mode
 124   --target sys                  Set the target system
 125   --verbose                     Increase verbosity
 126   --version                     Print the assembler version
 127 ---------------------------------------------------------------------------
 128 </verb></tscreen>
 129
 130
 131 <sect1>Command line options in detail<p>
 132
 133 Here is a description of all the command line options:
 134
 135 <descrip>
 136
 137   <label id="option--bin-include-dir">
 138   <tag><tt>--bin-include-dir dir</tt></tag>
 139
 140   Name a directory which is searched for binary include files. The option
 141   may be used more than once to specify more than one directory to search. The
 142   current directory is always searched first before considering any
 143   additional directories. See also the section about <ref id="search-paths"
 144   name="search paths">.
 145
 146
 147   <label id="option--cpu">
 148   <tag><tt>--cpu type</tt></tag>
 149
 150   Set the default for the CPU type. The option takes a parameter, which
 151   may be one of
 152
 153         6502, 65SC02, 65C02, 65816, sunplus, sweet16, HuC6280
 154
 155   The sunplus cpu is not available in the freeware version, because the
 156   instruction set is "proprietary and confidential".
 157
 158
 159   <label id="option-create-dep">
 160   <tag><tt>--create-dep name</tt></tag>
 161
 162   Tells the assembler to generate a file containing the dependency list for
 163   the assembled module in makefile syntax. The output is written to a file
 164   with the given name. The output does not include files passed via debug
 165   information to the assembler.
 166
 167
 168   <label id="option-create-full-dep">
 169   <tag><tt>--create-full-dep name</tt></tag>
 170
 171   Tells the assembler to generate a file containing the dependency list for
 172   the assembled module in makefile syntax. The output is written to a file
 173   with the given name. The output does include files passed via debug
 174   information to the assembler.
 175
 176
 177   <label id="option--feature">
 178   <tag><tt>--feature name</tt></tag>
 179
 180   Enable an emulation feature. This is identical as using <tt/.FEATURE/
 181   in the source with two exceptions: Feature names must be lower case, and
 182   each feature must be specified by using an extra <tt/--feature/ option,
 183   comma separated lists are not allowed.
 184
 185   See the discussion of the <tt><ref id=".FEATURE" name=".FEATURE"></tt>
 186   command for a list of emulation features.
 187
 188
 189   <label id="option--forget-inc-paths">
 190   <tag><tt>--forget-inc-paths</tt></tag>
 191
 192   Forget the builtin include paths. This is most useful when building
 193   customized assembler modules, in which case the standard header files should
 194   be ignored.
 195
 196
 197   <label id="option-g">
 198   <tag><tt>-g, --debug-info</tt></tag>
 199
 200   When this option (or the equivalent control command <tt/.DEBUGINFO/) is
 201   used, the assembler will add a section to the object file that contains
 202   all symbols (including local ones) together with the symbol values and
 203   source file positions. The linker will put these additional symbols into
 204   the VICE label file, so even local symbols can be seen in the VICE
 205   monitor.
 206
 207
 208   <label id="option-h">
 209   <tag><tt>-h, --help</tt></tag>
 210
 211   Print the short option summary shown above.
 212
 213
 214   <label id="option-i">
 215   <tag><tt>-i, --ignore-case</tt></tag>
 216
 217   This option makes the assembler case insensitive on identifiers and labels.
 218   This option will override the default, but may itself be overridden by the
 219   <tt><ref id=".CASE" name=".CASE"></tt> control command.
 220
 221
 222   <label id="option-l">
 223   <tag><tt>-l name, --listing name</tt></tag>
 224
 225   Generate an assembler listing with the given name. A listing file will
 226   never be generated in case of assembly errors.
 227
 228
 229   <label id="option--large-alignment">
 230   <tag><tt>--large-alignment</tt></tag>
 231
 232   Disable warnings about a large combined alignment. See the discussion of the
 233   <tt><ref id=".ALIGN" name=".ALIGN"></tt> directive for futher information.
 234
 235
 236   <label id="option--list-bytes">
 237   <tag><tt>--list-bytes n</tt></tag>
 238
 239   Set the maximum number of bytes printed in the listing for one line of
 240   input. See the <tt><ref id=".LISTBYTES" name=".LISTBYTES"></tt> directive
 241   for more information. The value zero can be used to encode an unlimited
 242   number of printed bytes.
 243
 244
 245   <label id="option--macpack-dir">
 246   <tag><tt>--macpack-dir dir</tt></tag>
 247
 248   This options allows to specify a directory containing macro files that are
 249   used instead of the builtin images when a <tt><ref id=".MACPACK"
 250   name=".MACPACK"></tt> directive is encountered. If <tt>--macpack-dir</tt>
 251   was specified, a <tt>.mac</tt> extension is added to the package name and
 252   the resulting file is loaded from the given directory. This is most useful
 253   when debugging the builtin macro packages.
 254
 255
 256   <label id="option-mm">
 257   <tag><tt>-mm model, --memory-model model</tt></tag>
 258
 259   Define the default memory model. Possible model specifiers are near, far and
 260   huge.
 261
 262
 263   <label id="option-o">
 264   <tag><tt>-o name</tt></tag>
 265
 266   The default output name is the name of the input file with the extension
 267   replaced by ".o". If you don't like that, you may give another name with
 268   the -o option. The output file will be placed in the same directory as
 269   the source file, or, if -o is given, the full path in this name is used.
 270
 271
 272   <label id="option--pagelength">
 273   <tag><tt>--pagelength n</tt></tag>
 274
 275   sets the length of a listing page in lines. See the <tt><ref
 276   id=".PAGELENGTH" name=".PAGELENGTH"></tt> directive for more information.
 277
 278
 279   <label id="option-s">
 280   <tag><tt>-s, --smart-mode</tt></tag>
 281
 282   In smart mode (enabled by -s or the <tt><ref id=".SMART" name=".SMART"></tt>
 283   pseudo instruction) the assembler will track usage of the <tt/REP/ and
 284   <tt/SEP/ instructions in 65816 mode and update the operand sizes
 285   accordingly. If the operand of such an instruction cannot be evaluated by
 286   the assembler (for example, because the operand is an imported symbol), a
 287   warning is issued.
 288
 289   Beware: Since the assembler cannot trace the execution flow this may
 290   lead to false results in some cases. If in doubt, use the .ixx and .axx
 291   instructions to tell the assembler about the current settings. Smart
 292   mode is off by default.
 293
 294
 295   <label id="option-t">
 296   <tag><tt>-t sys, --target sys</tt></tag>
 297
 298   Set the target system. This will enable translation of character strings and
 299   character constants into the character set of the target platform. The
 300   default for the target system is "none", which means that no translation
 301   will take place. The assembler supports the same target systems as the
 302   compiler, see there for a list.
 303
 304   Depending on the target, the default CPU type is also set. This can be
 305   overriden by using the <tt/<ref id="option--cpu" name="--cpu">/ option.
 306
 307
 308   <label id="option-v">
 309   <tag><tt>-v, --verbose</tt></tag>
 310
 311   Increase the assembler verbosity. Usually only needed for debugging
 312   purposes. You may use this option more than one time for even more
 313   verbose output.
 314
 315
 316   <label id="option-D">
 317   <tag><tt>-D</tt></tag>
 318
 319   This option allows you to define symbols on the command line. Without a
 320   value, the symbol is defined with the value zero. When giving a value,
 321   you may use the '&dollar;' prefix for hexadecimal symbols. Please note
 322   that for some operating systems, '&dollar;' has a special meaning, so
 323   you may have to quote the expression.
 324
 325
 326   <label id="option-I">
 327   <tag><tt>-I dir, --include-dir dir</tt></tag>
 328
 329   Name a directory which is searched for include files. The option may be
 330   used more than once to specify more than one directory to search. The
 331   current directory is always searched first before considering any
 332   additional directories. See also the section about <ref id="search-paths"
 333   name="search paths">.
 334
 335
 336   <label id="option-U">
 337   <tag><tt>-U, --auto-import</tt></tag>
 338
 339   Mark symbols that are not defined in the sources as imported symbols. This
 340   should be used with care since it delays error messages about typos and such
 341   until the linker is run. The compiler uses the equivalent of this switch
 342   (<tt><ref id=".AUTOIMPORT" name=".AUTOIMPORT"></tt>) to enable auto imported
 343   symbols for the runtime library. However, the compiler is supposed to
 344   generate code that runs through the assembler without problems, something
 345   which is not always true for assembler programmers.
 346
 347
 348   <label id="option-V">
 349   <tag><tt>-V, --version</tt></tag>
 350
 351   Print the version number of the assembler. If you send any suggestions
 352   or bugfixes, please include the version number.
 353
 354
 355   <label id="option-W">
 356   <tag><tt>-Wn</tt></tag>
 357
 358   Set the warning level for the assembler. Using -W2 the assembler will
 359   even warn about such things like unused imported symbols. The default
 360   warning level is 1, and it would probably be silly to set it to
 361   something lower.
 362
 363 </descrip>
 364 <p>
 365
 366
 367
 368 <sect>Search paths<label id="search-paths"><p>
 369
 370 Normal include files are searched in the following places:
 371
 372 <enum>
 373 <item>The current directory.
 374 <item>A compiled-in directory, which is often <tt>/usr/lib/cc65/asminc</tt>
 375       on Linux systems.
 376 <item>The value of the environment variable <tt/CA65_INC/ if it is defined.
 377 <item>A subdirectory named <tt/asminc/ of the directory defined in the
 378       environment variable <tt/CC65_HOME/, if it is defined.
 379 <item>Any directory added with the <tt/<ref id="option-I" name="-I">/ option
 380 on the command line.
 381 </enum>
 382
 383 Binary include files are searched in the following places:
 384
 385 <enum>
 386 <item>The current directory.
 387 <item>Any directory added with the <tt/<ref id="option--bin-include-dir"
 388 name="--bin-include-dir">/ option on the command line.
 389 </enum>
 390
 391
 392
 393 <sect>Input format<p>
 394
 395 <sect1>Assembler syntax<p>
 396
 397 The assembler accepts the standard 6502/65816 assembler syntax. One line may
 398 contain a label (which is identified by a colon), and, in addition to the
 399 label, an assembler mnemonic, a macro, or a control command (see section <ref
 400 id="control-commands" name="Control Commands"> for supported control
 401 commands). Alternatively, the line may contain a symbol definition using
 402 the '=' token. Everything after a semicolon is handled as a comment (that is,
 403 it is ignored).
 404
 405 Here are some examples for valid input lines:
 406
 407 <tscreen><verb>
 408         Label:                          ; A label and a comment
 409                 lda     #$20            ; A 6502 instruction plus comment
 410         L1:     ldx     #$20            ; Same with label
 411         L2:     .byte   "Hello world"   ; Label plus control command
 412                 mymac   $20             ; Macro expansion
 413                 MySym = 3*L1            ; Symbol definition
 414         MaSym   = Label                 ; Another symbol
 415 </verb></tscreen>
 416
 417 The assembler accepts
 418
 419 <itemize>
 420 <item>all valid 6502 mnemonics when in 6502 mode (the default or after the
 421       <tt><ref id=".P02" name=".P02"></tt> command was given).
 422 <item>all valid 6502 mnemonics plus a set of illegal instructions when in
 423       <ref id="6502X-mode" name="6502X mode">.
 424 <item>all valid 65SC02 mnemonics when in 65SC02 mode (after the
 425       <tt><ref id=".PSC02" name=".PSC02"></tt> command was given).
 426 <item>all valid 65C02 mnemonics when in 65C02 mode (after the
 427       <tt><ref id=".PC02" name=".PC02"></tt> command was given).
 428 <item>all valid 65618 mnemonics when in 65816 mode (after the
 429       <tt><ref id=".P816" name=".P816"></tt> command was given).
 430 <item>all valid SunPlus mnemonics when in SunPlus mode (after the
 431       <tt><ref id=".SUNPLUS" name=".SUNPLUS"></tt> command was given).
 432 </itemize>
 433
 434
 435 <sect1>65816 mode<p>
 436
 437 In 65816 mode several aliases are accepted in addition to the official
 438 mnemonics:
 439
 440 <tscreen><verb>
 441         BGE is an alias for BCS
 442         BLT is an alias for BCC
 443         CPA is an alias for CMP
 444         DEA is an alias for DEC A
 445         INA is an alias for INC A
 446         SWA is an alias for XBA
 447         TAD is an alias for TCD
 448         TAS is an alias for TCS
 449         TDA is an alias for TDC
 450         TSA is an alias for TSC
 451 </verb></tscreen>
 452
 453
 454
 455 <sect1>6502X mode<label id="6502X-mode"><p>
 456
 457 6502X mode is an extension to the normal 6502 mode. In this mode, several
 458 mnemonics for illegal instructions of the NMOS 6502 CPUs are accepted. Since
 459 these instructions are illegal, there are no official mnemonics for them. The
 460 unofficial ones are taken from <htmlurl
 461 url="http://www.oxyron.de/html/opcodes02.html"
 462 name="http://www.oxyron.de/html/opcodes02.html">. Please note that only the
 463 ones marked as "stable" are supported. The following table uses information
 464 from the mentioned web page, for more information, see there.
 465
 466 <itemize>
 467 <item><tt>ALR: A:=(A and #{imm})/2;</tt>
 468 <item><tt>ANC: A:=A and #{imm};</tt> Generates opcode &dollar;0B.
 469 <item><tt>ARR: A:=(A and #{imm})/2;</tt>
 470 <item><tt>AXS: X:=A and X-#{imm};</tt>
 471 <item><tt>DCP: {adr}:={adr}-1; A-{adr};</tt>
 472 <item><tt>ISC: {adr}:={adr}+1; A:=A-{adr};</tt>
 473 <item><tt>LAS: A,X,S:={adr} and S;</tt>
 474 <item><tt>LAX: A,X:={adr};</tt>
 475 <item><tt>RLA: {adr}:={adr}rol; A:=A and {adr};</tt>
 476 <item><tt>RRA: {adr}:={adr}ror; A:=A adc {adr};</tt>
 477 <item><tt>SAX: {adr}:=A and X;</tt>
 478 <item><tt>SLO: {adr}:={adr}*2; A:=A or {adr};</tt>
 479 <item><tt>SRE: {adr}:={adr}/2; A:=A xor {adr};</tt>
 480 </itemize>
 481
 482
 483
 484 <sect1>sweet16 mode<label id="sweet16-mode"><p>
 485
 486 SWEET 16 is an interpreter for a pseudo 16 bit CPU written by Steve Wozniak
 487 for the Apple ][ machines. It is available in the Apple ][ ROM. ca65 can
 488 generate code for this pseudo CPU when switched into sweet16 mode. The
 489 following is special in sweet16 mode:
 490
 491 <itemize>
 492
 493 <item>The '@' character denotes indirect addressing and is no longer available
 494 for cheap local labels. If you need cheap local labels, you will have to
 495 switch to another lead character using the <tt/<ref id=".LOCALCHAR"
 496 name=".LOCALCHAR">/ command.
 497
 498 <item>Registers are specified using <tt/R0/ .. <tt/R15/. In sweet16 mode,
 499 these identifiers are reserved words.
 500
 501 </itemize>
 502
 503 Please note that the assembler does neither supply the interpreter needed for
 504 SWEET 16 code, nor the zero page locations needed for the SWEET 16 registers,
 505 nor does it call the interpreter. All this must be done by your program. Apple
 506 ][ programmers do probably know how to use sweet16 mode.
 507
 508 For more information about SWEET 16, see
 509 <htmlurl url="http://www.6502.org/source/interpreters/sweet16.htm"
 510 name="http://www.6502.org/source/interpreters/sweet16.htm">.
 511
 512
 513 <sect1>Number format<p>
 514
 515 For literal values, the assembler accepts the widely used number formats: A
 516 preceding '&dollar;' or a trailing 'h' denotes a hex value, a preceding '%'
 517 denotes a binary value, and a bare number is interpreted as a decimal. There
 518 are currently no octal values and no floats.
 519
 520
 521 <sect1>Conditional assembly<p>
 522
 523 Please note that when using the conditional directives (<tt/.IF/ and friends),
 524 the input must consist of valid assembler tokens, even in <tt/.IF/ branches
 525 that are not assembled. The reason for this behaviour is that the assembler
 526 must still be able to detect the ending tokens (like <tt/.ENDIF/), so
 527 conversion of the input stream into tokens still takes place. As a consequence
 528 conditional assembly directives may <bf/not/ be used to prevent normal text
 529 (used as a comment or similar) from being assembled. <p>
 530
 531
 532 <sect>Expressions<p>
 533
 534
 535 <sect1>Expression evaluation<p>
 536
 537 All expressions are evaluated with (at least) 32 bit precision. An
 538 expression may contain constant values and any combination of internal and
 539 external symbols. Expressions that cannot be evaluated at assembly time
 540 are stored inside the object file for evaluation by the linker.
 541 Expressions referencing imported symbols must always be evaluated by the
 542 linker.
 543
 544
 545 <sect1>Size of an expression result<p>
 546
 547 Sometimes, the assembler must know about the size of the value that is the
 548 result of an expression. This is usually the case, if a decision has to be
 549 made, to generate a zero page or an absolute memory references. In this
 550 case, the assembler has to make some assumptions about the result of an
 551 expression:
 552
 553 <itemize>
 554 <item>  If the result of an expression is constant, the actual value is
 555         checked to see if it's a byte sized expression or not.
 556 <item>  If the expression is explicitly casted to a byte sized expression by
 557         one of the '&gt;', '&lt;' or '^' operators, it is a byte expression.
 558 <item>  If this is not the case, and the expression contains a symbol,
 559         explicitly declared as zero page symbol (by one of the .importzp or
 560         .exportzp instructions), then the whole expression is assumed to be
 561         byte sized.
 562 <item>  If the expression contains symbols that are not defined, and these
 563         symbols are local symbols, the enclosing scopes are searched for a
 564         symbol with the same name. If one exists and this symbol is defined,
 565         its attributes are used to determine the result size.
 566 <item>  In all other cases the expression is assumed to be word sized.
 567 </itemize>
 568
 569 Note: If the assembler is not able to evaluate the expression at assembly
 570 time, the linker will evaluate it and check for range errors as soon as
 571 the result is known.
 572
 573
 574 <sect1>Boolean expressions<p>
 575
 576 In the context of a boolean expression, any non zero value is evaluated as
 577 true, any other value to false. The result of a boolean expression is 1 if
 578 it's true, and zero if it's false. There are boolean operators with extreme
 579 low precedence with version 2.x (where x &gt; 0). The <tt/.AND/ and <tt/.OR/
 580 operators are shortcut operators. That is, if the result of the expression is
 581 already known, after evaluating the left hand side, the right hand side is
 582 not evaluated.
 583
 584
 585 <sect1>Constant expressions<p>
 586
 587 Sometimes an expression must evaluate to a constant without looking at any
 588 further input. One such example is the <tt/<ref id=".IF" name=".IF">/ command
 589 that decides if parts of the code are assembled or not. An expression used in
 590 the <tt/.IF/ command cannot reference a symbol defined later, because the
 591 decision about the <tt/.IF/ must be made at the point when it is read. If the
 592 expression used in such a context contains only constant numerical values,
 593 there is no problem. When unresolvable symbols are involved it may get harder
 594 for the assembler to determine if the expression is actually constant, and it
 595 is even possible to create expressions that aren't recognized as constant.
 596 Simplifying the expressions will often help.
 597
 598 In cases where the result of the expression is not needed immediately, the
 599 assembler will delay evaluation until all input is read, at which point all
 600 symbols are known. So using arbitrary complex constant expressions is no
 601 problem in most cases.
 602
 603
 604
 605 <sect1>Available operators<label id="operators"><p>
 606
 607 <table>
 608 <tabular ca="clc">
 609 <bf/Operator/| <bf/Description/| <bf/Precedence/@<hline>
 610 | Built-in string functions| 0@
 611 ||~@
 612 | Built-in pseudo-variables| 1@
 613 | Built-in pseudo-functions| 1@
 614 +| Unary positive| 1@
 615 -| Unary negative| 1@
 616 &tilde;<newline>
 617 .BITNOT| Unary bitwise not| 1@
 618 &lt;<newline>
 619 .LOBYTE| Unary low-byte operator| 1@
 620 &gt;<newline>
 621 .HIBYTE| Unary high-byte operator| 1@
 622 ^<newline>
 623 .BANKBYTE| Unary bank-byte operator| 1@
 624 ||~@
 625 *| Multiplication| 2@
 626 /| Division| 2@
 627 .MOD| Modulo operator| 2@
 628 &amp;<newline>
 629 .BITAND| Bitwise and| 2@
 630 ^<newline>
 631 .BITXOR| Binary bitwise xor| 2@
 632 &lt;&lt;<newline>
 633 .SHL| Shift-left operator| 2@
 634 &gt;&gt;<newline>
 635 .SHR| Shift-right operator| 2@
 636 ||~@
 637 +| Binary addition| 3@
 638 -| Binary subtraction| 3@
 639 &verbar;<newline>
 640 .BITOR| Bitwise or| 3@
 641 ||~@
 642 = | Compare operator (equal)| 4@
 643 &lt;&gt;| Compare operator (not equal)| 4@
 644 &lt;| Compare operator (less)| 4@
 645 &gt;| Compare operator (greater)| 4@
 646 &lt;=| Compare operator (less or equal)| 4@
 647 &gt;=| Compare operator (greater or equal)| 4@
 648 ||~@
 649 &amp;&amp;<newline>
 650 .AND| Boolean and| 5@
 651 .XOR| Boolean xor| 5@
 652 ||~@
 653 &verbar;&verbar;<newline>
 654 .OR| Boolean or| 6@
 655 ||~@
 656 !<newline>
 657 .NOT| Boolean not| 7@<hline>
 658 </tabular>
 659 <caption>Available operators, sorted by precedence
 660 </table>
 661
 662 To force a specific order of evaluation, parentheses may be used, as usual.
 663
 664
 665
 666 <sect>Symbols and labels<p>
 667
 668 A symbol or label is an identifier that starts with a letter and is followed
 669 by letters and digits. Depending on some features enabled (see
 670 <tt><ref id="at_in_identifiers" name="at_in_identifiers"></tt>,
 671 <tt><ref id="dollar_in_identifiers" name="dollar_in_identifiers"></tt> and
 672 <tt><ref id="leading_dot_in_identifiers" name="leading_dot_in_identifiers"></tt>)
 673 other characters may be present. Use of identifiers consisting of a single
 674 character will not work in all cases, because some of these identifiers are
 675 reserved keywords (for example "A" is not a valid identifier for a label,
 676 because it is the keyword for the accumulator).
 677
 678 The assembler allows you to use symbols instead of naked values to make
 679 the source more readable. There are a lot of different ways to define and
 680 use symbols and labels, giving a lot of flexibility.
 681
 682 <sect1>Numeric constants<p>
 683
 684 Numeric constants are defined using the equal sign or the label assignment
 685 operator. After doing
 686
 687 <tscreen><verb>
 688         two = 2
 689 </verb></tscreen>
 690
 691 may use the symbol "two" in every place where a number is expected, and it is
 692 evaluated to the value 2 in this context. The label assignment operator is
 693 almost identical, but causes the symbol to be marked as a label, so it may be
 694 handled differently in a debugger:
 695
 696 <tscreen><verb>
 697         io := $d000
 698 </verb></tscreen>
 699
 700 The right side can of course be an expression:
 701
 702 <tscreen><verb>
 703         four = two * two
 704 </verb></tscreen>
 705
 706
 707 <label id="variables">
 708 <sect1>Numeric variables<p>
 709
 710 Within macros and other control structures (<tt><ref id=".REPEAT"
 711 name=".REPEAT"></tt>, ...) it is sometimes useful to have some sort of
 712 variable. This can be achieved by the <tt>.SET</tt> operator. It creates a
 713 symbol that may get assigned a different value later:
 714
 715 <tscreen><verb>
 716         four .set 4
 717         lda     #four           ; Loads 4 into A
 718         four .set 3
 719         lda     #four           ; Loads 3 into A
 720 </verb></tscreen>
 721
 722 Since the value of the symbol can change later, it must be possible to
 723 evaluate it when used (no delayed evaluation as with normal symbols). So the
 724 expression used as the value must be constant.
 725
 726 Following is an example for a macro that generates a different label each time
 727 it is used. It uses the <tt><ref id=".SPRINTF" name=".SPRINTF"></tt> function
 728 and a numeric variable named <tt>lcount</tt>.
 729
 730 <tscreen><verb>
 731         .lcount .set 0          ; Initialize the counter
 732
 733         .macro  genlab
 734                 .ident (.sprintf ("L%04X", lcount)):
 735                 lcount .set lcount + 1
 736         .endmacro
 737 </verb></tscreen>
 738
 739
 740 <sect1>Standard labels<p>
 741
 742 A label is defined by writing the name of the label at the start of the line
 743 (before any instruction mnemonic, macro or pseudo directive), followed by a
 744 colon. This will declare a symbol with the given name and the value of the
 745 current program counter.
 746
 747
 748 <sect1>Local labels and symbols<p>
 749
 750 Using the <tt><ref id=".PROC" name=".PROC"></tt> directive, it is possible to
 751 create regions of code where the names of labels and symbols are local to this
 752 region. They are not known outside of this region and cannot be accessed from
 753 there. Such regions may be nested like PROCEDUREs in Pascal.
 754
 755 See the description of the <tt><ref id=".PROC" name=".PROC"></tt>
 756 directive for more information.
 757
 758
 759 <sect1>Cheap local labels<p>
 760
 761 Cheap local labels are defined like standard labels, but the name of the
 762 label must begin with a special symbol (usually '@', but this can be
 763 changed by the <tt><ref id=".LOCALCHAR" name=".LOCALCHAR"></tt>
 764 directive).
 765
 766 Cheap local labels are visible only between two non cheap labels. As soon as a
 767 standard symbol is encountered (this may also be a local symbol if inside a
 768 region defined with the <tt><ref id=".PROC" name=".PROC"></tt> directive), the
 769 cheap local symbol goes out of scope.
 770
 771 You may use cheap local labels as an easy way to reuse common label
 772 names like "Loop". Here is an example:
 773
 774 <tscreen><verb>
 775         Clear:  lda    #$00             ; Global label
 776                 ldy    #$20
 777         @Loop:  sta    Mem,y            ; Local label
 778                 dey
 779                 bne    @Loop            ; Ok
 780                 rts
 781         Sub:    ...                     ; New global label
 782                 bne    @Loop            ; ERROR: Unknown identifier!
 783 </verb></tscreen>
 784
 785 <sect1>Unnamed labels<p>
 786
 787 If you really want to write messy code, there are also unnamed labels. These
 788 labels do not have a name (you guessed that already, didn't you?). A colon is
 789 used to mark the absence of the name.
 790
 791 Unnamed labels may be accessed by using the colon plus several minus or plus
 792 characters as a label designator. Using the '-' characters will create a back
 793 reference (use the n'th label backwards), using '+' will create a forward
 794 reference (use the n'th label in forward direction). An example will help to
 795 understand this:
 796
 797 <tscreen><verb>
 798         :       lda     (ptr1),y        ; #1
 799                 cmp     (ptr2),y
 800                 bne     :+              ; -> #2
 801                 tax
 802                 beq     :+++            ; -> #4
 803                 iny
 804                 bne     :-              ; -> #1
 805                 inc     ptr1+1
 806                 inc     ptr2+1
 807                 bne     :-              ; -> #1
 808
 809         :       bcs     :+              ; #2 -> #3
 810                 ldx     #$FF
 811                 rts
 812
 813         :       ldx     #$01            ; #3
 814         :       rts                     ; #4
 815 </verb></tscreen>
 816
 817 As you can see from the example, unnamed labels will make even short
 818 sections of code hard to understand, because you have to count labels
 819 to find branch targets (this is the reason why I for my part do
 820 prefer the "cheap" local labels). Nevertheless, unnamed labels are
 821 convenient in some situations, so it's your decision.
 822
 823
 824 <sect1>Using macros to define labels and constants<p>
 825
 826 While there are drawbacks with this approach, it may be handy in a few rare
 827 situations. Using <tt><ref id=".DEFINE" name=".DEFINE"></tt>, it is possible
 828 to define symbols or constants that may be used elsewhere. One of the
 829 advantages is that you can use it to define string constants (this is not
 830 possible with the other symbol types).
 831
 832 Please note: <tt/.DEFINE/ style macros do token replacements on a low level,
 833 so the names do not adhere to scoping, diagnostics may be misleading, there
 834 are no symbols to look up in the map file, and there is no debug info.
 835 Especially the first problem in the list can lead to very nasty programming
 836 errors. Because of these problems, the general advice is, <bf/NOT/ do use
 837 <tt/.DEFINE/ if you don't have to.
 838
 839 Example:
 840
 841 <tscreen><verb>
 842         .DEFINE two     2
 843         .DEFINE version "SOS V2.3"
 844
 845         four = two * two        ; Ok
 846         .byte   version         ; Ok
 847
 848         .PROC                   ; Start local scope
 849         two = 3                 ; Will give "2 = 3" - invalid!
 850         .ENDPROC
 851 </verb></tscreen>
 852
 853
 854 <sect1>Symbols and <tt>.DEBUGINFO</tt><p>
 855
 856 If <tt><ref id=".DEBUGINFO" name=".DEBUGINFO"></tt> is enabled (or <ref
 857 id="option-g" name="-g"> is given on the command line), global, local and
 858 cheap local labels are written to the object file and will be available in the
 859 symbol file via the linker. Unnamed labels are not written to the object file,
 860 because they don't have a name which would allow to access them.
 861
 862
 863
 864 <sect>Scopes<label id="scopes"><p>
 865
 866 ca65 implements several sorts of scopes for symbols.
 867
 868 <sect1>Global scope<p>
 869
 870 All (non cheap local) symbols that are declared outside of any nested scopes
 871 are in global scope.
 872
 873
 874 <sect1>Cheap locals<p>
 875
 876 A special scope is the scope for cheap local symbols. It lasts from one non
 877 local symbol to the next one, without any provisions made by the programmer.
 878 All other scopes differ in usage but use the same concept internally.
 879
 880
 881 <sect1>Generic nested scopes<p>
 882
 883 A nested scoped for generic use is started with <tt/<ref id=".SCOPE"
 884 name=".SCOPE">/ and closed with <tt/<ref id=".ENDSCOPE" name=".ENDSCOPE">/.
 885 The scope can have a name, in which case it is accessible from the outside by
 886 using <ref id="scopesyntax" name="explicit scopes">. If the scope does not
 887 have a name, all symbols created within the scope are local to the scope, and
 888 aren't accessible from the outside.
 889
 890 A nested scope can access symbols from the local or from enclosing scopes by
 891 name without using explicit scope names. In some cases there may be
 892 ambiguities, for example if there is a reference to a local symbol that is not
 893 yet defined, but a symbol with the same name exists in outer scopes:
 894
 895 <tscreen><verb>
 896         .scope  outer
 897                 foo     = 2
 898                 .scope  inner
 899                         lda     #foo
 900                         foo     = 3
 901                 .endscope
 902         .endscope
 903 </verb></tscreen>
 904
 905 In the example above, the <tt/lda/ instruction will load the value 3 into the
 906 accumulator, because <tt/foo/ is redefined in the scope. However:
 907
 908 <tscreen><verb>
 909         .scope  outer
 910                 foo     = $1234
 911                 .scope  inner
 912                         lda     foo,x
 913                         foo     = $12
 914                 .endscope
 915         .endscope
 916 </verb></tscreen>
 917
 918 Here, <tt/lda/ will still load from <tt/$12,x/, but since it is unknown to the
 919 assembler that <tt/foo/ is a zeropage symbol when translating the instruction,
 920 absolute mode is used instead. In fact, the assembler will not use absolute
 921 mode by default, but it will search through the enclosing scopes for a symbol
 922 with the given name. If one is found, the address size of this symbol is used.
 923 This may lead to errors:
 924
 925 <tscreen><verb>
 926         .scope  outer
 927                 foo     = $12
 928                 .scope  inner
 929                         lda     foo,x
 930                         foo     = $1234
 931                 .endscope
 932         .endscope
 933 </verb></tscreen>
 934
 935 In this case, when the assembler sees the symbol <tt/foo/ in the <tt/lda/
 936 instruction, it will search for an already defined symbol <tt/foo/. It will
 937 find <tt/foo/ in scope <tt/outer/, and a close look reveals that it is a
 938 zeropage symbol. So the assembler will use zeropage addressing mode. If
 939 <tt/foo/ is redefined later in scope <tt/inner/, the assembler tries to change
 940 the address in the <tt/lda/ instruction already translated, but since the new
 941 value needs absolute addressing mode, this fails, and an error message "Range
 942 error" is output.
 943
 944 Of course the most simple solution for the problem is to move the definition
 945 of <tt/foo/ in scope <tt/inner/ upwards, so it precedes its use. There may be
 946 rare cases when this cannot be done. In these cases, you can use one of the
 947 address size override operators:
 948
 949 <tscreen><verb>
 950         .scope  outer
 951                 foo     = $12
 952                 .scope  inner
 953                         lda     a:foo,x
 954                         foo     = $1234
 955                 .endscope
 956         .endscope
 957 </verb></tscreen>
 958
 959 This will cause the <tt/lda/ instruction to be translated using absolute
 960 addressing mode, which means changing the symbol reference later does not
 961 cause any errors.
 962
 963
 964 <sect1>Nested procedures<p>
 965
 966 A nested procedure is created by use of <tt/<ref id=".PROC" name=".PROC">/. It
 967 differs from a <tt/<ref id=".SCOPE" name=".SCOPE">/ in that it must have a
 968 name, and a it will introduce a symbol with this name in the enclosing scope.
 969 So
 970
 971 <tscreen><verb>
 972         .proc   foo
 973                 ...
 974         .endproc
 975 </verb></tscreen>
 976
 977 is actually the same as
 978
 979 <tscreen><verb>
 980         foo:
 981         .scope  foo
 982                 ...
 983         .endscope
 984 </verb></tscreen>
 985
 986 This is the reason why a procedure must have a name. If you want a scope
 987 without a name, use <tt/<ref id=".SCOPE" name=".SCOPE">/.
 988
 989 <bf/Note:/ As you can see from the example above, scopes and symbols live in
 990 different namespaces. There can be a symbol named <tt/foo/ and a scope named
 991 <tt/foo/ without any conflicts (but see the section titled <ref
 992 id="scopesearch" name="&quot;Scope search order&quot;">).
 993
 994
 995 <sect1>Structs, unions and enums<p>
 996
 997 Structs, unions and enums are explained in a <ref id="structs" name="separate
 998 section">, I do only cover them here, because if they are declared with a
 999 name, they open a nested scope, similar to <tt/<ref id=".SCOPE"
1000 name=".SCOPE">/. However, when no name is specified, the behaviour is
1001 different: In this case, no new scope will be opened, symbols declared within
1002 a struct, union, or enum declaration will then be added to the enclosing scope
1003 instead.
1004
1005
1006 <sect1>Explicit scope specification<label id="scopesyntax"><p>
1007
1008 Accessing symbols from other scopes is possible by using an explicit scope
1009 specification, provided that the scope where the symbol lives in has a name.
1010 The namespace token (<tt/::/) is used to access other scopes:
1011
1012 <tscreen><verb>
1013         .scope  foo
1014         bar:    .word   0
1015         .endscope
1016
1017                 ...
1018                 lda     foo::bar        ; Access foo in scope bar
1019 </verb></tscreen>
1020
1021 The only way to deny access to a scope from the outside is to declare a scope
1022 without a name (using the <tt/<ref id=".SCOPE" name=".SCOPE">/ command).
1023
1024 A special syntax is used to specify the global scope: If a symbol or scope is
1025 preceded by the namespace token, the global scope is searched:
1026
1027 <tscreen><verb>
1028         bar     = 3
1029
1030         .scope  foo
1031                 bar     = 2
1032                 lda     #::bar  ; Access the global bar (which is 3)
1033         .endscope
1034 </verb></tscreen>
1035
1036
1037 <sect1>Scope search order<label id="scopesearch"><p>
1038
1039 The assembler searches for a scope in a similar way as for a symbol. First, it
1040 looks in the current scope, and then it walks up the enclosing scopes until
1041 the scope is found.
1042
1043 However, one important thing to note when using explicit scope syntax is, that
1044 a symbol may be accessed before it is defined, but a scope may <bf/not/ be
1045 used without a preceding definition. This means that in the following
1046 example:
1047
1048 <tscreen><verb>
1049         .scope  foo
1050                 bar     = 3
1051         .endscope
1052
1053         .scope  outer
1054                 lda     #foo::bar  ; Will load 3, not 2!
1055                 .scope  foo
1056                         bar     = 2
1057                 .endscope
1058         .endscope
1059 </verb></tscreen>
1060
1061 the reference to the scope <tt/foo/ will use the global scope, and not the
1062 local one, because the local one is not visible at the point where it is
1063 referenced.
1064
1065 Things get more complex if a complete chain of scopes is specified:
1066
1067 <tscreen><verb>
1068         .scope  foo
1069                 .scope  outer
1070                         .scope  inner
1071                                 bar = 1
1072                         .endscope
1073                 .endscope
1074                 .scope  another
1075                         .scope  nested
1076                                 lda     #outer::inner::bar      ; 1
1077                         .endscope
1078                 .endscope
1079         .endscope
1080
1081         .scope  outer
1082                 .scope  inner
1083                         bar = 2
1084                 .endscope
1085         .endscope
1086 </verb></tscreen>
1087
1088 When <tt/outer::inner::bar/ is referenced in the <tt/lda/ instruction, the
1089 assembler will first search in the local scope for a scope named <tt/outer/.
1090 Since none is found, the enclosing scope (<tt/another/) is checked. There is
1091 still no scope named <tt/outer/, so scope <tt/foo/ is checked, and finally
1092 scope <tt/outer/ is found. Within this scope, <tt/inner/ is searched, and in
1093 this scope, the assembler looks for a symbol named <tt/bar/.
1094
1095 Please note that once the anchor scope is found, all following scopes
1096 (<tt/inner/ in this case) are expected to be found exactly in this scope. The
1097 assembler will search the scope tree only for the first scope (if it is not
1098 anchored in the root scope). Starting from there on, there is no flexibility,
1099 so if the scope named <tt/outer/ found by the assembler does not contain a
1100 scope named <tt/inner/, this would be an error, even if such a pair does exist
1101 (one level up in global scope).
1102
1103 Ambiguities that may be introduced by this search algorithm may be removed by
1104 anchoring the scope specification in the global scope. In the example above,
1105 if you want to access the "other" symbol <tt/bar/, you would have to write:
1106
1107 <tscreen><verb>
1108         .scope  foo
1109                 .scope  outer
1110                         .scope  inner
1111                                 bar = 1
1112                         .endscope
1113                 .endscope
1114                 .scope  another
1115                         .scope  nested
1116                                 lda     #::outer::inner::bar    ; 2
1117                         .endscope
1118                 .endscope
1119         .endscope
1120
1121         .scope  outer
1122                 .scope  inner
1123                         bar = 2
1124                 .endscope
1125         .endscope
1126 </verb></tscreen>
1127
1128
1129 <sect>Address sizes and memory models<label id="address-sizes"><p>
1130
1131 <sect1>Address sizes<p>
1132
1133 ca65 assigns each segment and each symbol an address size. This is true, even
1134 if the symbol is not used as an address. You may also think of a value range
1135 of the symbol instead of an address size.
1136
1137 Possible address sizes are:
1138
1139 <itemize>
1140 <item>Zeropage or direct (8 bits)
1141 <item>Absolute (16 bits)
1142 <item>Far (24 bits)
1143 <item>Long (32 bits)
1144 </itemize>
1145
1146 Since the assembler uses default address sizes for the segments and symbols,
1147 it is usually not necessary to override the default behaviour. In cases, where
1148 it is necessary, the following keywords may be used to specify address sizes:
1149
1150 <itemize>
1151 <item>DIRECT, ZEROPAGE or ZP for zeropage addressing (8 bits).
1152 <item>ABSOLUTE, ABS or NEAR for absolute addressing (16 bits).
1153 <item>FAR for far addressing (24 bits).
1154 <item>LONG or DWORD for long addressing (32 bits).
1155 </itemize>
1156
1157
1158 <sect1>Address sizes of segments<p>
1159
1160 The assembler assigns an address size to each segment. Since the
1161 representation of a label within this segment is "segment start + offset",
1162 labels will inherit the address size of the segment they are declared in.
1163
1164 The address size of a segment may be changed, by using an optional address
1165 size modifier. See the <tt/<ref id=".SEGMENT" name="segment directive">/ for
1166 an explanation on how this is done.
1167
1168
1169 <sect1>Address sizes of symbols<p>
1170
1171
1172
1173
1174 <sect1>Memory models<p>
1175
1176 The default address size of a segment depends on the memory model used. Since
1177 labels inherit the address size from the segment they are declared in,
1178 changing the memory model is an easy way to change the address size of many
1179 symbols at once.
1180
1181
1182
1183
1184 <sect>Pseudo variables<label id="pseudo-variables"><p>
1185
1186 Pseudo variables are readable in all cases, and in some special cases also
1187 writable.
1188
1189 <sect1><tt>*</tt><p>
1190
1191   Reading this pseudo variable will return the program counter at the start
1192   of the current input line.
1193
1194   Assignment to this variable is possible when <tt/<ref id=".FEATURE"
1195   name=".FEATURE pc_assignment">/ is used. Note: You should not use
1196   assignments to <tt/*/, use <tt/<ref id=".ORG" name=".ORG">/ instead.
1197
1198
1199 <sect1><tt>.CPU</tt><label id=".CPU"><p>
1200
1201   Reading this pseudo variable will give a constant integer value that
1202   tells which CPU is currently enabled. It can also tell which instruction
1203   set the CPU is able to translate. The value read from the pseudo variable
1204   should be further examined by using one of the constants defined by the
1205   "cpu" macro package (see <tt/<ref id=".MACPACK" name=".MACPACK">/).
1206
1207   It may be used to replace the .IFPxx pseudo instructions or to construct
1208   even more complex expressions.
1209
1210   Example:
1211
1212   <tscreen><verb>
1213         .macpack        cpu
1214         .if     (.cpu .bitand CPU_ISET_65816)
1215                 phx
1216                 phy
1217         .else
1218                 txa
1219                 pha
1220                 tya
1221                 pha
1222         .endif
1223   </verb></tscreen>
1224
1225
1226 <sect1><tt>.PARAMCOUNT</tt><label id=".PARAMCOUNT"><p>
1227
1228   This builtin pseudo variable is only available in macros. It is replaced by
1229   the actual number of parameters that were given in the macro invocation.
1230
1231   Example:
1232
1233   <tscreen><verb>
1234         .macro  foo     arg1, arg2, arg3
1235         .if     .paramcount <> 3
1236         .error  "Too few parameters for macro foo"
1237         .endif
1238         ...
1239         .endmacro
1240   </verb></tscreen>
1241
1242   See section <ref id="macros" name="Macros">.
1243
1244
1245 <sect1><tt>.TIME</tt><label id=".TIME"><p>
1246
1247   Reading this pseudo variable will give a constant integer value that
1248   represents the current time in POSIX standard (as seconds since the
1249   Epoch).
1250
1251   It may be used to encode the time of translation somewhere in the created
1252   code.
1253
1254   Example:
1255
1256   <tscreen><verb>
1257         .dword  .time   ; Place time here
1258   </verb></tscreen>
1259
1260
1261 <sect1><tt>.VERSION</tt><label id=".VERSION"><p>
1262
1263   Reading this pseudo variable will give the assembler version according to
1264   the following formula:
1265
1266         VER_MAJOR*$100 + VER_MINOR*$10 + VER_PATCH
1267
1268   It may be used to encode the assembler version or check the assembler for
1269   special features not available with older versions.
1270
1271   Example:
1272
1273   Version 2.11.1 of the assembler will return $2B1 as numerical constant when
1274   reading the pseudo variable <tt/.VERSION/.
1275
1276
1277
1278 <sect>Pseudo functions<label id="pseudo-functions"><p>
1279
1280 Pseudo functions expect their arguments in parenthesis, and they have a result,
1281 either a string or an expression.
1282
1283
1284 <sect1><tt>.BANK</tt><label id=".BANK"><p>
1285
1286   The <tt/.BANK/ function is used to support systems with banked memory. The
1287   argument is an expression with exactly one segment reference - usually a
1288   label. The function result is the value of the <tt/bank/ attribute assigned
1289   to the run memory area of the segment. Please see the linker documentation
1290   for more information about memory areas and their attributes.
1291
1292   The value of <tt/.BANK/ can be used to switch memory so that a memory bank
1293   containing specific data is available.
1294
1295   The <tt/bank/ attribute is a 32 bit integer and so is the result of the
1296   <tt/.BANK/ function. You will have to use <tt><ref id=".LOBYTE"
1297   name=".LOBYTE"></tt> or similar functions to address just part of it.
1298
1299   Please note that <tt/.BANK/ will always get evaluated in the link stage, so
1300   an expression containing <tt/.BANK/ can never be used where a constant known
1301   result is expected (for example with <tt/.RES/).
1302
1303   Example:
1304
1305   <tscreen><verb>
1306         .segment "BANK1"
1307         .proc   banked_func_1
1308                 ...
1309         .endproc
1310
1311         .segment "BANK2"
1312         .proc   banked_func_2
1313                 ...
1314         .endproc
1315
1316         .proc   bank_table
1317                 .addr   banked_func_1
1318                 .byte   <.BANK (banked_func_1)
1319
1320                 .addr   banked_func_2
1321                 .byte   <.BANK (banked_func_2)
1322         .endproc
1323   </verb></tscreen>
1324
1325
1326
1327 <sect1><tt>.BANKBYTE</tt><label id=".BANKBYTE"><p>
1328
1329   The function returns the bank byte (that is, bits 16-23) of its argument.
1330   It works identical to the '^' operator.
1331
1332   See: <tt><ref id=".HIBYTE" name=".HIBYTE"></tt>,
1333        <tt><ref id=".LOBYTE" name=".LOBYTE"></tt>
1334
1335
1336 <sect1><tt>.BLANK</tt><label id=".BLANK"><p>
1337
1338   Builtin function. The function evaluates its argument in braces and yields
1339   "false" if the argument is non blank (there is an argument), and "true" if
1340   there is no argument.  The token list that makes up the function argument
1341   may optionally be enclosed in curly braces. This allows the inclusion of
1342   tokens that would otherwise terminate the list (the closing right
1343   parenthesis). The curly braces are not considered part of the list, a list
1344   just consisting of curly braces is considered to be empty.
1345
1346   As an example, the <tt/.IFBLANK/ statement may be replaced by
1347
1348   <tscreen><verb>
1349         .if     .blank({arg})
1350   </verb></tscreen>
1351
1352
1353
1354 <sect1><tt>.CONCAT</tt><label id=".CONCAT"><p>
1355
1356   Builtin string function. The function allows to concatenate a list of string
1357   constants separated by commas. The result is a string constant that is the
1358   concatenation of all arguments. This function is most useful in macros and
1359   when used together with the <tt/.STRING/ builtin function. The function may
1360   be used in any case where a string constant is expected.
1361
1362   Example:
1363
1364   <tscreen><verb>
1365         .include        .concat ("myheader", ".", "inc")
1366   </verb></tscreen>
1367
1368   This is the same as the command
1369
1370   <tscreen><verb>
1371         .include        "myheader.inc"
1372   </verb></tscreen>
1373
1374
1375 <sect1><tt>.CONST</tt><label id=".CONST"><p>
1376
1377   Builtin function. The function evaluates its argument in braces and
1378   yields "true" if the argument is a constant expression (that is, an
1379   expression that yields a constant value at assembly time) and "false"
1380   otherwise. As an example, the .IFCONST statement may be replaced by
1381
1382   <tscreen><verb>
1383         .if     .const(a + 3)
1384   </verb></tscreen>
1385
1386
1387 <sect1><tt>.HIBYTE</tt><label id=".HIBYTE"><p>
1388
1389   The function returns the high byte (that is, bits 8-15) of its argument.
1390   It works identical to the '>' operator.
1391
1392   See: <tt><ref id=".LOBYTE" name=".LOBYTE"></tt>,
1393        <tt><ref id=".BANKBYTE" name=".BANKBYTE"></tt>
1394
1395
1396 <sect1><tt>.HIWORD</tt><label id=".HIWORD"><p>
1397
1398   The function returns the high word (that is, bits 16-31) of its argument.
1399
1400   See: <tt><ref id=".LOWORD" name=".LOWORD"></tt>
1401
1402
1403 <sect1><tt>.IDENT</tt><label id=".IDENT"><p>
1404
1405   The function expects a string as its argument, and converts this argument
1406   into an identifier. If the string starts with the current <tt/<ref
1407   id=".LOCALCHAR" name=".LOCALCHAR">/, it will be converted into a cheap local
1408   identifier, otherwise it will be converted into a normal identifier.
1409
1410   Example:
1411
1412   <tscreen><verb>
1413         .macro  makelabel       arg1, arg2
1414                 .ident (.concat (arg1, arg2)):
1415         .endmacro
1416
1417                 makelabel       "foo", "bar"
1418
1419                 .word           foobar          ; Valid label
1420   </verb></tscreen>
1421
1422
1423 <sect1><tt>.LEFT</tt><label id=".LEFT"><p>
1424
1425   Builtin function. Extracts the left part of a given token list.
1426
1427   Syntax:
1428
1429   <tscreen><verb>
1430         .LEFT (&lt;int expr&gt;, &lt;token list&gt;)
1431   </verb></tscreen>
1432
1433   The first integer expression gives the number of tokens to extract from
1434   the token list. The second argument is the token list itself. The token
1435   list may optionally be enclosed into curly braces. This allows the
1436   inclusion of tokens that would otherwise terminate the list (the closing
1437   right paren in the given case).
1438
1439   Example:
1440
1441   To check in a macro if the given argument has a '#' as first token
1442   (immediate addressing mode), use something like this:
1443
1444   <tscreen><verb>
1445         .macro  ldax    arg
1446                 ...
1447                 .if (.match (.left (1, {arg}), #))
1448
1449                 ; ldax called with immediate operand
1450                 ...
1451
1452                 .endif
1453                 ...
1454         .endmacro
1455   </verb></tscreen>
1456
1457   See also the <tt><ref id=".MID" name=".MID"></tt> and <tt><ref id=".RIGHT"
1458   name=".RIGHT"></tt> builtin functions.
1459
1460
1461 <sect1><tt>.LOBYTE</tt><label id=".LOBYTE"><p>
1462
1463   The function returns the low byte (that is, bits 0-7) of its argument.
1464   It works identical to the '<' operator.
1465
1466   See: <tt><ref id=".HIBYTE" name=".HIBYTE"></tt>,
1467        <tt><ref id=".BANKBYTE" name=".BANKBYTE"></tt>
1468
1469
1470 <sect1><tt>.LOWORD</tt><label id=".LOWORD"><p>
1471
1472   The function returns the low word (that is, bits 0-15) of its argument.
1473
1474   See: <tt><ref id=".HIWORD" name=".HIWORD"></tt>
1475
1476
1477 <sect1><tt>.MATCH</tt><label id=".MATCH"><p>
1478
1479   Builtin function. Matches two token lists against each other. This is
1480   most useful within macros, since macros are not stored as strings, but
1481   as lists of tokens.
1482
1483   The syntax is
1484
1485   <tscreen><verb>
1486         .MATCH(&lt;token list #1&gt;, &lt;token list #2&gt;)
1487   </verb></tscreen>
1488
1489   Both token list may contain arbitrary tokens with the exception of the
1490   terminator token (comma resp. right parenthesis) and
1491
1492   <itemize>
1493   <item>end-of-line
1494   <item>end-of-file
1495   </itemize>
1496
1497   The token lists may optionally be enclosed into curly braces. This allows
1498   the inclusion of tokens that would otherwise terminate the list (the closing
1499   right paren in the given case). Often a macro parameter is used for any of
1500   the token lists.
1501
1502   Please note that the function does only compare tokens, not token
1503   attributes. So any number is equal to any other number, regardless of the
1504   actual value. The same is true for strings. If you need to compare tokens
1505   <em/and/ token attributes, use the <tt><ref id=".XMATCH"
1506   name=".XMATCH"></tt> function.
1507
1508   Example:
1509
1510   Assume the macro <tt/ASR/, that will shift right the accumulator by one,
1511   while honoring the sign bit. The builtin processor instructions will allow
1512   an optional "A" for accu addressing for instructions like <tt/ROL/ and
1513   <tt/ROR/. We will use the <tt><ref id=".MATCH" name=".MATCH"></tt> function
1514   to check for this and print and error for invalid calls.
1515
1516   <tscreen><verb>
1517         .macro  asr     arg
1518
1519                 .if (.not .blank(arg)) .and (.not .match ({arg}, a))
1520                 .error "Syntax error"
1521                 .endif
1522
1523                 cmp     #$80            ; Bit 7 into carry
1524                 lsr     a               ; Shift carry into bit 7
1525
1526         .endmacro
1527   </verb></tscreen>
1528
1529   The macro will only accept no arguments, or one argument that must be the
1530   reserved keyword "A".
1531
1532   See: <tt><ref id=".XMATCH" name=".XMATCH"></tt>
1533
1534
1535 <sect1><tt>.MAX</tt><label id=".MAX"><p>
1536
1537   Builtin function. The result is the larger of two values.
1538
1539   The syntax is
1540
1541   <tscreen><verb>
1542         .MAX (&lt;value #1&gt;, &lt;value #2&gt;)
1543   </verb></tscreen>
1544
1545   Example:
1546
1547   <tscreen><verb>
1548         ; Reserve space for the larger of two data blocks
1549         savearea:       .max (.sizeof (foo), .sizeof (bar))
1550   </verb></tscreen>
1551
1552   See: <tt><ref id=".MIN" name=".MIN"></tt>
1553
1554
1555 <sect1><tt>.MID</tt><label id=".MID"><p>
1556
1557   Builtin function. Takes a starting index, a count and a token list as
1558   arguments. Will return part of the token list.
1559
1560   Syntax:
1561
1562   <tscreen><verb>
1563         .MID (&lt;int expr&gt;, &lt;int expr&gt;, &lt;token list&gt;)
1564   </verb></tscreen>
1565
1566   The first integer expression gives the starting token in the list (the first
1567   token has index 0). The second integer expression gives the number of tokens
1568   to extract from the token list. The third argument is the token list itself.
1569   The token list may optionally be enclosed into curly braces. This allows the
1570   inclusion of tokens that would otherwise terminate the list (the closing
1571   right paren in the given case).
1572
1573   Example:
1574
1575   To check in a macro if the given argument has a '<tt/#/' as first token
1576   (immediate addressing mode), use something like this:
1577
1578     <tscreen><verb>
1579         .macro  ldax    arg
1580                 ...
1581                 .if (.match (.mid (0, 1, {arg}), #))
1582
1583                 ; ldax called with immediate operand
1584                 ...
1585
1586                 .endif
1587                 ...
1588         .endmacro
1589   </verb></tscreen>
1590
1591   See also the <tt><ref id=".LEFT" name=".LEFT"></tt> and <tt><ref id=".RIGHT"
1592   name=".RIGHT"></tt> builtin functions.
1593
1594
1595 <sect1><tt>.MIN</tt><label id=".MIN"><p>
1596
1597   Builtin function. The result is the smaller of two values.
1598
1599   The syntax is
1600
1601   <tscreen><verb>
1602         .MIN (&lt;value #1&gt;, &lt;value #2&gt;)
1603   </verb></tscreen>
1604
1605   Example:
1606
1607   <tscreen><verb>
1608         ; Reserve space for some data, but 256 bytes minimum
1609         savearea:       .min (.sizeof (foo), 256)
1610   </verb></tscreen>
1611
1612   See: <tt><ref id=".MAX" name=".MAX"></tt>
1613
1614
1615 <sect1><tt>.REF, .REFERENCED</tt><label id=".REFERENCED"><p>
1616
1617   Builtin function. The function expects an identifier as argument in braces.
1618   The argument is evaluated, and the function yields "true" if the identifier
1619   is a symbol that has already been referenced somewhere in the source file up
1620   to the current position. Otherwise the function yields false. As an example,
1621   the <tt><ref id=".IFREF" name=".IFREF"></tt> statement may be replaced by
1622
1623   <tscreen><verb>
1624         .if     .referenced(a)
1625   </verb></tscreen>
1626
1627   See: <tt><ref id=".DEFINED" name=".DEFINED"></tt>
1628
1629
1630 <sect1><tt>.RIGHT</tt><label id=".RIGHT"><p>
1631
1632   Builtin function. Extracts the right part of a given token list.
1633
1634   Syntax:
1635
1636   <tscreen><verb>
1637         .RIGHT (&lt;int expr&gt;, &lt;token list&gt;)
1638   </verb></tscreen>
1639
1640   The first integer expression gives the number of tokens to extract from the
1641   token list. The second argument is the token list itself.  The token list
1642   may optionally be enclosed into curly braces. This allows the inclusion of
1643   tokens that would otherwise terminate the list (the closing right paren in
1644   the given case).
1645
1646   See also the <tt><ref id=".LEFT" name=".LEFT"></tt> and <tt><ref id=".MID"
1647   name=".MID"></tt> builtin functions.
1648
1649
1650 <sect1><tt>.SIZEOF</tt><label id=".SIZEOF"><p>
1651
1652   <tt/.SIZEOF/ is a pseudo function that returns the size of its argument. The
1653   argument can be a struct/union, a struct member, a procedure, or a label. In
1654   case of a procedure or label, its size is defined by the amount of data
1655   placed in the segment where the label is relative to. If a line of code
1656   switches segments (for example in a macro) data placed in other segments
1657   does not count for the size.
1658
1659   Please note that a symbol or scope must exist, before it is used together with
1660   <tt/.SIZEOF/ (this may get relaxed later, but will always be true for scopes).
1661   A scope has preference over a symbol with the same name, so if the last part
1662   of a name represents both, a scope and a symbol, the scope is chosen over the
1663   symbol.
1664
1665   After the following code:
1666
1667   <tscreen><verb>
1668         .struct Point                   ; Struct size = 4
1669                 xcoord  .word
1670                 xcoord  .word
1671         .endstruct
1672
1673         P:      .tag    Point           ; Declare a point
1674         @P:     .tag    Point           ; Declare another point
1675
1676         .code
1677         .proc   Code
1678                 nop
1679                 .proc   Inner
1680                         nop
1681                 .endproc
1682                 nop
1683         .endproc
1684
1685         .proc   Data
1686         .data                           ; Segment switch!!!
1687                 .res    4
1688         .endproc
1689   </verb></tscreen>
1690
1691   <descrip>
1692     <tag><tt/.sizeof(Point)/</tag>
1693     will have the value 4, because this is the size of struct <tt/Point/.
1694
1695     <tag><tt/.sizeof(Point::xcoord)/</tag>
1696     will have the value 2, because this is the size of the member <tt/xcoord/
1697     in struct <tt/Point/.
1698
1699     <tag><tt/.sizeof(P)/</tag>
1700     will have the value 4, this is the size of the data declared on the same
1701     source line as the label <tt/P/, which is in the same segment that <tt/P/
1702     is relative to.
1703
1704     <tag><tt/.sizeof(@P)/</tag>
1705     will have the value 4, see above. The example demonstrates that <tt/.SIZEOF/
1706     does also work for cheap local symbols.
1707
1708     <tag><tt/.sizeof(Code)/</tag>
1709     will have the value 3, since this is amount of data emitted into the code
1710     segment, the segment that was active when <tt/Code/ was entered. Note that
1711     this value includes the amount of data emitted in child scopes (in this
1712     case <tt/Code::Inner/).
1713
1714     <tag><tt/.sizeof(Code::Inner)/</tag>
1715     will have the value 1 as expected.
1716
1717     <tag><tt/.sizeof(Data)/</tag>
1718     will have the value 0. Data is emitted within the scope <tt/Data/, but since
1719     the segment is switched after entry, this data is emitted into another
1720     segment.
1721   </descrip>
1722
1723
1724 <sect1><tt>.STRAT</tt><label id=".STRAT"><p>
1725
1726   Builtin function. The function accepts a string and an index as
1727   arguments and returns the value of the character at the given position
1728   as an integer value. The index is zero based.
1729
1730   Example:
1731
1732   <tscreen><verb>
1733         .macro  M       Arg
1734                 ; Check if the argument string starts with '#'
1735                 .if (.strat (Arg, 0) = '#')
1736                 ...
1737                 .endif
1738         .endmacro
1739   </verb></tscreen>
1740
1741
1742 <sect1><tt>.SPRINTF</tt><label id=".SPRINTF"><p>
1743
1744   Builtin function. It expects a format string as first argument. The number
1745   and type of the following arguments depend on the format string. The format
1746   string is similar to the one of the C <tt/printf/ function. Missing things
1747   are: Length modifiers, variable width.
1748
1749   The result of the function is a string.
1750
1751   Example:
1752
1753   <tscreen><verb>
1754         num     = 3
1755
1756         ; Generate an identifier:
1757         .ident (.sprintf ("%s%03d", "label", num)):
1758   </verb></tscreen>
1759
1760
1761 <sect1><tt>.STRING</tt><label id=".STRING"><p>
1762
1763   Builtin function. The function accepts an argument in braces and converts
1764   this argument into a string constant. The argument may be an identifier, or
1765   a constant numeric value.
1766
1767   Since you can use a string in the first place, the use of the function may
1768   not be obvious. However, it is useful in macros, or more complex setups.
1769
1770   Example:
1771
1772   <tscreen><verb>
1773         ; Emulate other assemblers:
1774         .macro  section name
1775                 .segment        .string(name)
1776         .endmacro
1777   </verb></tscreen>
1778
1779
1780 <sect1><tt>.STRLEN</tt><label id=".STRLEN"><p>
1781
1782   Builtin function. The function accepts a string argument in braces and
1783   evaluates to the length of the string.
1784
1785   Example:
1786
1787   The following macro encodes a string as a pascal style string with
1788   a leading length byte.
1789
1790   <tscreen><verb>
1791         .macro  PString Arg
1792                 .byte   .strlen(Arg), Arg
1793         .endmacro
1794   </verb></tscreen>
1795
1796
1797 <sect1><tt>.TCOUNT</tt><label id=".TCOUNT"><p>
1798
1799   Builtin function. The function accepts a token list in braces. The function
1800   result is the number of tokens given as argument. The token list may
1801   optionally be enclosed into curly braces which are not considered part of
1802   the list and not counted. Enclosement in curly braces allows the inclusion
1803   of tokens that would otherwise terminate the list (the closing right paren
1804   in the given case).
1805
1806   Example:
1807
1808   The <tt/ldax/ macro accepts the '#' token to denote immediate addressing (as
1809   with the normal 6502 instructions). To translate it into two separate 8 bit
1810   load instructions, the '#' token has to get stripped from the argument:
1811
1812   <tscreen><verb>
1813         .macro  ldax    arg
1814                 .if (.match (.mid (0, 1, {arg}), #))
1815                 ; ldax called with immediate operand
1816                 lda     #<(.right (.tcount ({arg})-1, {arg}))
1817                 ldx     #>(.right (.tcount ({arg})-1, {arg}))
1818                 .else
1819                 ...
1820                 .endif
1821         .endmacro
1822   </verb></tscreen>
1823
1824
1825 <sect1><tt>.XMATCH</tt><label id=".XMATCH"><p>
1826
1827   Builtin function. Matches two token lists against each other. This is
1828   most useful within macros, since macros are not stored as strings, but
1829   as lists of tokens.
1830
1831   The syntax is
1832
1833   <tscreen><verb>
1834         .XMATCH(&lt;token list #1&gt;, &lt;token list #2&gt;)
1835   </verb></tscreen>
1836
1837   Both token list may contain arbitrary tokens with the exception of the
1838   terminator token (comma resp. right parenthesis) and
1839
1840   <itemize>
1841   <item>end-of-line
1842   <item>end-of-file
1843   </itemize>
1844
1845   The token lists may optionally be enclosed into curly braces. This allows
1846   the inclusion of tokens that would otherwise terminate the list (the closing
1847   right paren in the given case). Often a macro parameter is used for any of
1848   the token lists.
1849
1850   The function compares tokens <em/and/ token values. If you need a function
1851   that just compares the type of tokens, have a look at the <tt><ref
1852   id=".MATCH" name=".MATCH"></tt> function.
1853
1854   See: <tt><ref id=".MATCH" name=".MATCH"></tt>
1855
1856
1857
1858 <sect>Control commands<label id="control-commands"><p>
1859
1860 Here's a list of all control commands and a description, what they do:
1861
1862
1863 <sect1><tt>.A16</tt><label id=".A16"><p>
1864
1865   Valid only in 65816 mode. Switch the accumulator to 16 bit.
1866
1867   Note: This command will not emit any code, it will tell the assembler to
1868   create 16 bit operands for immediate accumulator addressing mode.
1869
1870   See also: <tt><ref id=".SMART" name=".SMART"></tt>
1871
1872
1873 <sect1><tt>.A8</tt><label id=".A8"><p>
1874
1875   Valid only in 65816 mode. Switch the accumulator to 8 bit.
1876
1877   Note: This command will not emit any code, it will tell the assembler to
1878   create 8 bit operands for immediate accu addressing mode.
1879
1880   See also: <tt><ref id=".SMART" name=".SMART"></tt>
1881
1882
1883 <sect1><tt>.ADDR</tt><label id=".ADDR"><p>
1884
1885   Define word sized data. In 6502 mode, this is an alias for <tt/.WORD/ and
1886   may be used for better readability if the data words are address values. In
1887   65816 mode, the address is forced to be 16 bit wide to fit into the current
1888   segment. See also <tt><ref id=".FARADDR" name=".FARADDR"></tt>. The command
1889   must be followed by a sequence of (not necessarily constant) expressions.
1890
1891   Example:
1892
1893   <tscreen><verb>
1894         .addr   $0D00, $AF13, _Clear
1895   </verb></tscreen>
1896
1897   See: <tt><ref id=".FARADDR" name=".FARADDR"></tt>, <tt><ref id=".WORD"
1898        name=".WORD"></tt>
1899
1900
1901 <sect1><tt>.ALIGN</tt><label id=".ALIGN"><p>
1902
1903   Align data to a given boundary. The command expects a constant integer
1904   argument in the range 1 ... 65536, plus an optional second argument
1905   in byte range. If there is a second argument, it is used as fill value,
1906   otherwise the value defined in the linker configuration file is used
1907   (the default for this value is zero).
1908
1909   <tt/.ALIGN/ will insert fill bytes, and the number of fill bytes depend of
1910   the final address of the segment. <tt/.ALIGN/ cannot insert a variable
1911   number of bytes, since that would break address calculations within the
1912   module. So each <tt/.ALIGN/ expects the segment to be aligned to a multiple
1913   of the alignment, because that allows the number of fill bytes to be
1914   calculated in advance by the assembler. You are therefore required to
1915   specify a matching alignment for the segment in the linker config. The
1916   linker will output a warning if the alignment of the segment is less than
1917   what is necessary to have a correct alignment in the object file.
1918
1919   Example:
1920
1921   <tscreen><verb>
1922         .align  256
1923   </verb></tscreen>
1924
1925   Some unexpected behaviour might occur if there are multiple <tt/.ALIGN/
1926   commands with different arguments. To allow the assembler to calculate the
1927   number of fill bytes in advance, the alignment of the segment must be a
1928   multiple of each of the alignment factors. This may result in unexpectedly
1929   large alignments for the segment within the module.
1930
1931   Example:
1932
1933   <tscreen><verb>
1934         .align  15
1935         .byte   15
1936         .align  18
1937         .byte   18
1938   </verb></tscreen>
1939
1940   For the assembler to be able to align correctly, the segment must be aligned
1941   to the least common multiple of 15 and 18 which is 90. The assembler will
1942   calculate this automatically and will mark the segment with this value.
1943
1944   Unfortunately, the combined alignment may get rather large without the user
1945   knowing about it, wasting space in the final executable. If we add another
1946   alignment to the example above
1947
1948   <tscreen><verb>
1949         .align  15
1950         .byte   15
1951         .align  18
1952         .byte   18
1953         .align  251
1954         .byte   0
1955   </verb></tscreen>
1956
1957   the assembler will force a segment alignment to the least common multiple of
1958   15, 18 and 251 - which is 22590. To protect the user against errors, the
1959   assembler will issue a warning when the combined alignment exceeds 256. The
1960   command line option <tt><ref id="option--large-alignment"
1961   name="--large-alignment"></tt> will disable this warning.
1962
1963   Please note that with alignments that are a power of two (which were the
1964   only alignments possible in older versions of the assembler), the problem is
1965   less severe, because the least common multiple of powers to the same base is
1966   always the larger one.
1967
1968
1969
1970 <sect1><tt>.ASCIIZ</tt><label id=".ASCIIZ"><p>
1971
1972   Define a string with a trailing zero.
1973
1974   Example:
1975
1976   <tscreen><verb>
1977         Msg:    .asciiz "Hello world"
1978   </verb></tscreen>
1979
1980   This will put the string "Hello world" followed by a binary zero into
1981   the current segment. There may be more strings separated by commas, but
1982   the binary zero is only appended once (after the last one).
1983
1984
1985 <sect1><tt>.ASSERT</tt><label id=".ASSERT"><p>
1986
1987   Add an assertion. The command is followed by an expression, an action
1988   specifier, and an optional message that is output in case the assertion
1989   fails. If no message was given, the string "Assertion failed" is used. The
1990   action specifier may be one of <tt/warning/, <tt/error/, <tt/ldwarning/ or
1991   <tt/lderror/. In the former two cases, the assertion is evaluated by the
1992   assembler if possible, and in any case, it's also passed to the linker in
1993   the object file (if one is generated). The linker will then evaluate the
1994   expression when segment placement has been done.
1995
1996   Example:
1997
1998   <tscreen><verb>
1999         .assert         * = $8000, error, "Code not at $8000"
2000   </verb></tscreen>
2001
2002   The example assertion will check that the current location is at $8000,
2003   when the output file is written, and abort with an error if this is not
2004   the case. More complex expressions are possible. The action specifier
2005   <tt/warning/ outputs a warning, while the <tt/error/ specifier outputs
2006   an error message. In the latter case, generation of the output file is
2007   suppressed in both the assembler and linker.
2008
2009
2010 <sect1><tt>.AUTOIMPORT</tt><label id=".AUTOIMPORT"><p>
2011
2012   Is followed by a plus or a minus character. When switched on (using a
2013   +), undefined symbols are automatically marked as import instead of
2014   giving errors. When switched off (which is the default so this does not
2015   make much sense), this does not happen and an error message is
2016   displayed. The state of the autoimport flag is evaluated when the
2017   complete source was translated, before outputting actual code, so it is
2018   <em/not/ possible to switch this feature on or off for separate sections
2019   of code. The last setting is used for all symbols.
2020
2021   You should probably not use this switch because it delays error
2022   messages about undefined symbols until the link stage. The cc65
2023   compiler (which is supposed to produce correct assembler code in all
2024   circumstances, something which is not true for most assembler
2025   programmers) will insert this command to avoid importing each and every
2026   routine from the runtime library.
2027
2028   Example:
2029
2030   <tscreen><verb>
2031         .autoimport     +       ; Switch on auto import
2032   </verb></tscreen>
2033
2034 <sect1><tt>.BANKBYTES</tt><label id=".BANKBYTES"><p>
2035
2036   Define byte sized data by extracting only the bank byte (that is, bits 16-23) from
2037   each expression.  This is equivalent to <tt><ref id=".BYTE" name=".BYTE"></tt> with
2038   the operator '^' prepended to each expression in its list.
2039
2040   Example:
2041
2042   <tscreen><verb>
2043         .define MyTable TableItem0, TableItem1, TableItem2, TableItem3
2044
2045         TableLookupLo:   .lobytes   MyTable
2046         TableLookupHi:   .hibytes   MyTable
2047         TableLookupBank: .bankbytes MyTable
2048   </verb></tscreen>
2049
2050   which is equivalent to
2051
2052   <tscreen><verb>
2053         TableLookupLo:   .byte &lt;TableItem0, &lt;TableItem1, &lt;TableItem2, &lt;TableItem3
2054         TableLookupHi:   .byte &gt;TableItem0, &gt;TableItem1, &gt;TableItem2, &gt;TableItem3
2055         TableLookupBank: .byte ^TableItem0, ^TableItem1, ^TableItem2, ^TableItem3
2056   </verb></tscreen>
2057
2058   See also: <tt><ref id=".BYTE" name=".BYTE"></tt>,
2059             <tt><ref id=".HIBYTES" name=".HIBYTES"></tt>,
2060             <tt><ref id=".LOBYTES" name=".LOBYTES"></tt>
2061
2062
2063 <sect1><tt>.BSS</tt><label id=".BSS"><p>
2064
2065   Switch to the BSS segment. The name of the BSS segment is always "BSS",
2066   so this is a shortcut for
2067
2068   <tscreen><verb>
2069         .segment  "BSS"
2070   </verb></tscreen>
2071
2072   See also the <tt><ref id=".SEGMENT" name=".SEGMENT"></tt> command.
2073
2074
2075 <sect1><tt>.BYT, .BYTE</tt><label id=".BYTE"><p>
2076
2077   Define byte sized data. Must be followed by a sequence of (byte ranged)
2078   expressions or strings.
2079
2080   Example:
2081
2082   <tscreen><verb>
2083         .byte   "Hello "
2084         .byt    "world", $0D, $00
2085   </verb></tscreen>
2086
2087
2088 <sect1><tt>.CASE</tt><label id=".CASE"><p>
2089
2090   Switch on or off case sensitivity on identifiers. The default is off
2091   (that is, identifiers are case sensitive), but may be changed by the
2092   -i switch on the command line.
2093   The command must be followed by a '+' or '-' character to switch the
2094   option on or off respectively.
2095
2096   Example:
2097
2098   <tscreen><verb>
2099         .case   -               ; Identifiers are not case sensitive
2100   </verb></tscreen>
2101
2102
2103 <sect1><tt>.CHARMAP</tt><label id=".CHARMAP"><p>
2104
2105   Apply a custom mapping for characters. The command is followed by two
2106   numbers in the range 1..255. The first one is the index of the source
2107   character, the second one is the mapping. The mapping applies to all
2108   character and string constants when they generate output, and overrides
2109   a mapping table specified with the <tt><ref id="option-t" name="-t"></tt>
2110   command line switch.
2111
2112   Example:
2113
2114   <tscreen><verb>
2115         .charmap        $41, $61        ; Map 'A' to 'a'
2116   </verb></tscreen>
2117
2118
2119 <sect1><tt>.CODE</tt><label id=".CODE"><p>
2120
2121   Switch to the CODE segment. The name of the CODE segment is always
2122   "CODE", so this is a shortcut for
2123
2124   <tscreen><verb>
2125         .segment  "CODE"
2126   </verb></tscreen>
2127
2128   See also the <tt><ref id=".SEGMENT" name=".SEGMENT"></tt> command.
2129
2130
2131 <sect1><tt>.CONDES</tt><label id=".CONDES"><p>
2132
2133   Export a symbol and mark it in a special way. The linker is able to build
2134   tables of all such symbols. This may be used to automatically create a list
2135   of functions needed to initialize linked library modules.
2136
2137   Note: The linker has a feature to build a table of marked routines, but it
2138   is your code that must call these routines, so just declaring a symbol with
2139   <tt/.CONDES/ does nothing by itself.
2140
2141   All symbols are exported as an absolute (16 bit) symbol. You don't need to
2142   use an additional <tt><ref id=".EXPORT" name=".EXPORT"></tt> statement, this
2143   is implied by <tt/.CONDES/.
2144
2145   <tt/.CONDES/ is followed by the type, which may be <tt/constructor/,
2146   <tt/destructor/ or a numeric value between 0 and 6 (where 0 is the same as
2147   specifying <tt/constructor/ and 1 is equal to specifying <tt/destructor/).
2148   The <tt><ref id=".CONSTRUCTOR" name=".CONSTRUCTOR"></tt>, <tt><ref
2149   id=".DESTRUCTOR" name=".DESTRUCTOR"></tt> and <tt><ref id=".INTERRUPTOR"
2150   name=".INTERRUPTOR"></tt> commands are actually shortcuts for <tt/.CONDES/
2151   with a type of <tt/constructor/ resp. <tt/destructor/ or <tt/interruptor/.
2152
2153   After the type, an optional priority may be specified. Higher numeric values
2154   mean higher priority. If no priority is given, the default priority of 7 is
2155   used. Be careful when assigning priorities to your own module constructors
2156   so they won't interfere with the ones in the cc65 library.
2157
2158   Example:
2159
2160   <tscreen><verb>
2161         .condes         ModuleInit, constructor
2162         .condes         ModInit, 0, 16
2163   </verb></tscreen>
2164
2165   See the <tt><ref id=".CONSTRUCTOR" name=".CONSTRUCTOR"></tt>, <tt><ref
2166   id=".DESTRUCTOR" name=".DESTRUCTOR"></tt> and <tt><ref id=".INTERRUPTOR"
2167   name=".INTERRUPTOR"></tt> commands and the separate section <ref id="condes"
2168   name="Module constructors/destructors"> explaining the feature in more
2169   detail.
2170
2171
2172 <sect1><tt>.CONSTRUCTOR</tt><label id=".CONSTRUCTOR"><p>
2173
2174   Export a symbol and mark it as a module constructor. This may be used
2175   together with the linker to build a table of constructor subroutines that
2176   are called by the startup code.
2177
2178   Note: The linker has a feature to build a table of marked routines, but it
2179   is your code that must call these routines, so just declaring a symbol as
2180   constructor does nothing by itself.
2181
2182   A constructor is always exported as an absolute (16 bit) symbol. You don't
2183   need to use an additional <tt/.export/ statement, this is implied by
2184   <tt/.constructor/. It may have an optional priority that is separated by a
2185   comma. Higher numeric values mean a higher priority. If no priority is
2186   given, the default priority of 7 is used. Be careful when assigning
2187   priorities to your own module constructors so they won't interfere with the
2188   ones in the cc65 library.
2189
2190   Example:
2191
2192   <tscreen><verb>
2193         .constructor    ModuleInit
2194         .constructor    ModInit, 16
2195   </verb></tscreen>
2196
2197   See the <tt><ref id=".CONDES" name=".CONDES"></tt> and <tt><ref
2198   id=".DESTRUCTOR" name=".DESTRUCTOR"></tt> commands and the separate section
2199   <ref id="condes" name="Module constructors/destructors"> explaining the
2200   feature in more detail.
2201
2202
2203 <sect1><tt>.DATA</tt><label id=".DATA"><p>
2204
2205   Switch to the DATA segment. The name of the DATA segment is always
2206   "DATA", so this is a shortcut for
2207
2208   <tscreen><verb>
2209         .segment  "DATA"
2210   </verb></tscreen>
2211
2212   See also the <tt><ref id=".SEGMENT" name=".SEGMENT"></tt> command.
2213
2214
2215 <sect1><tt>.DBYT</tt><label id=".DBYT"><p>
2216
2217   Define word sized data with the hi and lo bytes swapped (use <tt/.WORD/ to
2218   create word sized data in native 65XX format). Must be followed by a
2219   sequence of (word ranged) expressions.
2220
2221   Example:
2222
2223   <tscreen><verb>
2224         .dbyt   $1234, $4512
2225   </verb></tscreen>
2226
2227   This will emit the bytes
2228
2229   <tscreen><verb>
2230         $12 $34 $45 $12
2231   </verb></tscreen>
2232
2233   into the current segment in that order.
2234
2235
2236 <sect1><tt>.DEBUGINFO</tt><label id=".DEBUGINFO"><p>
2237
2238   Switch on or off debug info generation. The default is off (that is,
2239   the object file will not contain debug infos), but may be changed by the
2240   -g switch on the command line.
2241   The command must be followed by a '+' or '-' character to switch the
2242   option on or off respectively.
2243
2244   Example:
2245
2246   <tscreen><verb>
2247         .debuginfo      +       ; Generate debug info
2248   </verb></tscreen>
2249
2250
2251 <sect1><tt>.DEFINE</tt><label id=".DEFINE"><p>
2252
2253   Start a define style macro definition. The command is followed by an
2254   identifier (the macro name) and optionally by a list of formal arguments
2255   in braces.
2256
2257   Please note that <tt/.DEFINE/ shares most disadvantages with its C
2258   counterpart, so the general advice is, <bf/NOT/ do use <tt/.DEFINE/ if you
2259   don't have to.
2260
2261   See also the <tt><ref id=".UNDEFINE" name=".UNDEFINE"></tt> command and
2262   section <ref id="macros" name="Macros">.
2263
2264
2265 <sect1><tt>.DELMAC, .DELMACRO</tt><label id=".DELMACRO"><p>
2266
2267   Delete a classic macro (defined with <tt><ref id=".MACRO"
2268   name=".MACRO"></tt>) . The command is followed by the name of an
2269   existing macro. Its definition will be deleted together with the name.
2270   If necessary, another macro with this name may be defined later.
2271
2272   See: <tt><ref id=".ENDMACRO" name=".ENDMACRO"></tt>,
2273        <tt><ref id=".EXITMACRO" name=".EXITMACRO"></tt>,
2274        <tt><ref id=".MACRO" name=".MACRO"></tt>
2275
2276   See also section <ref id="macros" name="Macros">.
2277
2278
2279 <sect1><tt>.DEF, .DEFINED</tt><label id=".DEFINED"><p>
2280
2281   Builtin function. The function expects an identifier as argument in braces.
2282   The argument is evaluated, and the function yields "true" if the identifier
2283   is a symbol that is already defined somewhere in the source file up to the
2284   current position. Otherwise the function yields false. As an example, the
2285   <tt><ref id=".IFDEF" name=".IFDEF"></tt> statement may be replaced by
2286
2287   <tscreen><verb>
2288         .if     .defined(a)
2289   </verb></tscreen>
2290
2291
2292 <sect1><tt>.DESTRUCTOR</tt><label id=".DESTRUCTOR"><p>
2293
2294   Export a symbol and mark it as a module destructor. This may be used
2295   together with the linker to build a table of destructor subroutines that
2296   are called by the startup code.
2297
2298   Note: The linker has a feature to build a table of marked routines, but it
2299   is your code that must call these routines, so just declaring a symbol as
2300   constructor does nothing by itself.
2301
2302   A destructor is always exported as an absolute (16 bit) symbol. You don't
2303   need to use an additional <tt/.export/ statement, this is implied by
2304   <tt/.destructor/. It may have an optional priority that is separated by a
2305   comma. Higher numerical values mean a higher priority. If no priority is
2306   given, the default priority of 7 is used. Be careful when assigning
2307   priorities to your own module destructors so they won't interfere with the
2308   ones in the cc65 library.
2309
2310   Example:
2311
2312   <tscreen><verb>
2313         .destructor     ModuleDone
2314         .destructor     ModDone, 16
2315   </verb></tscreen>
2316
2317   See the <tt><ref id=".CONDES" name=".CONDES"></tt> and <tt><ref
2318   id=".CONSTRUCTOR" name=".CONSTRUCTOR"></tt> commands and the separate
2319   section <ref id="condes" name="Module constructors/destructors"> explaining
2320   the feature in more detail.
2321
2322
2323 <sect1><tt>.DWORD</tt><label id=".DWORD"><p>
2324
2325   Define dword sized data (4 bytes) Must be followed by a sequence of
2326   expressions.
2327
2328   Example:
2329
2330   <tscreen><verb>
2331         .dword  $12344512, $12FA489
2332   </verb></tscreen>
2333
2334
2335 <sect1><tt>.ELSE</tt><label id=".ELSE"><p>
2336
2337   Conditional assembly: Reverse the current condition.
2338
2339
2340 <sect1><tt>.ELSEIF</tt><label id=".ELSEIF"><p>
2341
2342   Conditional assembly: Reverse current condition and test a new one.
2343
2344
2345 <sect1><tt>.END</tt><label id=".END"><p>
2346
2347   Forced end of assembly. Assembly stops at this point, even if the command
2348   is read from an include file.
2349
2350
2351 <sect1><tt>.ENDENUM</tt><label id=".ENDENUM"><p>
2352
2353   End a <tt><ref id=".ENUM" name=".ENUM"></tt> declaration.
2354
2355
2356 <sect1><tt>.ENDIF</tt><label id=".ENDIF"><p>
2357
2358   Conditional assembly: Close a <tt><ref id=".IF" name=".IF..."></tt> or
2359   <tt><ref id=".ELSE" name=".ELSE"></tt> branch.
2360
2361
2362 <sect1><tt>.ENDMAC, .ENDMACRO</tt><label id=".ENDMACRO"><p>
2363
2364   Marks the end of a macro definition.
2365
2366   See: <tt><ref id=".DELMACRO" name=".DELMACRO"></tt>,
2367        <tt><ref id=".EXITMACRO" name=".EXITMACRO"></tt>,
2368        <tt><ref id=".MACRO" name=".MACRO"></tt>
2369
2370   See also section <ref id="macros" name="Macros">.
2371
2372
2373 <sect1><tt>.ENDPROC</tt><label id=".ENDPROC"><p>
2374
2375   End of local lexical level (see <tt><ref id=".PROC" name=".PROC"></tt>).
2376
2377
2378 <sect1><tt>.ENDREP, .ENDREPEAT</tt><label id=".ENDREPEAT"><p>
2379
2380   End a <tt><ref id=".REPEAT" name=".REPEAT"></tt> block.
2381
2382
2383 <sect1><tt>.ENDSCOPE</tt><label id=".ENDSCOPE"><p>
2384
2385   End of local lexical level (see <tt/<ref id=".SCOPE" name=".SCOPE">/).
2386
2387
2388 <sect1><tt>.ENDSTRUCT</tt><label id=".ENDSTRUCT"><p>
2389
2390   Ends a struct definition. See the <tt/<ref id=".STRUCT" name=".STRUCT">/
2391   command and the separate section named <ref id="structs" name="&quot;Structs
2392   and unions&quot;">.
2393
2394
2395 <sect1><tt>.ENDUNION</tt><label id=".ENDUNION"><p>
2396
2397   Ends a union definition. See the <tt/<ref id=".UNION" name=".UNION">/
2398   command and the separate section named <ref id="structs" name="&quot;Structs
2399   and unions&quot;">.
2400
2401
2402 <sect1><tt>.ENUM</tt><label id=".ENUM"><p>
2403
2404   Start an enumeration. This directive is very similar to the C <tt/enum/
2405   keyword. If a name is given, a new scope is created for the enumeration,
2406   otherwise the enumeration members are placed in the enclosing scope.
2407
2408   In the enumeration body, symbols are declared. The first symbol has a value
2409   of zero, and each following symbol will get the value of the preceding plus
2410   one. This behaviour may be overridden by an explicit assignment. Two symbols
2411   may have the same value.
2412
2413   Example:
2414
2415   <tscreen><verb>
2416         .enum   errorcodes
2417                 no_error
2418                 file_error
2419                 parse_error
2420         .endenum
2421   </verb></tscreen>
2422
2423   Above example will create a new scope named <tt/errorcodes/ with three
2424   symbols in it that get the values 0, 1 and 2 respectively. Another way
2425   to write this would have been:
2426
2427   <tscreen><verb>
2428         .scope  errorcodes
2429                 no_error        = 0
2430                 file_error      = 1
2431                 parse_error     = 2
2432         .endscope
2433   </verb></tscreen>
2434
2435   Please note that explicit scoping must be used to access the identifiers:
2436
2437   <tscreen><verb>
2438         .word   errorcodes::no_error
2439   </verb></tscreen>
2440
2441   A more complex example:
2442
2443   <tscreen><verb>
2444         .enum
2445                 EUNKNOWN        = -1
2446                 EOK
2447                 EFILE
2448                 EBUSY
2449                 EAGAIN
2450                 EWOULDBLOCK     = EAGAIN
2451         .endenum
2452   </verb></tscreen>
2453
2454   In this example, the enumeration does not have a name, which means that the
2455   members will be visible in the enclosing scope and can be used in this scope
2456   without explicit scoping. The first member (<tt/EUNKNOWN/) has the value -1.
2457   The value for the following members is incremented by one, so <tt/EOK/ would
2458   be zero and so on. <tt/EWOULDBLOCK/ is an alias for <tt/EGAIN/, so it has an
2459   override for the value using an already defined symbol.
2460
2461
2462 <sect1><tt>.ERROR</tt><label id=".ERROR"><p>
2463
2464   Force an assembly error. The assembler will output an error message
2465   preceded by "User error". Assembly is continued but no object file will
2466   generated.
2467
2468   This command may be used to check for initial conditions that must be
2469   set before assembling a source file.
2470
2471   Example:
2472
2473   <tscreen><verb>
2474         .if     foo = 1
2475         ...
2476         .elseif bar = 1
2477         ...
2478         .else
2479         .error  "Must define foo or bar!"
2480         .endif
2481   </verb></tscreen>
2482
2483   See also: <tt><ref id=".FATAL" name=".FATAL"></tt>,
2484             <tt><ref id=".OUT" name=".OUT"></tt>,
2485             <tt><ref id=".WARNING" name=".WARNING"></tt>
2486
2487
2488 <sect1><tt>.EXITMAC, .EXITMACRO</tt><label id=".EXITMACRO"><p>
2489
2490   Abort a macro expansion immediately. This command is often useful in
2491   recursive macros.
2492
2493   See: <tt><ref id=".DELMACRO" name=".DELMACRO"></tt>,
2494        <tt><ref id=".ENDMACRO" name=".ENDMACRO"></tt>,
2495        <tt><ref id=".MACRO" name=".MACRO"></tt>
2496
2497   See also section <ref id="macros" name="Macros">.
2498
2499
2500 <sect1><tt>.EXPORT</tt><label id=".EXPORT"><p>
2501
2502   Make symbols accessible from other modules. Must be followed by a comma
2503   separated list of symbols to export, with each one optionally followed by an
2504   address specification and (also optional) an assignment. Using an additional
2505   assignment in the export statement allows to define and export a symbol in
2506   one statement. The default is to export the symbol with the address size it
2507   actually has. The assembler will issue a warning, if the symbol is exported
2508   with an address size smaller than the actual address size.
2509
2510   Examples:
2511
2512   <tscreen><verb>
2513         .export foo
2514         .export bar: far
2515         .export foobar: far = foo * bar
2516         .export baz := foobar, zap: far = baz - bar
2517   </verb></tscreen>
2518
2519   As with constant definitions, using <tt/:=/ instead of <tt/=/ marks the
2520   symbols as a label.
2521
2522   See: <tt><ref id=".EXPORTZP" name=".EXPORTZP"></tt>
2523
2524
2525 <sect1><tt>.EXPORTZP</tt><label id=".EXPORTZP"><p>
2526
2527   Make symbols accessible from other modules. Must be followed by a comma
2528   separated list of symbols to export. The exported symbols are explicitly
2529   marked as zero page symbols. An assignment may be included in the
2530   <tt/.EXPORTZP/ statement. This allows to define and export a symbol in one
2531   statement.
2532
2533   Examples:
2534
2535   <tscreen><verb>
2536         .exportzp  foo, bar
2537         .exportzp  baz := &dollar;02
2538   </verb></tscreen>
2539
2540   See: <tt><ref id=".EXPORT" name=".EXPORT"></tt>
2541
2542
2543 <sect1><tt>.FARADDR</tt><label id=".FARADDR"><p>
2544
2545   Define far (24 bit) address data. The command must be followed by a
2546   sequence of (not necessarily constant) expressions.
2547
2548   Example:
2549
2550   <tscreen><verb>
2551         .faraddr        DrawCircle, DrawRectangle, DrawHexagon
2552   </verb></tscreen>
2553
2554   See: <tt><ref id=".ADDR" name=".ADDR"></tt>
2555
2556
2557 <sect1><tt>.FATAL</tt><label id=".FATAL"><p>
2558
2559   Force an assembly error and terminate assembly. The assembler will output an
2560   error message preceded by "User error" and will terminate assembly
2561   immediately.
2562
2563   This command may be used to check for initial conditions that must be
2564   set before assembling a source file.
2565
2566   Example:
2567
2568   <tscreen><verb>
2569         .if     foo = 1
2570         ...
2571         .elseif bar = 1
2572         ...
2573         .else
2574         .fatal  "Must define foo or bar!"
2575         .endif
2576   </verb></tscreen>
2577
2578   See also: <tt><ref id=".ERROR" name=".ERROR"></tt>,
2579             <tt><ref id=".OUT" name=".OUT"></tt>,
2580             <tt><ref id=".WARNING" name=".WARNING"></tt>
2581
2582
2583 <sect1><tt>.FEATURE</tt><label id=".FEATURE"><p>
2584
2585   This directive may be used to enable one or more compatibility features
2586   of the assembler. While the use of <tt/.FEATURE/ should be avoided when
2587   possible, it may be useful when porting sources written for other
2588   assemblers. There is no way to switch a feature off, once you have
2589   enabled it, so using
2590
2591   <tscreen><verb>
2592         .FEATURE        xxx
2593   </verb></tscreen>
2594
2595   will enable the feature until end of assembly is reached.
2596
2597   The following features are available:
2598
2599   <descrip>
2600
2601   <tag><tt>at_in_identifiers</tt><label id="at_in_identifiers"></tag>
2602
2603     Accept the at character (`@') as a valid character in identifiers. The
2604     at character is not allowed to start an identifier, even with this
2605     feature enabled.
2606
2607   <tag><tt>c_comments</tt><label id="c_comments"></tag>
2608
2609     Allow C like comments using <tt>/*</tt> and <tt>*/</tt> as left and right
2610     comment terminators. Note that C comments may not be nested. There's also a
2611     pitfall when using C like comments: All statements must be terminated by
2612     "end-of-line". Using C like comments, it is possible to hide the newline,
2613     which results in error messages. See the following non working example:
2614
2615     <tscreen><verb>
2616         lda     #$00  /* This comment hides the newline
2617 */      sta     $82
2618     </verb></tscreen>
2619
2620   <tag><tt>dollar_in_identifiers</tt><label id="dollar_in_identifiers"></tag>
2621
2622     Accept the dollar sign (`&dollar;') as a valid character in identifiers. The
2623     dollar character is not allowed to start an identifier, even with this
2624     feature enabled.
2625
2626   <tag><tt>dollar_is_pc</tt><label id="dollar_is_pc"></tag>
2627
2628     The dollar sign may be used as an alias for the star (`*'), which
2629     gives the value of the current PC in expressions.
2630     Note: Assignment to the pseudo variable is not allowed.
2631
2632   <tag><tt>force_range</tt><label id="force_range"></tag>
2633
2634     Force expressions into their valid range for immediate addressing and
2635     storage operators like <tt><ref id=".BYTE" name=".BYTE"></tt> and
2636     <tt><ref id=".WORD" name=".WORD"></tt>. Be very careful with this one,
2637     since it will completely disable error checks.
2638
2639   <tag><tt>labels_without_colons</tt><label id="labels_without_colons"></tag>
2640
2641     Allow labels without a trailing colon. These labels are only accepted,
2642     if they start at the beginning of a line (no leading white space).
2643
2644   <tag><tt>leading_dot_in_identifiers</tt><label id="leading_dot_in_identifiers"></tag>
2645
2646     Accept the dot (`.') as the first character of an identifier. This may be
2647     used for example to create macro names that start with a dot emulating
2648     control directives of other assemblers. Note however, that none of the
2649     reserved keywords built into the assembler, that starts with a dot, may be
2650     overridden. When using this feature, you may also get into trouble if
2651     later versions of the assembler define new keywords starting with a dot.
2652
2653   <tag><tt>loose_char_term</tt><label id="loose_char_term"></tag>
2654
2655     Accept single quotes as well as double quotes as terminators for char
2656     constants.
2657
2658   <tag><tt>loose_string_term</tt><label id="loose_string_term"></tag>
2659
2660     Accept single quotes as well as double quotes as terminators for string
2661     constants.
2662
2663   <tag><tt>missing_char_term</tt><label id="missing_char_term"></tag>
2664
2665     Accept single quoted character constants where the terminating quote is
2666     missing.
2667     <tscreen><verb>
2668         lda     #'a
2669     </verb></tscreen>
2670     <bf/Note:/ This does not work in conjunction with <tt/.FEATURE
2671     loose_string_term/, since in this case the input would be ambiguous.
2672
2673   <tag><tt>org_per_seg</tt><label id="org_per_seg"></tag>
2674
2675     This feature makes relocatable/absolute mode local to the current segment.
2676     Using <tt><ref id=".ORG" name=".ORG"></tt> when <tt/org_per_seg/ is in
2677     effect will only enable absolute mode for the current segment. Dito for
2678     <tt><ref id=".RELOC" name=".RELOC"></tt>.
2679
2680   <tag><tt>pc_assignment</tt><label id="pc_assignment"></tag>
2681
2682     Allow assignments to the PC symbol (`*' or `&dollar;' if <tt/dollar_is_pc/
2683     is enabled). Such an assignment is handled identical to the <tt><ref
2684     id=".ORG" name=".ORG"></tt> command (which is usually not needed, so just
2685     removing the lines with the assignments may also be an option when porting
2686     code written for older assemblers).
2687
2688   <tag><tt>ubiquitous_idents</tt><label id="ubiquitous_idents"></tag>
2689
2690     Allow the use of instructions names as names for macros and symbols. This
2691     makes it possible to "overload" instructions by defining a macro with the
2692     same name. This does also make it possible to introduce hard to find errors
2693     in your code, so be careful!
2694
2695   </descrip>
2696
2697   It is also possible to specify features on the command line using the
2698   <tt><ref id="option--feature" name="--feature"></tt> command line option.
2699   This is useful when translating sources written for older assemblers, when
2700   you don't want to change the source code.
2701
2702   As an example, to translate sources written for Andre Fachats xa65
2703   assembler, the features
2704
2705   <verb>
2706         labels_without_colons, pc_assignment, loose_char_term
2707   </verb>
2708
2709   may be helpful. They do not make ca65 completely compatible, so you may not
2710   be able to translate the sources without changes, even when enabling these
2711   features. However, I have found several sources that translate without
2712   problems when enabling these features on the command line.
2713
2714
2715 <sect1><tt>.FILEOPT, .FOPT</tt><label id=".FOPT"><p>
2716
2717   Insert an option string into the object file. There are two forms of
2718   this command, one specifies the option by a keyword, the second
2719   specifies it as a number. Since usage of the second one needs knowledge
2720   of the internal encoding, its use is not recommended and I will only
2721   describe the first form here.
2722
2723   The command is followed by one of the keywords
2724
2725   <tscreen><verb>
2726         author
2727         comment
2728         compiler
2729   </verb></tscreen>
2730
2731   a comma and a string. The option is written into the object file
2732   together with the string value. This is currently unidirectional and
2733   there is no way to actually use these options once they are in the
2734   object file.
2735
2736   Examples:
2737
2738   <tscreen><verb>
2739         .fileopt        comment, "Code stolen from my brother"
2740         .fileopt        compiler, "BASIC 2.0"
2741         .fopt           author, "J. R. User"
2742   </verb></tscreen>
2743
2744
2745 <sect1><tt>.FORCEIMPORT</tt><label id=".FORCEIMPORT"><p>
2746
2747   Import an absolute symbol from another module. The command is followed by a
2748   comma separated list of symbols to import. The command is similar to <tt>
2749   <ref id=".IMPORT" name=".IMPORT"></tt>, but the import reference is always
2750   written to the generated object file, even if the symbol is never referenced
2751   (<tt><ref id=".IMPORT" name=".IMPORT"></tt> will not generate import
2752   references for unused symbols).
2753
2754   Example:
2755
2756   <tscreen><verb>
2757         .forceimport    needthisone, needthistoo
2758   </verb></tscreen>
2759
2760   See: <tt><ref id=".IMPORT" name=".IMPORT"></tt>
2761
2762
2763 <sect1><tt>.GLOBAL</tt><label id=".GLOBAL"><p>
2764
2765   Declare symbols as global. Must be followed by a comma separated list of
2766   symbols to declare. Symbols from the list, that are defined somewhere in the
2767   source, are exported, all others are imported. Additional <tt><ref
2768   id=".IMPORT" name=".IMPORT"></tt> or <tt><ref id=".EXPORT"
2769   name=".EXPORT"></tt> commands for the same symbol are allowed.
2770
2771   Example:
2772
2773   <tscreen><verb>
2774         .global foo, bar
2775   </verb></tscreen>
2776
2777
2778 <sect1><tt>.GLOBALZP</tt><label id=".GLOBALZP"><p>
2779
2780   Declare symbols as global. Must be followed by a comma separated list of
2781   symbols to declare. Symbols from the list, that are defined somewhere in the
2782   source, are exported, all others are imported. Additional <tt><ref
2783   id=".IMPORTZP" name=".IMPORTZP"></tt> or <tt><ref id=".EXPORTZP"
2784   name=".EXPORTZP"></tt> commands for the same symbol are allowed. The symbols
2785   in the list are explicitly marked as zero page symbols.
2786
2787   Example:
2788
2789   <tscreen><verb>
2790         .globalzp foo, bar
2791   </verb></tscreen>
2792
2793 <sect1><tt>.HIBYTES</tt><label id=".HIBYTES"><p>
2794
2795   Define byte sized data by extracting only the high byte (that is, bits 8-15) from
2796   each expression.  This is equivalent to <tt><ref id=".BYTE" name=".BYTE"></tt> with
2797   the operator '>' prepended to each expression in its list.
2798
2799   Example:
2800
2801   <tscreen><verb>
2802         .lobytes         $1234, $2345, $3456, $4567
2803         .hibytes         $fedc, $edcb, $dcba, $cba9
2804   </verb></tscreen>
2805
2806   which is equivalent to
2807
2808   <tscreen><verb>
2809         .byte            $34, $45, $56, $67
2810         .byte            $fe, $ed, $dc, $cb
2811   </verb></tscreen>
2812
2813   Example:
2814
2815   <tscreen><verb>
2816         .define MyTable TableItem0, TableItem1, TableItem2, TableItem3
2817
2818         TableLookupLo:   .lobytes MyTable
2819         TableLookupHi:   .hibytes MyTable
2820   </verb></tscreen>
2821
2822   which is equivalent to
2823
2824   <tscreen><verb>
2825         TableLookupLo:   .byte &lt;TableItem0, &lt;TableItem1, &lt;TableItem2, &lt;TableItem3
2826         TableLookupHi:   .byte &gt;TableItem0, &gt;TableItem1, &gt;TableItem2, &gt;TableItem3
2827   </verb></tscreen>
2828
2829   See also: <tt><ref id=".BYTE" name=".BYTE"></tt>,
2830             <tt><ref id=".LOBYTES" name=".LOBYTES"></tt>,
2831             <tt><ref id=".BANKBYTES" name=".BANKBYTES"></tt>
2832
2833
2834 <sect1><tt>.I16</tt><label id=".I16"><p>
2835
2836   Valid only in 65816 mode. Switch the index registers to 16 bit.
2837
2838   Note: This command will not emit any code, it will tell the assembler to
2839   create 16 bit operands for immediate operands.
2840
2841   See also the <tt><ref id=".I8" name=".I8"></tt> and <tt><ref id=".SMART"
2842   name=".SMART"></tt> commands.
2843
2844
2845 <sect1><tt>.I8</tt><label id=".I8"><p>
2846
2847   Valid only in 65816 mode. Switch the index registers to 8 bit.
2848
2849   Note: This command will not emit any code, it will tell the assembler to
2850   create 8 bit operands for immediate operands.
2851
2852   See also the <tt><ref id=".I16" name=".I16"></tt> and <tt><ref id=".SMART"
2853   name=".SMART"></tt> commands.
2854
2855
2856 <sect1><tt>.IF</tt><label id=".IF"><p>
2857
2858   Conditional assembly: Evaluate an expression and switch assembler output
2859   on or off depending on the expression. The expression must be a constant
2860   expression, that is, all operands must be defined.
2861
2862   A expression value of zero evaluates to FALSE, any other value evaluates
2863   to TRUE.
2864
2865
2866 <sect1><tt>.IFBLANK</tt><label id=".IFBLANK"><p>
2867
2868   Conditional assembly: Check if there are any remaining tokens in this line,
2869   and evaluate to FALSE if this is the case, and to TRUE otherwise. If the
2870   condition is not true, further lines are not assembled until an <tt><ref
2871   id=".ELSE" name=".ESLE"></tt>, <tt><ref id=".ELSEIF" name=".ELSEIF"></tt> or
2872   <tt><ref id=".ENDIF" name=".ENDIF"></tt> directive.
2873
2874   This command is often used to check if a macro parameter was given. Since an
2875   empty macro parameter will evaluate to nothing, the condition will evaluate
2876   to FALSE if an empty parameter was given.
2877
2878   Example:
2879
2880   <tscreen><verb>
2881         .macro     arg1, arg2
2882         .ifblank   arg2
2883                    lda     #arg1
2884         .else
2885                    lda     #arg2
2886         .endif
2887         .endmacro
2888   </verb></tscreen>
2889
2890   See also: <tt><ref id=".BLANK" name=".BLANK"></tt>
2891
2892
2893 <sect1><tt>.IFCONST</tt><label id=".IFCONST"><p>
2894
2895   Conditional assembly: Evaluate an expression and switch assembler output
2896   on or off depending on the constness of the expression.
2897
2898   A const expression evaluates to to TRUE, a non const expression (one
2899   containing an imported or currently undefined symbol) evaluates to
2900   FALSE.
2901
2902   See also: <tt><ref id=".CONST" name=".CONST"></tt>
2903
2904
2905 <sect1><tt>.IFDEF</tt><label id=".IFDEF"><p>
2906
2907   Conditional assembly: Check if a symbol is defined. Must be followed by
2908   a symbol name. The condition is true if the the given symbol is already
2909   defined, and false otherwise.
2910
2911   See also: <tt><ref id=".DEFINED" name=".DEFINED"></tt>
2912
2913
2914 <sect1><tt>.IFNBLANK</tt><label id=".IFNBLANK"><p>
2915
2916   Conditional assembly: Check if there are any remaining tokens in this line,
2917   and evaluate to TRUE if this is the case, and to FALSE otherwise. If the
2918   condition is not true, further lines are not assembled until an <tt><ref
2919   id=".ELSE" name=".ELSE"></tt>, <tt><ref id=".ELSEIF" name=".ELSEIF"></tt> or
2920   <tt><ref id=".ENDIF" name=".ENDIF"></tt> directive.
2921
2922   This command is often used to check if a macro parameter was given.
2923   Since an empty macro parameter will evaluate to nothing, the condition
2924   will evaluate to FALSE if an empty parameter was given.
2925
2926   Example:
2927
2928   <tscreen><verb>
2929         .macro     arg1, arg2
2930                    lda     #arg1
2931         .ifnblank  arg2
2932                    lda     #arg2
2933         .endif
2934         .endmacro
2935   </verb></tscreen>
2936
2937   See also: <tt><ref id=".BLANK" name=".BLANK"></tt>
2938
2939
2940 <sect1><tt>.IFNDEF</tt><label id=".IFNDEF"><p>
2941
2942   Conditional assembly: Check if a symbol is defined. Must be followed by
2943   a symbol name. The condition is true if the the given symbol is not
2944   defined, and false otherwise.
2945
2946   See also: <tt><ref id=".DEFINED" name=".DEFINED"></tt>
2947
2948
2949 <sect1><tt>.IFNREF</tt><label id=".IFNREF"><p>
2950
2951   Conditional assembly: Check if a symbol is referenced. Must be followed
2952   by a symbol name. The condition is true if if the the given symbol was
2953   not referenced before, and false otherwise.
2954
2955   See also: <tt><ref id=".REFERENCED" name=".REFERENCED"></tt>
2956
2957
2958 <sect1><tt>.IFP02</tt><label id=".IFP02"><p>
2959
2960   Conditional assembly: Check if the assembler is currently in 6502 mode
2961   (see <tt><ref id=".P02" name=".P02"></tt> command).
2962
2963
2964 <sect1><tt>.IFP816</tt><label id=".IFP816"><p>
2965
2966   Conditional assembly: Check if the assembler is currently in 65816 mode
2967   (see <tt><ref id=".P816" name=".P816"></tt> command).
2968
2969
2970 <sect1><tt>.IFPC02</tt><label id=".IFPC02"><p>
2971
2972   Conditional assembly: Check if the assembler is currently in 65C02 mode
2973   (see <tt><ref id=".PC02" name=".PC02"></tt> command).
2974
2975
2976 <sect1><tt>.IFPSC02</tt><label id=".IFPSC02"><p>
2977
2978   Conditional assembly: Check if the assembler is currently in 65SC02 mode
2979   (see <tt><ref id=".PSC02" name=".PSC02"></tt> command).
2980
2981
2982 <sect1><tt>.IFREF</tt><label id=".IFREF"><p>
2983
2984   Conditional assembly: Check if a symbol is referenced. Must be followed
2985   by a symbol name. The condition is true if if the the given symbol was
2986   referenced before, and false otherwise.
2987
2988   This command may be used to build subroutine libraries in include files
2989   (you may use separate object modules for this purpose too).
2990
2991   Example:
2992
2993   <tscreen><verb>
2994         .ifref  ToHex                   ; If someone used this subroutine
2995         ToHex:  tay                     ; Define subroutine
2996                 lda     HexTab,y
2997                 rts
2998         .endif
2999   </verb></tscreen>
3000
3001   See also: <tt><ref id=".REFERENCED" name=".REFERENCED"></tt>
3002
3003
3004 <sect1><tt>.IMPORT</tt><label id=".IMPORT"><p>
3005
3006   Import a symbol from another module. The command is followed by a comma
3007   separated list of symbols to import, with each one optionally followed by
3008   an address specification.
3009
3010   Example:
3011
3012   <tscreen><verb>
3013         .import foo
3014         .import bar: zeropage
3015   </verb></tscreen>
3016
3017   See: <tt><ref id=".IMPORTZP" name=".IMPORTZP"></tt>
3018
3019
3020 <sect1><tt>.IMPORTZP</tt><label id=".IMPORTZP"><p>
3021
3022   Import a symbol from another module. The command is followed by a comma
3023   separated list of symbols to import. The symbols are explicitly imported
3024   as zero page symbols (that is, symbols with values in byte range).
3025
3026   Example:
3027
3028   <tscreen><verb>
3029         .importzp       foo, bar
3030   </verb></tscreen>
3031
3032   See: <tt><ref id=".IMPORT" name=".IMPORT"></tt>
3033
3034
3035 <sect1><tt>.INCBIN</tt><label id=".INCBIN"><p>
3036
3037   Include a file as binary data. The command expects a string argument
3038   that is the name of a file to include literally in the current segment.
3039   In addition to that, a start offset and a size value may be specified,
3040   separated by commas. If no size is specified, all of the file from the
3041   start offset to end-of-file is used. If no start position is specified
3042   either, zero is assumed (which means that the whole file is inserted).
3043
3044   Example:
3045
3046   <tscreen><verb>
3047         ; Include whole file
3048         .incbin         "sprites.dat"
3049
3050         ; Include file starting at offset 256
3051         .incbin         "music.dat", $100
3052
3053         ; Read 100 bytes starting at offset 200
3054         .incbin         "graphics.dat", 200, 100
3055   </verb></tscreen>
3056
3057
3058 <sect1><tt>.INCLUDE</tt><label id=".INCLUDE"><p>
3059
3060   Include another file. Include files may be nested up to a depth of 16.
3061
3062   Example:
3063
3064   <tscreen><verb>
3065         .include        "subs.inc"
3066   </verb></tscreen>
3067
3068
3069 <sect1><tt>.INTERRUPTOR</tt><label id=".INTERRUPTOR"><p>
3070
3071   Export a symbol and mark it as an interruptor. This may be used together
3072   with the linker to build a table of interruptor subroutines that are called
3073   in an interrupt.
3074
3075   Note: The linker has a feature to build a table of marked routines, but it
3076   is your code that must call these routines, so just declaring a symbol as
3077   interruptor does nothing by itself.
3078
3079   An interruptor is always exported as an absolute (16 bit) symbol. You don't
3080   need to use an additional <tt/.export/ statement, this is implied by
3081   <tt/.interruptor/. It may have an optional priority that is separated by a
3082   comma. Higher numeric values mean a higher priority. If no priority is
3083   given, the default priority of 7 is used. Be careful when assigning
3084   priorities to your own module constructors so they won't interfere with the
3085   ones in the cc65 library.
3086
3087   Example:
3088
3089   <tscreen><verb>
3090         .interruptor    IrqHandler
3091         .interruptor    Handler, 16
3092   </verb></tscreen>
3093
3094   See the <tt><ref id=".CONDES" name=".CONDES"></tt> command and the separate
3095   section <ref id="condes" name="Module constructors/destructors"> explaining
3096   the feature in more detail.
3097
3098
3099 <sect1><tt>.LINECONT</tt><label id=".LINECONT"><p>
3100
3101   Switch on or off line continuations using the backslash character
3102   before a newline. The option is off by default.
3103   Note: Line continuations do not work in a comment. A backslash at the
3104   end of a comment is treated as part of the comment and does not trigger
3105   line continuation.
3106   The command must be followed by a '+' or '-' character to switch the
3107   option on or off respectively.
3108
3109   Example:
3110
3111   <tscreen><verb>
3112         .linecont       +               ; Allow line continuations
3113
3114         lda     \
3115                 #$20                    ; This is legal now
3116   </verb></tscreen>
3117
3118
3119 <sect1><tt>.LIST</tt><label id=".LIST"><p>
3120
3121   Enable output to the listing. The command must be followed by a boolean
3122   switch ("on", "off", "+" or "-") and will enable or disable listing
3123   output.
3124   The option has no effect if the listing is not enabled by the command line
3125   switch -l. If -l is used, an internal counter is set to 1. Lines are output
3126   to the listing file, if the counter is greater than zero, and suppressed if
3127   the counter is zero. Each use of <tt/.LIST/ will increment or decrement the
3128   counter.
3129
3130   Example:
3131
3132   <tscreen><verb>
3133         .list   on              ; Enable listing output
3134   </verb></tscreen>
3135
3136
3137 <sect1><tt>.LISTBYTES</tt><label id=".LISTBYTES"><p>
3138
3139   Set, how many bytes are shown in the listing for one source line. The
3140   default is 12, so the listing will show only the first 12 bytes for any
3141   source line that generates more than 12 bytes of code or data.
3142   The directive needs an argument, which is either "unlimited", or an
3143   integer constant in the range 4..255.
3144
3145   Examples:
3146
3147   <tscreen><verb>
3148         .listbytes      unlimited       ; List all bytes
3149         .listbytes      12              ; List the first 12 bytes
3150         .incbin         "data.bin"      ; Include large binary file
3151   </verb></tscreen>
3152
3153
3154 <sect1><tt>.LOBYTES</tt><label id=".LOBYTES"><p>
3155
3156   Define byte sized data by extracting only the low byte (that is, bits 0-7) from
3157   each expression.  This is equivalent to <tt><ref id=".BYTE" name=".BYTE"></tt> with
3158   the operator '<' prepended to each expression in its list.
3159
3160   Example:
3161
3162   <tscreen><verb>
3163         .lobytes         $1234, $2345, $3456, $4567
3164         .hibytes         $fedc, $edcb, $dcba, $cba9
3165   </verb></tscreen>
3166
3167   which is equivalent to
3168
3169   <tscreen><verb>
3170         .byte            $34, $45, $56, $67
3171         .byte            $fe, $ed, $dc, $cb
3172   </verb></tscreen>
3173
3174   Example:
3175
3176   <tscreen><verb>
3177         .define MyTable TableItem0, TableItem1, TableItem2, TableItem3
3178
3179         TableLookupLo:   .lobytes MyTable
3180         TableLookupHi:   .hibytes MyTable
3181   </verb></tscreen>
3182
3183   which is equivalent to
3184
3185   <tscreen><verb>
3186         TableLookupLo:   .byte &lt;TableItem0, &lt;TableItem1, &lt;TableItem2, &lt;TableItem3
3187         TableLookupHi:   .byte &gt;TableItem0, &gt;TableItem1, &gt;TableItem2, &gt;TableItem3
3188   </verb></tscreen>
3189
3190   See also: <tt><ref id=".BYTE" name=".BYTE"></tt>,
3191             <tt><ref id=".HIBYTES" name=".HIBYTES"></tt>,
3192             <tt><ref id=".BANKBYTES" name=".BANKBYTES"></tt>
3193
3194
3195 <sect1><tt>.LOCAL</tt><label id=".LOCAL"><p>
3196
3197   This command may only be used inside a macro definition. It declares a
3198   list of identifiers as local to the macro expansion.
3199
3200   A problem when using macros are labels: Since they don't change their name,
3201   you get a "duplicate symbol" error if the macro is expanded the second time.
3202   Labels declared with <tt><ref id=".LOCAL" name=".LOCAL"></tt> have their
3203   name mapped to an internal unique name (<tt/___ABCD__/) with each macro
3204   invocation.
3205
3206   Some other assemblers start a new lexical block inside a macro expansion.
3207   This has some drawbacks however, since that will not allow <em/any/ symbol
3208   to be visible outside a macro, a feature that is sometimes useful. The
3209   <tt><ref id=".LOCAL" name=".LOCAL"></tt> command is in my eyes a better way
3210   to address the problem.
3211
3212   You get an error when using <tt><ref id=".LOCAL" name=".LOCAL"></tt> outside
3213   a macro.
3214
3215
3216 <sect1><tt>.LOCALCHAR</tt><label id=".LOCALCHAR"><p>
3217
3218   Defines the character that start "cheap" local labels. You may use one
3219   of '@' and '?' as start character. The default is '@'.
3220
3221   Cheap local labels are labels that are visible only between two non
3222   cheap labels. This way you can reuse identifiers like "<tt/loop/" without
3223   using explicit lexical nesting.
3224
3225   Example:
3226
3227   <tscreen><verb>
3228         .localchar      '?'
3229
3230         Clear:  lda     #$00            ; Global label
3231         ?Loop:  sta     Mem,y           ; Local label
3232                 dey
3233                 bne     ?Loop           ; Ok
3234                 rts
3235         Sub:    ...                     ; New global label
3236                 bne     ?Loop           ; ERROR: Unknown identifier!
3237   </verb></tscreen>
3238
3239
3240 <sect1><tt>.MACPACK</tt><label id=".MACPACK"><p>
3241
3242   Insert a predefined macro package. The command is followed by an
3243   identifier specifying the macro package to insert. Available macro
3244   packages are:
3245
3246   <tscreen><verb>
3247         atari           Defines the scrcode macro.
3248         cbm             Defines the scrcode macro.
3249         cpu             Defines constants for the .CPU variable.
3250         generic         Defines generic macros like add and sub.
3251         longbranch      Defines conditional long jump macros.
3252   </verb></tscreen>
3253
3254   Including a macro package twice, or including a macro package that
3255   redefines already existing macros will lead to an error.
3256
3257   Example:
3258
3259   <tscreen><verb>
3260         .macpack        longbranch      ; Include macro package
3261
3262                 cmp     #$20            ; Set condition codes
3263                 jne     Label           ; Jump long on condition
3264   </verb></tscreen>
3265
3266   Macro packages are explained in more detail in section <ref
3267   id="macropackages" name="Macro packages">.
3268
3269
3270 <sect1><tt>.MAC, .MACRO</tt><label id=".MACRO"><p>
3271
3272   Start a classic macro definition. The command is followed by an identifier
3273   (the macro name) and optionally by a comma separated list of identifiers
3274   that are macro parameters. A macro definition is terminated by <tt><ref
3275   id=".ENDMACRO" name=".ENDMACRO"></tt>.
3276
3277   Example:
3278
3279   <tscreen><verb>
3280         .macro  ldax    arg             ; Define macro ldax
3281                 lda     arg
3282                 ldx     arg+1
3283   </verb></tscreen>
3284
3285   See: <tt><ref id=".DELMACRO" name=".DELMACRO"></tt>,
3286        <tt><ref id=".ENDMACRO" name=".ENDMACRO"></tt>,
3287        <tt><ref id=".EXITMACRO" name=".EXITMACRO"></tt>
3288
3289   See also section <ref id="macros" name="Macros">.
3290
3291
3292 <sect1><tt>.ORG</tt><label id=".ORG"><p>
3293
3294   Start a section of absolute code. The command is followed by a constant
3295   expression that gives the new PC counter location for which the code is
3296   assembled. Use <tt><ref id=".RELOC" name=".RELOC"></tt> to switch back to
3297   relocatable code.
3298
3299   By default, absolute/relocatable mode is global (valid even when switching
3300   segments). Using <tt>.FEATURE <ref id="org_per_seg" name="org_per_seg"></tt>
3301   it can be made segment local.
3302
3303   Please note that you <em/do not need/ <tt/.ORG/ in most cases. Placing
3304   code at a specific address is the job of the linker, not the assembler, so
3305   there is usually no reason to assemble code to a specific address.
3306
3307   Example:
3308
3309   <tscreen><verb>
3310         .org    $7FF            ; Emit code starting at $7FF
3311   </verb></tscreen>
3312
3313
3314 <sect1><tt>.OUT</tt><label id=".OUT"><p>
3315
3316   Output a string to the console without producing an error. This command
3317   is similar to <tt/.ERROR/, however, it does not force an assembler error
3318   that prevents the creation of an object file.
3319
3320   Example:
3321
3322   <tscreen><verb>
3323         .out    "This code was written by the codebuster(tm)"
3324   </verb></tscreen>
3325
3326   See also: <tt><ref id=".ERROR" name=".ERROR"></tt>,
3327             <tt><ref id=".FATAL" name=".FATAL"></tt>,
3328             <tt><ref id=".WARNING" name=".WARNING"></tt>
3329
3330
3331 <sect1><tt>.P02</tt><label id=".P02"><p>
3332
3333   Enable the 6502 instruction set, disable 65SC02, 65C02 and 65816
3334   instructions. This is the default if not overridden by the
3335   <tt><ref id="option--cpu" name="--cpu"></tt> command line option.
3336
3337   See: <tt><ref id=".PC02" name=".PC02"></tt>, <tt><ref id=".PSC02"
3338   name=".PSC02"></tt> and <tt><ref id=".P816" name=".P816"></tt>
3339
3340
3341 <sect1><tt>.P816</tt><label id=".P816"><p>
3342
3343   Enable the 65816 instruction set. This is a superset of the 65SC02 and
3344   6502 instruction sets.
3345
3346   See: <tt><ref id=".P02" name=".P02"></tt>, <tt><ref id=".PSC02"
3347   name=".PSC02"></tt> and <tt><ref id=".PC02" name=".PC02"></tt>
3348
3349
3350 <sect1><tt>.PAGELEN, .PAGELENGTH</tt><label id=".PAGELENGTH"><p>
3351
3352   Set the page length for the listing. Must be followed by an integer
3353   constant. The value may be "unlimited", or in the range 32 to 127. The
3354   statement has no effect if no listing is generated. The default value is -1
3355   (unlimited) but may be overridden by the <tt/--pagelength/ command line
3356   option. Beware: Since ca65 is a one pass assembler, the listing is generated
3357   after assembly is complete, you cannot use multiple line lengths with one
3358   source. Instead, the value set with the last <tt/.PAGELENGTH/ is used.
3359
3360   Examples:
3361
3362   <tscreen><verb>
3363         .pagelength     66              ; Use 66 lines per listing page
3364
3365         .pagelength     unlimited       ; Unlimited page length
3366   </verb></tscreen>
3367
3368
3369 <sect1><tt>.PC02</tt><label id=".PC02"><p>
3370
3371   Enable the 65C02 instructions set. This instruction set includes all
3372   6502 and 65SC02 instructions.
3373
3374   See: <tt><ref id=".P02" name=".P02"></tt>, <tt><ref id=".PSC02"
3375   name=".PSC02"></tt> and <tt><ref id=".P816" name=".P816"></tt>
3376
3377
3378 <sect1><tt>.POPCPU</tt><label id=".POPCPU"><p>
3379
3380   Pop the last CPU setting from the stack, and activate it.
3381
3382   This command will switch back to the CPU that was last pushed onto the CPU
3383   stack using the <tt><ref id=".PUSHCPU" name=".PUSHCPU"></tt> command, and
3384   remove this entry from the stack.
3385
3386   The assembler will print an error message if the CPU stack is empty when
3387   this command is issued.
3388
3389   See: <tt><ref id=".CPU" name=".CPU"></tt>, <tt><ref id=".PUSHCPU"
3390   name=".PUSHCPU"></tt>, <tt><ref id=".SETCPU" name=".SETCPU"></tt>
3391
3392
3393 <sect1><tt>.POPSEG</tt><label id=".POPSEG"><p>
3394
3395   Pop the last pushed segment from the stack, and set it.
3396
3397   This command will switch back to the segment that was last pushed onto the
3398   segment stack using the <tt><ref id=".PUSHSEG" name=".PUSHSEG"></tt>
3399   command, and remove this entry from the stack.
3400
3401   The assembler will print an error message if the segment stack is empty
3402   when this command is issued.
3403
3404   See: <tt><ref id=".PUSHSEG" name=".PUSHSEG"></tt>
3405
3406
3407 <sect1><tt>.PROC</tt><label id=".PROC"><p>
3408
3409   Start a nested lexical level with the given name and adds a symbol with this
3410   name to the enclosing scope. All new symbols from now on are in the local
3411   lexical level and are accessible from outside only via <ref id="scopesyntax"
3412   name="explicit scope specification">. Symbols defined outside this local
3413   level may be accessed as long as their names are not used for new symbols
3414   inside the level. Symbols names in other lexical levels do not clash, so you
3415   may use the same names for identifiers. The lexical level ends when the
3416   <tt><ref id=".ENDPROC" name=".ENDPROC"></tt> command is read. Lexical levels
3417   may be nested up to a depth of 16 (this is an artificial limit to protect
3418   against errors in the source).
3419
3420   Note: Macro names are always in the global level and in a separate name
3421   space. There is no special reason for this, it's just that I've never
3422   had any need for local macro definitions.
3423
3424   Example:
3425
3426   <tscreen><verb>
3427         .proc   Clear           ; Define Clear subroutine, start new level
3428                 lda     #$00
3429         L1:     sta     Mem,y   ; L1 is local and does not cause a
3430                                 ; duplicate symbol error if used in other
3431                                 ; places
3432                 dey
3433                 bne     L1      ; Reference local symbol
3434                 rts
3435         .endproc                ; Leave lexical level
3436   </verb></tscreen>
3437
3438   See: <tt/<ref id=".ENDPROC" name=".ENDPROC">/ and <tt/<ref id=".SCOPE"
3439   name=".SCOPE">/
3440
3441
3442 <sect1><tt>.PSC02</tt><label id=".PSC02"><p>
3443
3444   Enable the 65SC02 instructions set. This instruction set includes all
3445   6502 instructions.
3446
3447   See: <tt><ref id=".P02" name=".P02"></tt>, <tt><ref id=".PC02"
3448   name=".PC02"></tt> and <tt><ref id=".P816" name=".P816"></tt>
3449
3450
3451 <sect1><tt>.PUSHCPU</tt><label id=".PUSHCPU"><p>
3452
3453   Push the currently active CPU onto a stack. The stack has a size of 8
3454   entries.
3455
3456   <tt/.PUSHCPU/ allows together with <tt><ref id=".POPCPU"
3457   name=".POPCPU"></tt> to switch to another CPU and to restore the old CPU
3458   later, without knowledge of the current CPU setting.
3459
3460   The assembler will print an error message if the CPU stack is already full,
3461   when this command is issued.
3462
3463   See: <tt><ref id=".CPU" name=".CPU"></tt>, <tt><ref id=".POPCPU"
3464   name=".POPCPU"></tt>, <tt><ref id=".SETCPU" name=".SETCPU"></tt>
3465
3466
3467 <sect1><tt>.PUSHSEG</tt><label id=".PUSHSEG"><p>
3468
3469   Push the currently active segment onto a stack. The entries on the stack
3470   include the name of the segment and the segment type. The stack has a size
3471   of 16 entries.
3472
3473   <tt/.PUSHSEG/ allows together with <tt><ref id=".POPSEG" name=".POPSEG"></tt>
3474   to switch to another segment and to restore the old segment later, without
3475   even knowing the name and type of the current segment.
3476
3477   The assembler will print an error message if the segment stack is already
3478   full, when this command is issued.
3479
3480   See: <tt><ref id=".POPSEG" name=".POPSEG"></tt>
3481
3482
3483 <sect1><tt>.RELOC</tt><label id=".RELOC"><p>
3484
3485   Switch back to relocatable mode. See the <tt><ref id=".ORG"
3486   name=".ORG"></tt> command.
3487
3488
3489 <sect1><tt>.REPEAT</tt><label id=".REPEAT"><p>
3490
3491   Repeat all commands between <tt/.REPEAT/ and <tt><ref id=".ENDREPEAT"
3492   name=".ENDREPEAT"></tt> constant number of times. The command is followed by
3493   a constant expression that tells how many times the commands in the body
3494   should get repeated. Optionally, a comma and an identifier may be specified.
3495   If this identifier is found in the body of the repeat statement, it is
3496   replaced by the current repeat count (starting with zero for the first time
3497   the body is repeated).
3498
3499   <tt/.REPEAT/ statements may be nested. If you use the same repeat count
3500   identifier for a nested <tt/.REPEAT/ statement, the one from the inner
3501   level will be used, not the one from the outer level.
3502
3503   Example:
3504
3505   The following macro will emit a string that is "encrypted" in that all
3506   characters of the string are XORed by the value $55.
3507
3508   <tscreen><verb>
3509         .macro  Crypt   Arg
3510                 .repeat .strlen(Arg), I
3511                 .byte   .strat(Arg, I) ^ $55
3512                 .endrep
3513         .endmacro
3514   </verb></tscreen>
3515
3516   See: <tt><ref id=".ENDREPEAT" name=".ENDREPEAT"></tt>
3517
3518
3519 <sect1><tt>.RES</tt><label id=".RES"><p>
3520
3521   Reserve storage. The command is followed by one or two constant
3522   expressions. The first one is mandatory and defines, how many bytes of
3523   storage should be defined. The second, optional expression must by a
3524   constant byte value that will be used as value of the data. If there
3525   is no fill value given, the linker will use the value defined in the
3526   linker configuration file (default: zero).
3527
3528   Example:
3529
3530   <tscreen><verb>
3531         ; Reserve 12 bytes of memory with value $AA
3532         .res    12, $AA
3533   </verb></tscreen>
3534
3535
3536 <sect1><tt>.RODATA</tt><label id=".RODATA"><p>
3537
3538   Switch to the RODATA segment. The name of the RODATA segment is always
3539   "RODATA", so this is a shortcut for
3540
3541   <tscreen><verb>
3542         .segment  "RODATA"
3543   </verb></tscreen>
3544
3545   The RODATA segment is a segment that is used by the compiler for
3546   readonly data like string constants.
3547
3548   See also the <tt><ref id=".SEGMENT" name=".SEGMENT"></tt> command.
3549
3550
3551 <sect1><tt>.SCOPE</tt><label id=".SCOPE"><p>
3552
3553   Start a nested lexical level with the given name. All new symbols from now
3554   on are in the local lexical level and are accessible from outside only via
3555   <ref id="scopesyntax" name="explicit scope specification">. Symbols defined
3556   outside this local level may be accessed as long as their names are not used
3557   for new symbols inside the level. Symbols names in other lexical levels do
3558   not clash, so you may use the same names for identifiers. The lexical level
3559   ends when the <tt><ref id=".ENDSCOPE" name=".ENDSCOPE"></tt> command is
3560   read. Lexical levels may be nested up to a depth of 16 (this is an
3561   artificial limit to protect against errors in the source).
3562
3563   Note: Macro names are always in the global level and in a separate name
3564   space. There is no special reason for this, it's just that I've never
3565   had any need for local macro definitions.
3566
3567   Example:
3568
3569   <tscreen><verb>
3570         .scope  Error                   ; Start new scope named Error
3571                 None = 0                ; No error
3572                 File = 1                ; File error
3573                 Parse = 2               ; Parse error
3574         .endscope                       ; Close lexical level
3575
3576                 ...
3577                 lda #Error::File        ; Use symbol from scope Error
3578   </verb></tscreen>
3579
3580   See: <tt/<ref id=".ENDSCOPE" name=".ENDSCOPE">/ and <tt/<ref id=".PROC"
3581   name=".PROC">/
3582
3583
3584 <sect1><tt>.SEGMENT</tt><label id=".SEGMENT"><p>
3585
3586   Switch to another segment. Code and data is always emitted into a
3587   segment, that is, a named section of data. The default segment is
3588   "CODE". There may be up to 254 different segments per object file
3589   (and up to 65534 per executable). There are shortcut commands for
3590   the most common segments ("CODE", "DATA" and "BSS").
3591
3592   The command is followed by a string containing the segment name (there are
3593   some constraints for the name - as a rule of thumb use only those segment
3594   names that would also be valid identifiers). There may also be an optional
3595   address size separated by a colon. See the section covering <tt/<ref
3596   id="address-sizes" name="address sizes">/ for more information.
3597
3598   The default address size for a segment depends on the memory model specified
3599   on the command line. The default is "absolute", which means that you don't
3600   have to use an address size modifier in most cases.
3601
3602   "absolute" means that the is a segment with 16 bit (absolute) addressing.
3603   That is, the segment will reside somewhere in core memory outside the zero
3604   page. "zeropage" (8 bit) means that the segment will be placed in the zero
3605   page and direct (short) addressing is possible for data in this segment.
3606
3607   Beware: Only labels in a segment with the zeropage attribute are marked
3608   as reachable by short addressing. The `*' (PC counter) operator will
3609   work as in other segments and will create absolute variable values.
3610
3611   Please note that a segment cannot have two different address sizes. A
3612   segment specified as zeropage cannot be declared as being absolute later.
3613
3614   Examples:
3615
3616   <tscreen><verb>
3617         .segment "ROM2"                 ; Switch to ROM2 segment
3618         .segment "ZP2": zeropage        ; New direct segment
3619         .segment "ZP2"                  ; Ok, will use last attribute
3620         .segment "ZP2": absolute        ; Error, redecl mismatch
3621   </verb></tscreen>
3622
3623   See: <tt><ref id=".BSS" name=".BSS"></tt>, <tt><ref id=".CODE"
3624   name=".CODE"></tt>, <tt><ref id=".DATA" name=".DATA"></tt> and <tt><ref
3625   id=".RODATA" name=".RODATA"></tt>
3626
3627
3628 <sect1><tt>.SET</tt><label id=".SET"><p>
3629
3630   <tt/.SET/ is used to assign a value to a variable. See <ref id="variables"
3631   name="Numeric variables"> for a full description.
3632
3633
3634 <sect1><tt>.SETCPU</tt><label id=".SETCPU"><p>
3635
3636   Switch the CPU instruction set. The command is followed by a string that
3637   specifies the CPU. Possible values are those that can also be supplied to
3638   the <tt><ref id="option--cpu" name="--cpu"></tt> command line option,
3639   namely: 6502, 6502X, 65SC02, 65C02, 65816, sunplus and HuC6280. Please
3640   note that support for the sunplus CPU is not available in the freeware
3641   version, because the instruction set of the sunplus CPU is "proprietary
3642   and confidential".
3643
3644   See: <tt><ref id=".CPU" name=".CPU"></tt>,
3645        <tt><ref id=".IFP02" name=".IFP02"></tt>,
3646        <tt><ref id=".IFP816" name=".IFP816"></tt>,
3647        <tt><ref id=".IFPC02" name=".IFPC02"></tt>,
3648        <tt><ref id=".IFPSC02" name=".IFPSC02"></tt>,
3649        <tt><ref id=".P02" name=".P02"></tt>,
3650        <tt><ref id=".P816" name=".P816"></tt>,
3651        <tt><ref id=".PC02" name=".PC02"></tt>,
3652        <tt><ref id=".PSC02" name=".PSC02"></tt>
3653
3654
3655 <sect1><tt>.SMART</tt><label id=".SMART"><p>
3656
3657   Switch on or off smart mode. The command must be followed by a '+' or '-'
3658   character to switch the option on or off respectively. The default is off
3659   (that is, the assembler doesn't try to be smart), but this default may be
3660   changed by the -s switch on the command line.
3661
3662   In smart mode the assembler will do the following:
3663
3664   <itemize>
3665   <item>Track usage of the <tt/REP/ and <tt/SEP/ instructions in 65816 mode
3666         and update the operand sizes accordingly. If the operand of such an
3667         instruction cannot be evaluated by the assembler (for example, because
3668         the operand is an imported symbol), a warning is issued. Beware: Since
3669         the assembler cannot trace the execution flow this may lead to false
3670         results in some cases. If in doubt, use the <tt/.Inn/ and <tt/.Ann/
3671         instructions to tell the assembler about the current settings.
3672   <item>In 65816 mode, replace a <tt/RTS/ instruction by <tt/RTL/ if it is
3673         used within a procedure declared as <tt/far/, or if the procedure has
3674         no explicit address specification, but it is <tt/far/ because of the
3675         memory model used.
3676   </itemize>
3677
3678   Example:
3679
3680   <tscreen><verb>
3681         .smart                          ; Be smart
3682         .smart  -                       ; Stop being smart
3683   </verb></tscreen>
3684
3685   See: <tt><ref id=".A16" name=".A16"></tt>,
3686        <tt><ref id=".A8" name=".A8"></tt>,
3687        <tt><ref id=".I16" name=".I16"></tt>,
3688        <tt><ref id=".I8" name=".I8"></tt>
3689
3690
3691 <sect1><tt>.STRUCT</tt><label id=".STRUCT"><p>
3692
3693   Starts a struct definition. Structs are covered in a separate section named
3694   <ref id="structs" name="&quot;Structs and unions&quot;">.
3695
3696   See also: <tt><ref id=".ENDSTRUCT" name=".ENDSTRUCT"></tt>,
3697             <tt><ref id=".ENDUNION" name=".ENDUNION"></tt>,
3698             <tt><ref id=".UNION" name=".UNION"></tt>
3699
3700
3701 <sect1><tt>.SUNPLUS</tt><label id=".SUNPLUS"><p>
3702
3703   Enable the SunPlus instructions set. This command will not work in the
3704   freeware version of the assembler, because the instruction set is
3705   "proprietary and confidential".
3706
3707   See: <tt><ref id=".P02" name=".P02"></tt>, <tt><ref id=".PSC02"
3708   name=".PSC02"></tt>, <tt><ref id=".PC02" name=".PC02"></tt>, and
3709   <tt><ref id=".P816" name=".P816"></tt>
3710
3711
3712 <sect1><tt>.TAG</tt><label id=".TAG"><p>
3713
3714   Allocate space for a struct or union.
3715
3716   Example:
3717
3718   <tscreen><verb>
3719         .struct Point
3720                 xcoord  .word
3721                 ycoord  .word
3722         .endstruct
3723
3724         .bss
3725                 .tag    Point           ; Allocate 4 bytes
3726   </verb></tscreen>
3727
3728
3729 <sect1><tt>.UNDEF, .UNDEFINE</tt><label id=".UNDEFINE"><p>
3730
3731   Delete a define style macro definition. The command is followed by an
3732   identifier which specifies the name of the macro to delete. Macro
3733   replacement is switched of when reading the token following the command
3734   (otherwise the macro name would be replaced by its replacement list).
3735
3736   See also the <tt><ref id=".DEFINE" name=".DEFINE"></tt> command and
3737   section <ref id="macros" name="Macros">.
3738
3739
3740 <sect1><tt>.UNION</tt><label id=".UNION"><p>
3741
3742   Starts a union definition. Unions are covered in a separate section named
3743   <ref id="structs" name="&quot;Structs and unions&quot;">.
3744
3745   See also: <tt><ref id=".ENDSTRUCT" name=".ENDSTRUCT"></tt>,
3746             <tt><ref id=".ENDUNION" name=".ENDUNION"></tt>,
3747             <tt><ref id=".STRUCT" name=".STRUCT"></tt>
3748
3749
3750 <sect1><tt>.WARNING</tt><label id=".WARNING"><p>
3751
3752   Force an assembly warning. The assembler will output a warning message
3753   preceded by "User warning". This warning will always be output, even if
3754   other warnings are disabled with the <tt><ref id="option-W" name="-W0"></tt>
3755   command line option.
3756
3757   This command may be used to output possible problems when assembling
3758   the source file.
3759
3760   Example:
3761
3762   <tscreen><verb>
3763         .macro  jne     target
3764                 .local L1
3765                 .ifndef target
3766                 .warning "Forward jump in jne, cannot optimize!"
3767                 beq     L1
3768                 jmp     target
3769         L1:
3770                 .else
3771                 ...
3772                 .endif
3773         .endmacro
3774   </verb></tscreen>
3775
3776   See also: <tt><ref id=".ERROR" name=".ERROR"></tt>,
3777             <tt><ref id=".FATAL" name=".FATAL"></tt>,
3778             <tt><ref id=".OUT" name=".OUT"></tt>
3779
3780
3781 <sect1><tt>.WORD</tt><label id=".WORD"><p>
3782
3783   Define word sized data. Must be followed by a sequence of (word ranged,
3784   but not necessarily constant) expressions.
3785
3786   Example:
3787
3788   <tscreen><verb>
3789         .word   $0D00, $AF13, _Clear
3790   </verb></tscreen>
3791
3792
3793 <sect1><tt>.ZEROPAGE</tt><label id=".ZEROPAGE"><p>
3794
3795   Switch to the ZEROPAGE segment and mark it as direct (zeropage) segment.
3796   The name of the ZEROPAGE segment is always "ZEROPAGE", so this is a
3797   shortcut for
3798
3799   <tscreen><verb>
3800         .segment  "ZEROPAGE", zeropage
3801   </verb></tscreen>
3802
3803   Because of the "zeropage" attribute, labels declared in this segment are
3804   addressed using direct addressing mode if possible. You <em/must/ instruct
3805   the linker to place this segment somewhere in the address range 0..$FF
3806   otherwise you will get errors.
3807
3808   See: <tt><ref id=".SEGMENT" name=".SEGMENT"></tt>
3809
3810
3811
3812 <sect>Macros<label id="macros"><p>
3813
3814
3815 <sect1>Introduction<p>
3816
3817 Macros may be thought of as "parametrized super instructions". Macros are
3818 sequences of tokens that have a name. If that name is used in the source
3819 file, the macro is "expanded", that is, it is replaced by the tokens that
3820 were specified when the macro was defined.
3821
3822
3823 <sect1>Macros without parameters<p>
3824
3825 In its simplest form, a macro does not have parameters. Here's an
3826 example:
3827
3828 <tscreen><verb>
3829         .macro  asr             ; Arithmetic shift right
3830                 cmp     #$80    ; Put bit 7 into carry
3831                 ror             ; Rotate right with carry
3832         .endmacro
3833 </verb></tscreen>
3834
3835 The macro above consists of two real instructions, that are inserted into
3836 the code, whenever the macro is expanded. Macro expansion is simply done
3837 by using the name, like this:
3838
3839 <tscreen><verb>
3840         lda     $2010
3841         asr
3842         sta     $2010
3843 </verb></tscreen>
3844
3845
3846 <sect1>Parametrized macros<p>
3847
3848 When using macro parameters, macros can be even more useful:
3849
3850 <tscreen><verb>
3851         .macro  inc16   addr
3852                 clc
3853                 lda     addr
3854                 adc     #$01
3855                 sta     addr
3856                 lda     addr+1
3857                 adc     #$00
3858                 sta     addr+1
3859         .endmacro
3860 </verb></tscreen>
3861
3862 When calling the macro, you may give a parameter, and each occurrence of
3863 the name "addr" in the macro definition will be replaced by the given
3864 parameter. So
3865
3866 <tscreen><verb>
3867         inc16   $1000
3868 </verb></tscreen>
3869
3870 will be expanded to
3871
3872 <tscreen><verb>
3873                 clc
3874                 lda     $1000
3875                 adc     #$01
3876                 sta     $1000
3877                 lda     $1000+1
3878                 adc     #$00
3879                 sta     $1000+1
3880 </verb></tscreen>
3881
3882 A macro may have more than one parameter, in this case, the parameters
3883 are separated by commas. You are free to give less parameters than the
3884 macro actually takes in the definition. You may also leave intermediate
3885 parameters empty. Empty parameters are replaced by empty space (that is,
3886 they are removed when the macro is expanded). If you have a look at our
3887 macro definition above, you will see, that replacing the "addr" parameter
3888 by nothing will lead to wrong code in most lines. To help you, writing
3889 macros with a variable parameter list, there are some control commands:
3890
3891 <tt><ref id=".IFBLANK" name=".IFBLANK"></tt> tests the rest of the line and
3892 returns true, if there are any tokens on the remainder of the line. Since
3893 empty parameters are replaced by nothing, this may be used to test if a given
3894 parameter is empty. <tt><ref id=".IFNBLANK" name=".IFNBLANK"></tt> tests the
3895 opposite.
3896
3897 Look at this example:
3898
3899 <tscreen><verb>
3900         .macro  ldaxy   a, x, y
3901         .ifnblank       a
3902                 lda     #a
3903         .endif
3904         .ifnblank       x
3905                 ldx     #x
3906         .endif
3907         .ifnblank       y
3908                 ldy     #y
3909         .endif
3910         .endmacro
3911 </verb></tscreen>
3912
3913 This macro may be called as follows:
3914
3915 <tscreen><verb>
3916         ldaxy   1, 2, 3         ; Load all three registers
3917
3918         ldaxy   1, , 3          ; Load only a and y
3919
3920         ldaxy   , , 3           ; Load y only
3921 </verb></tscreen>
3922
3923 There's another helper command for determining, which macro parameters are
3924 valid: <tt><ref id=".PARAMCOUNT" name=".PARAMCOUNT"></tt> This command is
3925 replaced by the parameter count given, <em/including/ intermediate empty macro
3926 parameters:
3927
3928 <tscreen><verb>
3929         ldaxy   1               ; .PARAMCOUNT = 1
3930         ldaxy   1,,3            ; .PARAMCOUNT = 3
3931         ldaxy   1,2             ; .PARAMCOUNT = 2
3932         ldaxy   1,              ; .PARAMCOUNT = 2
3933         ldaxy   1,2,3           ; .PARAMCOUNT = 3
3934 </verb></tscreen>
3935
3936 Macro parameters may optionally be enclosed into curly braces. This allows the
3937 inclusion of tokens that would otherwise terminate the parameter (the comma in
3938 case of a macro parameter).
3939
3940 <tscreen><verb>
3941         .macro  foo     arg1, arg2
3942                 ...
3943         .endmacro
3944
3945                 foo     ($00,x)         ; Two parameters passed
3946                 foo     {($00,x)}       ; One parameter passed
3947 </verb></tscreen>
3948
3949 In the first case, the macro is called with two parameters: '<tt/(&dollar;00/'
3950 and 'x)'. The comma is not passed to the macro, since it is part of the
3951 calling sequence, not the parameters.
3952
3953 In the second case, '(&dollar;00,x)' is passed to the macro, this time
3954 including the comma.
3955
3956
3957 <sect1>Detecting parameter types<p>
3958
3959 Sometimes it is nice to write a macro that acts differently depending on the
3960 type of the argument supplied. An example would be a macro that loads a 16 bit
3961 value from either an immediate operand, or from memory. The <tt/<ref
3962 id=".MATCH" name=".MATCH">/ and <tt/<ref id=".XMATCH" name=".XMATCH">/
3963 functions will allow you to do exactly this:
3964
3965 <tscreen><verb>
3966         .macro  ldax    arg
3967                 .if (.match (.left (1, {arg}), #))
3968                     ; immediate mode
3969                     lda     #<(.right (.tcount ({arg})-1, {arg}))
3970                     ldx     #>(.right (.tcount ({arg})-1, {arg}))
3971                 .else
3972                     ; assume absolute or zero page
3973                     lda     arg
3974                     ldx     1+(arg)
3975                 .endif
3976         .endmacro
3977 </verb></tscreen>
3978
3979 Using the <tt/<ref id=".MATCH" name=".MATCH">/ function, the macro is able to
3980 check if its argument begins with a hash mark. If so, two immediate loads are
3981 emitted, Otherwise a load from an absolute zero page memory location is
3982 assumed. Please note how the curly braces are used to enclose parameters to
3983 pseudo functions handling token lists. This is necessary, because the token
3984 lists may include commas or parens, which would be treated by the assembler
3985 as end-of-list.
3986
3987 The macro can be used as
3988
3989 <tscreen><verb>
3990         foo:    .word   $5678
3991         ...
3992                 ldax    #$1234          ; X=$12, A=$34
3993         ...
3994                 ldax    foo             ; X=$56, A=$78
3995 </verb></tscreen>
3996
3997
3998 <sect1>Recursive macros<p>
3999
4000 Macros may be used recursively:
4001
4002 <tscreen><verb>
4003         .macro  push    r1, r2, r3
4004                 lda     r1
4005                 pha
4006         .if     .paramcount > 1
4007                 push    r2, r3
4008         .endif
4009         .endmacro
4010 </verb></tscreen>
4011
4012 There's also a special macro to help writing recursive macros: <tt><ref
4013 id=".EXITMACRO" name=".EXITMACRO"></tt> This command will stop macro expansion
4014 immediately:
4015
4016 <tscreen><verb>
4017         .macro  push    r1, r2, r3, r4, r5, r6, r7
4018         .ifblank        r1
4019                 ; First parameter is empty
4020                 .exitmacro
4021         .else
4022                 lda     r1
4023                 pha
4024         .endif
4025                 push    r2, r3, r4, r5, r6, r7
4026         .endmacro
4027 </verb></tscreen>
4028
4029 When expanding this macro, the expansion will push all given parameters
4030 until an empty one is encountered. The macro may be called like this:
4031
4032 <tscreen><verb>
4033         push    $20, $21, $32           ; Push 3 ZP locations
4034         push    $21                     ; Push one ZP location
4035 </verb></tscreen>
4036
4037
4038 <sect1>Local symbols inside macros<p>
4039
4040 Now, with recursive macros, <tt><ref id=".IFBLANK" name=".IFBLANK"></tt> and
4041 <tt><ref id=".PARAMCOUNT" name=".PARAMCOUNT"></tt>, what else do you need?
4042 Have a look at the inc16 macro above. Here is it again:
4043
4044 <tscreen><verb>
4045         .macro  inc16   addr
4046                 clc
4047                 lda     addr
4048                 adc     #$01
4049                 sta     addr
4050                 lda     addr+1
4051                 adc     #$00
4052                 sta     addr+1
4053         .endmacro
4054 </verb></tscreen>
4055
4056 If you have a closer look at the code, you will notice, that it could be
4057 written more efficiently, like this:
4058
4059 <tscreen><verb>
4060         .macro  inc16   addr
4061                 inc     addr
4062                 bne     Skip
4063                 inc     addr+1
4064         Skip:
4065         .endmacro
4066 </verb></tscreen>
4067
4068 But imagine what happens, if you use this macro twice? Since the label
4069 "Skip" has the same name both times, you get a "duplicate symbol" error.
4070 Without a way to circumvent this problem, macros are not as useful, as
4071 they could be. One solution is, to start a new lexical block inside the
4072 macro:
4073
4074 <tscreen><verb>
4075         .macro  inc16   addr
4076         .proc
4077                 inc     addr
4078                 bne     Skip
4079                 inc     addr+1
4080         Skip:
4081         .endproc
4082         .endmacro
4083 </verb></tscreen>
4084
4085 Now the label is local to the block and not visible outside. However,
4086 sometimes you want a label inside the macro to be visible outside. To make
4087 that possible, there's a new command that's only usable inside a macro
4088 definition: <tt><ref id=".LOCAL" name=".LOCAL"></tt>. <tt/.LOCAL/ declares one
4089 or more symbols as local to the macro expansion. The names of local variables
4090 are replaced by a unique name in each separate macro expansion. So we could
4091 also solve the problem above by using <tt/.LOCAL/:
4092
4093 <tscreen><verb>
4094         .macro  inc16   addr
4095                 .local  Skip            ; Make Skip a local symbol
4096                 clc
4097                 lda     addr
4098                 adc     #$01
4099                 sta     addr
4100                 bcc     Skip
4101                 inc     addr+1
4102         Skip:                           ; Not visible outside
4103         .endmacro
4104 </verb></tscreen>
4105
4106
4107 <sect1>C style macros<p>
4108
4109 Starting with version 2.5 of the assembler, there is a second macro type
4110 available: C style macros using the <tt/.DEFINE/ directive. These macros are
4111 similar to the classic macro type described above, but behaviour is sometimes
4112 different:
4113
4114 <itemize>
4115
4116 <item>  Macros defined with <tt><ref id=".DEFINE" name=".DEFINE"></tt> may not
4117         span more than a line. You may use line continuation (see <tt><ref
4118         id=".LINECONT" name=".LINECONT"></tt>) to spread the definition over
4119         more than one line for increased readability, but the macro itself
4120         may not contain an end-of-line token.
4121
4122 <item>  Macros defined with <tt><ref id=".DEFINE" name=".DEFINE"></tt> share
4123         the name space with classic macros, but they are detected and replaced
4124         at the scanner level. While classic macros may be used in every place,
4125         where a mnemonic or other directive is allowed, <tt><ref id=".DEFINE"
4126         name=".DEFINE"></tt> style macros are allowed anywhere in a line. So
4127         they are more versatile in some situations.
4128
4129 <item>  <tt><ref id=".DEFINE" name=".DEFINE"></tt> style macros may take
4130         parameters. While classic macros may have empty parameters, this is
4131         not true for <tt><ref id=".DEFINE" name=".DEFINE"></tt> style macros.
4132         For this macro type, the number of actual parameters must match
4133         exactly the number of formal parameters.
4134
4135         To make this possible, formal parameters are enclosed in braces when
4136         defining the macro. If there are no parameters, the empty braces may
4137         be omitted.
4138
4139 <item>  Since <tt><ref id=".DEFINE" name=".DEFINE"></tt> style macros may not
4140         contain end-of-line tokens, there are things that cannot be done. They
4141         may not contain several processor instructions for example. So, while
4142         some things may be done with both macro types, each type has special
4143         usages. The types complement each other.
4144
4145 </itemize>
4146
4147 Let's look at a few examples to make the advantages and disadvantages
4148 clear.
4149
4150 To emulate assemblers that use "<tt/EQU/" instead of "<tt/=/" you may use the
4151 following <tt/.DEFINE/:
4152
4153 <tscreen><verb>
4154         .define EQU     =
4155
4156         foo     EQU     $1234           ; This is accepted now
4157 </verb></tscreen>
4158
4159 You may use the directive to define string constants used elsewhere:
4160
4161 <tscreen><verb>
4162         ; Define the version number
4163         .define VERSION         "12.3a"
4164
4165         ; ... and use it
4166         .asciiz VERSION
4167 </verb></tscreen>
4168
4169 Macros with parameters may also be useful:
4170
4171 <tscreen><verb>
4172         .define DEBUG(message)  .out    message
4173
4174         DEBUG   "Assembling include file #3"
4175 </verb></tscreen>
4176
4177 Note that, while formal parameters have to be placed in braces, this is
4178 not true for the actual parameters. Beware: Since the assembler cannot
4179 detect the end of one parameter, only the first token is used. If you
4180 don't like that, use classic macros instead:
4181
4182 <tscreen><verb>
4183         .macro  message
4184                 .out    message
4185         .endmacro
4186 </verb></tscreen>
4187
4188 (This is an example where a problem can be solved with both macro types).
4189
4190
4191 <sect1>Characters in macros<p>
4192
4193 When using the <ref id="option-t" name="-t"> option, characters are translated
4194 into the target character set of the specific machine. However, this happens
4195 as late as possible. This means that strings are translated if they are part
4196 of a <tt><ref id=".BYTE" name=".BYTE"></tt> or <tt><ref id=".ASCIIZ"
4197 name=".ASCIIZ"></tt> command. Characters are translated as soon as they are
4198 used as part of an expression.
4199
4200 This behaviour is very intuitive outside of macros but may be confusing when
4201 doing more complex macros. If you compare characters against numeric values,
4202 be sure to take the translation into account.
4203
4204
4205 <sect1>Deleting macros<p>
4206
4207 Macros can be deleted. This will not work if the macro that should be deleted
4208 is currently expanded as in the following non working example:
4209
4210 <tscreen><verb>
4211         .macro  notworking
4212                 .delmacro       notworking
4213         .endmacro
4214
4215         notworking              ; Will not work
4216 </verb></tscreen>
4217
4218 The commands to delete classic and define style macros differ. Classic macros
4219 can be deleted by use of <tt><ref id=".DELMACRO" name=".DELMACRO"></tt>, while
4220 for <tt><ref id=".DEFINE" name=".DEFINE"></tt> style macros, <tt><ref
4221 id=".UNDEFINE" name=".UNDEFINE"></tt> must be used. Example:
4222
4223 <tscreen><verb>
4224         .define value   1
4225         .macro  mac
4226                 .byte   2
4227         .endmacro
4228
4229                 .byte   value           ; Emit one byte with value 1
4230                 mac                     ; Emit another byte with value 2
4231
4232         .undefine value
4233         .delmacro mac
4234
4235                 .byte   value           ; Error: Unknown identifier
4236                 mac                     ; Error: Missing ":"
4237 </verb></tscreen>
4238
4239 A separate command for <tt>.DEFINE</tt> style macros was necessary, because
4240 the name of such a macro is replaced by its replacement list on a very low
4241 level. To get the actual name, macro replacement has to be switched off when
4242 reading the argument to <tt>.UNDEFINE</tt>. This does also mean that the
4243 argument to <tt>.UNDEFINE</tt> is not allowed to come from another
4244 <tt>.DEFINE</tt>. All this is not necessary for classic macros, so having two
4245 different commands increases flexibility.
4246
4247
4248 <sect>Macro packages<label id="macropackages"><p>
4249
4250 Using the <tt><ref id=".MACPACK" name=".MACPACK"></tt> directive, predefined
4251 macro packages may be included with just one command. Available macro packages
4252 are:
4253
4254
4255 <sect1><tt>.MACPACK generic</tt><p>
4256
4257 This macro package defines macros that are useful in almost any program.
4258 Currently defined macros are:
4259
4260 <tscreen><verb>
4261         .macro  add     Arg
4262                 clc
4263                 adc     Arg
4264         .endmacro
4265
4266         .macro  sub     Arg
4267                 sec
4268                 sbc     Arg
4269         .endmacro
4270
4271         .macro  bge     Arg
4272                 bcs     Arg
4273         .endmacro
4274
4275         .macro  blt     Arg
4276                 bcc     Arg
4277         .endmacro
4278
4279         .macro  bgt     Arg
4280                 .local  L
4281                 beq     L
4282                 bcs     Arg
4283         L:
4284         .endmacro
4285
4286         .macro  ble     Arg
4287                 beq     Arg
4288                 bcc     Arg
4289         .endmacro
4290 </verb></tscreen>
4291
4292
4293 <sect1><tt>.MACPACK longbranch</tt><p>
4294
4295 This macro package defines long conditional jumps. They are named like the
4296 short counterpart but with the 'b' replaced by a 'j'. Here is a sample
4297 definition for the "<tt/jeq/" macro, the other macros are built using the same
4298 scheme:
4299
4300 <tscreen><verb>
4301         .macro  jeq     Target
4302                 .if     .def(Target) .and ((*+2)-(Target) <= 127)
4303                 beq     Target
4304                 .else
4305                 bne     *+5
4306                 jmp     Target
4307                 .endif
4308         .endmacro
4309 </verb></tscreen>
4310
4311 All macros expand to a short branch, if the label is already defined (back
4312 jump) and is reachable with a short jump. Otherwise the macro expands to a
4313 conditional branch with the branch condition inverted, followed by an absolute
4314 jump to the actual branch target.
4315
4316 The package defines the following macros:
4317
4318 <tscreen><verb>
4319         jeq, jne, jmi, jpl, jcs, jcc, jvs, jvc
4320 </verb></tscreen>
4321
4322
4323
4324 <sect1><tt>.MACPACK atari</tt><p>
4325
4326 The atari macro package will define a macro named <tt/scrcode/. It takes a
4327 string as argument and places this string into memory translated into screen
4328 codes.
4329
4330
4331 <sect1><tt>.MACPACK cbm</tt><p>
4332
4333 The cbm macro package will define a macro named <tt/scrcode/. It takes a
4334 string as argument and places this string into memory translated into screen
4335 codes.
4336
4337
4338 <sect1><tt>.MACPACK cpu</tt><p>
4339
4340 This macro package does not define any macros but constants used to examine
4341 the value read from the <tt/<ref id=".CPU" name=".CPU">/ pseudo variable. For
4342 each supported CPU a constant similar to
4343
4344 <tscreen><verb>
4345     CPU_6502
4346     CPU_65SC02
4347     CPU_65C02
4348     CPU_65816
4349     CPU_SUNPLUS
4350     CPU_SWEET16
4351     CPU_HUC6280
4352 </verb></tscreen>
4353
4354 is defined. These constants may be used to determine the exact type of the
4355 currently enabled CPU. In addition to that, for each CPU instruction set,
4356 another constant is defined:
4357
4358 <tscreen><verb>
4359     CPU_ISET_6502
4360     CPU_ISET_65SC02
4361     CPU_ISET_65C02
4362     CPU_ISET_65816
4363     CPU_ISET_SUNPLUS
4364     CPU_ISET_SWEET16
4365     CPU_ISET_HUC6280
4366 </verb></tscreen>
4367
4368 The value read from the <tt/<ref id=".CPU" name=".CPU">/ pseudo variable may
4369 be checked with <tt/<ref id="operators" name=".BITAND">/ to determine if the
4370 currently enabled CPU supports a specific instruction set. For example the
4371 65C02 supports all instructions of the 65SC02 CPU, so it has the
4372 <tt/CPU_ISET_65SC02/ bit set in addition to its native <tt/CPU_ISET_65C02/
4373 bit. Using
4374
4375 <tscreen><verb>
4376         .if (.cpu .bitand CPU_ISET_65SC02)
4377                 lda     (sp)
4378         .else
4379                 ldy     #$00
4380                 lda     (sp),y
4381         .endif
4382 </verb></tscreen>
4383
4384 it is possible to determine if the
4385
4386 <tscreen><verb>
4387                 lda     (sp)
4388 </verb></tscreen>
4389
4390 instruction is supported, which is the case for the 65SC02, 65C02 and 65816
4391 CPUs (the latter two are upwards compatible to the 65SC02).
4392
4393
4394
4395 <sect>Predefined constants<label id="predefined-constants"><p>
4396
4397 For better orthogonality, the assembler defines similar symbols as the
4398 compiler, depending on the target system selected:
4399
4400 <itemize>
4401 <item><tt/__APPLE2__/ - Target system is <tt/apple2/
4402 <item><tt/__APPLE2ENH__/ - Target system is <tt/apple2enh/
4403 <item><tt/__ATARI__/ - Target system is <tt/atari/
4404 <item><tt/__ATMOS__/ - Target system is <tt/atmos/
4405 <item><tt/__BBC__/ - Target system is <tt/bbc/
4406 <item><tt/__C128__/ - Target system is <tt/c128/
4407 <item><tt/__C16__/ - Target system is <tt/c16/
4408 <item><tt/__C64__/ - Target system is <tt/c64/
4409 <item><tt/__CBM__/ - Target is a Commodore system
4410 <item><tt/__CBM510__/ - Target system is <tt/cbm510/
4411 <item><tt/__CBM610__/ - Target system is <tt/cbm610/
4412 <item><tt/__GEOS__/ - Target system is <tt/geos/
4413 <item><tt/__LUNIX__/ - Target system is <tt/lunix/
4414 <item><tt/__NES__/ - Target system is <tt/nes/
4415 <item><tt/__PET__/ - Target system is <tt/pet/
4416 <item><tt/__PLUS4__/ - Target system is <tt/plus4/
4417 <item><tt/__SUPERVISION__/ - Target system is <tt/supervision/
4418 <item><tt/__VIC20__/ - Target system is <tt/vic20/
4419 </itemize>
4420
4421
4422 <sect>Structs and unions<label id="structs"><p>
4423
4424 <sect1>Structs and unions Overview<p>
4425
4426 Structs and unions are special forms of <ref id="scopes" name="scopes">.  They
4427 are to some degree comparable to their C counterparts. Both have a list of
4428 members. Each member allocates storage and may optionally have a name, which,
4429 in case of a struct, is the offset from the beginning and, in case of a union,
4430 is always zero.
4431
4432
4433 <sect1>Declaration<p>
4434
4435 Here is an example for a very simple struct with two members and a total size
4436 of 4 bytes:
4437
4438 <tscreen><verb>
4439       .struct Point
4440               xcoord  .word
4441               ycoord  .word
4442       .endstruct
4443 </verb></tscreen>
4444
4445 A union shares the total space between all its members, its size is the same
4446 as that of the largest member. The offset of all members relative to the union
4447 is zero.
4448
4449 <tscreen><verb>
4450       .union  Entry
4451               index   .word
4452               ptr     .addr
4453       .endunion
4454 </verb></tscreen>
4455
4456 A struct or union must not necessarily have a name. If it is anonymous, no
4457 local scope is opened, the identifiers used to name the members are placed
4458 into the current scope instead.
4459
4460 A struct may contain unnamed members and definitions of local structs. The
4461 storage allocators may contain a multiplier, as in the example below:
4462
4463 <tscreen><verb>
4464       .struct Circle
4465               .struct Point
4466                       .word   2         ; Allocate two words
4467               .endstruct
4468               Radius  .word
4469       .endstruct
4470 </verb></tscreen>
4471
4472
4473 <sect1>The <tt/.TAG/ keyword<p>
4474
4475 Using the <ref id=".TAG" name=".TAG"> keyword, it is possible to reserve space
4476 for an already defined struct or unions within another struct:
4477
4478 <tscreen><verb>
4479       .struct Point
4480               xcoord  .word
4481               ycoord  .word
4482       .endstruct
4483
4484       .struct Circle
4485               Origin  .tag    Point
4486               Radius  .byte
4487       .endstruct
4488 </verb></tscreen>
4489
4490 Space for a struct or union may be allocated using the <ref id=".TAG"
4491 name=".TAG"> directive.
4492
4493 <tscreen><verb>
4494         C:      .tag    Circle
4495 </verb></tscreen>
4496
4497 Currently, members are just offsets from the start of the struct or union. To
4498 access a field of a struct, the member offset has to be added to the address
4499 of the struct itself:
4500
4501 <tscreen><verb>
4502         lda     C+Circle::Radius        ; Load circle radius into A
4503 </verb></tscreen>
4504
4505 This may change in a future version of the assembler.
4506
4507
4508 <sect1>Limitations<p>
4509
4510 Structs and unions are currently implemented as nested symbol tables (in fact,
4511 they were a by-product of the improved scoping rules). Currently, the
4512 assembler has no idea of types. This means that the <ref id=".TAG"
4513 name=".TAG"> keyword will only allocate space. You won't be able to initialize
4514 variables declared with <ref id=".TAG" name=".TAG">, and adding an embedded
4515 structure to another structure with <ref id=".TAG" name=".TAG"> will not make
4516 this structure accessible by using the '::' operator.
4517
4518
4519
4520 <sect>Module constructors/destructors<label id="condes"><p>
4521
4522 <em>Note:</em> This section applies mostly to C programs, so the explanation
4523 below uses examples from the C libraries. However, the feature may also be
4524 useful for assembler programs.
4525
4526
4527 <sect1>Module constructors/destructors Overview<p>
4528
4529 Using the <tt><ref id=".CONSTRUCTOR" name=".CONSTRUCTOR"></tt>, <tt><ref
4530 id=".DESTRUCTOR" name=".DESTRUCTOR"></tt> and <tt><ref id=".INTERRUPTOR"
4531 name=".INTERRUPTOR"></tt> keywords it it possible to export functions in a
4532 special way. The linker is able to generate tables with all functions of a
4533 specific type. Such a table will <em>only</em> include symbols from object
4534 files that are linked into a specific executable. This may be used to add
4535 initialization and cleanup code for library modules, or a table of interrupt
4536 handler functions.
4537
4538 The C heap functions are an example where module initialization code is used.
4539 All heap functions (<tt>malloc</tt>, <tt>free</tt>, ...) work with a few
4540 variables that contain the start and the end of the heap, pointers to the free
4541 list and so on. Since the end of the heap depends on the size and start of the
4542 stack, it must be initialized at runtime. However, initializing these
4543 variables for programs that do not use the heap are a waste of time and
4544 memory.
4545
4546 So the central module defines a function that contains initialization code and
4547 exports this function using the <tt/.CONSTRUCTOR/ statement. If (and only if)
4548 this module is added to an executable by the linker, the initialization
4549 function will be placed into the table of constructors by the linker. The C
4550 startup code will call all constructors before <tt/main/ and all destructors
4551 after <tt/main/, so without any further work, the heap initialization code is
4552 called once the module is linked in.
4553
4554 While it would be possible to add explicit calls to initialization functions
4555 in the startup code, the new approach has several advantages:
4556
4557 <enum>
4558 <item>
4559 If a module is not included, the initialization code is not linked in and not
4560 called. So you don't pay for things you don't need.
4561
4562 <item>
4563 Adding another library that needs initialization does not mean that the
4564 startup code has to be changed. Before we had module constructors and
4565 destructors, the startup code for all systems had to be adjusted to call the
4566 new initialization code.
4567
4568 <item>
4569 The feature saves memory: Each additional initialization function needs just
4570 two bytes in the table (a pointer to the function).
4571
4572 </enum>
4573
4574
4575 <sect1>Calling order<p>
4576
4577 The symbols are sorted in increasing priority order by the linker when using
4578 one of the builtin linker configurations, so the functions with lower
4579 priorities come first and are followed by those with higher priorities. The C
4580 library runtime subroutine that walks over the function tables calls the
4581 functions starting from the top of the table - which means that functions with
4582 a high priority are called first.
4583
4584 So when using the C runtime, functions are called with high priority functions
4585 first, followed by low priority functions.
4586
4587
4588 <sect1>Pitfalls<p>
4589
4590 When using these special symbols, please take care of the following:
4591
4592 <itemize>
4593
4594 <item>
4595 The linker will only generate function tables, it will not generate code to
4596 call these functions. If you're using the feature in some other than the
4597 existing C environments, you have to write code to call all functions in a
4598 linker generated table yourself. See the <tt/condes/ and <tt/callirq/ modules
4599 in the C runtime for an example on how to do this.
4600
4601 <item>
4602 The linker will only add addresses of functions that are in modules linked to
4603 the executable. This means that you have to be careful where to place the
4604 condes functions. If initialization or an irq handler is needed for a group of
4605 functions, be sure to place the function into a module that is linked in
4606 regardless of which function is called by the user.
4607
4608 <item>
4609 The linker will generate the tables only when requested to do so by the
4610 <tt/FEATURE CONDES/ statement in the linker config file. Each table has to
4611 be requested separately.
4612
4613 <item>
4614 Constructors and destructors may have priorities. These priorities determine
4615 the order of the functions in the table. If your initialization or cleanup code
4616 does depend on other initialization or cleanup code, you have to choose the
4617 priority for the functions accordingly.
4618
4619 <item>
4620 Besides the <tt><ref id=".CONSTRUCTOR" name=".CONSTRUCTOR"></tt>, <tt><ref
4621 id=".DESTRUCTOR" name=".DESTRUCTOR"></tt> and <tt><ref id=".INTERRUPTOR"
4622 name=".INTERRUPTOR"></tt> statements, there is also a more generic command:
4623 <tt><ref id=".CONDES" name=".CONDES"></tt>. This allows to specify an
4624 additional type. Predefined types are 0 (constructor), 1 (destructor) and 2
4625 (interruptor). The linker generates a separate table for each type on request.
4626
4627 </itemize>
4628
4629
4630 <sect>Porting sources from other assemblers<p>
4631
4632 Sometimes it is necessary to port code written for older assemblers to ca65.
4633 In some cases, this can be done without any changes to the source code by
4634 using the emulation features of ca65 (see <tt><ref id=".FEATURE"
4635 name=".FEATURE"></tt>). In other cases, it is necessary to make changes to the
4636 source code.
4637
4638 Probably the biggest difference is the handling of the <tt><ref id=".ORG"
4639 name=".ORG"></tt> directive. ca65 generates relocatable code, and placement is
4640 done by the linker. Most other assemblers generate absolute code, placement is
4641 done within the assembler and there is no external linker.
4642
4643 In general it is not a good idea to write new code using the emulation
4644 features of the assembler, but there may be situations where even this rule is
4645 not valid.
4646
4647 <sect1>TASS<p>
4648
4649 You need to use some of the ca65 emulation features to simulate the behaviour
4650 of such simple assemblers.
4651
4652 <enum>
4653 <item>Prepare your sourcecode like this:
4654
4655 <tscreen><verb>
4656         ; if you want TASS style labels without colons
4657         .feature labels_without_colons
4658
4659         ; if you want TASS style character constants
4660         ; ("a" instead of the default 'a')
4661         .feature loose_char_term
4662
4663                 .word *+2       ; the cbm load address
4664
4665                 [yourcode here]
4666 </verb></tscreen>
4667
4668 notice that the two emulation features are mostly useful for porting
4669 sources originally written in/for TASS, they are not needed for the
4670 actual "simple assembler operation" and are not recommended if you are
4671 writing new code from scratch.
4672
4673 <item>Replace all program counter assignments (which are not possible in ca65
4674 by default, and the respective emulation feature works different from what
4675 you'd expect) by another way to skip to memory locations, for example the
4676 <tt><ref id=".RES" name=".RES"></tt> directive.
4677
4678 <tscreen><verb>
4679         ; *=$2000
4680         .res $2000-*    ; reserve memory up to $2000
4681 </verb></tscreen>
4682
4683 Please note that other than the original TASS, ca65 can never move the program
4684 counter backwards - think of it as if you are assembling to disk with TASS.
4685
4686 <item>Conditional assembly (<tt/.ifeq//<tt/.endif//<tt/.goto/ etc.) must be
4687 rewritten to match ca65 syntax. Most importantly notice that due to the lack
4688 of <tt/.goto/, everything involving loops must be replaced by
4689 <tt><ref id=".REPEAT" name=".REPEAT"></tt>.
4690
4691 <item>To assemble code to a different address than it is executed at, use the
4692 <tt><ref id=".ORG" name=".ORG"></tt> directive instead of
4693 <tt/.offs/-constructs.
4694
4695 <tscreen><verb>
4696         .org $1800
4697
4698         [floppy code here]
4699
4700         .reloc  ; back to normal
4701 </verb></tscreen>
4702
4703 <item>Then assemble like this:
4704
4705 <tscreen><verb>
4706         cl65 --start-addr 0x0ffe -t none myprog.s -o myprog.prg
4707 </verb></tscreen>
4708
4709 Note that you need to use the actual start address minus two, since two bytes
4710 are used for the cbm load address.
4711
4712 </enum>
4713
4714
4715 <sect>Bugs/Feedback<p>
4716
4717 If you have problems using the assembler, if you find any bugs, or if
4718 you're doing something interesting with the assembler, I would be glad to
4719 hear from you. Feel free to contact me by email
4720 (<htmlurl url="mailto:uz@cc65.org" name="uz@cc65.org">).
4721
4722
4723
4724 <sect>Copyright<p>
4725
4726 ca65 (and all cc65 binutils) are (C) Copyright 1998-2003 Ullrich von
4727 Bassewitz. For usage of the binaries and/or sources the following
4728 conditions do apply:
4729
4730 This software is provided 'as-is', without any expressed or implied
4731 warranty.  In no event will the authors be held liable for any damages
4732 arising from the use of this software.
4733
4734 Permission is granted to anyone to use this software for any purpose,
4735 including commercial applications, and to alter it and redistribute it
4736 freely, subject to the following restrictions:
4737
4738 <enum>
4739 <item>  The origin of this software must not be misrepresented; you must not
4740         claim that you wrote the original software. If you use this software
4741         in a product, an acknowledgment in the product documentation would be
4742         appreciated but is not required.
4743 <item>  Altered source versions must be plainly marked as such, and must not
4744         be misrepresented as being the original software.
4745 <item>  This notice may not be removed or altered from any source
4746         distribution.
4747 </enum>
4748
4749
4750
4751 </article>
4752
4753
4754