git.sur5r.net Git - cc65/blob - doc/ca65.sgml

   1 <!doctype linuxdoc system>
   2
   3 <article>
   4 <title>ca65 Users Guide
   5 <author>Ullrich von Bassewitz, <htmlurl url="mailto:uz@cc65.org" name="uz@cc65.org">
   6 <date>19.07.2000, 29.11.2000, 02.10.2001
   7
   8 <abstract>
   9 ca65 is a powerful macro assembler for the 6502, 65C02 and 65816 CPUs. It is
  10 used as a companion assembler for the cc65 crosscompiler, but it may also be
  11 used as a standalone product.
  12 </abstract>
  13
  14 <!-- Table of contents -->
  15 <toc>
  16
  17 <!-- Begin the document -->
  18
  19 <sect>Overview<p>
  20
  21 ca65 is a replacement for the ra65 assembler that was part of the cc65 C
  22 compiler, originally developed by John R. Dunning. I had some problems with
  23 ra65 and the copyright does not permit some things which I wanted to be
  24 possible, so I decided to write a completely new assembler/linker/archiver
  25 suite for the cc65 compiler. ca65 is part of this suite.
  26
  27 Some parts of the assembler (code generation and some routines for symbol
  28 table handling) are taken from an older crossassembler named a816 written
  29 by me a long time ago.
  30
  31
  32 <sect1>Design criteria<p>
  33
  34 Here's a list of the design criteria, that I considered important for the
  35 development:
  36
  37 <itemize>
  38
  39 <item>  The assembler must support macros. Macros are not essential, but they
  40         make some things easier, especially when you use the assembler in the
  41         backend of a compiler.
  42 <item>  The assembler must support the newer 65C02 and 65816 CPUs. I have been
  43         thinking about a 65816 backend for the C compiler, and even my old
  44         a816 assembler had support for these CPUs, so this wasn't really a
  45         problem.
  46 <item>  The assembler must produce relocatable code. This is necessary for the
  47         compiler support, and it is more convenient.
  48 <item>  Conditional assembly must be supported. This is a must for bigger
  49         projects written in assembler (like Elite128).
  50 <item>  The assembler must support segments, and it must support more than
  51         three segments (this is the count, most other assemblers support).
  52         Having more than one code segments helps developing code for systems
  53         with a divided ROM area (like the C64).
  54 <item>  The linker must be able to resolve arbitrary expressions. It should
  55         be able to get things like
  56 <tscreen><verb>
  57         .import S1, S2
  58         .export Special
  59         Special = 2*S1 + S2/7
  60 </verb></tscreen>
  61         right.
  62 <item>  True lexical nesting for symbols. This is very convenient for larger
  63         assembly projects.
  64 <item>  "Cheap" local symbols without lexical nesting for those quick, late
  65         night hacks.
  66 <item>  I liked the idea of "options" as Anre Fachats .o65 format has it, so I
  67         introduced the concept into the object file format use by the new cc65
  68         binutils.
  69 <item>  The assembler will be a one pass assembler. There was no real need for
  70         this decision, but I've written several multipass assemblers, and it
  71         started to get boring. A one pass assembler needs much more elaborated
  72         data structures, and because of that it's much more fun:-)
  73 <item>  Non-GPLed code that may be used in any project without restrictions or
  74         fear of "GPL infecting" other code.
  75 </itemize>
  76 <p>
  77
  78
  79 <sect>Usage<p>
  80
  81
  82 <sect1>Command line option overview<p>
  83
  84 The assembler accepts the following options:
  85
  86 <tscreen><verb>
  87 ---------------------------------------------------------------------------
  88 Usage: ca65 [options] file
  89 Short options:
  90   -D name[=value]       Define a symbol
  91   -I dir                Set an include directory search path
  92   -U                    Mark unresolved symbols as import
  93   -V                    Print the assembler version
  94   -W n                  Set warning level n
  95   -g                    Add debug info to object file
  96   -h                    Help (this text)
  97   -i                    Ignore case of symbols
  98   -l                    Create a listing if assembly was ok
  99   -o name               Name the output file
 100   -s                    Enable smart mode
 101   -t sys                Set the target system
 102   -v                    Increase verbosity
 103
 104 Long options:
 105   --auto-import         Mark unresolved symbols as import
 106   --cpu type            Set cpu type
 107   --debug-info          Add debug info to object file
 108   --feature name        Set an emulation feature
 109   --help                Help (this text)
 110   --ignore-case         Ignore case of symbols
 111   --include-dir dir     Set an include directory search path
 112   --list-bytes n        Maximum number of bytes per listing line
 113   --listing             Create a listing if assembly was ok
 114   --pagelength n        Set the page length for the listing
 115   --smart               Enable smart mode
 116   --target sys          Set the target system
 117   --verbose             Increase verbosity
 118   --version             Print the assembler version
 119 ---------------------------------------------------------------------------
 120 </verb></tscreen>
 121
 122
 123 <sect1>Command line options in detail<p>
 124
 125 Here is a description of all the command line options:
 126
 127 <descrip>
 128
 129   <label id="option--cpu">
 130   <tag><tt>--cpu type</tt></tag>
 131
 132   Set the default for the CPU type. The option takes a parameter, which
 133   may be one of
 134
 135         6502, 65SC02, 65C02, 65816, sunplus, sweet16
 136
 137   The sunplus cpu is not available in the freeware version, because the
 138   instruction set is "proprietary and confidential".
 139
 140
 141   <label id="option--feature">
 142   <tag><tt>--feature name</tt></tag>
 143
 144   Enable an emulation feature. This is identical as using <tt/.FEATURE/
 145   in the source with two exceptions: Feature names must be lower case, and
 146   each feature must be specified by using an extra <tt/--feature/ option,
 147   comma separated lists are not allowed.
 148
 149   See the discussion of the <tt><ref id=".FEATURE" name=".FEATURE"></tt>
 150   command for a list of emulation features.
 151
 152
 153   <label id="option-g">
 154   <tag><tt>-g, --debug-info</tt></tag>
 155
 156   When this option (or the equivalent control command <tt/.DEBUGINFO/) is
 157   used, the assembler will add a section to the object file that contains
 158   all symbols (including local ones) together with the symbol values and
 159   source file positions. The linker will put these additional symbols into
 160   the VICE label file, so even local symbols can be seen in the VICE
 161   monitor.
 162
 163
 164   <tag><tt>-h, --help</tt></tag>
 165
 166   Print the short option summary shown above.
 167
 168
 169   <tag><tt>-i, --ignore-case</tt></tag>
 170
 171   This option makes the assembler case insensitive on identifiers and labels.
 172   This option will override the default, but may itself be overridden by the
 173   <tt><ref id=".CASE" name=".CASE"></tt> control command.
 174
 175
 176   <tag><tt>-l, --listing</tt></tag>
 177
 178   Generate an assembler listing. The listing file will always have the
 179   name of the main input file with the extension replaced by ".lst". This
 180   may change in future versions.
 181
 182
 183   <tag><tt>--list-bytes n</tt></tag>
 184
 185   Set the maximum number of bytes printed in the listing for one line of
 186   input. See the <tt><ref id=".LISTBYTES" name=".LISTBYTES"></tt> directive
 187   for more information. The value zero can be used to encode an unlimited
 188   number of printed bytes.
 189
 190
 191   <tag><tt>-o name</tt></tag>
 192
 193   The default output name is the name of the input file with the extension
 194   replaced by ".o". If you don't like that, you may give another name with
 195   the -o option. The output file will be placed in the same directory as
 196   the source file, or, if -o is given, the full path in this name is used.
 197
 198
 199   <tag><tt>--pagelength n</tt></tag>
 200
 201   sets the length of a listing page in lines. See the <tt><ref
 202   id=".PAGELENGTH" name=".PAGELENGTH"></tt> directive for more information.
 203
 204
 205   <tag><tt>-s, --smart-mode</tt></tag>
 206
 207   In smart mode (enabled by -s or the <tt><ref id=".SMART" name=".SMART"></tt>
 208   pseudo instruction) the assembler will track usage of the <tt/REP/ and
 209   <tt/SEP/ instructions in 65816 mode and update the operand sizes
 210   accordingly. If the operand of such an instruction cannot be evaluated by
 211   the assembler (for example, because the operand is an imported symbol), a
 212   warning is issued.
 213
 214   Beware: Since the assembler cannot trace the execution flow this may
 215   lead to false results in some cases. If in doubt, use the .ixx and .axx
 216   instructions to tell the assembler about the current settings. Smart
 217   mode is off by default.
 218
 219
 220   <label id="option-t">
 221   <tag><tt>-t sys, --target sys</tt></tag>
 222
 223   Set the target system. This will enable translation of character strings
 224   and character constants into the character set of the target platform.
 225   The default for the target system is "none", which means that no translation
 226   will take place. The assembler supports the same target systems as the
 227   compiler, see there for a list.
 228
 229
 230   <tag><tt>-v, --verbose</tt></tag>
 231
 232   Increase the assembler verbosity. Usually only needed for debugging
 233   purposes. You may use this option more than one time for even more
 234   verbose output.
 235
 236
 237   <tag><tt>-D</tt></tag>
 238
 239   This option allows you to define symbols on the command line. Without a
 240   value, the symbol is defined with the value zero. When giving a value,
 241   you may use the '&dollar;' prefix for hexadecimal symbols. Please note
 242   that for some operating systems, '&dollar;' has a special meaning, so
 243   you may have to quote the expression.
 244
 245
 246   <tag><tt>-I dir, --include-dir dir</tt></tag>
 247
 248   Name a directory which is searched for include files. The option may be
 249   used more than once to specify more than one directory to search. The
 250   current directory is always searched first before considering any
 251   additional directories.
 252
 253
 254   <tag><tt>-U, --auto-import</tt></tag>
 255
 256   Mark symbols that are not defined in the sources as imported symbols. This
 257   should be used with care since it delays error messages about typos and such
 258   until the linker is run. The compiler uses the equivalent of this switch
 259   (<tt><ref id=".AUTOIMPORT" name=".AUTOIMPORT"></tt>) to enable auto imported
 260   symbols for the runtime library. However, the compiler is supposed to
 261   generate code that runs through the assembler without problems, something
 262   which is not always true for assembler programmers.
 263
 264
 265   <tag><tt>-V, --version</tt></tag>
 266
 267   Print the version number of the assembler. If you send any suggestions
 268   or bugfixes, please include the version number.
 269
 270
 271   <label id="option-W">
 272   <tag><tt>-Wn</tt></tag>
 273
 274   Set the warning level for the assembler. Using -W2 the assembler will
 275   even warn about such things like unused imported symbols. The default
 276   warning level is 1, and it would probably be silly to set it to
 277   something lower.
 278
 279 </descrip>
 280 <p>
 281
 282
 283 <sect>Input format<p>
 284
 285 <sect1>Assembler syntax<p>
 286
 287 The assembler accepts the standard 6502/65816 assembler syntax. One line may
 288 contain a label (which is identified by a colon), and, in addition to the
 289 label, an assembler mnemonic, a macro, or a control command (see section <ref
 290 id="control-commands" name="Control Commands"> for supported control
 291 commands). Alternatively, the line may contain a symbol definition using
 292 the '=' token. Everything after a semicolon is handled as a comment (that is,
 293 it is ignored).
 294
 295 Here are some examples for valid input lines:
 296
 297 <tscreen><verb>
 298         Label:                          ; A label and a comment
 299                 lda     #$20            ; A 6502 instruction plus comment
 300         L1:     ldx     #$20            ; Same with label
 301         L2:     .byte   "Hello world"   ; Label plus control command
 302                 mymac   $20             ; Macro expansion
 303                 MySym = 3*L1            ; Symbol definition
 304         MaSym   = Label                 ; Another symbol
 305 </verb></tscreen>
 306
 307 The assembler accepts
 308
 309 <itemize>
 310 <item>all valid 6502 mnemonics when in 6502 mode (the default or after the
 311       <tt><ref id=".P02" name=".P02"></tt> command was given).
 312 <item>all valid 6502 mnemonics plus a set of illegal instructions when in
 313       <ref id="6502X-mode" name="6502X mode">.
 314 <item>all valid 65SC02 mnemonics when in 65SC02 mode (after the
 315       <tt><ref id=".PSC02" name=".PSC02"></tt> command was given).
 316 <item>all valid 65C02 mnemonics when in 65C02 mode (after the
 317       <tt><ref id=".PC02" name=".PC02"></tt> command was given).
 318 <item>all valid 65618 mnemonics when in 65816 mode (after the
 319       <tt><ref id=".P816" name=".P816"></tt> command was given).
 320 <item>all valid SunPlus mnemonics when in SunPlus mode (after the
 321       <tt><ref id=".SUNPLUS" name=".SUNPLUS"></tt> command was given).
 322 </itemize>
 323
 324
 325 <sect1>65816 mode<p>
 326
 327 In 65816 mode several aliases are accepted in addition to the official
 328 mnemonics:
 329
 330 <tscreen><verb>
 331         BGE is an alias for BCS
 332         BLT is an alias for BCC
 333         CPA is an alias for CMP
 334         DEA is an alias for DEC A
 335         INA is an alias for INC A
 336         SWA is an alias for XBA
 337         TAD is an alias for TCD
 338         TAS is an alias for TCS
 339         TDA is an alias for TDC
 340         TSA is an alias for TSC
 341 </verb></tscreen>
 342
 343
 344
 345 <sect1>6502X mode<label id="6502X-mode"><p>
 346
 347 6502X mode is an extension to the normal 6502 mode. In this mode, several
 348 mnemonics for illegal instructions of the NMOS 6502 CPUs are accepted. Since
 349 these instructions are illegal, there are no official mnemonics for them. The
 350 unofficial ones are taken from <htmlurl
 351 url="http://oxyron.net/graham/opcodes02.html"
 352 name="http://oxyron.net/graham/opcodes02.html">. Please note that only the
 353 ones marked as "stable" are supported. The following table uses information
 354 from the mentioned web page, for more information, see there.
 355
 356 <itemize>
 357 <item><tt>ALR: A:=(A and #{imm})*2;</tt>
 358 <item><tt>ANC: A:=A and #{imm};</tt> Generates opcode &dollar;0B.
 359 <item><tt>ARR: A:=(A and #{imm})/2;</tt>
 360 <item><tt>AXS: X:=A and X-#{imm};</tt>
 361 <item><tt>DCP: {adr}:={adr}-1; A-{adr};</tt>
 362 <item><tt>ISC: {adr}:={adr}+1; A:=A-{adr};</tt>
 363 <item><tt>LAS: A,X,S:={adr} and S;</tt>
 364 <item><tt>LAX: A,X:={adr};</tt>
 365 <item><tt>RLA: {adr}:={adr}rol; A:=A and {adr};</tt>
 366 <item><tt>RRA: {adr}:={adr}ror; A:=A adc {adr};</tt>
 367 <item><tt>SAX: {adr}:=A and X;</tt>
 368 <item><tt>SLO: {adr}:={adr}*2; A:=A or {adr};</tt>
 369 <item><tt>SRE: {adr}:={adr}/2; A:=A xor {adr};</tt>
 370 </itemize>
 371
 372
 373
 374 <sect1>sweet16 mode<label id="sweet16-mode"><p>
 375
 376 SWEET 16 is an interpreter for a pseudo 16 bit CPU written by Steve Wozniak
 377 for the Apple ][ machines. It is available in the Apple ][ ROM. ca65 can
 378 generate code for this pseudo CPU when switched into sweet16 mode. The
 379 following is special in sweet16 mode:
 380
 381 <itemize>
 382
 383 <item>The '@' character denotes indirect addressing and is no longer available
 384 for cheap local labels. If you need cheap local labels, you will have to
 385 switch to another lead character using the <tt/<ref id=".LOCALCHAR"
 386 name=".LOCALCHAR">/ command.
 387
 388 <item>Registers are specified using <tt/R0/ .. <tt/R15/. In sweet16 mode,
 389 these identifiers are reserved words.
 390
 391 </itemize>
 392
 393 Please note that the assembler does neither supply the interpreter needed for
 394 SWEET 16 code, nor the zero page locations needed for the SWEET 16 registers,
 395 nor does it call the interpreter. All this must be done by your program. Apple
 396 ][ programmers do probably know how to use sweet16 mode.
 397
 398 For more information about SWEET 16, see
 399 <htmlurl url="http://www.6502.org/source/interpreters/sweet16.htm"
 400 name="http://www.6502.org/source/interpreters/sweet16.htm">.
 401
 402
 403 <sect1>Number format<p>
 404
 405 For literal values, the assembler accepts the widely used number formats: A
 406 preceding '&dollar;' or a trailing 'h' denotes a hex value, a preceding '%'
 407 denotes a binary value, and a bare number is interpreted as a decimal. There
 408 are currently no octal values and no floats.
 409
 410
 411 <sect1>Conditional assembly<p>
 412
 413 Please note that when using the conditional directives (<tt/.IF/ and friends),
 414 the input must consist of valid assembler tokens, even in <tt/.IF/ branches
 415 that are not assembled. The reason for this behaviour is that the assembler
 416 must still be able to detect the ending tokens (like <tt/.ENDIF/), so
 417 conversion of the input stream into tokens still takes place. As a consequence
 418 conditional assembly directives may <bf/not/ be used to prevent normal text
 419 (used as a comment or similar) from being assembled. <p>
 420
 421
 422 <sect>Expressions<p>
 423
 424
 425 <sect1>Expression evaluation<p>
 426
 427 All expressions are evaluated with (at least) 32 bit precision. An
 428 expression may contain constant values and any combination of internal and
 429 external symbols. Expressions that cannot be evaluated at assembly time
 430 are stored inside the object file for evaluation by the linker.
 431 Expressions referencing imported symbols must always be evaluated by the
 432 linker.
 433
 434
 435 <sect1>Size of an expression result<p>
 436
 437 Sometimes, the assembler must know about the size of the value that is the
 438 result of an expression. This is usually the case, if a decision has to be
 439 made, to generate a zero page or an absolute memory references. In this
 440 case, the assembler has to make some assumptions about the result of an
 441 expression:
 442
 443 <itemize>
 444 <item>  If the result of an expression is constant, the actual value is
 445         checked to see if it's a byte sized expression or not.
 446 <item>  If the expression is explicitly casted to a byte sized expression by
 447         one of the '&gt;', '&lt;' or '^' operators, it is a byte expression.
 448 <item>  If this is not the case, and the expression contains a symbol,
 449         explicitly declared as zero page symbol (by one of the .importzp or
 450         .exportzp instructions), then the whole expression is assumed to be
 451         byte sized.
 452 <item>  If the expression contains symbols that are not defined, and these
 453         symbols are local symbols, the enclosing scopes are searched for a
 454         symbol with the same name. If one exists and this symbol is defined,
 455         it's attributes are used to determine the result size.
 456 <item>  In all other cases the expression is assumed to be word sized.
 457 </itemize>
 458
 459 Note: If the assembler is not able to evaluate the expression at assembly
 460 time, the linker will evaluate it and check for range errors as soon as
 461 the result is known.
 462
 463
 464 <sect1>Boolean expressions<p>
 465
 466 In the context of a boolean expression, any non zero value is evaluated as
 467 true, any other value to false. The result of a boolean expression is 1 if
 468 it's true, and zero if it's false. There are boolean operators with extreme
 469 low precedence with version 2.x (where x &gt; 0). The <tt/.AND/ and <tt/.OR/
 470 operators are shortcut operators. That is, if the result of the expression is
 471 already known, after evaluating the left hand side, the right hand side is
 472 not evaluated.
 473
 474
 475 <sect1>Constant expressions<p>
 476
 477 Sometimes an expression must evaluate to a constant without looking at any
 478 further input. One such example is the <tt/<ref id=".IF" name=".IF">/ command
 479 that decides if parts of the code are assembled or not. An expression used in
 480 the <tt/.IF/ command cannot reference a symbol defined later, because the
 481 decision about the <tt/.IF/ must be made at the point when it is read. If the
 482 expression used in such a context contains only constant numerical values,
 483 there is no problem. When unresolvable symbols are involved it may get harder
 484 for the assembler to determine if the expression is actually constant, and it
 485 is even possible to create expressions that aren't recognized as constant.
 486 Simplifying the expressions will often help.
 487
 488 In cases where the result of the expression is not needed immediately, the
 489 assembler will delay evaluation until all input is read, at which point all
 490 symbols are known. So using arbitrary complex constant expressions is no
 491 problem in most cases.
 492
 493
 494
 495 <sect1>Available operators<label id="operators"><p>
 496
 497 Available operators sorted by precedence:
 498
 499 <tscreen><verb>
 500     Op          Description                             Precedence
 501   -------------------------------------------------------------------
 502                 Builtin string functions                0
 503
 504                 Builtin pseudo variables                1
 505                 Builtin pseudo functions                1
 506     +           Unary plus                              1
 507     -           Unary minus                             1
 508     ~           Unary bitwise not                       1
 509     .BITNOT     Unary bitwise not                       1
 510     &lt;           Low byte operator                       1
 511     &gt;           High byte operator                      1
 512     ^           Bank byte operator                      1
 513
 514     *           Multiplication                          2
 515     /           Division                                2
 516     .MOD        Modulo operation                        2
 517     &amp;           Bitwise and                             2
 518     .BITAND     Bitwise and                             2
 519     ^           Bitwise xor                             2
 520     .BITXOR     Bitwise xor                             2
 521     &lt;&lt;          Shift left operator                     2
 522     .SHL        Shift left operator                     2
 523     &gt;&gt;          Shift right operator
 524     .SHR        Shift right operator                    2
 525
 526     +           Binary plus                             3
 527     -           Binary minus                            3
 528     |           Binary or                               3
 529     .BITOR      Binary or                               3
 530
 531     =           Compare operation (equal)               4
 532     &lt;&gt;          Compare operation (not equal)           4
 533     &lt;           Compare operation (less)                4
 534     &gt;           Compare operation (greater)             4
 535     &lt;=          Compare operation (less or equal)       4
 536     &gt;=          Compare operation (greater or equal)    4
 537
 538     &amp;&amp;          Boolean and                             5
 539     .AND        Boolean and                             5
 540     .XOR        Boolean xor                             5
 541
 542     ||          Boolean or                              6
 543     .OR         Boolean or                              6
 544
 545     !           Boolean not                             7
 546     .NOT        Boolean not                             7
 547 </verb></tscreen>
 548
 549
 550 To force a specific order of evaluation, braces may be used as usual.
 551
 552 <p>
 553
 554
 555
 556 <sect>Symbols and labels<p>
 557
 558 The assembler allows you to use symbols instead of naked values to make
 559 the source more readable. There are a lot of different ways to define and
 560 use symbols and labels, giving a lot of flexibility.
 561
 562
 563 <sect1>Numeric constants<p>
 564
 565 Numeric constants are defined using the equal sign or the label assignment
 566 operator. After doing
 567
 568 <tscreen><verb>
 569       two = 2
 570 </verb></tscreen>
 571
 572 may use the symbol "two" in every place where a number is expected, and it is
 573 evaluated to the value 2 in this context. The label assignment operator causes
 574 the same, but causes the symbol to be marked as a label, which may cause a
 575 different handling in the debugger:
 576
 577 <tscreen><verb>
 578       io := $d000
 579 </verb></tscreen>
 580
 581 The right side can of course be an expression:
 582
 583 <tscreen><verb>
 584       four = two * two
 585 </verb></tscreen>
 586
 587
 588 <sect1>Standard labels<p>
 589
 590 A label is defined by writing the name of the label at the start of the line
 591 (before any instruction mnemonic, macro or pseudo directive), followed by a
 592 colon. This will declare a symbol with the given name and the value of the
 593 current program counter.
 594
 595
 596 <sect1>Local labels and symbols<p>
 597
 598 Using the <tt><ref id=".PROC" name=".PROC"></tt> directive, it is possible to
 599 create regions of code where the names of labels and symbols are local to this
 600 region. They are not known outside of this region and cannot be accessed from
 601 there. Such regions may be nested like PROCEDUREs in Pascal.
 602
 603 See the description of the <tt><ref id=".PROC" name=".PROC"></tt>
 604 directive for more information.
 605
 606
 607 <sect1>Cheap local labels<p>
 608
 609 Cheap local labels are defined like standard labels, but the name of the
 610 label must begin with a special symbol (usually '@', but this can be
 611 changed by the <tt><ref id=".LOCALCHAR" name=".LOCALCHAR"></tt>
 612 directive).
 613
 614 Cheap local labels are visible only between two non cheap labels. As soon as a
 615 standard symbol is encountered (this may also be a local symbol if inside a
 616 region defined with the <tt><ref id=".PROC" name=".PROC"></tt> directive), the
 617 cheap local symbol goes out of scope.
 618
 619 You may use cheap local labels as an easy way to reuse common label
 620 names like "Loop". Here is an example:
 621
 622 <tscreen><verb>
 623         Clear:  lda    #$00             ; Global label
 624                 ldy    #$20
 625         @Loop:  sta    Mem,y            ; Local label
 626                 dey
 627                 bne    @Loop            ; Ok
 628                 rts
 629         Sub:    ...                     ; New global label
 630                 bne    @Loop            ; ERROR: Unknown identifier!
 631 </verb></tscreen>
 632
 633 <sect1>Unnamed labels<p>
 634
 635 If you really want to write messy code, there are also unnamed labels. These
 636 labels do not have a name (you guessed that already, didn't you?). A colon is
 637 used to mark the absence of the name.
 638
 639 Unnamed labels may be accessed by using the colon plus several minus or plus
 640 characters as a label designator. Using the '-' characters will create a back
 641 reference (use the n'th label backwards), using '+' will create a forward
 642 reference (use the n'th label in forward direction). An example will help to
 643 understand this:
 644
 645 <tscreen><verb>
 646         :       lda     (ptr1),y        ; #1
 647                 cmp     (ptr2),y
 648                 bne     :+              ; -> #2
 649                 tax
 650                 beq     :+++            ; -> #4
 651                 iny
 652                 bne     :-              ; -> #1
 653                 inc     ptr1+1
 654                 inc     ptr2+1
 655                 bne     :-              ; -> #1
 656
 657         :       bcs     :+              ; #2 -> #3
 658                 ldx     #$FF
 659                 rts
 660
 661         :       ldx     #$01            ; #3
 662         :       rts                     ; #4
 663 </verb></tscreen>
 664
 665 As you can see from the example, unnamed labels will make even short
 666 sections of code hard to understand, because you have to count labels
 667 to find branch targets (this is the reason why I for my part do
 668 prefer the "cheap" local labels). Nevertheless, unnamed labels are
 669 convenient in some situations, so it's your decision.
 670
 671
 672 <sect1>Using macros to define labels and constants<p>
 673
 674 While there are drawbacks with this approach, it may be handy in some
 675 situations. Using <tt><ref id=".DEFINE" name=".DEFINE"></tt>, it is
 676 possible to define symbols or constants that may be used elsewhere. Since
 677 the macro facility works on a very low level, there is no scoping. On the
 678 other side, you may also define string constants this way (this is not
 679 possible with the other symbol types).
 680
 681 Example:
 682
 683 <tscreen><verb>
 684         .DEFINE two     2
 685         .DEFINE version "SOS V2.3"
 686
 687         four = two * two        ; Ok
 688         .byte   version         ; Ok
 689
 690         .PROC                   ; Start local scope
 691         two = 3                 ; Will give "2 = 3" - invalid!
 692         .ENDPROC
 693 </verb></tscreen>
 694
 695
 696 <sect1>Symbols and <tt>.DEBUGINFO</tt><p>
 697
 698 If <tt><ref id=".DEBUGINFO" name=".DEBUGINFO"></tt> is enabled (or <ref
 699 id="option-g" name="-g"> is given on the command line), global, local and
 700 cheap local labels are written to the object file and will be available in the
 701 symbol file via the linker. Unnamed labels are not written to the object file,
 702 because they don't have a name which would allow to access them.
 703
 704
 705
 706 <sect>Scopes<label id="scopes"><p>
 707
 708 ca65 implements several sorts of scopes for symbols.
 709
 710 <sect1>Global scope<p>
 711
 712 All (non cheap local) symbols that are declared outside of any nested scopes
 713 are in global scope.
 714
 715
 716 <sect1>Cheap locals<p>
 717
 718 A special scope is the scope for cheap local symbols. It lasts from one non
 719 local symbol to the next one, without any provisions made by the programmer.
 720 All other scopes differ in usage but use the same concept internally.
 721
 722
 723 <sect1>Generic nested scopes<p>
 724
 725 A nested scoped for generic use is started with <tt/<ref id=".SCOPE"
 726 name=".SCOPE">/ and closed with <tt/<ref id=".ENDSCOPE" name=".ENDSCOPE">/.
 727 The scope can have a name, in which case it is accessible from the outside by
 728 using <ref id="scopesyntax" name="explicit scopes">. If the scope does not
 729 have a name, all symbols created within the scope are local to the scope, and
 730 aren't accessible from the outside.
 731
 732 A nested scope can access symbols from the local or from enclosing scopes by
 733 name without using explicit scope names. In some cases there may be
 734 ambiguities, for example if there is a reference to a local symbol that is not
 735 yet defined, but a symbol with the same name exists in outer scopes:
 736
 737 <tscreen><verb>
 738         .scope  outer
 739                 foo     = 2
 740                 .scope  inner
 741                         lda     #foo
 742                         foo     = 3
 743                 .endscope
 744         .endscope
 745 </verb></tscreen>
 746
 747 In the example above, the <tt/lda/ instruction will load the value 3 into the
 748 accumulator, because <tt/foo/ is redefined in the scope. However:
 749
 750 <tscreen><verb>
 751         .scope  outer
 752                 foo     = $1234
 753                 .scope  inner
 754                         lda     foo,x
 755                         foo     = $12
 756                 .endscope
 757         .endscope
 758 </verb></tscreen>
 759
 760 Here, <tt/lda/ will still load from <tt/$12,x/, but since it is unknown to the
 761 assembler that <tt/foo/ is a zeropage symbol when translating the instruction,
 762 absolute mode is used instead. In fact, the assembler will not use absolute
 763 mode by default, but it will search through the enclosing scopes for a symbol
 764 with the given name. If one is found, the address size of this symbol is used.
 765 This may lead to errors:
 766
 767 <tscreen><verb>
 768         .scope  outer
 769                 foo     = $12
 770                 .scope  inner
 771                         lda     foo,x
 772                         foo     = $1234
 773                 .endscope
 774         .endscope
 775 </verb></tscreen>
 776
 777 In this case, when the assembler sees the symbol <tt/foo/ in the <tt/lda/
 778 instruction, it will search for an already defined symbol <tt/foo/. It will
 779 find <tt/foo/ in scope <tt/outer/, and a close look reveals that it is a
 780 zeropage symbol. So the assembler will use zeropage addressing mode. If
 781 <tt/foo/ is redefined later in scope <tt/inner/, the assembler tries to change
 782 the address in the <tt/lda/ instruction already translated, but since the new
 783 value needs absolute addressing mode, this fails, and an error message "Range
 784 error" is output.
 785
 786 Of course the most simple solution for the problem is to move the definition
 787 of <tt/foo/ in scope <tt/inner/ upwards, so it precedes its use. There may be
 788 rare cases when this cannot be done. In these cases, you can use one of the
 789 address size override operators:
 790
 791 <tscreen><verb>
 792         .scope  outer
 793                 foo     = $12
 794                 .scope  inner
 795                         lda     a:foo,x
 796                         foo     = $1234
 797                 .endscope
 798         .endscope
 799 </verb></tscreen>
 800
 801 This will cause the <tt/lda/ instruction to be translated using absolute
 802 addressing mode, which means changing the symbol reference later does not
 803 cause any errors.
 804
 805
 806 <sect1>Nested procedures<p>
 807
 808 A nested procedure is created by use of <tt/<ref id=".PROC" name=".PROC">/. It
 809 differs from a <tt/<ref id=".SCOPE" name=".SCOPE">/ in that it must have a
 810 name, and a it will introduce a symbol with this name in the enclosing scope.
 811 So
 812
 813 <tscreen><verb>
 814         .proc   foo
 815                 ...
 816         .endscope
 817 </verb></tscreen>
 818
 819 is actually the same as
 820
 821 <tscreen><verb>
 822         foo:
 823         .scope  foo
 824                 ...
 825         .endscope
 826 </verb></tscreen>
 827
 828 This is the reason why a procedure must have a name. If you want a scope
 829 without a name, use <tt/<ref id=".SCOPE" name=".SCOPE">/.
 830
 831 <bf/Note:/ As you can see from the example above, scopes and symbols live in
 832 different namespaces. There can be a symbol named <tt/foo/ and a scope named
 833 <tt/foo/ without any conflicts (but see the section titled <ref
 834 id="scopesearch" name="&quot;Scope search order&quot;">).
 835
 836
 837 <sect1>Structs, unions and enums<p>
 838
 839 Structs, unions and enums are explained in a <ref id="structs" name="separate
 840 section">, I do only cover them here, because if they are declared with a
 841 name, they open a nested scope, similar to <tt/<ref id=".SCOPE"
 842 name=".SCOPE">/. However, when no name is specified, the behaviour is
 843 different: In this case, no new scope will be opened, symbols declared within
 844 a struct, union, or enum declaration will then be added to the enclosing scope
 845 instead.
 846
 847
 848 <sect1>Explicit scope specification<label id="scopesyntax"><p>
 849
 850 Accessing symbols from other scopes is possible by using an explicit scope
 851 specification, provided that the scope where the symbol lives in has a name.
 852 The namespace token (<tt/::/) is used to access other scopes:
 853
 854 <tscreen><verb>
 855         .scope  foo
 856         bar:    .word   0
 857         .endscope
 858
 859                 ...
 860                 lda     foo::bar        ; Access foo in scope bar
 861 </verb></tscreen>
 862
 863 The only way to deny access to a scope from the outside is to declare a scope
 864 without a name (using the <tt/<ref id=".SCOPE" name=".SCOPE">/ command).
 865
 866 A special syntax is used to specify the global scope: If a symbol or scope is
 867 preceded by the namespace token, the global scope is searched:
 868
 869 <tscreen><verb>
 870         bar     = 3
 871
 872         .scope  foo
 873                 bar     = 2
 874                 lda     #::bar  ; Access the global bar (which is 3)
 875         .endscope
 876 </verb></tscreen>
 877
 878
 879 <sect1>Scope search order<label id="scopesearch"><p>
 880
 881 The assembler searches for a scope in a similar way as for a symbol. First, it
 882 looks in the current scope, and then it walks up the enclosing scopes until
 883 the scope is found.
 884
 885 However, one important thing to note when using explicit scope syntax is, that
 886 a symbol may be accessed before it is defined, but a scope may <bf/not/ be
 887 used without a preceding definition. This means that in the following
 888 example:
 889
 890 <tscreen><verb>
 891         .scope  foo
 892                 bar     = 3
 893         .endscope
 894
 895         .scope  outer
 896                 lda     #foo::bar  ; Will load 3, not 2!
 897                 .scope  foo
 898                         bar     = 2
 899                 .endscope
 900         .endscope
 901 </verb></tscreen>
 902
 903 the reference to the scope <tt/foo/ will use the global scope, and not the
 904 local one, because the local one is not visible at the point where it is
 905 referenced.
 906
 907 Things get more complex if a complete chain of scopes is specified:
 908
 909 <tscreen><verb>
 910         .scope  foo
 911                 .scope  outer
 912                         .scope  inner
 913                                 bar = 1
 914                         .endscope
 915                 .endscope
 916                 .scope  another
 917                         .scope  nested
 918                                 lda     #outer::inner::bar      ; 1
 919                         .endscope
 920                 .endscope
 921         .endscope
 922
 923         .scope  outer
 924                 .scope  inner
 925                         bar = 2
 926                 .endscope
 927         .endscope
 928 </verb></tscreen>
 929
 930 When <tt/outer::inner::bar/ is referenced in the <tt/lda/ instruction, the
 931 assembler will first search in the local scope for a scope named <tt/outer/.
 932 Since none is found, the enclosing scope (<tt/another/) is checked. There is
 933 still no scope named <tt/outer/, so scope <tt/foo/ is checked, and finally
 934 scope <tt/outer/ is found. Within this scope, <tt/inner/ is searched, and in
 935 this scope, the assembler looks for a symbol named <tt/bar/.
 936
 937 Please note that once the anchor scope is found, all following scopes
 938 (<tt/inner/ in this case) are expected to be found exactly in this scope. The
 939 assembler will search the scope tree only for the first scope (if it is not
 940 anchored in the root scope). Starting from there on, there is no flexibility,
 941 so if the scope named <tt/outer/ found by the assembler does not contain a
 942 scope named <tt/inner/, this would be an error, even if such a pair does exist
 943 (one level up in global scope).
 944
 945 Ambiguities that may be introduced by this search algorithm may be removed by
 946 anchoring the scope specification in the global scope. In the example above,
 947 if you want to access the "other" symbol <tt/bar/, you would have to write:
 948
 949 <tscreen><verb>
 950         .scope  foo
 951                 .scope  outer
 952                         .scope  inner
 953                                 bar = 1
 954                         .endscope
 955                 .endscope
 956                 .scope  another
 957                         .scope  nested
 958                                 lda     #::outer::inner::bar    ; 2
 959                         .endscope
 960                 .endscope
 961         .endscope
 962
 963         .scope  outer
 964                 .scope  inner
 965                         bar = 2
 966                 .endscope
 967         .endscope
 968 </verb></tscreen>
 969
 970
 971 <sect>Address sizes and memory models<label id="address-sizes"><p>
 972
 973 <sect1>Address sizes<p>
 974
 975 ca65 assigns each segment and each symbol an address size. This is true, even
 976 if the symbol is not used as an address. You may also think of a value range
 977 of the symbol instead of an address size.
 978
 979 Possible address sizes are:
 980
 981 <itemize>
 982 <item>Zeropage or direct (8 bits)
 983 <item>Absolute (16 bits)
 984 <item>Far (24 bits)
 985 <item>Long (32 bits)
 986 </itemize>
 987
 988 Since the assembler uses default address sizes for the segments and symbols,
 989 it is usually not necessary to override the default behaviour. In cases, where
 990 it is necessary, the following keywords may be used to specify address sizes:
 991
 992 <itemize>
 993 <item>DIRECT, ZEROPAGE or ZP for zeropage addressing (8 bits).
 994 <item>ABSOLUTE, ABS or NEAR for absolute addressing (16 bits).
 995 <item>FAR for far addressing (24 bits).
 996 <item>LONG or DWORD for long addressing (32 bits).
 997 </itemize>
 998
 999
1000 <sect1>Address sizes of segments<p>
1001
1002 The assembler assigns an address size to each segment. Since the
1003 representation of a label within this segment is "segment start + offset",
1004 labels will inherit the address size of the segment they are declared in.
1005
1006 The address size of a segment may be changed, by using an optional address
1007 size modifier. See the <tt/<ref id=".SEGMENT" name="segment directive">/ for
1008 an explanation on how this is done.
1009
1010
1011 <sect1>Address sizes of symbols<p>
1012
1013
1014
1015
1016 <sect1>Memory models<p>
1017
1018 The default address size of a segment depends on the memory model used. Since
1019 labels inherit the address size from the segment they are declared in,
1020 changing the memory model is an easy way to change the address size of many
1021 symbols at once.
1022
1023
1024
1025
1026 <sect>Pseudo variables<label id="pseudo-variables"><p>
1027
1028 Pseudo variables are readable in all cases, and in some special cases also
1029 writable.
1030
1031 <sect1><tt>*</tt><p>
1032
1033   Reading this pseudo variable will return the program counter at the start
1034   of the current input line.
1035
1036   Assignment to this variable is possible when <tt/<ref id=".FEATURE"
1037   name=".FEATURE pc_assignment">/ is used. Note: You should not use
1038   assignments to <tt/*/, use <tt/<ref id=".ORG" name=".ORG">/ instead.
1039
1040
1041 <sect1><tt>.CPU</tt><label id=".CPU"><p>
1042
1043   Reading this pseudo variable will give a constant integer value that
1044   tells which CPU is currently enabled. It can also tell which instruction
1045   set the CPU is able to translate. The value read from the pseudo variable
1046   should be further examined by using one of the constants defined by the
1047   "cpu" macro package (see <tt/<ref id=".MACPACK" name=".MACPACK">/).
1048
1049   It may be used to replace the .IFPxx pseudo instructions or to construct
1050   even more complex expressions.
1051
1052   Example:
1053
1054   <tscreen><verb>
1055         .macpack        cpu
1056         .if     (.cpu .bitand CPU_ISET_65816)
1057                 phx
1058                 phy
1059         .else
1060                 txa
1061                 pha
1062                 tya
1063                 pha
1064         .endif
1065   </verb></tscreen>
1066
1067
1068 <sect1><tt>.PARAMCOUNT</tt><label id=".PARAMCOUNT"><p>
1069
1070   This builtin pseudo variable is only available in macros. It is replaced by
1071   the actual number of parameters that were given in the macro invocation.
1072
1073   Example:
1074
1075   <tscreen><verb>
1076         .macro  foo     arg1, arg2, arg3
1077         .if     .paramcount <> 3
1078         .error  "Too few parameters for macro foo"
1079         .endif
1080         ...
1081         .endmacro
1082   </verb></tscreen>
1083
1084   See section <ref id="macros" name="Macros">.
1085
1086
1087 <sect1><tt>.TIME</tt><label id=".TIME"><p>
1088
1089   Reading this pseudo variable will give a constant integer value that
1090   represents the current time in POSIX standard (as seconds since the
1091   Epoch).
1092
1093   It may be used to encode the time of translation somewhere in the created
1094   code.
1095
1096   Example:
1097
1098   <tscreen><verb>
1099         .dword  .time   ; Place time here
1100   </verb></tscreen>
1101
1102
1103 <sect1><tt>.VERSION</tt><label id=".VERSION"><p>
1104
1105   Reading this pseudo variable will give the assembler version according to
1106   the following formula:
1107
1108         VER_MAJOR*$100 + VER_MINOR*$10 + VER_PATCH
1109
1110   It may be used to encode the assembler version or check the assembler for
1111   special features not available with older versions.
1112
1113   Example:
1114
1115   Version 2.11.1 of the assembler will return $2B1 as numerical constant when
1116   reading the pseudo variable <tt/.VERSION/.
1117
1118
1119
1120 <sect>Pseudo functions<label id="pseudo-functions"><p>
1121
1122 Pseudo functions expect their arguments in parenthesis, and they have a result,
1123 either a string or an expression.
1124
1125
1126 <sect1><tt>.BANKBYTE</tt><label id=".BANKBYTE"><p>
1127
1128   The function returns the bank byte (that is, bits 16-23) of its argument.
1129   It works identical to the '^' operator.
1130
1131   See: <tt><ref id=".HIBYTE" name=".HIBYTE"></tt>,
1132        <tt><ref id=".LOBYTE" name=".LOBYTE"></tt>
1133
1134
1135 <sect1><tt>.BLANK</tt><label id=".BLANK"><p>
1136
1137   Builtin function. The function evaluates its argument in braces and yields
1138   "false" if the argument is non blank (there is an argument), and "true" if
1139   there is no argument.  The token list that makes up the function argument
1140   may optionally be enclosed in curly braces. This allows the inclusion of
1141   tokens that would otherwise terminate the list (the closing right
1142   parenthesis). The curly braces are not considered part of the list, a list
1143   just consisting of curly braces is considered to be empty.
1144
1145   As an example, the <tt/.IFBLANK/ statement may be replaced by
1146
1147   <tscreen><verb>
1148         .if     .blank({arg})
1149   </verb></tscreen>
1150
1151
1152
1153 <sect1><tt>.CONCAT</tt><label id=".CONCAT"><p>
1154
1155   Builtin string function. The function allows to concatenate a list of string
1156   constants separated by commas. The result is a string constant that is the
1157   concatenation of all arguments. This function is most useful in macros and
1158   when used together with the <tt/.STRING/ builtin function. The function may
1159   be used in any case where a string constant is expected.
1160
1161   Example:
1162
1163   <tscreen><verb>
1164         .include        .concat ("myheader", ".", "inc")
1165   </verb></tscreen>
1166
1167   This is the same as the command
1168
1169   <tscreen><verb>
1170         .include        "myheader.inc"
1171   </verb></tscreen>
1172
1173
1174 <sect1><tt>.CONST</tt><label id=".CONST"><p>
1175
1176   Builtin function. The function evaluates its argument in braces and
1177   yields "true" if the argument is a constant expression (that is, an
1178   expression that yields a constant value at assembly time) and "false"
1179   otherwise. As an example, the .IFCONST statement may be replaced by
1180
1181   <tscreen><verb>
1182         .if     .const(a + 3)
1183   </verb></tscreen>
1184
1185
1186 <sect1><tt>.HIBYTE</tt><label id=".HIBYTE"><p>
1187
1188   The function returns the high byte (that is, bits 8-15) of its argument.
1189   It works identical to the '>' operator.
1190
1191   See: <tt><ref id=".LOBYTE" name=".LOBYTE"></tt>,
1192        <tt><ref id=".BANKBYTE" name=".BANKBYTE"></tt>
1193
1194
1195 <sect1><tt>.HIWORD</tt><label id=".HIWORD"><p>
1196
1197   The function returns the high word (that is, bits 16-31) of its argument.
1198
1199   See: <tt><ref id=".LOWORD" name=".LOWORD"></tt>
1200
1201
1202 <sect1><tt>.IDENT</tt><label id=".IDENT"><p>
1203
1204   The function expects a string as its argument, and converts this argument
1205   into an identifier. If the string starts with the current <tt/<ref
1206   id=".LOCALCHAR" name=".LOCALCHAR">/, it will be converted into a cheap local
1207   identifier, otherwise it will be converted into a normal identifier.
1208
1209   Example:
1210
1211   <tscreen><verb>
1212         .macro  makelabel       arg1, arg2
1213                 .ident (.concat (arg1, arg2)):
1214         .endmacro
1215
1216                 makelabel       "foo", "bar"
1217
1218                 .word           foobar          ; Valid label
1219   </verb></tscreen>
1220
1221
1222 <sect1><tt>.LEFT</tt><label id=".LEFT"><p>
1223
1224   Builtin function. Extracts the left part of a given token list.
1225
1226   Syntax:
1227
1228   <tscreen><verb>
1229         .LEFT (&lt;int expr&gt;, &lt;token list&gt;)
1230   </verb></tscreen>
1231
1232   The first integer expression gives the number of tokens to extract from
1233   the token list. The second argument is the token list itself. The token
1234   list may optionally be enclosed into curly braces. This allows the
1235   inclusion of tokens that would otherwise terminate the list (the closing
1236   right paren in the given case).
1237
1238   Example:
1239
1240   To check in a macro if the given argument has a '#' as first token
1241   (immediate addressing mode), use something like this:
1242
1243   <tscreen><verb>
1244         .macro  ldax    arg
1245                 ...
1246                 .if (.match (.left (1, {arg}), #))
1247
1248                 ; ldax called with immediate operand
1249                 ...
1250
1251                 .endif
1252                 ...
1253         .endmacro
1254   </verb></tscreen>
1255
1256   See also the <tt><ref id=".MID" name=".MID"></tt> and <tt><ref id=".RIGHT"
1257   name=".RIGHT"></tt> builtin functions.
1258
1259
1260 <sect1><tt>.LOBYTE</tt><label id=".LOBYTE"><p>
1261
1262   The function returns the low byte (that is, bits 0-7) of its argument.
1263   It works identical to the '<' operator.
1264
1265   See: <tt><ref id=".HIBYTE" name=".HIBYTE"></tt>,
1266        <tt><ref id=".BANKBYTE" name=".BANKBYTE"></tt>
1267
1268
1269 <sect1><tt>.LOWORD</tt><label id=".LOWORD"><p>
1270
1271   The function returns the low word (that is, bits 0-15) of its argument.
1272
1273   See: <tt><ref id=".HIWORD" name=".HIWORD"></tt>
1274
1275
1276 <sect1><tt>.MATCH</tt><label id=".MATCH"><p>
1277
1278   Builtin function. Matches two token lists against each other. This is
1279   most useful within macros, since macros are not stored as strings, but
1280   as lists of tokens.
1281
1282   The syntax is
1283
1284   <tscreen><verb>
1285         .MATCH(&lt;token list #1&gt;, &lt;token list #2&gt;)
1286   </verb></tscreen>
1287
1288   Both token list may contain arbitrary tokens with the exception of the
1289   terminator token (comma resp. right parenthesis) and
1290
1291   <itemize>
1292   <item>end-of-line
1293   <item>end-of-file
1294   </itemize>
1295
1296   The token lists may optionally be enclosed into curly braces. This allows
1297   the inclusion of tokens that would otherwise terminate the list (the closing
1298   right paren in the given case). Often a macro parameter is used for any of
1299   the token lists.
1300
1301   Please note that the function does only compare tokens, not token
1302   attributes. So any number is equal to any other number, regardless of the
1303   actual value. The same is true for strings. If you need to compare tokens
1304   <em/and/ token attributes, use the <tt><ref id=".XMATCH"
1305   name=".XMATCH"></tt> function.
1306
1307   Example:
1308
1309   Assume the macro <tt/ASR/, that will shift right the accumulator by one,
1310   while honoring the sign bit. The builtin processor instructions will allow
1311   an optional "A" for accu addressing for instructions like <tt/ROL/ and
1312   <tt/ROR/. We will use the <tt><ref id=".MATCH" name=".MATCH"></tt> function
1313   to check for this and print and error for invalid calls.
1314
1315   <tscreen><verb>
1316         .macro  asr     arg
1317
1318                 .if (.not .blank(arg)) .and (.not .match ({arg}, a))
1319                 .error "Syntax error"
1320                 .endif
1321
1322                 cmp     #$80            ; Bit 7 into carry
1323                 lsr     a               ; Shift carry into bit 7
1324
1325         .endmacro
1326   </verb></tscreen>
1327
1328   The macro will only accept no arguments, or one argument that must be the
1329   reserved keyword "A".
1330
1331   See: <tt><ref id=".XMATCH" name=".XMATCH"></tt>
1332
1333
1334 <sect1><tt>.MID</tt><label id=".MID"><p>
1335
1336   Builtin function. Takes a starting index, a count and a token list as
1337   arguments. Will return part of the token list.
1338
1339   Syntax:
1340
1341   <tscreen><verb>
1342         .MID (&lt;int expr&gt;, &lt;int expr&gt;, &lt;token list&gt;)
1343   </verb></tscreen>
1344
1345   The first integer expression gives the starting token in the list (the first
1346   token has index 0). The second integer expression gives the number of tokens
1347   to extract from the token list. The third argument is the token list itself.
1348   The token list may optionally be enclosed into curly braces. This allows the
1349   inclusion of tokens that would otherwise terminate the list (the closing
1350   right paren in the given case).
1351
1352   Example:
1353
1354   To check in a macro if the given argument has a '<tt/#/' as first token
1355   (immediate addressing mode), use something like this:
1356
1357     <tscreen><verb>
1358         .macro  ldax    arg
1359                 ...
1360                 .if (.match (.mid (0, 1, {arg}), #))
1361
1362                 ; ldax called with immediate operand
1363                 ...
1364
1365                 .endif
1366                 ...
1367         .endmacro
1368   </verb></tscreen>
1369
1370   See also the <tt><ref id=".LEFT" name=".LEFT"></tt> and <tt><ref id=".RIGHT"
1371   name=".RIGHT"></tt> builtin functions.
1372
1373
1374 <sect1><tt>.REF, .REFERENCED</tt><label id=".REFERENCED"><p>
1375
1376   Builtin function. The function expects an identifier as argument in braces.
1377   The argument is evaluated, and the function yields "true" if the identifier
1378   is a symbol that has already been referenced somewhere in the source file up
1379   to the current position. Otherwise the function yields false. As an example,
1380   the <tt><ref id=".IFREF" name=".IFREF"></tt> statement may be replaced by
1381
1382   <tscreen><verb>
1383         .if     .referenced(a)
1384   </verb></tscreen>
1385
1386   See: <tt><ref id=".DEFINED" name=".DEFINED"></tt>
1387
1388
1389 <sect1><tt>.RIGHT</tt><label id=".RIGHT"><p>
1390
1391   Builtin function. Extracts the right part of a given token list.
1392
1393   Syntax:
1394
1395   <tscreen><verb>
1396         .RIGHT (&lt;int expr&gt;, &lt;token list&gt;)
1397   </verb></tscreen>
1398
1399   The first integer expression gives the number of tokens to extract from the
1400   token list. The second argument is the token list itself.  The token list
1401   may optionally be enclosed into curly braces. This allows the inclusion of
1402   tokens that would otherwise terminate the list (the closing right paren in
1403   the given case).
1404
1405   See also the <tt><ref id=".LEFT" name=".LEFT"></tt> and <tt><ref id=".MID"
1406   name=".MID"></tt> builtin functions.
1407
1408
1409 <sect1><tt>.SIZEOF</tt><label id=".SIZEOF"><p>
1410
1411   <tt/.SIZEOF/ is a pseudo function that returns the size of its argument. The
1412   argument can be a struct/union, a struct member, a procedure, or a label. In
1413   case of a procedure or label, its size is defined by the amount of data
1414   placed in the segment where the label is relative to. If a line of code
1415   switches segments (for example in a macro) data placed in other segments
1416   does not count for the size.
1417
1418   Please note that a symbol or scope must exist, before it is used together with
1419   <tt/.SIZEOF/ (this may get relaxed later, but will always be true for scopes).
1420   A scope has preference over a symbol with the same name, so if the last part
1421   of a name represents both, a scope and a symbol, the scope is chosen over the
1422   symbol.
1423
1424   After the following code:
1425
1426   <tscreen><verb>
1427         .struct Point                   ; Struct size = 4
1428                 xcoord  .word
1429                 xcoord  .word
1430         .endstruct
1431
1432         P:      .tag    Point           ; Declare a point
1433         @P:     .tag    Point           ; Declare another point
1434
1435         .code
1436         .proc   Code
1437                 nop
1438                 .proc   Inner
1439                         nop
1440                 .endproc
1441                 nop
1442         .endproc
1443
1444         .proc   Data
1445         .data                           ; Segment switch!!!
1446                 .res    4
1447         .endproc
1448   </verb></tscreen>
1449
1450   <descrip>
1451     <tag><tt/.sizeof(Point)/</tag>
1452     will have the value 4, because this is the size of struct <tt/Point/.
1453
1454     <tag><tt/.sizeof(Point::xcoord)/</tag>
1455     will have the value 2, because this is the size of the member <tt/xcoord/
1456     in struct <tt/Point/.
1457
1458     <tag><tt/.sizeof(P)/</tag>
1459     will have the value 4, this is the size of the data declared on the same
1460     source line as the label <tt/P/, which is in the same segment that <tt/P/
1461     is relative to.
1462
1463     <tag><tt/.sizeof(@P)/</tag>
1464     will have the value 4, see above. The example demonstrates that <tt/.SIZEOF/
1465     does also work for cheap local symbols.
1466
1467     <tag><tt/.sizeof(Code)/</tag>
1468     will have the value 3, since this is amount of data emitted into the code
1469     segment, the segment that was active when <tt/Code/ was entered. Note that
1470     this value includes the amount of data emitted in child scopes (in this
1471     case <tt/Code::Inner/).
1472
1473     <tag><tt/.sizeof(Code::Inner)/</tag>
1474     will have the value 1 as expected.
1475
1476     <tag><tt/.sizeof(Data)/</tag>
1477     will have the value 0. Data is emitted within the scope <tt/Data/, but since
1478     the segment is switched after entry, this data is emitted into another
1479     segment.
1480   </descrip>
1481
1482
1483 <sect1><tt>.STRAT</tt><label id=".STRAT"><p>
1484
1485   Builtin function. The function accepts a string and an index as
1486   arguments and returns the value of the character at the given position
1487   as an integer value. The index is zero based.
1488
1489   Example:
1490
1491   <tscreen><verb>
1492         .macro  M       Arg
1493                 ; Check if the argument string starts with '#'
1494                 .if (.strat (Arg, 0) = '#')
1495                 ...
1496                 .endif
1497         .endmacro
1498   </verb></tscreen>
1499
1500
1501 <sect1><tt>.SPRINTF</tt><label id=".SPRINTF"><p>
1502
1503   Builtin function. It expects a format string as first argument. The number
1504   and type of the following arguments depend on the format string. The format
1505   string is similar to the one of the C <tt/printf/ function. Missing things
1506   are: Length modifiers, variable width.
1507
1508   The result of the function is a string.
1509
1510   Example:
1511
1512   <tscreen><verb>
1513         num     = 3
1514
1515         ; Generate an identifier:
1516         .ident (.sprintf ("%s%03d", "label", num)):
1517   </verb></tscreen>
1518
1519
1520 <sect1><tt>.STRING</tt><label id=".STRING"><p>
1521
1522   Builtin function. The function accepts an argument in braces and converts
1523   this argument into a string constant. The argument may be an identifier, or
1524   a constant numeric value.
1525
1526   Since you can use a string in the first place, the use of the function may
1527   not be obvious. However, it is useful in macros, or more complex setups.
1528
1529   Example:
1530
1531   <tscreen><verb>
1532         ; Emulate other assemblers:
1533         .macro  section name
1534                 .segment        .string(name)
1535         .endmacro
1536   </verb></tscreen>
1537
1538
1539 <sect1><tt>.STRLEN</tt><label id=".STRLEN"><p>
1540
1541   Builtin function. The function accepts a string argument in braces and
1542   evaluates to the length of the string.
1543
1544   Example:
1545
1546   The following macro encodes a string as a pascal style string with
1547   a leading length byte.
1548
1549   <tscreen><verb>
1550         .macro  PString Arg
1551                 .byte   .strlen(Arg), Arg
1552         .endmacro
1553   </verb></tscreen>
1554
1555
1556 <sect1><tt>.TCOUNT</tt><label id=".TCOUNT"><p>
1557
1558   Builtin function. The function accepts a token list in braces. The function
1559   result is the number of tokens given as argument. The token list may
1560   optionally be enclosed into curly braces which are not considered part of
1561   the list and not counted. Enclosement in curly braces allows the inclusion
1562   of tokens that would otherwise terminate the list (the closing right paren
1563   in the given case).
1564
1565   Example:
1566
1567   The <tt/ldax/ macro accepts the '#' token to denote immediate addressing (as
1568   with the normal 6502 instructions). To translate it into two separate 8 bit
1569   load instructions, the '#' token has to get stripped from the argument:
1570
1571   <tscreen><verb>
1572         .macro  ldax    arg
1573                 .if (.match (.mid (0, 1, {arg}), #))
1574                 ; ldax called with immediate operand
1575                 lda     #<(.right (.tcount ({arg})-1, {arg}))
1576                 ldx     #>(.right (.tcount ({arg})-1, {arg}))
1577                 .else
1578                 ...
1579                 .endif
1580         .endmacro
1581   </verb></tscreen>
1582
1583
1584 <sect1><tt>.XMATCH</tt><label id=".XMATCH"><p>
1585
1586   Builtin function. Matches two token lists against each other. This is
1587   most useful within macros, since macros are not stored as strings, but
1588   as lists of tokens.
1589
1590   The syntax is
1591
1592   <tscreen><verb>
1593         .XMATCH(&lt;token list #1&gt;, &lt;token list #2&gt;)
1594   </verb></tscreen>
1595
1596   Both token list may contain arbitrary tokens with the exception of the
1597   terminator token (comma resp. right parenthesis) and
1598
1599   <itemize>
1600   <item>end-of-line
1601   <item>end-of-file
1602   </itemize>
1603
1604   The token lists may optionally be enclosed into curly braces. This allows
1605   the inclusion of tokens that would otherwise terminate the list (the closing
1606   right paren in the given case). Often a macro parameter is used for any of
1607   the token lists.
1608
1609   The function compares tokens <em/and/ token values. If you need a function
1610   that just compares the type of tokens, have a look at the <tt><ref
1611   id=".MATCH" name=".MATCH"></tt> function.
1612
1613   See: <tt><ref id=".MATCH" name=".MATCH"></tt>
1614
1615
1616
1617 <sect>Control commands<label id="control-commands"><p>
1618
1619 Here's a list of all control commands and a description, what they do:
1620
1621
1622 <sect1><tt>.A16</tt><label id=".A16"><p>
1623
1624   Valid only in 65816 mode. Switch the accumulator to 16 bit.
1625
1626   Note: This command will not emit any code, it will tell the assembler to
1627   create 16 bit operands for immediate accumulator addressing mode.
1628
1629   See also: <tt><ref id=".SMART" name=".SMART"></tt>
1630
1631
1632 <sect1><tt>.A8</tt><label id=".A8"><p>
1633
1634   Valid only in 65816 mode. Switch the accumulator to 8 bit.
1635
1636   Note: This command will not emit any code, it will tell the assembler to
1637   create 8 bit operands for immediate accu addressing mode.
1638
1639   See also: <tt><ref id=".SMART" name=".SMART"></tt>
1640
1641
1642 <sect1><tt>.ADDR</tt><label id=".ADDR"><p>
1643
1644   Define word sized data. In 6502 mode, this is an alias for <tt/.WORD/ and
1645   may be used for better readability if the data words are address values. In
1646   65816 mode, the address is forced to be 16 bit wide to fit into the current
1647   segment. See also <tt><ref id=".FARADDR" name=".FARADDR"></tt>. The command
1648   must be followed by a sequence of (not necessarily constant) expressions.
1649
1650   Example:
1651
1652   <tscreen><verb>
1653         .addr   $0D00, $AF13, _Clear
1654   </verb></tscreen>
1655
1656   See: <tt><ref id=".FARADDR" name=".FARADDR"></tt>, <tt><ref id=".WORD"
1657        name=".WORD"></tt>
1658
1659
1660 <sect1><tt>.ALIGN</tt><label id=".ALIGN"><p>
1661
1662   Align data to a given boundary. The command expects a constant integer
1663   argument that must be a power of two, plus an optional second argument
1664   in byte range. If there is a second argument, it is used as fill value,
1665   otherwise the value defined in the linker configuration file is used
1666   (the default for this value is zero).
1667
1668   Since alignment depends on the base address of the module, you must
1669   give the same (or a greater) alignment for the segment when linking.
1670   The linker will give you a warning, if you don't do that.
1671
1672   Example:
1673
1674   <tscreen><verb>
1675         .align  256
1676   </verb></tscreen>
1677
1678
1679 <sect1><tt>.ASCIIZ</tt><label id=".ASCIIZ"><p>
1680
1681   Define a string with a trailing zero.
1682
1683   Example:
1684
1685   <tscreen><verb>
1686         Msg:    .asciiz "Hello world"
1687   </verb></tscreen>
1688
1689   This will put the string "Hello world" followed by a binary zero into
1690   the current segment. There may be more strings separated by commas, but
1691   the binary zero is only appended once (after the last one).
1692
1693
1694 <sect1><tt>.ASSERT</tt><label id=".ASSERT"><p>
1695
1696   Add an assertion. The command is followed by an expression, an action
1697   specifier, and an optional message that is output in case the assertion
1698   fails. If no message was given, the string "Assertion failed" is used. The
1699   action specifier may be one of <tt/warning/ or <tt/error/. The assertion is
1700   evaluated by the assembler if possible, and also passed to the linker in the
1701   object file (if one is generated). The linker will then evaluate the
1702   expression when segment placement has been done.
1703
1704   Example:
1705
1706   <tscreen><verb>
1707         .assert         * = $8000, error, "Code not at $8000"
1708   </verb></tscreen>
1709
1710   The example assertion will check that the current location is at $8000,
1711   when the output file is written, and abort with an error if this is not
1712   the case. More complex expressions are possible. The action specifier
1713   <tt/warning/ outputs a warning, while the <tt/error/ specifier outputs
1714   an error message. In the latter case, generation if the output file is
1715   suppressed in both the assembler and linker.
1716
1717
1718 <sect1><tt>.AUTOIMPORT</tt><label id=".AUTOIMPORT"><p>
1719
1720   Is followed by a plus or a minus character. When switched on (using a
1721   +), undefined symbols are automatically marked as import instead of
1722   giving errors. When switched off (which is the default so this does not
1723   make much sense), this does not happen and an error message is
1724   displayed. The state of the autoimport flag is evaluated when the
1725   complete source was translated, before outputting actual code, so it is
1726   <em/not/ possible to switch this feature on or off for separate sections
1727   of code. The last setting is used for all symbols.
1728
1729   You should probably not use this switch because it delays error
1730   messages about undefined symbols until the link stage. The cc65
1731   compiler (which is supposed to produce correct assembler code in all
1732   circumstances, something which is not true for most assembler
1733   programmers) will insert this command to avoid importing each and every
1734   routine from the runtime library.
1735
1736   Example:
1737
1738   <tscreen><verb>
1739         .autoimport     +       ; Switch on auto import
1740   </verb></tscreen>
1741
1742
1743 <sect1><tt>.BSS</tt><label id=".BSS"><p>
1744
1745   Switch to the BSS segment. The name of the BSS segment is always "BSS",
1746   so this is a shortcut for
1747
1748   <tscreen><verb>
1749         .segment  "BSS"
1750   </verb></tscreen>
1751
1752   See also the <tt><ref id=".SEGMENT" name=".SEGMENT"></tt> command.
1753
1754
1755 <sect1><tt>.BYT, .BYTE</tt><label id=".BYTE"><p>
1756
1757   Define byte sized data. Must be followed by a sequence of (byte ranged)
1758   expressions or strings.
1759
1760   Example:
1761
1762   <tscreen><verb>
1763         .byte   "Hello "
1764         .byt    "world", $0D, $00
1765   </verb></tscreen>
1766
1767
1768 <sect1><tt>.CASE</tt><label id=".CASE"><p>
1769
1770   Switch on or off case sensitivity on identifiers. The default is off
1771   (that is, identifiers are case sensitive), but may be changed by the
1772   -i switch on the command line.
1773   The command must be followed by a '+' or '-' character to switch the
1774   option on or off respectively.
1775
1776   Example:
1777
1778   <tscreen><verb>
1779         .case   -               ; Identifiers are not case sensitive
1780   </verb></tscreen>
1781
1782
1783 <sect1><tt>.CHARMAP</tt><label id=".CHARMAP"><p>
1784
1785   Apply a custom mapping for characters. The command is followed by two
1786   numbers in the range 1..255. The first one is the index of the source
1787   character, the second one is the mapping. The mapping applies to all
1788   character and string constants when they generate output, and overrides
1789   a mapping table specified with the <tt><ref id="option-t" name="-t"></tt>
1790   command line switch.
1791
1792   Example:
1793
1794   <tscreen><verb>
1795         .charmap        $41, $61        ; Map 'A' to 'a'
1796   </verb></tscreen>
1797
1798
1799 <sect1><tt>.CODE</tt><label id=".CODE"><p>
1800
1801   Switch to the CODE segment. The name of the CODE segment is always
1802   "CODE", so this is a shortcut for
1803
1804   <tscreen><verb>
1805         .segment  "CODE"
1806   </verb></tscreen>
1807
1808   See also the <tt><ref id=".SEGMENT" name=".SEGMENT"></tt> command.
1809
1810
1811 <sect1><tt>.CONDES</tt><label id=".CONDES"><p>
1812
1813   Export a symbol and mark it in a special way. The linker is able to build
1814   tables of all such symbols. This may be used to automatically create a list
1815   of functions needed to initialize linked library modules.
1816
1817   Note: The linker has a feature to build a table of marked routines, but it
1818   is your code that must call these routines, so just declaring a symbol with
1819   <tt/.CONDES/ does nothing by itself.
1820
1821   All symbols are exported as an absolute (16 bit) symbol. You don't need to
1822   use an additional <tt><ref id=".EXPORT" name=".EXPORT"></tt> statement, this
1823   is implied by <tt/.CONDES/.
1824
1825   <tt/.CONDES/ is followed by the type, which may be <tt/constructor/,
1826   <tt/destructor/ or a numeric value between 0 and 6 (where 0 is the same as
1827   specifying <tt/constructor/ and 1 is equal to specifying <tt/destructor/).
1828   The <tt><ref id=".CONSTRUCTOR" name=".CONSTRUCTOR"></tt>, <tt><ref
1829   id=".DESTRUCTOR" name=".DESTRUCTOR"></tt> and <tt><ref id=".INTERRUPTOR"
1830   name=".INTERRUPTOR"></tt> commands are actually shortcuts for <tt/.CONDES/
1831   with a type of <tt/constructor/ resp. <tt/destructor/ or <tt/interruptor/.
1832
1833   After the type, an optional priority may be specified. Higher numeric values
1834   mean higher priority. If no priority is given, the default priority of 7 is
1835   used. Be careful when assigning priorities to your own module constructors
1836   so they won't interfere with the ones in the cc65 library.
1837
1838   Example:
1839
1840   <tscreen><verb>
1841         .condes         ModuleInit, constructor
1842         .condes         ModInit, 0, 16
1843   </verb></tscreen>
1844
1845   See the <tt><ref id=".CONSTRUCTOR" name=".CONSTRUCTOR"></tt>, <tt><ref
1846   id=".DESTRUCTOR" name=".DESTRUCTOR"></tt> and <tt><ref id=".INTERRUPTOR"
1847   name=".INTERRUPTOR"></tt> commands and the separate section <ref id="condes"
1848   name="Module constructors/destructors"> explaining the feature in more
1849   detail.
1850
1851
1852 <sect1><tt>.CONSTRUCTOR</tt><label id=".CONSTRUCTOR"><p>
1853
1854   Export a symbol and mark it as a module constructor. This may be used
1855   together with the linker to build a table of constructor subroutines that
1856   are called by the startup code.
1857
1858   Note: The linker has a feature to build a table of marked routines, but it
1859   is your code that must call these routines, so just declaring a symbol as
1860   constructor does nothing by itself.
1861
1862   A constructor is always exported as an absolute (16 bit) symbol. You don't
1863   need to use an additional <tt/.export/ statement, this is implied by
1864   <tt/.constructor/. It may have an optional priority that is separated by a
1865   comma. Higher numeric values mean a higher priority. If no priority is
1866   given, the default priority of 7 is used. Be careful when assigning
1867   priorities to your own module constructors so they won't interfere with the
1868   ones in the cc65 library.
1869
1870   Example:
1871
1872   <tscreen><verb>
1873         .constructor    ModuleInit
1874         .constructor    ModInit, 16
1875   </verb></tscreen>
1876
1877   See the <tt><ref id=".CONDES" name=".CONDES"></tt> and <tt><ref
1878   id=".DESTRUCTOR" name=".DESTRUCTOR"></tt> commands and the separate section
1879   <ref id="condes" name="Module constructors/destructors"> explaining the
1880   feature in more detail.
1881
1882
1883 <sect1><tt>.DATA</tt><label id=".DATA"><p>
1884
1885   Switch to the DATA segment. The name of the DATA segment is always
1886   "DATA", so this is a shortcut for
1887
1888   <tscreen><verb>
1889         .segment  "DATA"
1890   </verb></tscreen>
1891
1892   See also the <tt><ref id=".SEGMENT" name=".SEGMENT"></tt> command.
1893
1894
1895 <sect1><tt>.DBYT</tt><label id=".DBYT"><p>
1896
1897   Define word sized data with the hi and lo bytes swapped (use <tt/.WORD/ to
1898   create word sized data in native 65XX format). Must be followed by a
1899   sequence of (word ranged) expressions.
1900
1901   Example:
1902
1903   <tscreen><verb>
1904         .dbyt   $1234, $4512
1905   </verb></tscreen>
1906
1907   This will emit the bytes
1908
1909   <tscreen><verb>
1910         $12 $34 $45 $12
1911   </verb></tscreen>
1912
1913   into the current segment in that order.
1914
1915
1916 <sect1><tt>.DEBUGINFO</tt><label id=".DEBUGINFO"><p>
1917
1918   Switch on or off debug info generation. The default is off (that is,
1919   the object file will not contain debug infos), but may be changed by the
1920   -g switch on the command line.
1921   The command must be followed by a '+' or '-' character to switch the
1922   option on or off respectively.
1923
1924   Example:
1925
1926   <tscreen><verb>
1927         .debuginfo      +       ; Generate debug info
1928   </verb></tscreen>
1929
1930
1931 <sect1><tt>.DEFINE</tt><label id=".DEFINE"><p>
1932
1933   Start a define style macro definition. The command is followed by an
1934   identifier (the macro name) and optionally by a list of formal arguments
1935   in braces.
1936   See section <ref id="macros" name="Macros">.
1937
1938
1939 <sect1><tt>.DEF, .DEFINED</tt><label id=".DEFINED"><p>
1940
1941   Builtin function. The function expects an identifier as argument in braces.
1942   The argument is evaluated, and the function yields "true" if the identifier
1943   is a symbol that is already defined somewhere in the source file up to the
1944   current position. Otherwise the function yields false. As an example, the
1945   <tt><ref id=".IFDEF" name=".IFDEF"></tt> statement may be replaced by
1946
1947   <tscreen><verb>
1948         .if     .defined(a)
1949   </verb></tscreen>
1950
1951
1952 <sect1><tt>.DESTRUCTOR</tt><label id=".DESTRUCTOR"><p>
1953
1954   Export a symbol and mark it as a module destructor. This may be used
1955   together with the linker to build a table of destructor subroutines that
1956   are called by the startup code.
1957
1958   Note: The linker has a feature to build a table of marked routines, but it
1959   is your code that must call these routines, so just declaring a symbol as
1960   constructor does nothing by itself.
1961
1962   A destructor is always exported as an absolute (16 bit) symbol. You don't
1963   need to use an additional <tt/.export/ statement, this is implied by
1964   <tt/.destructor/. It may have an optional priority that is separated by a
1965   comma. Higher numerical values mean a higher priority. If no priority is
1966   given, the default priority of 7 is used. Be careful when assigning
1967   priorities to your own module destructors so they won't interfere with the
1968   ones in the cc65 library.
1969
1970   Example:
1971
1972   <tscreen><verb>
1973         .destructor     ModuleDone
1974         .destructor     ModDone, 16
1975   </verb></tscreen>
1976
1977   See the <tt><ref id=".CONDES" name=".CONDES"></tt> and <tt><ref
1978   id=".CONSTRUCTOR" name=".CONSTRUCTOR"></tt> commands and the separate
1979   section <ref id="condes" name="Module constructors/destructors"> explaining
1980   the feature in more detail.
1981
1982
1983 <sect1><tt>.DWORD</tt><label id=".DWORD"><p>
1984
1985   Define dword sized data (4 bytes) Must be followed by a sequence of
1986   expressions.
1987
1988   Example:
1989
1990   <tscreen><verb>
1991         .dword  $12344512, $12FA489
1992   </verb></tscreen>
1993
1994
1995 <sect1><tt>.ELSE</tt><label id=".ELSE"><p>
1996
1997   Conditional assembly: Reverse the current condition.
1998
1999
2000 <sect1><tt>.ELSEIF</tt><label id=".ELSEIF"><p>
2001
2002   Conditional assembly: Reverse current condition and test a new one.
2003
2004
2005 <sect1><tt>.END</tt><label id=".END"><p>
2006
2007   Forced end of assembly. Assembly stops at this point, even if the command
2008   is read from an include file.
2009
2010
2011 <sect1><tt>.ENDENUM</tt><label id=".ENDENUM"><p>
2012
2013   End a <tt><ref id=".ENUM" name=".ENUM"></tt> declaration.
2014
2015
2016 <sect1><tt>.ENDIF</tt><label id=".ENDIF"><p>
2017
2018   Conditional assembly: Close a <tt><ref id=".IF" name=".IF..."></tt> or
2019   <tt><ref id=".ELSE" name=".ELSE"></tt> branch.
2020
2021
2022 <sect1><tt>.ENDMAC, .ENDMACRO</tt><label id=".ENDMACRO"><p>
2023
2024   End of macro definition (see section <ref id="macros" name="Macros">).
2025
2026
2027 <sect1><tt>.ENDPROC</tt><label id=".ENDPROC"><p>
2028
2029   End of local lexical level (see <tt><ref id=".PROC" name=".PROC"></tt>).
2030
2031
2032 <sect1><tt>.ENDREP, .ENDREPEAT</tt><label id=".ENDREPEAT"><p>
2033
2034   End a <tt><ref id=".REPEAT" name=".REPEAT"></tt> block.
2035
2036
2037 <sect1><tt>.ENDSCOPE</tt><label id=".ENDSCOPE"><p>
2038
2039   End of local lexical level (see <tt/<ref id=".SCOPE" name=".SCOPE">/).
2040
2041
2042 <sect1><tt>.ENDSTRUCT</tt><label id=".ENDSTRUCT"><p>
2043
2044   Ends a struct definition. See the <tt/<ref id=".STRUCT" name=".STRUCT">/
2045   command and the separate section named <ref id="structs" name="&quot;Structs
2046   and unions&quot;">.
2047
2048
2049 <sect1><tt>.ENUM</tt><label id=".ENUM"><p>
2050
2051   Start an enumeration. This directive is very similar to the C <tt/enum/
2052   keyword. If a name is given, a new scope is created for the enumeration,
2053   otherwise the enumeration members are placed in the enclosing scope.
2054
2055   In the enumeration body, symbols are declared. The first symbol has a value
2056   of zero, and each following symbol will get the value of the preceding plus
2057   one. This behaviour may be overridden by an explicit assignment. Two symbols
2058   may have the same value.
2059
2060   Example:
2061
2062   <tscreen><verb>
2063         .enum   errorcodes
2064                 no_error
2065                 file_error
2066                 parse_error
2067         .endenum
2068   </verb></tscreen>
2069
2070   Above example will create a new scope named <tt/errorcodes/ with three
2071   symbols in it that get the values 0, 1 and 2 respectively. Another way
2072   to write this would have been:
2073
2074   <tscreen><verb>
2075         .scope  errorcodes
2076                 no_error        = 0
2077                 file_error      = 1
2078                 parse_error     = 2
2079         .endscope
2080   </verb></tscreen>
2081
2082   Please note that explicit scoping must be used to access the identifiers:
2083
2084   <tscreen><verb>
2085         .word   errorcodes::no_error
2086   </verb></tscreen>
2087
2088   A more complex example:
2089
2090   <tscreen><verb>
2091         .enum
2092                 EUNKNOWN        = -1
2093                 EOK
2094                 EFILE
2095                 EBUSY
2096                 EAGAIN
2097                 EWOULDBLOCK     = EAGAIN
2098         .endenum
2099   </verb></tscreen>
2100
2101   In this example, the enumeration does not have a name, which means that the
2102   members will be visible in the enclosing scope and can be used in this scope
2103   without explicit scoping. The first member (<tt/EUNKNOWN/) has the value -1.
2104   The value for the following members is incremented by one, so <tt/EOK/ would
2105   be zero and so on. <tt/EWOULDBLOCK/ is an alias for <tt/EGAIN/, so it has an
2106   override for the value using an already defined symbol.
2107
2108
2109 <sect1><tt>.ERROR</tt><label id=".ERROR"><p>
2110
2111   Force an assembly error. The assembler will output an error message
2112   preceded by "User error" and will <em/not/ produce an object file.
2113
2114   This command may be used to check for initial conditions that must be
2115   set before assembling a source file.
2116
2117   Example:
2118
2119   <tscreen><verb>
2120         .if     foo = 1
2121         ...
2122         .elseif bar = 1
2123         ...
2124         .else
2125         .error  "Must define foo or bar!"
2126         .endif
2127   </verb></tscreen>
2128
2129   See also the <tt><ref id=".WARNING" name=".WARNING"></tt> and <tt><ref
2130   id=".OUT" name=".OUT"></tt> directives.
2131
2132
2133 <sect1><tt>.EXITMAC, .EXITMACRO</tt><label id=".EXITMACRO"><p>
2134
2135   Abort a macro expansion immediately. This command is often useful in
2136   recursive macros. See separate section <ref id="macros" name="Macros">.
2137
2138
2139 <sect1><tt>.EXPORT</tt><label id=".EXPORT"><p>
2140
2141   Make symbols accessible from other modules. Must be followed by a comma
2142   separated list of symbols to export, with each one optionally followed by
2143   an address specification. The default is to export the symbol with the
2144   address size it actually has. The assembler will issue a warning, if the
2145   symbol is exported with an address size smaller than the actual address
2146   size.
2147
2148   Example:
2149
2150   <tscreen><verb>
2151         .export foo
2152         .export bar: far
2153   </verb></tscreen>
2154
2155   See: <tt><ref id=".EXPORTZP" name=".EXPORTZP"></tt>
2156
2157
2158 <sect1><tt>.EXPORTZP</tt><label id=".EXPORTZP"><p>
2159
2160   Make symbols accessible from other modules. Must be followed by a comma
2161   separated list of symbols to export. The exported symbols are explicitly
2162   marked as zero page symbols.
2163
2164   Example:
2165
2166   <tscreen><verb>
2167         .exportzp  foo, bar
2168   </verb></tscreen>
2169
2170   See: <tt><ref id=".EXPORT" name=".EXPORT"></tt>
2171
2172
2173 <sect1><tt>.FARADDR</tt><label id=".FARADDR"><p>
2174
2175   Define far (24 bit) address data. The command must be followed by a
2176   sequence of (not necessarily constant) expressions.
2177
2178   Example:
2179
2180   <tscreen><verb>
2181         .faraddr        DrawCircle, DrawRectangle, DrawHexagon
2182   </verb></tscreen>
2183
2184   See: <tt><ref id=".ADDR" name=".ADDR"></tt>
2185
2186
2187 <sect1><tt>.FEATURE</tt><label id=".FEATURE"><p>
2188
2189   This directive may be used to enable one or more compatibility features
2190   of the assembler. While the use of <tt/.FEATURE/ should be avoided when
2191   possible, it may be useful when porting sources written for other
2192   assemblers. There is no way to switch a feature off, once you have
2193   enabled it, so using
2194
2195   <tscreen><verb>
2196         .FEATURE        xxx
2197   </verb></tscreen>
2198
2199   will enable the feature until end of assembly is reached.
2200
2201   The following features are available:
2202
2203   <descrip>
2204
2205   <tag><tt>at_in_identifiers</tt></tag>
2206
2207     Accept the at character (`@') as a valid character in identifiers. The
2208     at character is not allowed to start an identifier, even with this
2209     feature enabled.
2210
2211   <tag><tt>dollar_in_identifiers</tt></tag>
2212
2213     Accept the dollar sign (`&dollar;') as a valid character in identifiers. The
2214     at character is not allowed to start an identifier, even with this
2215     feature enabled.
2216
2217   <tag><tt>dollar_is_pc</tt></tag>
2218
2219     The dollar sign may be used as an alias for the star (`*'), which
2220     gives the value of the current PC in expressions.
2221     Note: Assignment to the pseudo variable is not allowed.
2222
2223   <tag><tt>labels_without_colons</tt></tag>
2224
2225     Allow labels without a trailing colon. These labels are only accepted,
2226     if they start at the beginning of a line (no leading white space).
2227
2228   <tag><tt>leading_dot_in_identifiers</tt></tag>
2229
2230     Accept the dot (`.') as the first character of an identifier. This may be
2231     used for example to create macro names that start with a dot emulating
2232     control directives of other assemblers. Note however, that none of the
2233     reserved keywords built into the assembler, that starts with a dot, may be
2234     overridden. When using this feature, you may also get into trouble if
2235     later versions of the assembler define new keywords starting with a dot.
2236
2237   <tag><tt>loose_char_term</tt></tag>
2238
2239     Accept single quotes as well as double quotes as terminators for char
2240     constants.
2241
2242   <tag><tt>loose_string_term</tt></tag>
2243
2244     Accept single quotes as well as double quotes as terminators for string
2245     constants.
2246
2247   <tag><tt>missing_char_term</tt></tag>
2248
2249     Accept single quoted character constants where the terminating quote is
2250     missing.
2251     <tscreen><verb>
2252         lda     #'a
2253     </verb></tscreen>
2254     <bf/Note:/ This does not work in conjunction with <tt/.FEATURE
2255     loose_string_term/, since in this case the input would be ambiguous.
2256
2257   <tag><tt>pc_assignment</tt></tag>
2258
2259     Allow assignments to the PC symbol (`*' or `&dollar;' if <tt/dollar_is_pc/
2260     is enabled). Such an assignment is handled identical to the <tt><ref
2261     id=".ORG" name=".ORG"></tt> command (which is usually not needed, so just
2262     removing the lines with the assignments may also be an option when porting
2263     code written for older assemblers).
2264
2265   <tag><tt>ubiquitous_idents</tt></tag>
2266
2267     Allow the use of instructions names as names for macros and symbols. This
2268     makes it possible to "overload" instructions by defining a macro with the
2269     same name. This does also make it possible to introduce hard to find errors
2270     in your code, so be careful!
2271
2272   </descrip>
2273
2274   It is also possible to specify features on the command line using the
2275   <tt><ref id="option--feature" name="--feature"></tt> command line option.
2276   This is useful when translating sources written for older assemblers, when
2277   you don't want to change the source code.
2278
2279   As an example, to translate sources written for Andre Fachats xa65
2280   assembler, the features
2281
2282   <verb>
2283         labels_without_colons, pc_assignment, loose_char_term
2284   </verb>
2285
2286   may be helpful. They do not make ca65 completely compatible, so you may not
2287   be able to translate the sources without changes, even when enabling these
2288   features. However, I have found several sources that translate without
2289   problems when enabling these features on the command line.
2290
2291
2292 <sect1><tt>.FILEOPT, .FOPT</tt><label id=".FOPT"><p>
2293
2294   Insert an option string into the object file. There are two forms of
2295   this command, one specifies the option by a keyword, the second
2296   specifies it as a number. Since usage of the second one needs knowledge
2297   of the internal encoding, its use is not recommended and I will only
2298   describe the first form here.
2299
2300   The command is followed by one of the keywords
2301
2302   <tscreen><verb>
2303         author
2304         comment
2305         compiler
2306   </verb></tscreen>
2307
2308   a comma and a string. The option is written into the object file
2309   together with the string value. This is currently unidirectional and
2310   there is no way to actually use these options once they are in the
2311   object file.
2312
2313   Examples:
2314
2315   <tscreen><verb>
2316         .fileopt        comment, "Code stolen from my brother"
2317         .fileopt        compiler, "BASIC 2.0"
2318         .fopt           author, "J. R. User"
2319   </verb></tscreen>
2320
2321
2322 <sect1><tt>.FORCEIMPORT</tt><label id=".FORCEIMPORT"><p>
2323
2324   Import an absolute symbol from another module. The command is followed by a
2325   comma separated list of symbols to import. The command is similar to <tt>
2326   <ref id=".IMPORT" name=".IMPORT"></tt>, but the import reference is always
2327   written to the generated object file, even if the symbol is never referenced
2328   (<tt><ref id=".IMPORT" name=".IMPORT"></tt> will not generate import
2329   references for unused symbols).
2330
2331   Example:
2332
2333   <tscreen><verb>
2334         .forceimport    needthisone, needthistoo
2335   </verb></tscreen>
2336
2337   See: <tt><ref id=".IMPORT" name=".IMPORT"></tt>
2338
2339
2340 <sect1><tt>.GLOBAL</tt><label id=".GLOBAL"><p>
2341
2342   Declare symbols as global. Must be followed by a comma separated list of
2343   symbols to declare. Symbols from the list, that are defined somewhere in the
2344   source, are exported, all others are imported. Additional <tt><ref
2345   id=".IMPORT" name=".IMPORT"></tt> or <tt><ref id=".EXPORT"
2346   name=".EXPORT"></tt> commands for the same symbol are allowed.
2347
2348   Example:
2349
2350   <tscreen><verb>
2351         .global foo, bar
2352   </verb></tscreen>
2353
2354
2355 <sect1><tt>.GLOBALZP</tt><label id=".GLOBALZP"><p>
2356
2357   Declare symbols as global. Must be followed by a comma separated list of
2358   symbols to declare. Symbols from the list, that are defined somewhere in the
2359   source, are exported, all others are imported. Additional <tt><ref
2360   id=".IMPORTZP" name=".IMPORTZP"></tt> or <tt><ref id=".EXPORTZP"
2361   name=".EXPORTZP"></tt> commands for the same symbol are allowed. The symbols
2362   in the list are explicitly marked as zero page symbols.
2363
2364   Example:
2365
2366   <tscreen><verb>
2367         .globalzp foo, bar
2368   </verb></tscreen>
2369
2370
2371 <sect1><tt>.I16</tt><label id=".I16"><p>
2372
2373   Valid only in 65816 mode. Switch the index registers to 16 bit.
2374
2375   Note: This command will not emit any code, it will tell the assembler to
2376   create 16 bit operands for immediate operands.
2377
2378   See also the <tt><ref id=".I8" name=".I8"></tt> and <tt><ref id=".SMART"
2379   name=".SMART"></tt> commands.
2380
2381
2382 <sect1><tt>.I8</tt><label id=".I8"><p>
2383
2384   Valid only in 65816 mode. Switch the index registers to 8 bit.
2385
2386   Note: This command will not emit any code, it will tell the assembler to
2387   create 8 bit operands for immediate operands.
2388
2389   See also the <tt><ref id=".I16" name=".I16"></tt> and <tt><ref id=".SMART"
2390   name=".SMART"></tt> commands.
2391
2392
2393 <sect1><tt>.IF</tt><label id=".IF"><p>
2394
2395   Conditional assembly: Evaluate an expression and switch assembler output
2396   on or off depending on the expression. The expression must be a constant
2397   expression, that is, all operands must be defined.
2398
2399   A expression value of zero evaluates to FALSE, any other value evaluates
2400   to TRUE.
2401
2402
2403 <sect1><tt>.IFBLANK</tt><label id=".IFBLANK"><p>
2404
2405   Conditional assembly: Check if there are any remaining tokens in this line,
2406   and evaluate to FALSE if this is the case, and to TRUE otherwise. If the
2407   condition is not true, further lines are not assembled until an <tt><ref
2408   id=".ELSE" name=".ESLE"></tt>, <tt><ref id=".ELSEIF" name=".ELSEIF"></tt> or
2409   <tt><ref id=".ENDIF" name=".ENDIF"></tt> directive.
2410
2411   This command is often used to check if a macro parameter was given. Since an
2412   empty macro parameter will evaluate to nothing, the condition will evaluate
2413   to FALSE if an empty parameter was given.
2414
2415   Example:
2416
2417   <tscreen><verb>
2418         .macro     arg1, arg2
2419         .ifblank   arg2
2420                    lda     #arg1
2421         .else
2422                    lda     #arg2
2423         .endif
2424         .endmacro
2425   </verb></tscreen>
2426
2427   See also: <tt><ref id=".BLANK" name=".BLANK"></tt>
2428
2429
2430 <sect1><tt>.IFCONST</tt><label id=".IFCONST"><p>
2431
2432   Conditional assembly: Evaluate an expression and switch assembler output
2433   on or off depending on the constness of the expression.
2434
2435   A const expression evaluates to to TRUE, a non const expression (one
2436   containing an imported or currently undefined symbol) evaluates to
2437   FALSE.
2438
2439   See also: <tt><ref id=".CONST" name=".CONST"></tt>
2440
2441
2442 <sect1><tt>.IFDEF</tt><label id=".IFDEF"><p>
2443
2444   Conditional assembly: Check if a symbol is defined. Must be followed by
2445   a symbol name. The condition is true if the the given symbol is already
2446   defined, and false otherwise.
2447
2448   See also: <tt><ref id=".DEFINED" name=".DEFINED"></tt>
2449
2450
2451 <sect1><tt>.IFNBLANK</tt><label id=".IFNBLANK"><p>
2452
2453   Conditional assembly: Check if there are any remaining tokens in this line,
2454   and evaluate to TRUE if this is the case, and to FALSE otherwise. If the
2455   condition is not true, further lines are not assembled until an <tt><ref
2456   id=".ELSE" name=".ELSE"></tt>, <tt><ref id=".ELSEIF" name=".ELSEIF"></tt> or
2457   <tt><ref id=".ENDIF" name=".ENDIF"></tt> directive.
2458
2459   This command is often used to check if a macro parameter was given.
2460   Since an empty macro parameter will evaluate to nothing, the condition
2461   will evaluate to FALSE if an empty parameter was given.
2462
2463   Example:
2464
2465   <tscreen><verb>
2466         .macro     arg1, arg2
2467                    lda     #arg1
2468         .ifnblank  arg2
2469                    lda     #arg2
2470         .endif
2471         .endmacro
2472   </verb></tscreen>
2473
2474   See also: <tt><ref id=".BLANK" name=".BLANK"></tt>
2475
2476
2477 <sect1><tt>.IFNDEF</tt><label id=".IFNDEF"><p>
2478
2479   Conditional assembly: Check if a symbol is defined. Must be followed by
2480   a symbol name. The condition is true if the the given symbol is not
2481   defined, and false otherwise.
2482
2483   See also: <tt><ref id=".DEFINED" name=".DEFINED"></tt>
2484
2485
2486 <sect1><tt>.IFNREF</tt><label id=".IFNREF"><p>
2487
2488   Conditional assembly: Check if a symbol is referenced. Must be followed
2489   by a symbol name. The condition is true if if the the given symbol was
2490   not referenced before, and false otherwise.
2491
2492   See also: <tt><ref id=".REFERENCED" name=".REFERENCED"></tt>
2493
2494
2495 <sect1><tt>.IFP02</tt><label id=".IFP02"><p>
2496
2497   Conditional assembly: Check if the assembler is currently in 6502 mode
2498   (see <tt><ref id=".P02" name=".P02"></tt> command).
2499
2500
2501 <sect1><tt>.IFP816</tt><label id=".IFP816"><p>
2502
2503   Conditional assembly: Check if the assembler is currently in 65816 mode
2504   (see <tt><ref id=".P816" name=".P816"></tt> command).
2505
2506
2507 <sect1><tt>.IFPC02</tt><label id=".IFPC02"><p>
2508
2509   Conditional assembly: Check if the assembler is currently in 65C02 mode
2510   (see <tt><ref id=".PC02" name=".PC02"></tt> command).
2511
2512
2513 <sect1><tt>.IFPSC02</tt><label id=".IFPSC02"><p>
2514
2515   Conditional assembly: Check if the assembler is currently in 65SC02 mode
2516   (see <tt><ref id=".PSC02" name=".PSC02"></tt> command).
2517
2518
2519 <sect1><tt>.IFREF</tt><label id=".IFREF"><p>
2520
2521   Conditional assembly: Check if a symbol is referenced. Must be followed
2522   by a symbol name. The condition is true if if the the given symbol was
2523   referenced before, and false otherwise.
2524
2525   This command may be used to build subroutine libraries in include files
2526   (you may use separate object modules for this purpose too).
2527
2528   Example:
2529
2530   <tscreen><verb>
2531         .ifref  ToHex                   ; If someone used this subroutine
2532         ToHex:  tay                     ; Define subroutine
2533                 lda     HexTab,y
2534                 rts
2535         .endif
2536   </verb></tscreen>
2537
2538   See also: <tt><ref id=".REFERENCED" name=".REFERENCED"></tt>
2539
2540
2541 <sect1><tt>.IMPORT</tt><label id=".IMPORT"><p>
2542
2543   Import a symbol from another module. The command is followed by a comma
2544   separated list of symbols to import, with each one optionally followed by
2545   an address specification.
2546
2547   Example:
2548
2549   <tscreen><verb>
2550         .import foo
2551         .import bar: zeropage
2552   </verb></tscreen>
2553
2554   See: <tt><ref id=".IMPORTZP" name=".IMPORTZP"></tt>
2555
2556
2557 <sect1><tt>.IMPORTZP</tt><label id=".IMPORTZP"><p>
2558
2559   Import a symbol from another module. The command is followed by a comma
2560   separated list of symbols to import. The symbols are explicitly imported
2561   as zero page symbols (that is, symbols with values in byte range).
2562
2563   Example:
2564
2565   <tscreen><verb>
2566         .importzp       foo, bar
2567   </verb></tscreen>
2568
2569   See: <tt><ref id=".IMPORT" name=".IMPORT"></tt>
2570
2571
2572 <sect1><tt>.INCBIN</tt><label id=".INCBIN"><p>
2573
2574   Include a file as binary data. The command expects a string argument
2575   that is the name of a file to include literally in the current segment.
2576   In addition to that, a start offset and a size value may be specified,
2577   separated by commas. If no size is specified, all of the file from the
2578   start offset to end-of-file is used. If no start position is specified
2579   either, zero is assumed (which means that the whole file is inserted).
2580
2581   Example:
2582
2583   <tscreen><verb>
2584         ; Include whole file
2585         .incbin         "sprites.dat"
2586
2587         ; Include file starting at offset 256
2588         .incbin         "music.dat", $100
2589
2590         ; Read 100 bytes starting at offset 200
2591         .incbin         "graphics.dat", 200, 100
2592   </verb></tscreen>
2593
2594
2595 <sect1><tt>.INCLUDE</tt><label id=".INCLUDE"><p>
2596
2597   Include another file. Include files may be nested up to a depth of 16.
2598
2599   Example:
2600
2601   <tscreen><verb>
2602         .include        "subs.inc"
2603   </verb></tscreen>
2604
2605
2606 <sect1><tt>.INTERRUPTOR</tt><label id=".INTERRUPTOR"><p>
2607
2608   Export a symbol and mark it as an interruptor. This may be used together
2609   with the linker to build a table of interruptor subroutines that are called
2610   in an interrupt.
2611
2612   Note: The linker has a feature to build a table of marked routines, but it
2613   is your code that must call these routines, so just declaring a symbol as
2614   interruptor does nothing by itself.
2615
2616   An interruptor is always exported as an absolute (16 bit) symbol. You don't
2617   need to use an additional <tt/.export/ statement, this is implied by
2618   <tt/.interruptor/. It may have an optional priority that is separated by a
2619   comma. Higher numeric values mean a higher priority. If no priority is
2620   given, the default priority of 7 is used. Be careful when assigning
2621   priorities to your own module constructors so they won't interfere with the
2622   ones in the cc65 library.
2623
2624   Example:
2625
2626   <tscreen><verb>
2627         .interruptor    IrqHandler
2628         .interruptor    Handler, 16
2629   </verb></tscreen>
2630
2631   See the <tt><ref id=".CONDES" name=".CONDES"></tt> command and the separate
2632   section <ref id="condes" name="Module constructors/destructors"> explaining
2633   the feature in more detail.
2634
2635
2636 <sect1><tt>.LINECONT</tt><label id=".LINECONT"><p>
2637
2638   Switch on or off line continuations using the backslash character
2639   before a newline. The option is off by default.
2640   Note: Line continuations do not work in a comment. A backslash at the
2641   end of a comment is treated as part of the comment and does not trigger
2642   line continuation.
2643   The command must be followed by a '+' or '-' character to switch the
2644   option on or off respectively.
2645
2646   Example:
2647
2648   <tscreen><verb>
2649         .linecont       +               ; Allow line continuations
2650
2651         lda     \
2652                 #$20                    ; This is legal now
2653   </verb></tscreen>
2654
2655
2656 <sect1><tt>.LIST</tt><label id=".LIST"><p>
2657
2658   Enable output to the listing. The command must be followed by a boolean
2659   switch ("on", "off", "+" or "-") and will enable or disable listing
2660   output.
2661   The option has no effect if the listing is not enabled by the command line
2662   switch -l. If -l is used, an internal counter is set to 1. Lines are output
2663   to the listing file, if the counter is greater than zero, and suppressed if
2664   the counter is zero. Each use of <tt/.LIST/ will increment or decrement the
2665   counter.
2666
2667   Example:
2668
2669   <tscreen><verb>
2670         .list   on              ; Enable listing output
2671   </verb></tscreen>
2672
2673
2674 <sect1><tt>.LISTBYTES</tt><label id=".LISTBYTES"><p>
2675
2676   Set, how many bytes are shown in the listing for one source line. The
2677   default is 12, so the listing will show only the first 12 bytes for any
2678   source line that generates more than 12 bytes of code or data.
2679   The directive needs an argument, which is either "unlimited", or an
2680   integer constant in the range 4..255.
2681
2682   Examples:
2683
2684   <tscreen><verb>
2685         .listbytes      unlimited       ; List all bytes
2686         .listbytes      12              ; List the first 12 bytes
2687         .incbin         "data.bin"      ; Include large binary file
2688   </verb></tscreen>
2689
2690
2691 <sect1><tt>.LOCAL</tt><label id=".LOCAL"><p>
2692
2693   This command may only be used inside a macro definition. It declares a
2694   list of identifiers as local to the macro expansion.
2695
2696   A problem when using macros are labels: Since they don't change their name,
2697   you get a "duplicate symbol" error if the macro is expanded the second time.
2698   Labels declared with <tt><ref id=".LOCAL" name=".LOCAL"></tt> have their
2699   name mapped to an internal unique name (<tt/___ABCD__/) with each macro
2700   invocation.
2701
2702   Some other assemblers start a new lexical block inside a macro expansion.
2703   This has some drawbacks however, since that will not allow <em/any/ symbol
2704   to be visible outside a macro, a feature that is sometimes useful. The
2705   <tt><ref id=".LOCAL" name=".LOCAL"></tt> command is in my eyes a better way
2706   to address the problem.
2707
2708   You get an error when using <tt><ref id=".LOCAL" name=".LOCAL"></tt> outside
2709   a macro.
2710
2711
2712 <sect1><tt>.LOCALCHAR</tt><label id=".LOCALCHAR"><p>
2713
2714   Defines the character that start "cheap" local labels. You may use one
2715   of '@' and '?' as start character. The default is '@'.
2716
2717   Cheap local labels are labels that are visible only between two non
2718   cheap labels. This way you can reuse identifiers like "<tt/loop/" without
2719   using explicit lexical nesting.
2720
2721   Example:
2722
2723   <tscreen><verb>
2724         .localchar      '?'
2725
2726         Clear:  lda     #$00            ; Global label
2727         ?Loop:  sta     Mem,y           ; Local label
2728                 dey
2729                 bne     ?Loop           ; Ok
2730                 rts
2731         Sub:    ...                     ; New global label
2732                 bne     ?Loop           ; ERROR: Unknown identifier!
2733   </verb></tscreen>
2734
2735
2736 <sect1><tt>.MACPACK</tt><label id=".MACPACK"><p>
2737
2738   Insert a predefined macro package. The command is followed by an
2739   identifier specifying the macro package to insert. Available macro
2740   packages are:
2741
2742   <tscreen><verb>
2743         generic         Defines generic macros like add and sub.
2744         longbranch      Defines conditional long jump macros.
2745         cbm             Defines the scrcode macro
2746         cpu             Defines constants for the .CPU variable
2747   </verb></tscreen>
2748
2749   Including a macro package twice, or including a macro package that
2750   redefines already existing macros will lead to an error.
2751
2752   Example:
2753
2754   <tscreen><verb>
2755         .macpack        longbranch      ; Include macro package
2756
2757                 cmp     #$20            ; Set condition codes
2758                 jne     Label           ; Jump long on condition
2759   </verb></tscreen>
2760
2761   Macro packages are explained in more detail in section <ref
2762   id="macropackages" name="Macro packages">.
2763
2764
2765 <sect1><tt>.MAC, .MACRO</tt><label id=".MAC"><p>
2766
2767   Start a classic macro definition. The command is followed by an identifier
2768   (the macro name) and optionally by a comma separated list of identifiers
2769   that are macro parameters.
2770
2771   See section <ref id="macros" name="Macros">.
2772
2773
2774 <sect1><tt>.ORG</tt><label id=".ORG"><p>
2775
2776   Start a section of absolute code. The command is followed by a constant
2777   expression that gives the new PC counter location for which the code is
2778   assembled. Use <tt><ref id=".RELOC" name=".RELOC"></tt> to switch back to
2779   relocatable code.
2780
2781   Please note that you <em/do not need/ this command in most cases. Placing
2782   code at a specific address is the job of the linker, not the assembler, so
2783   there is usually no reason to assemble code to a specific address.
2784
2785   You may not switch segments while inside a section of absolute code.
2786
2787   Example:
2788
2789   <tscreen><verb>
2790         .org    $7FF            ; Emit code starting at $7FF
2791   </verb></tscreen>
2792
2793
2794 <sect1><tt>.OUT</tt><label id=".OUT"><p>
2795
2796   Output a string to the console without producing an error. This command
2797   is similar to <tt/.ERROR/, however, it does not force an assembler error
2798   that prevents the creation of an object file.
2799
2800   Example:
2801
2802   <tscreen><verb>
2803         .out    "This code was written by the codebuster(tm)"
2804   </verb></tscreen>
2805
2806   See also the <tt><ref id=".WARNING" name=".WARNING"></tt> and <tt><ref
2807   id=".ERROR" name=".ERROR"></tt> directives.
2808
2809
2810 <sect1><tt>.P02</tt><label id=".P02"><p>
2811
2812   Enable the 6502 instruction set, disable 65SC02, 65C02 and 65816
2813   instructions. This is the default if not overridden by the
2814   <tt><ref id="option--cpu" name="--cpu"></tt> command line option.
2815
2816   See: <tt><ref id=".PC02" name=".PC02"></tt>, <tt><ref id=".PSC02"
2817   name=".PSC02"></tt> and <tt><ref id=".P816" name=".P816"></tt>
2818
2819
2820 <sect1><tt>.P816</tt><label id=".P816"><p>
2821
2822   Enable the 65816 instruction set. This is a superset of the 65SC02 and
2823   6502 instruction sets.
2824
2825   See: <tt><ref id=".P02" name=".P02"></tt>, <tt><ref id=".PSC02"
2826   name=".PSC02"></tt> and <tt><ref id=".PC02" name=".PC02"></tt>
2827
2828
2829 <sect1><tt>.PAGELEN, .PAGELENGTH</tt><label id=".PAGELENGTH"><p>
2830
2831   Set the page length for the listing. Must be followed by an integer
2832   constant. The value may be "unlimited", or in the range 32 to 127. The
2833   statement has no effect if no listing is generated. The default value is -1
2834   (unlimited) but may be overridden by the <tt/--pagelength/ command line
2835   option. Beware: Since ca65 is a one pass assembler, the listing is generated
2836   after assembly is complete, you cannot use multiple line lengths with one
2837   source. Instead, the value set with the last <tt/.PAGELENGTH/ is used.
2838
2839   Examples:
2840
2841   <tscreen><verb>
2842         .pagelength     66              ; Use 66 lines per listing page
2843
2844         .pagelength     unlimited       ; Unlimited page length
2845   </verb></tscreen>
2846
2847
2848 <sect1><tt>.PC02</tt><label id=".PC02"><p>
2849
2850   Enable the 65C02 instructions set. This instruction set includes all
2851   6502 and 65SC02 instructions.
2852
2853   See: <tt><ref id=".P02" name=".P02"></tt>, <tt><ref id=".PSC02"
2854   name=".PSC02"></tt> and <tt><ref id=".P816" name=".P816"></tt>
2855
2856
2857 <sect1><tt>.POPSEG</tt><label id=".POPSEG"><p>
2858
2859   Pop the last pushed segment from the stack, and set it.
2860
2861   This command will switch back to the segment that was last pushed onto the
2862   segment stack using the <tt><ref id=".PUSHSEG" name=".PUSHSEG"></tt>
2863   command, and remove this entry from the stack.
2864
2865   The assembler will print an error message if the segment stack is empty
2866   when this command is issued.
2867
2868   See: <tt><ref id=".PUSHSEG" name=".PUSHSEG"></tt>
2869
2870
2871 <sect1><tt>.PROC</tt><label id=".PROC"><p>
2872
2873   Start a nested lexical level with the given name and adds a symbol with this
2874   name to the enclosing scope. All new symbols from now on are in the local
2875   lexical level and are accessible from outside only via <ref id="scopesyntax"
2876   name="explicit scope specification">. Symbols defined outside this local
2877   level may be accessed as long as their names are not used for new symbols
2878   inside the level. Symbols names in other lexical levels do not clash, so you
2879   may use the same names for identifiers. The lexical level ends when the
2880   <tt><ref id=".ENDPROC" name=".ENDPROC"></tt> command is read. Lexical levels
2881   may be nested up to a depth of 16 (this is an artificial limit to protect
2882   against errors in the source).
2883
2884   Note: Macro names are always in the global level and in a separate name
2885   space. There is no special reason for this, it's just that I've never
2886   had any need for local macro definitions.
2887
2888   Example:
2889
2890   <tscreen><verb>
2891         .proc   Clear           ; Define Clear subroutine, start new level
2892                 lda     #$00
2893         L1:     sta     Mem,y   ; L1 is local and does not cause a
2894                                 ; duplicate symbol error if used in other
2895                                 ; places
2896                 dey
2897                 bne     L1      ; Reference local symbol
2898                 rts
2899         .endproc                ; Leave lexical level
2900   </verb></tscreen>
2901
2902   See: <tt/<ref id=".ENDPROC" name=".ENDPROC">/ and <tt/<ref id=".SCOPE"
2903   name=".SCOPE">/
2904
2905
2906 <sect1><tt>.PSC02</tt><label id=".PSC02"><p>
2907
2908   Enable the 65SC02 instructions set. This instruction set includes all
2909   6502 instructions.
2910
2911   See: <tt><ref id=".P02" name=".P02"></tt>, <tt><ref id=".PC02"
2912   name=".PC02"></tt> and <tt><ref id=".P816" name=".P816"></tt>
2913
2914
2915 <sect1><tt>.PUSHSEG</tt><label id=".PUSHSEG"><p>
2916
2917   Push the currently active segment onto a stack. The entries on the stack
2918   include the name of the segment and the segment type. The stack has a size
2919   of 16 entries.
2920
2921   <tt/.PUSHSEG/ allows together with <tt><ref id=".POPSEG" name=".POPSEG"></tt>
2922   to switch to another segment and to restore the old segment later, without
2923   even knowing the name and type of the current segment.
2924
2925   The assembler will print an error message if the segment stack is already
2926   full, when this command is issued.
2927
2928   See: <tt><ref id=".POPSEG" name=".POPSEG"></tt>
2929
2930
2931 <sect1><tt>.REPEAT</tt><label id=".REPEAT"><p>
2932
2933   Repeat all commands between <tt/.REPEAT/ and <tt><ref id=".ENDREPEAT"
2934   name=".ENDREPEAT"></tt> constant number of times. The command is followed by
2935   a constant expression that tells how many times the commands in the body
2936   should get repeated. Optionally, a comma and an identifier may be specified.
2937   If this identifier is found in the body of the repeat statement, it is
2938   replaced by the current repeat count (starting with zero for the first time
2939   the body is repeated).
2940
2941   <tt/.REPEAT/ statements may be nested. If you use the same repeat count
2942   identifier for a nested <tt/.REPEAT/ statement, the one from the inner
2943   level will be used, not the one from the outer level.
2944
2945   Example:
2946
2947   The following macro will emit a string that is "encrypted" in that all
2948   characters of the string are XORed by the value $55.
2949
2950   <tscreen><verb>
2951         .macro  Crypt   Arg
2952                 .repeat .strlen(Arg), I
2953                 .byte   .strat(Arg, I) ^ $55
2954                 .endrep
2955         .endmacro
2956   </verb></tscreen>
2957
2958   See: <tt><ref id=".ENDREPEAT" name=".ENDREPEAT"></tt>
2959
2960
2961 <sect1><tt>.RELOC</tt><label id=".RELOC"><p>
2962
2963   Switch back to relocatable mode. See the <tt><ref id=".ORG"
2964   name=".ORG"></tt> command.
2965
2966
2967 <sect1><tt>.RES</tt><label id=".RES"><p>
2968
2969   Reserve storage. The command is followed by one or two constant
2970   expressions. The first one is mandatory and defines, how many bytes of
2971   storage should be defined. The second, optional expression must by a
2972   constant byte value that will be used as value of the data. If there
2973   is no fill value given, the linker will use the value defined in the
2974   linker configuration file (default: zero).
2975
2976   Example:
2977
2978   <tscreen><verb>
2979         ; Reserve 12 bytes of memory with value $AA
2980         .res    12, $AA
2981   </verb></tscreen>
2982
2983
2984 <sect1><tt>.RODATA</tt><label id=".RODATA"><p>
2985
2986   Switch to the RODATA segment. The name of the RODATA segment is always
2987   "RODATA", so this is a shortcut for
2988
2989   <tscreen><verb>
2990         .segment  "RODATA"
2991   </verb></tscreen>
2992
2993   The RODATA segment is a segment that is used by the compiler for
2994   readonly data like string constants.
2995
2996   See also the <tt><ref id=".SEGMENT" name=".SEGMENT"></tt> command.
2997
2998
2999 <sect1><tt>.SCOPE</tt><label id=".SCOPE"><p>
3000
3001   Start a nested lexical level with the given name. All new symbols from now
3002   on are in the local lexical level and are accessible from outside only via
3003   <ref id="scopesyntax" name="explicit scope specification">. Symbols defined
3004   outside this local level may be accessed as long as their names are not used
3005   for new symbols inside the level. Symbols names in other lexical levels do
3006   not clash, so you may use the same names for identifiers. The lexical level
3007   ends when the <tt><ref id=".ENDSCOPE" name=".ENDSCOPE"></tt> command is
3008   read. Lexical levels may be nested up to a depth of 16 (this is an
3009   artificial limit to protect against errors in the source).
3010
3011   Note: Macro names are always in the global level and in a separate name
3012   space. There is no special reason for this, it's just that I've never
3013   had any need for local macro definitions.
3014
3015   Example:
3016
3017   <tscreen><verb>
3018         .scope  Error                   ; Start new scope named Error
3019                 None = 0                ; No error
3020                 File = 1                ; File error
3021                 Parse = 2               ; Parse error
3022         .endproc                        ; Close lexical level
3023
3024                 ...
3025                 lda #Error::File        ; Use symbol from scope Error
3026   </verb></tscreen>
3027
3028   See: <tt/<ref id=".ENDSCOPE" name=".ENDSCOPE">/ and <tt/<ref id=".PROC"
3029   name=".PROC">/
3030
3031
3032 <sect1><tt>.SEGMENT</tt><label id=".SEGMENT"><p>
3033
3034   Switch to another segment. Code and data is always emitted into a
3035   segment, that is, a named section of data. The default segment is
3036   "CODE". There may be up to 254 different segments per object file
3037   (and up to 65534 per executable). There are shortcut commands for
3038   the most common segments ("CODE", "DATA" and "BSS").
3039
3040   The command is followed by a string containing the segment name (there are
3041   some constraints for the name - as a rule of thumb use only those segment
3042   names that would also be valid identifiers). There may also be an optional
3043   address size separated by a colon. See the section covering <tt/<ref
3044   id="address-sizes" name="address sizes">/ for more information.
3045
3046   The default address size for a segment depends on the memory model specified
3047   on the command line. The default is "absolute", which means that you don't
3048   have to use an address size modifier in most cases.
3049
3050   "absolute" means that the is a segment with 16 bit (absolute) addressing.
3051   That is, the segment will reside somewhere in core memory outside the zero
3052   page. "zeropage" (8 bit) means that the segment will be placed in the zero
3053   page and direct (short) addressing is possible for data in this segment.
3054
3055   Beware: Only labels in a segment with the zeropage attribute are marked
3056   as reachable by short addressing. The `*' (PC counter) operator will
3057   work as in other segments and will create absolute variable values.
3058
3059   Please note that a segment cannot have two different address sizes. A
3060   segment specified as zeropage cannot be declared as being absolute later.
3061
3062   Examples:
3063
3064   <tscreen><verb>
3065         .segment "ROM2"                 ; Switch to ROM2 segment
3066         .segment "ZP2": zeropage        ; New direct segment
3067         .segment "ZP2"                  ; Ok, will use last attribute
3068         .segment "ZP2": absolute        ; Error, redecl mismatch
3069   </verb></tscreen>
3070
3071   See: <tt><ref id=".BSS" name=".BSS"></tt>, <tt><ref id=".CODE"
3072   name=".CODE"></tt>, <tt><ref id=".DATA" name=".DATA"></tt> and <tt><ref
3073   id=".RODATA" name=".RODATA"></tt>
3074
3075
3076 <sect1><tt>.SETCPU</tt><label id=".SETCPU"><p>
3077
3078   Switch the CPU instruction set. The command is followed by a string that
3079   specifies the CPU. Possible values are those that can also be supplied to
3080   the <tt><ref id="option--cpu" name="--cpu"></tt> command line option,
3081   namely: 6502, 6502X, 65SC02, 65C02, 65816 and sunplus. Please note that
3082   support for the sunplus CPU is not available in the freeware version,
3083   because the instruction set of the sunplus CPU is "proprietary and
3084   confidential".
3085
3086   See: <tt><ref id=".CPU" name=".CPU"></tt>,
3087        <tt><ref id=".IFP02" name=".IFP02"></tt>,
3088        <tt><ref id=".IFP816" name=".IFP816"></tt>,
3089        <tt><ref id=".IFPC02" name=".IFPC02"></tt>,
3090        <tt><ref id=".IFPSC02" name=".IFPSC02"></tt>,
3091        <tt><ref id=".P02" name=".P02"></tt>,
3092        <tt><ref id=".P816" name=".P816"></tt>,
3093        <tt><ref id=".PC02" name=".PC02"></tt>,
3094        <tt><ref id=".PSC02" name=".PSC02"></tt>
3095
3096
3097 <sect1><tt>.SMART</tt><label id=".SMART"><p>
3098
3099   Switch on or off smart mode. The command must be followed by a '+' or '-'
3100   character to switch the option on or off respectively. The default is off
3101   (that is, the assembler doesn't try to be smart), but this default may be
3102   changed by the -s switch on the command line.
3103
3104   In smart mode the assembler will do the following:
3105
3106   <itemize>
3107   <item>Track usage of the <tt/REP/ and <tt/SEP/ instructions in 65816 mode
3108         and update the operand sizes accordingly. If the operand of such an
3109         instruction cannot be evaluated by the assembler (for example, because
3110         the operand is an imported symbol), a warning is issued. Beware: Since
3111         the assembler cannot trace the execution flow this may lead to false
3112         results in some cases. If in doubt, use the <tt/.Inn/ and <tt/.Ann/
3113         instructions to tell the assembler about the current settings.
3114   <item>In 65816 mode, replace a <tt/RTS/ instruction by <tt/RTL/ if it is
3115         used within a procedure declared as <tt/far/, or if the procedure has
3116         no explicit address specification, but it is <tt/far/ because of the
3117         memory model used.
3118   </itemize>
3119
3120   Example:
3121
3122   <tscreen><verb>
3123         .smart                          ; Be smart
3124         .smart  -                       ; Stop being smart
3125   </verb></tscreen>
3126
3127   See: <tt><ref id=".A16" name=".A16"></tt>,
3128        <tt><ref id=".A8" name=".A8"></tt>,
3129        <tt><ref id=".I16" name=".I16"></tt>,
3130        <tt><ref id=".I8" name=".I8"></tt>
3131
3132
3133 <sect1><tt>.STRUCT</tt><label id=".STRUCT"><p>
3134
3135   Starts a struct definition. Structs are covered in a separate section named
3136   <ref id="structs" name="&quot;Structs and unions&quot;">.
3137
3138   See: <tt><ref id=".ENDSTRUCT" name=".ENDSTRUCT"></tt>
3139
3140
3141 <sect1><tt>.SUNPLUS</tt><label id=".SUNPLUS"><p>
3142
3143   Enable the SunPlus instructions set. This command will not work in the
3144   freeware version of the assembler, because the instruction set is
3145   "proprietary and confidential".
3146
3147   See: <tt><ref id=".P02" name=".P02"></tt>, <tt><ref id=".PSC02"
3148   name=".PSC02"></tt>, <tt><ref id=".PC02" name=".PC02"></tt>, and
3149   <tt><ref id=".P816" name=".P816"></tt>
3150
3151
3152 <sect1><tt>.TAG</tt><label id=".TAG"><p>
3153
3154   Allocate space for a struct or union.
3155
3156   Example:
3157
3158   <tscreen><verb>
3159         .struct Point
3160                 xcoord  .word
3161                 ycoord  .word
3162         .endstruct
3163
3164         .bss
3165                 .tag    Point           ; Allocate 4 bytes
3166   </verb></tscreen>
3167
3168
3169 <sect1><tt>.WARNING</tt><label id=".WARNING"><p>
3170
3171   Force an assembly warning. The assembler will output a warning message
3172   preceded by "User warning". This warning will always be output, even if
3173   other warnings are disabled with the <tt><ref id="option-W" name="-W0"></tt>
3174   command line option.
3175
3176   This command may be used to output possible problems when assembling
3177   the source file.
3178
3179   Example:
3180
3181   <tscreen><verb>
3182         .macro  jne     target
3183                 .local L1
3184                 .ifndef target
3185                 .warning "Forward jump in jne, cannot optimize!"
3186                 beq     L1
3187                 jmp     target
3188         L1:
3189                 .else
3190                 ...
3191                 .endif
3192         .endmacro
3193   </verb></tscreen>
3194
3195   See also the <tt><ref id=".ERROR" name=".ERROR"></tt> and <tt><ref id=".OUT"
3196   name=".OUT"></tt> directives.
3197
3198
3199 <sect1><tt>.WORD</tt><label id=".WORD"><p>
3200
3201   Define word sized data. Must be followed by a sequence of (word ranged,
3202   but not necessarily constant) expressions.
3203
3204   Example:
3205
3206   <tscreen><verb>
3207         .word   $0D00, $AF13, _Clear
3208   </verb></tscreen>
3209
3210
3211 <sect1><tt>.ZEROPAGE</tt><label id=".ZEROPAGE"><p>
3212
3213   Switch to the ZEROPAGE segment and mark it as direct (zeropage) segment.
3214   The name of the ZEROPAGE segment is always "ZEROPAGE", so this is a
3215   shortcut for
3216
3217   <tscreen><verb>
3218         .segment  "ZEROPAGE", zeropage
3219   </verb></tscreen>
3220
3221   Because of the "zeropage" attribute, labels declared in this segment are
3222   addressed using direct addressing mode if possible. You <em/must/ instruct
3223   the linker to place this segment somewhere in the address range 0..$FF
3224   otherwise you will get errors.
3225
3226   See: <tt><ref id=".SEGMENT" name=".SEGMENT"></tt>
3227
3228
3229
3230 <sect>Macros<label id="macros"><p>
3231
3232
3233 <sect1>Introduction<p>
3234
3235 Macros may be thought of as "parametrized super instructions". Macros are
3236 sequences of tokens that have a name. If that name is used in the source
3237 file, the macro is "expanded", that is, it is replaced by the tokens that
3238 were specified when the macro was defined.
3239
3240
3241 <sect1>Macros without parameters<p>
3242
3243 In it's simplest form, a macro does not have parameters. Here's an
3244 example:
3245
3246 <tscreen><verb>
3247         .macro  asr             ; Arithmetic shift right
3248                 cmp     #$80    ; Put bit 7 into carry
3249                 ror             ; Rotate right with carry
3250         .endmacro
3251 </verb></tscreen>
3252
3253 The macro above consists of two real instructions, that are inserted into
3254 the code, whenever the macro is expanded. Macro expansion is simply done
3255 by using the name, like this:
3256
3257 <tscreen><verb>
3258         lda     $2010
3259         asr
3260         sta     $2010
3261 </verb></tscreen>
3262
3263
3264 <sect1>Parametrized macros<p>
3265
3266 When using macro parameters, macros can be even more useful:
3267
3268 <tscreen><verb>
3269         .macro  inc16   addr
3270                 clc
3271                 lda     addr
3272                 adc     #$01
3273                 sta     addr
3274                 lda     addr+1
3275                 adc     #$00
3276                 sta     addr+1
3277         .endmacro
3278 </verb></tscreen>
3279
3280 When calling the macro, you may give a parameter, and each occurrence of
3281 the name "addr" in the macro definition will be replaced by the given
3282 parameter. So
3283
3284 <tscreen><verb>
3285         inc16   $1000
3286 </verb></tscreen>
3287
3288 will be expanded to
3289
3290 <tscreen><verb>
3291                 clc
3292                 lda     $1000
3293                 adc     #$01
3294                 sta     $1000
3295                 lda     $1000+1
3296                 adc     #$00
3297                 sta     $1000+1
3298 </verb></tscreen>
3299
3300 A macro may have more than one parameter, in this case, the parameters
3301 are separated by commas. You are free to give less parameters than the
3302 macro actually takes in the definition. You may also leave intermediate
3303 parameters empty. Empty parameters are replaced by empty space (that is,
3304 they are removed when the macro is expanded). If you have a look at our
3305 macro definition above, you will see, that replacing the "addr" parameter
3306 by nothing will lead to wrong code in most lines. To help you, writing
3307 macros with a variable parameter list, there are some control commands:
3308
3309 <tt><ref id=".IFBLANK" name=".IFBLANK"></tt> tests the rest of the line and
3310 returns true, if there are any tokens on the remainder of the line. Since
3311 empty parameters are replaced by nothing, this may be used to test if a given
3312 parameter is empty. <tt><ref id=".IFNBLANK" name=".IFNBLANK"></tt> tests the
3313 opposite.
3314
3315 Look at this example:
3316
3317 <tscreen><verb>
3318         .macro  ldaxy   a, x, y
3319         .ifnblank       a
3320                 lda     #a
3321         .endif
3322         .ifnblank       x
3323                 ldx     #x
3324         .endif
3325         .ifnblank       y
3326                 ldy     #y
3327         .endif
3328         .endmacro
3329 </verb></tscreen>
3330
3331 This macro may be called as follows:
3332
3333 <tscreen><verb>
3334         ldaxy   1, 2, 3         ; Load all three registers
3335
3336         ldaxy   1, , 3          ; Load only a and y
3337
3338         ldaxy   , , 3           ; Load y only
3339 </verb></tscreen>
3340
3341 There's another helper command for determining, which macro parameters are
3342 valid: <tt><ref id=".PARAMCOUNT" name=".PARAMCOUNT"></tt> This command is
3343 replaced by the parameter count given, <em/including/ intermediate empty macro
3344 parameters:
3345
3346 <tscreen><verb>
3347         ldaxy   1               ; .PARAMCOUNT = 1
3348         ldaxy   1,,3            ; .PARAMCOUNT = 3
3349         ldaxy   1,2             ; .PARAMCOUNT = 2
3350         ldaxy   1,              ; .PARAMCOUNT = 2
3351         ldaxy   1,2,3           ; .PARAMCOUNT = 3
3352 </verb></tscreen>
3353
3354 Macro parameters may optionally be enclosed into curly braces. This allows the
3355 inclusion of tokens that would otherwise terminate the parameter (the comma in
3356 case of a macro parameter).
3357
3358 <tscreen><verb>
3359         .macro  foo     arg1, arg2
3360                 ...
3361         .endmacro
3362
3363                 foo     ($00,x)         ; Two parameters passed
3364                 foo     {($00,x)}       ; One parameter passed
3365 </verb></tscreen>
3366
3367 In the first case, the macro is called with two parameters: '<tt/(&dollar;00/'
3368 and 'x)'. The comma is not passed to the macro, since it is part of the
3369 calling sequence, not the parameters.
3370
3371 In the second case, '(&dollar;00,x)' is passed to the macro, this time
3372 including the comma.
3373
3374
3375 <sect1>Detecting parameter types<p>
3376
3377 Sometimes it is nice to write a macro that acts differently depending on the
3378 type of the argument supplied. An example would be a macro that loads a 16 bit
3379 value from either an immediate operand, or from memory. The <tt/<ref
3380 id=".MATCH" name=".MATCH">/ and <tt/<ref id=".XMATCH" name=".XMATCH">/
3381 functions will allow you to do exactly this:
3382
3383 <tscreen><verb>
3384         .macro  ldax    arg
3385                 .if (.match (.left (1, {arg}), #))
3386                     ; immediate mode
3387                     lda     #<(.right (.tcount ({arg})-1, {arg}))
3388                     ldx     #>(.right (.tcount ({arg})-1, {arg}))
3389                 .else
3390                     ; assume absolute or zero page
3391                     lda     arg
3392                     ldx     1+(arg)
3393                 .endif
3394         .endmacro
3395 </verb></tscreen>
3396
3397 Using the <tt/<ref id=".MATCH" name=".MATCH">/ function, the macro is able to
3398 check if its argument begins with a hash mark. If so, two immediate loads are
3399 emitted, Otherwise a load from an absolute zero page memory location is
3400 assumed. Please note how the curly braces are used to enclose parameters to
3401 pseudo functions handling token lists. This is necessary, because the token
3402 lists may include commas or parens, which would be treated by the assembler
3403 as end-of-list.
3404
3405 The macro can be used as
3406
3407 <tscreen><verb>
3408         foo:    .word   $5678
3409         ...
3410                 ldax    #$1234          ; X=$12, A=$34
3411         ...
3412                 ldax    foo             ; X=$56, A=$78
3413 </verb></tscreen>
3414
3415
3416 <sect1>Recursive macros<p>
3417
3418 Macros may be used recursively:
3419
3420 <tscreen><verb>
3421         .macro  push    r1, r2, r3
3422                 lda     r1
3423                 pha
3424         .if     .paramcount > 1
3425                 push    r2, r3
3426         .endif
3427         .endmacro
3428 </verb></tscreen>
3429
3430 There's also a special macro to help writing recursive macros: <tt><ref
3431 id=".EXITMACRO" name=".EXITMACRO"></tt> This command will stop macro expansion
3432 immediately:
3433
3434 <tscreen><verb>
3435         .macro  push    r1, r2, r3, r4, r5, r6, r7
3436         .ifblank        r1
3437                 ; First parameter is empty
3438                 .exitmacro
3439         .else
3440                 lda     r1
3441                 pha
3442         .endif
3443                 push    r2, r3, r4, r5, r6, r7
3444         .endmacro
3445 </verb></tscreen>
3446
3447 When expanding this macro, the expansion will push all given parameters
3448 until an empty one is encountered. The macro may be called like this:
3449
3450 <tscreen><verb>
3451         push    $20, $21, $32           ; Push 3 ZP locations
3452         push    $21                     ; Push one ZP location
3453 </verb></tscreen>
3454
3455
3456 <sect1>Local symbols inside macros<p>
3457
3458 Now, with recursive macros, <tt><ref id=".IFBLANK" name=".IFBLANK"></tt> and
3459 <tt><ref id=".PARAMCOUNT" name=".PARAMCOUNT"></tt>, what else do you need?
3460 Have a look at the inc16 macro above. Here is it again:
3461
3462 <tscreen><verb>
3463         .macro  inc16   addr
3464                 clc
3465                 lda     addr
3466                 adc     #$01
3467                 sta     addr
3468                 lda     addr+1
3469                 adc     #$00
3470                 sta     addr+1
3471         .endmacro
3472 </verb></tscreen>
3473
3474 If you have a closer look at the code, you will notice, that it could be
3475 written more efficiently, like this:
3476
3477 <tscreen><verb>
3478         .macro  inc16   addr
3479                 inc     addr
3480                 bne     Skip
3481                 inc     addr+1
3482         Skip:
3483         .endmacro
3484 </verb></tscreen>
3485
3486 But imagine what happens, if you use this macro twice? Since the label
3487 "Skip" has the same name both times, you get a "duplicate symbol" error.
3488 Without a way to circumvent this problem, macros are not as useful, as
3489 they could be. One solution is, to start a new lexical block inside the
3490 macro:
3491
3492 <tscreen><verb>
3493         .macro  inc16   addr
3494         .proc
3495                 inc     addr
3496                 bne     Skip
3497                 inc     addr+1
3498         Skip:
3499         .endproc
3500         .endmacro
3501 </verb></tscreen>
3502
3503 Now the label is local to the block and not visible outside. However,
3504 sometimes you want a label inside the macro to be visible outside. To make
3505 that possible, there's a new command that's only usable inside a macro
3506 definition: <tt><ref id=".LOCAL" name=".LOCAL"></tt>. <tt/.LOCAL/ declares one
3507 or more symbols as local to the macro expansion. The names of local variables
3508 are replaced by a unique name in each separate macro expansion. So we could
3509 also solve the problem above by using <tt/.LOCAL/:
3510
3511 <tscreen><verb>
3512         .macro  inc16   addr
3513                 .local  Skip            ; Make Skip a local symbol
3514                 clc
3515                 lda     addr
3516                 adc     #$01
3517                 sta     addr
3518                 bcc     Skip
3519                 inc     addr+1
3520         Skip:                           ; Not visible outside
3521         .endmacro
3522 </verb></tscreen>
3523
3524
3525 <sect1>C style macros<p>
3526
3527 Starting with version 2.5 of the assembler, there is a second macro type
3528 available: C style macros using the <tt/.DEFINE/ directive. These macros are
3529 similar to the classic macro type described above, but behaviour is sometimes
3530 different:
3531
3532 <itemize>
3533
3534 <item>  Macros defined with <tt><ref id=".DEFINE" name=".DEFINE"></tt> may not
3535         span more than a line. You may use line continuation (see <tt><ref
3536         id=".LINECONT" name=".LINECONT"></tt>) to spread the definition over
3537         more than one line for increased readability, but the macro itself
3538         may not contain an end-of-line token.
3539
3540 <item>  Macros defined with <tt><ref id=".DEFINE" name=".DEFINE"></tt> share
3541         the name space with classic macros, but they are detected and replaced
3542         at the scanner level. While classic macros may be used in every place,
3543         where a mnemonic or other directive is allowed, <tt><ref id=".DEFINE"
3544         name=".DEFINE"></tt> style macros are allowed anywhere in a line. So
3545         they are more versatile in some situations.
3546
3547 <item>  <tt><ref id=".DEFINE" name=".DEFINE"></tt> style macros may take
3548         parameters. While classic macros may have empty parameters, this is
3549         not true for <tt><ref id=".DEFINE" name=".DEFINE"></tt> style macros.
3550         For this macro type, the number of actual parameters must match
3551         exactly the number of formal parameters.
3552
3553         To make this possible, formal parameters are enclosed in braces when
3554         defining the macro. If there are no parameters, the empty braces may
3555         be omitted.
3556
3557 <item>  Since <tt><ref id=".DEFINE" name=".DEFINE"></tt> style macros may not
3558         contain end-of-line tokens, there are things that cannot be done. They
3559         may not contain several processor instructions for example. So, while
3560         some things may be done with both macro types, each type has special
3561         usages. The types complement each other.
3562
3563 </itemize>
3564
3565 Let's look at a few examples to make the advantages and disadvantages
3566 clear.
3567
3568 To emulate assemblers that use "<tt/EQU/" instead of "<tt/=/" you may use the
3569 following <tt/.DEFINE/:
3570
3571 <tscreen><verb>
3572         .define EQU     =
3573
3574         foo     EQU     $1234           ; This is accepted now
3575 </verb></tscreen>
3576
3577 You may use the directive to define string constants used elsewhere:
3578
3579 <tscreen><verb>
3580         ; Define the version number
3581         .define VERSION         "12.3a"
3582
3583         ; ... and use it
3584         .asciiz VERSION
3585 </verb></tscreen>
3586
3587 Macros with parameters may also be useful:
3588
3589 <tscreen><verb>
3590         .define DEBUG(message)  .out    message
3591
3592         DEBUG   "Assembling include file #3"
3593 </verb></tscreen>
3594
3595 Note that, while formal parameters have to be placed in braces, this is
3596 not true for the actual parameters. Beware: Since the assembler cannot
3597 detect the end of one parameter, only the first token is used. If you
3598 don't like that, use classic macros instead:
3599
3600 <tscreen><verb>
3601         .macro  message
3602                 .out    message
3603         .endmacro
3604 </verb></tscreen>
3605
3606 (This is an example where a problem can be solved with both macro types).
3607
3608
3609 <sect1>Characters in macros<p>
3610
3611 When using the <ref id="option-t" name="-t"> option, characters are translated
3612 into the target character set of the specific machine. However, this happens
3613 as late as possible. This means that strings are translated if they are part
3614 of a <tt><ref id=".BYTE" name=".BYTE"></tt> or <tt><ref id=".ASCIIZ"
3615 name=".ASCIIZ"></tt> command. Characters are translated as soon as they are
3616 used as part of an expression.
3617
3618 This behaviour is very intuitive outside of macros but may be confusing when
3619 doing more complex macros. If you compare characters against numeric values,
3620 be sure to take the translation into account.
3621
3622
3623
3624
3625 <sect>Macro packages<label id="macropackages"><p>
3626
3627 Using the <tt><ref id=".MACPACK" name=".MACPACK"></tt> directive, predefined
3628 macro packages may be included with just one command. Available macro packages
3629 are:
3630
3631
3632 <sect1><tt>.MACPACK generic</tt><p>
3633
3634 This macro package defines macros that are useful in almost any program.
3635 Currently, two macros are defined:
3636
3637 <tscreen><verb>
3638         .macro  add     Arg
3639                 clc
3640                 adc     Arg
3641         .endmacro
3642
3643         .macro  sub     Arg
3644                 sec
3645                 sbc     Arg
3646         .endmacro
3647 </verb></tscreen>
3648
3649
3650 <sect1><tt>.MACPACK longbranch</tt><p>
3651
3652 This macro package defines long conditional jumps. They are named like the
3653 short counterpart but with the 'b' replaced by a 'j'. Here is a sample
3654 definition for the "<tt/jeq/" macro, the other macros are built using the same
3655 scheme:
3656
3657 <tscreen><verb>
3658         .macro  jeq     Target
3659                 .if     .def(Target) .and ((*+2)-(Target) <= 127)
3660                 beq     Target
3661                 .else
3662                 bne     *+5
3663                 jmp     Target
3664                 .endif
3665         .endmacro
3666 </verb></tscreen>
3667
3668 All macros expand to a short branch, if the label is already defined (back
3669 jump) and is reachable with a short jump. Otherwise the macro expands to a
3670 conditional branch with the branch condition inverted, followed by an absolute
3671 jump to the actual branch target.
3672
3673 The package defines the following macros:
3674
3675 <tscreen><verb>
3676         jeq, jne, jmi, jpl, jcs, jcc, jvs, jvc
3677 </verb></tscreen>
3678
3679
3680
3681 <sect1><tt>.MACPACK cbm</tt><p>
3682
3683 The cbm macro package will define a macro named <tt/scrcode/. It takes a
3684 string as argument and places this string into memory translated into screen
3685 codes.
3686
3687
3688 <sect1><tt>.MACPACK cpu</tt><p>
3689
3690 This macro package does not define any macros but constants used to examine
3691 the value read from the <tt/<ref id=".CPU" name=".CPU">/ pseudo variable. For
3692 each supported CPU a constant similar to
3693
3694 <tscreen><verb>
3695     CPU_6502
3696     CPU_65SC02
3697     CPU_65C02
3698     CPU_65816
3699     CPU_SUNPLUS
3700     CPU_SWEET16
3701 </verb></tscreen>
3702
3703 is defined. These constants may be used to determine the exact type of the
3704 currently enabled CPU. In addition to that, for each CPU instruction set,
3705 another constant is defined:
3706
3707 <tscreen><verb>
3708     CPU_ISET_6502
3709     CPU_ISET_65SC02
3710     CPU_ISET_65C02
3711     CPU_ISET_65816
3712     CPU_ISET_SUNPLUS
3713     CPU_ISET_SWEET16
3714 </verb></tscreen>
3715
3716 The value read from the <tt/<ref id=".CPU" name=".CPU">/ pseudo variable may
3717 be checked with <tt/<ref id="operators" name=".BITAND">/ to determine if the
3718 currently enabled CPU supports a specific instruction set. For example the
3719 65C02 supports all instructions of the 65SC02 CPU, so it has the
3720 <tt/CPU_ISET_65SC02/ bit set in addition to its native <tt/CPU_ISET_65C02/
3721 bit. Using
3722
3723 <tscreen><verb>
3724         .if (.cpu .bitand CPU_ISET_65SC02)
3725                 lda     (sp)
3726         .else
3727                 ldy     #$00
3728                 lda     (sp),y
3729         .endif
3730 </verb></tscreen>
3731
3732 it is possible to determine if the
3733
3734 <tscreen><verb>
3735                 lda     (sp)
3736 </verb></tscreen>
3737
3738 instruction is supported, which is the case for the 65SC02, 65C02 and 65816
3739 CPUs (the latter two are upwards compatible to the 65SC02).
3740
3741
3742
3743 <sect>Predefined constants<label id="predefined-constants"><p>
3744
3745 For better orthogonality, the assembler defines similar symbols as the
3746 compiler, depending on the target system selected:
3747
3748 <itemize>
3749 <item><tt/__ACE__/ - Target system is <tt/ace/
3750 <item><tt/__APPLE2__",/ - Target system is <tt/apple2/
3751 <item><tt/__APPLE2ENH__",/ - Target system is <tt/apple2enh/
3752 <item><tt/__ATARI__/ - Target system is <tt/atari/
3753 <item><tt/__ATMOS__",/ - Target system is <tt/atmos/
3754 <item><tt/__BBC__",/ - Target system is <tt/bbc/
3755 <item><tt/__C128__/ - Target system is <tt/c128/
3756 <item><tt/__C16__/ - Target system is <tt/c16/
3757 <item><tt/__C64__/ - Target system is <tt/c64/
3758 <item><tt/__CBM__/ - Target is a Commodore system
3759 <item><tt/__CBM510__/ - Target system is <tt/cbm510/
3760 <item><tt/__CBM610__/ - Target system is <tt/cbm610/
3761 <item><tt/__GEOS__",/ - Target system is <tt/geos/
3762 <item><tt/__LUNIX__",/ - Target system is <tt/lunix/
3763 <item><tt/__NES__",/ - Target system is <tt/nes/
3764 <item><tt/__PET__/ - Target system is <tt/pet/
3765 <item><tt/__PLUS4__/ - Target system is <tt/plus4/
3766 <item><tt/__SUPERVISION__",/ - Target system is <tt/supervision/
3767 <item><tt/__VIC20__/ - Target system is <tt/vic20/
3768 </itemize>
3769
3770
3771 <sect>Structs and unions<label id="structs"><p>
3772
3773 <sect1>Overview<p>
3774
3775 Structs and unions are special forms of <ref id="scopes" name="scopes">.  They
3776 are to some degree comparable to their C counterparts. Both have a list of
3777 members. Each member allocates storage and may optionally have a name, which,
3778 in case of a struct, is the offset from the beginning and, in case of a union,
3779 is always zero.
3780
3781
3782 <sect1>Declaration<p>
3783
3784 Here is an example for a very simple struct with two members and a total size
3785 of 4 bytes:
3786
3787 <tscreen><verb>
3788       .struct Point
3789               xcoord  .word
3790               ycoord  .word
3791       .endstruct
3792 </verb></tscreen>
3793
3794 A union shares the total space between all its members, its size is the same
3795 as that of the largest member.
3796
3797 A struct or union must not necessarily have a name. If it is anonymous, no
3798 local scope is opened, the identifiers used to name the members are placed
3799 into the current scope instead.
3800
3801 A struct may contain unnamed members and definitions of local structs. The
3802 storage allocators may contain a multiplier, as in the example below:
3803
3804 <tscreen><verb>
3805       .struct Circle
3806               .struct Point
3807                       .word   2         ; Allocate two words
3808               .endstruct
3809               Radius  .word
3810       .endstruct
3811 </verb></tscreen>
3812
3813
3814 <sect1>The <tt/.TAG/ keyword<p>
3815
3816 Using the <ref id=".TAG" name=".TAG"> keyword, it is possible to reserve space
3817 for an already defined struct or unions within another struct:
3818
3819 <tscreen><verb>
3820       .struct Point
3821               xcoord  .word
3822               ycoord  .word
3823       .endstruct
3824
3825       .struct Circle
3826               Origin  .tag    Point
3827               Radius  .byte
3828       .endstruct
3829 </verb></tscreen>
3830
3831 Space for a struct or union may be allocated using the <ref id=".TAG"
3832 name=".TAG"> directive.
3833
3834 <tscreen><verb>
3835         C:      .tag    Circle
3836 </verb></tscreen>
3837
3838 Currently, members are just offsets from the start of the struct or union. To
3839 access a field of a struct, the member offset has to be added to the address
3840 of the struct itself:
3841
3842 <tscreen><verb>
3843         lda     C+Circle::Radius        ; Load circle radius into A
3844 </verb></tscreen>
3845
3846 This may change in a future version of the assembler.
3847
3848
3849 <sect1>Limitations<p>
3850
3851 Structs and unions are currently implemented as nested symbol tables (in fact,
3852 they were a by-product of the improved scoping rules). Currently, the
3853 assembler has no idea of types. This means that the <ref id=".TAG"
3854 name=".TAG"> keyword will only allocate space. You won't be able to initialize
3855 variables declared with <ref id=".TAG" name=".TAG">, and adding an embedded
3856 structure to another structure with <ref id=".TAG" name=".TAG"> will not make
3857 this structure accessible by using the '::' operator.
3858
3859
3860
3861 <sect>Module constructors/destructors<label id="condes"><p>
3862
3863 <em>Note:</em> This section applies mostly to C programs, so the explanation
3864 below uses examples from the C libraries. However, the feature may also be
3865 useful for assembler programs.
3866
3867
3868 <sect1>Overview<p>
3869
3870 Using the <tt><ref id=".CONSTRUCTOR" name=".CONSTRUCTOR"></tt>, <tt><ref
3871 id=".DESTRUCTOR" name=".DESTRUCTOR"></tt> and <tt><ref id=".INTERRUPTOR"
3872 name=".INTERRUPTOR"></tt> keywords it it possible to export functions in a
3873 special way. The linker is able to generate tables with all functions of a
3874 specific type. Such a table will <em>only</em> include symbols from object
3875 files that are linked into a specific executable. This may be used to add
3876 initialization and cleanup code for library modules, or a table of interrupt
3877 handler functions.
3878
3879 The C heap functions are an example where module initialization code is used.
3880 All heap functions (<tt>malloc</tt>, <tt>free</tt>, ...) work with a few
3881 variables that contain the start and the end of the heap, pointers to the free
3882 list and so on. Since the end of the heap depends on the size and start of the
3883 stack, it must be initialized at runtime. However, initializing these
3884 variables for programs that do not use the heap are a waste of time and
3885 memory.
3886
3887 So the central module defines a function that contains initialization code and
3888 exports this function using the <tt/.CONSTRUCTOR/ statement. If (and only if)
3889 this module is added to an executable by the linker, the initialization
3890 function will be placed into the table of constructors by the linker. The C
3891 startup code will call all constructors before <tt/main/ and all destructors
3892 after <tt/main/, so without any further work, the heap initialization code is
3893 called once the module is linked in.
3894
3895 While it would be possible to add explicit calls to initialization functions
3896 in the startup code, the new approach has several advantages:
3897
3898 <enum>
3899 <item>
3900 If a module is not included, the initialization code is not linked in and not
3901 called. So you don't pay for things you don't need.
3902
3903 <item>
3904 Adding another library that needs initialization does not mean that the
3905 startup code has to be changed. Before we had module constructors and
3906 destructors, the startup code for all systems had to be adjusted to call the
3907 new initialization code.
3908
3909 <item>
3910 The feature saves memory: Each additional initialization function needs just
3911 two bytes in the table (a pointer to the function).
3912
3913 </enum>
3914
3915
3916 <sect1>Calling order<p>
3917
3918 The symbols are sorted in increasing priority order by the linker when using
3919 one of the builtin linker configurations, so the functions with lower
3920 priorities come first and are followed by those with higher priorities. The C
3921 library runtime subroutine that walks over the function tables calls the
3922 functions starting from the top of the table - which means that functions with
3923 a high priority are called first.
3924
3925 So when using the C runtime, functions are called with high priority functions
3926 first, followed by low priority functions.
3927
3928
3929 <sect1>Pitfalls<p>
3930
3931 When using these special symbols, please take care of the following:
3932
3933 <itemize>
3934
3935 <item>
3936 The linker will only generate function tables, it will not generate code to
3937 call these functions. If you're using the feature in some other than the
3938 existing C environments, you have to write code to call all functions in a
3939 linker generated table yourself. See the <tt/condes/ and <tt/callirq/ modules
3940 in the C runtime for an example on how to do this.
3941
3942 <item>
3943 The linker will only add addresses of functions that are in modules linked to
3944 the executable. This means that you have to be careful where to place the
3945 condes functions. If initialization or an irq handler is needed for a group of
3946 functions, be sure to place the function into a module that is linked in
3947 regardless of which function is called by the user.
3948
3949 <item>
3950 The linker will generate the tables only when requested to do so by the
3951 <tt/FEATURE CONDES/ statement in the linker config file. Each table has to
3952 be requested separately.
3953
3954 <item>
3955 Constructors and destructors may have priorities. These priorities determine
3956 the order of the functions in the table. If your initialization or cleanup code
3957 does depend on other initialization or cleanup code, you have to choose the
3958 priority for the functions accordingly.
3959
3960 <item>
3961 Besides the <tt><ref id=".CONSTRUCTOR" name=".CONSTRUCTOR"></tt>, <tt><ref
3962 id=".DESTRUCTOR" name=".DESTRUCTOR"></tt> and <tt><ref id=".INTERRUPTOR"
3963 name=".INTERRUPTOR"></tt> statements, there is also a more generic command:
3964 <tt><ref id=".CONDES" name=".CONDES"></tt>. This allows to specify an
3965 additional type. Predefined types are 0 (constructor), 1 (destructor) and 2
3966 (interruptor). The linker generates a separate table for each type on request.
3967
3968 </itemize>
3969
3970
3971 <sect>Porting sources from other assemblers<p>
3972
3973 Sometimes it is necessary to port code written for older assemblers to ca65.
3974 In some cases, this can be done without any changes to the source code by
3975 using the emulation features of ca65 (see <tt><ref id=".FEATURE"
3976 name=".FEATURE"></tt>). In other cases, it is necessary to make changes to the
3977 source code.
3978
3979 Probably the biggest difference is the handling of the <tt><ref id=".ORG"
3980 name=".ORG"></tt> directive. ca65 generates relocatable code, and placement is
3981 done by the linker. Most other assemblers generate absolute code, placement is
3982 done within the assembler and there is no external linker.
3983
3984 In general it is not a good idea to write new code using the emulation
3985 features of the assembler, but there may be situations where even this rule is
3986 not valid.
3987
3988 <sect1>TASS<p>
3989
3990 You need to use some of the ca65 emulation features to simulate the behaviour
3991 of such simple assemblers.
3992
3993 <enum>
3994 <item>Prepare your sourcecode like this:
3995
3996 <tscreen><verb>
3997         ; if you want TASS style labels without colons
3998         .feature labels_without_colons
3999
4000         ; if you want TASS style character constants
4001         ; ("a" instead of the default 'a')
4002         .feature loose_char_term
4003
4004                 .word *+2       ; the cbm load address
4005
4006                 [yourcode here]
4007 </verb></tscreen>
4008
4009 notice that the two emulation features are mostly useful for porting
4010 sources originally written in/for TASS, they are not needed for the
4011 actual "simple assembler operation" and are not recommended if you are
4012 writing new code from scratch.
4013
4014 <item>Replace all program counter assignments (which are not possible in ca65
4015 by default, and the respective emulation feature works different from what
4016 you'd expect) by another way to skip to another memory location, for example
4017 the <tt><ref id=".RES" name=".RES"></tt>directive.
4018
4019 <tscreen><verb>
4020         ; *=$2000
4021         .res $2000-*    ; reserve memory up to $2000
4022 </verb></tscreen>
4023
4024 notice that other than the original TASS, ca65 can never move the
4025 programmcounter backwards - think of it as if you are assembling to disc with
4026 TASS.
4027
4028 <item>Conditional assembly (<tt/.ifeq//<tt/.endif//<tt/.goto/ etc.) must be
4029 rewritten to match ca65 syntax. Most importantly notice that due to the lack
4030 of <tt/.goto/, everything involving loops must be replaced by
4031 <tt><ref id=".REPEAT" name=".REPEAT"></tt>.
4032
4033 <item>To assemble code to a different address than it is executed at, use the
4034 <tt><ref id=".ORG" name=".ORG"></tt> directive instead of
4035 <tt/.offs/-constructs.
4036
4037 <tscreen><verb>
4038         .org $1800
4039
4040         [floppy code here]
4041
4042         .reloc  ; back to normal
4043 </verb></tscreen>
4044
4045 <item>Then assemble like this:
4046
4047 <tscreen><verb>
4048         cl65 --start-addr 0x0ffe -t none myprog.s -o myprog.prg
4049 </verb></tscreen>
4050
4051 notice that you need to use the actual start address minus two, since two
4052 bytes are used for the cbm load address.
4053
4054 </enum>
4055
4056
4057 <sect>Bugs/Feedback<p>
4058
4059 If you have problems using the assembler, if you find any bugs, or if
4060 you're doing something interesting with the assembler, I would be glad to
4061 hear from you. Feel free to contact me by email
4062 (<htmlurl url="mailto:uz@cc65.org" name="uz@cc65.org">).
4063
4064
4065
4066 <sect>Copyright<p>
4067
4068 ca65 (and all cc65 binutils) are (C) Copyright 1998-2003 Ullrich von
4069 Bassewitz. For usage of the binaries and/or sources the following
4070 conditions do apply:
4071
4072 This software is provided 'as-is', without any expressed or implied
4073 warranty.  In no event will the authors be held liable for any damages
4074 arising from the use of this software.
4075
4076 Permission is granted to anyone to use this software for any purpose,
4077 including commercial applications, and to alter it and redistribute it
4078 freely, subject to the following restrictions:
4079
4080 <enum>
4081 <item>  The origin of this software must not be misrepresented; you must not
4082         claim that you wrote the original software. If you use this software
4083         in a product, an acknowledgment in the product documentation would be
4084         appreciated but is not required.
4085 <item>  Altered source versions must be plainly marked as such, and must not
4086         be misrepresented as being the original software.
4087 <item>  This notice may not be removed or altered from any source
4088         distribution.
4089 </enum>
4090
4091
4092
4093 </article>
4094
4095
4096