Table of Contents
Like most assemblers, each NASM source line contains (unless it is a macro, a preprocessor directive or an assembler directive: see Chapter 4) some combination of the four fields
label: instruction operands ; comment
As usual, most of these fields are optional; the presence or absence of any combination of a label, an instruction and a comment is allowed. Of course, the operand field is either required or forbidden by the presence and nature of the instruction field.
NASM uses backslash (\) as the line continuation character; if a line ends with backslash, the next line is considered to be a part of the backslash-ended line.
NASM places no restrictions on white space within a line: labels may have white space
before them, or instructions may have no space before them, or anything. The colon after
a label is also optional. Note that this means that if you intend to code lodsb alone on a line, and type lodab
by accident, then that's still a valid source line which does nothing but define a label.
Running NASM with the command-line option -w+orphan-labels
will cause it to warn you if you define a label alone on a line without a trailing
colon.
Valid characters in labels are letters, numbers, _, $, #, @,
~, ., and ?. The only characters which may be used as the first character of an identifier are letters, . (with special meaning: see Section 2.10), _ and ?. An identifier may also be prefixed with a $ to indicate that it is intended to be read as an identifier and
not a reserved word; thus, if some other module you are linking with defines a symbol
called eax, you can refer to $eax in NASM code to distinguish the symbol from the register.
The instruction field may contain any machine instruction: Pentium and P6
instructions, FPU instructions, MMX instructions and even undocumented instructions are
all supported. The instruction may be prefixed by LOCK,
REP, REPE/REPZ or REPNE/REPNZ, in the usual way. Explicit address-size and operand-size
prefixes A16, A32, O16 and O32 are provided. You can
also use the name of a segment register as an instruction prefix: coding es mov [bx],ax is equivalent to coding mov
[es:bx],ax. We recommend the latter syntax, since it is consistent with other
syntactic features of the language, but for instructions such as LODSB, which has no operands and yet can require a segment
override, there is no clean syntactic way to proceed apart from es
lodsb.
An instruction is not required to use a prefix: prefixes such as CS, A32, LOCK or REPE can appear on a line by
themselves, and NASM will just generate the prefix bytes.
In addition to actual machine instructions, NASM also supports a number of pseudo-instructions, described in Section 2.2.
Instruction operands may take a number of forms: they can be registers, described
simply by the register name (e.g. AX, BP, EBX, CR0: NASM does not use the gas-style syntax in which register names must be prefixed by a
% sign), or they can be effective addresses (see Section 2.3), constants
(Section 2.5) or expressions (Section 2.6).
For floating-point instructions, NASM accepts a wide range of syntaxes: you can use two-operand forms like MASM supports, or you can use NASM's native single-operand forms in most cases. For example, you can code:
fadd st1 ; this sets st0 := st0 + st1
fadd st0, st1 ; so does this
fadd st1, st0 ; this sets st1 := st1 + st0
fadd to st1 ; so does this
Almost any floating-point instruction that references memory must use one of the
prefixes DWORD, QWORD, TWORD, DDQWORD, or OWORD to indicate what size of memory operand it refers to.