%Useful Macros \newcommand{\id}{\text{id }} \newcommand{\ids}{\text{id}} \newcommand{\ints}{\text{int}} \newcommand{\intsp}{\text{int }} \newcommand{\strings}{\text{string}} \newcommand{\stringsp}{\text{string }} \newcommand{\kw}[1]{\text{\bf #1\ }} \newcommand{\kws}[1]{\text{\bf #1}} \newcommand{\pd}[1]{\text{\em #1\ }} \newcommand{\pds}[1]{\text{\em #1}} \newcommand{\bundleT}[1]{\{#1\}} \newcommand{\info}{[\pds{info}]\ } \newcommand{\version}{0.1.3} \title{Specification for the FIRRTL Language:\\ Version \version \\ PRE-RELEASE VERSION - DO NOT DISTRIBUTE} \author{Patrick S. Li \\ \href{mailto:psli@eecs.berkeley.edu}{psli@eecs.berkeley.edu} \and Adam M. Izraelevitz \\ \href{mailto:adamiz@eecs.berkeley.edu}{adamiz@eecs.berkeley.edu} \and Jonathan Bachrach \\ \href{mailto:jrb@eecs.berkeley.edu}{jrb@eecs.berkeley.edu} } \documentclass[12pt]{article} \usepackage{listings} \usepackage{amsmath} \usepackage{proof} \usepackage{amsfonts} \usepackage{enumitem} \usepackage{hyperref} \hypersetup{ colorlinks=true, linkcolor=blue, filecolor=magenta, urlcolor=cyan, } \usepackage[pdftex]{graphicx} \usepackage{fancyhdr} \pagestyle{fancy} \lhead{Specification for the FIRRTL Language} \rhead{Version \version} \cfoot{\thepage \\ \em{PRE-RELEASE VERSION - DO NOT DISTRIBUTE}} \renewcommand{\headrulewidth}{0.4pt} \renewcommand{\footrulewidth}{0.4pt} \lstset{basicstyle=\footnotesize\ttfamily,breaklines=true} \begin{document} \maketitle \tableofcontents \newpage \section{Introduction} \subsection{Background} The ideas for FIRRTL originated from a different UC Berkeley project, Chisel, which embedded a hardware description language in Scala and was used to write highly-parameterized circuit design generators. Users could manipulate circuit components using Scala functions, encode their interfaces into custom Scala types, and use Scala's object-orientation to write their own circuit libraries. All of these features enabled expressive, reliable and type-safe generators that improved RTL design productivity and robustness. At UC Berkeley, Chisel was a critical part of the infrastructure supporting computer architecture research. Many of these research projects, including vector machines, out-of-order processors, silicon photonics and cache coherency, drove ten different silicon tape-outs over a three year period with under 12 graduate student researchers. The research productivity gains proved to the graduate students and faculty of the validity of this design methodology. However, Chisel's external rate of adoption was slow for the following reasons: \begin{enumerate}[topsep=3pt,itemsep=-0.5ex,partopsep=1ex,parsep=1ex] \item Learning a functional language (Scala) was a large barrier to entry \item Conceptually separating the Chisel HDL from the host language was difficult for new users \item Verilog generation was unreadable and slow \item Writing transformational passes required insider knowledge of the Chisel compiler \item Compiler design was unstructured, making error checking difficult and error messages often incomprehensible \item Chisel IR semantics were ill-defined and thus impossible to target from other languages \end{enumerate} As a consequence, Chisel needed to be redesigned from its ground up to standardize its IR and semantics, modularize its compilation process for robustness, and cleanly separate its front-end (Chisel + Scala), internal representation (FIRRTL), and backends. \subsection{Design Philosophy} FIRRTL represents the formalized elaborated graph that the Chisel DSL produces, prior to any simplification. By including complicated constructs like vector types, bundle types, and when statements in FIRRTL, the Chisel/Scala front-end can be very light-weight. In addition, other front-ends written in languages other than Scala could be simple to write and increase external adoption. Lowered FIRRTL (LoFIRRTL) represents a simplified FIRRTL circuit with structural invariants, making it essentially a netlist. This form enables straightforward translation into another language (e.g., Verilog) by a light-weight backend. By defining LoFIRRTL as a structured subset of FIRRTL, an external user can write a transformational pass whose input is restricted, but whose output is full-featured. After a custom pass, the resulting circuit should undergo lowering prior to passing it to a backend or another custom pass. \section{Acknowledgements - IN PROGRESS} The FIRRTL language could not have been developed without the help of many of the faculty and students in the ASPIRE lab, including but not limited to XXXX. We'd also like to thank our sponsors XXXX, and the University of California, Berkeley. \section{FIRRTL Language Definition} \subsection{Abstract Syntax Tree} { \fontsize{11pt}{1.15em}\selectfont \[ \begin{array}{rrll} \pd{circuit} &= &\kw{circuit} \id \kw{:} (\pd{module*}) &\text{Circuit}\\ \pd{module} &= &\info \kw{module} \id \kw{:} (\pd{port*} \pd{stmt}) &\text{Module}\\ &\vert &\info \kw{extmodule} \id \kw{:} (\pd{port*}) &\text{External Module}\\ \pd{port} &= &\info \pd{kind} \id \kw{:} \pd{type} &\text{Port}\\ \pd{kind} &= &\kws{input} \vert \kws{output} &\text{Port Kind}\\ \pd{type} &= &\kws{UInt} \kws{$<$} \pds{width} \kws{$>$} &\text{Unsigned Integer}\\ &\vert &\kws{SInt} \kws{$<$} \pds{width} \kws{$>$} &\text{Signed Integer}\\ &\vert &\kws{Clock} &\text{Clock}\\ &\vert &\bundleT{\pd{field*}} &\text{Bundle}\\ &\vert &\pds{type}[\ints] &\text{Vector}\\ \pd{field} &= &\pd{orientation} \id \kw{:} \pd{type} &\text{Bundle Field}\\ \pd{orientation}&= &\kws{default} \vert \kws{reverse} &\text{Orientation}\\ \pd{width} &= &\ints \vert \kw{?} &\text{Known/Unknown Integer Width}\\ \pd{stmt} &= &\info \kw{wire} \id \kw{:} \pd{type} &\text{Wire Declaration}\\ &\vert &\info \kw{reg} \id \kw{:} \pds{type} , \pds{exp} , \pds{exp} &\text{Register Declaration}\\ &\vert &\info \kw{smem} \id \kw{:} \pds{type} , \ints &\text{Sequential Memory Declaration}\\ &\vert &\info \kw{cmem} \id \kw{:} \pds{type} , \ints &\text{Combinational Memory Declaration}\\ &\vert &\info \kw{inst} \id \kw{:} \id &\text{Instance Declaration}\\ &\vert &\info \kw{poison} \id \kw{:} \id &\text{Poison Declaration}\\ &\vert &\info \kw{node} \id = \pd{exp} &\text{Node Declaration}\\ &\vert &\info \pd{dir} \kw{accessor} \id = \pds{exp}[\pds{exp}] , \pds{exp} &\text{Accessor Declaration}\\ &\vert &\info \pd{exp} \kw{$<$=} \pd{exp} &\text{Connect}\\ &\vert &\info \kw{onreset} \pd{exp} \kw{$<$=} \pd{exp} &\text{OnReset Connect}\\ &\vert &\info \pd{exp} \kw{$<$--} \pd{exp} &\text{Partial Connect}\\ &\vert &\info \kw{when} \pd{exp} \kw{:} \pd{stmt} \kw{else :} \pd{stmt} &\text{Conditional}\\ &\vert &\info \kw{stop} &\text{Stop Statement}\\ &\vert &\info \kw{printf}(\strings,\pds{exp*}) &\text{Printf Statement}\\ &\vert &\info \kw{skip} &\text{Empty Statement}\\ &\vert &\info (\pd{stmt*}) &\text{Statement Group}\\ \pd{dir} &= &\kws{infer} \vert \kws{read} \vert \kws{write} \vert \kw{rdwr} &\text{Accessor Direction}\\ \pd{exp} &= &\info \kws{UInt} \kws{$<$} \pds{width} \kws{$>$}(\ints) &\text{Literal Unsigned Integer}\\ &\vert &\info \kws{SInt} \kws{$<$} \pds{width} \kws{$>$}(\ints) &\text{Literal Signed Integer}\\ &\vert &\info \id &\text{Reference}\\ &\vert &\info \pds{exp}.\id &\text{Subfield}\\ &\vert &\info \pds{exp}[\ints] &\text{Subindex}\\ &\vert &\info \pds{primop}(\pds{exp*}, \ints\text{*}) &\text{Primitive Operation}\\ \pd{info} &= &\text{filename } \kw{:} \text{line} . \text{col} &\text{File Location}\\ &\vert &\kw{noinfo} &\text{No File Location}\\ \end{array} \] } \[ { \fontsize{11pt}{1.15em}\selectfont \begin{array}{rrll} \pd{primop} &= &\kws{add} &\text{Unsigned/Signed Add}\\ &\vert &\kws{sub} &\text{Unsigned/Signed Subtract}\\ &\vert &\kws{addw} &\text{Unsigned/Signed Add Wrap}\\ &\vert &\kws{subw} &\text{Unsigned/Signed Subtract Wrap}\\ &\vert &\kws{mul} &\text{Unsigned/Signed Multiply}\\ &\vert &\kws{div} &\text{Unsigned/Signed Divide}\\ &\vert &\kws{mod} &\text{Unsigned/Signed Modulo}\\ &\vert &\kws{quo} &\text{Unsigned/Signed Quotient}\\ &\vert &\kws{rem} &\text{Unsigned/Signed Remainder}\\ &\vert &\kws{lt} &\text{Unsigned/Signed Less Than}\\ &\vert &\kws{leq} &\text{Unsigned/Signed Less or Equal}\\ &\vert &\kws{gt} &\text{Unsigned/Signed Greater Than}\\ &\vert &\kws{geq} &\text{Unsigned/Signed Greater or Equal}\\ &\vert &\kws{eq} &\text{Unsigned/Signed Equal}\\ &\vert &\kws{neq} &\text{Unsigned/Signed Not-Equal}\\ &\vert &\kws{eqv} &\text{Unsigned/Signed Equivalence}\\ &\vert &\kws{neqv} &\text{Unsigned/Signed Not-Equivalence}\\ &\vert &\kws{mux} &\text{Unsigned/Signed Multiplex}\\ &\vert &\kws{pad} &\text{Unsigned/Signed Pad to Length}\\ &\vert &\kws{asUInt} &\text{Unsigned/Signed Reinterpret Bits as UInt}\\ &\vert &\kws{asSInt} &\text{Unsigned/Signed Reinterpret Bits as SInt}\\ &\vert &\kws{shl} &\text{Unsigned/Signed Shift Left}\\ &\vert &\kws{shr} &\text{Unsigned/Signed Shift Right}\\ &\vert &\kws{dshl} &\text{Unsigned/Signed Dynamic Shift Left}\\ &\vert &\kws{dshr} &\text{Unsigned/Signed Dynamic Shift Right}\\ &\vert &\kws{cvt} &\text{Unsigned/Signed to Signed Logical Conversion}\\ &\vert &\kws{neg} &\text{Unsigned/Signed Negate}\\ &\vert &\kws{not} &\text{Unsigned Not}\\ &\vert &\kws{and} &\text{Unsigned And}\\ &\vert &\kws{or} &\text{Unsigned Or}\\ &\vert &\kws{xor} &\text{Unsigned Xor}\\ &\vert &\kws{andr} &\text{Unsigned And Reduce}\\ &\vert &\kws{orr} &\text{Unsigned Or Reduce}\\ &\vert &\kws{xorr} &\text{Unsigned Xor Reduce}\\ &\vert &\kws{cat} &\text{Unsigned Concatenation}\\ &\vert &\kws{bit} &\text{Single Bit Extraction}\\ &\vert &\kws{bits} &\text{Multiple Bit Extraction}\\ \end{array} } \] \subsection{Notation} The above definition specifies the structure of the abstract syntax tree corresponding to a FIRRTL circuit. Nodes in the abstract syntax tree are {\em italicized}. Keywords are shown in {\bf bold}. The special productions id, int, and string, indicates an identifier, an integer literal, and a string respectively. Tokens followed by an asterisk, {\em e.g.} \pds{field}*, indicates a list formed from repeated occurrences of the token. Keep in the mind that the above definition is only the {\em abstract} syntax tree, and is a representation of the in-memory FIRRTL data structure. Readers and writers are provided for converting a FIRRTL data structure into a purely textual representation, which is defined in Section \ref{concrete}. \section{Circuits and Modules} \[ \begin{array}{rrl} \pd{circuit} &= &\kw{circuit} \text{toplevel-module } \kw{:} (\text{modules*}) \\ \pd{module} &= &\kw{module} \text{name } \kw{:} (\text{ports* } \text{body}) \\ &\vert &\kw{extmodule} \text{name } \kw{:} (\text{ports* }) \\ \pd{port} &= &\pd{kind} \id \kw{:} \pd{type} \\ \pd{kind} &= &\kws{input} \vert \kws{output} \\ \end{array} \] All FIRRTL circuits consist of a flat list of modules, each representing one hardware block. Each module has a given name, a list of ports, and a statement representing the circuit connections within the module. Externally defined modules consist of a given name, and a list of ports, whose types must match the types defined in the associated Verilog. Module names exist in their own namespace, and all modules must have a unique name. The name of the top-level module must be specified for a circuit. A module port is specified by its \pd{kind}, which may be input or output, a name, and the data type for the port. The port names exist in the identifier namespace for the module, and must be unique. In addition, all references within a module must be unique. The following example is the port declaration of a module that spans two clock domains. \[ \begin{aligned} &\kw{module} TwoClock : \\ &\quad \kw{input} clk1 : \kw{Clock}\\ &\quad \kw{input} clk2 : \kw{Clock}\\ &\quad ... \\ \end{aligned} \] \section{Types} \subsection{Ground Types} \[ \begin{array}{rrl} \pd{type} &= &\kws{UInt}\kws{$<$} \pds{width} \kws{$>$} \\ &\vert &\kws{SInt}\kws{$<$} \pds{width} \kws{$>$} \\ &\vert &\kws{Clock} \\ \pd{width} &= &\ints \\ &\vert &\kw{?} \\ \end{array} \] There are only three ground types in FIRRTL, an unsigned, signed integer type, and clock type. Both unsigned and signed integer types require a given bit width, which may be some known integer width, which must be non-negative and greater than zero, or an unknown width. Unknown widths are a declaration for the width to be computed by the FIRRTL width inferencer, instead of manually given by the programmer. Zero-valued widths are currently not supported, but future versions will likely support them. Clock types have a restricted usage, where they can only be connected to other clock types or be referenced to in the \kws{reg}, \kws{accessor}, and \kws{inst} declarations, as explained in Section \ref{statements}. They cannot be used in primitive operations. \subsection{Vector Types} \[ \begin{array}{rrl} \pd{type} &= &\pds{type}[\ints] \\ \end{array} \] Vector types in FIRRTL indicate a structure consisting of multiple elements of some given type. This is akin to array types in the C programming language. Note that the number of elements must be known, and non-negative. As an example, the type $\kws{UInt}\kws{$<$} 16 \kws{$>$}[10]$ indicates a ten element vector of 16-bit unsigned integers. The type $\kws{UInt}\kws{$<$} \kws{?} \kws{$>$}[10]$ indicates a ten element vector of unsigned integers, with unknown but the same bit widths. Vector types may be nested ad infinitum. The type $\kws{UInt}\kws{$<$} 16 \kws{$>$}[10][5]$ indicates a five element vector {\em of} ten element vectors of 16-bit unsigned integers. \subsection{Bundle Types} \[ \begin{array}{rrl} \pd{type} &= &\bundleT{\pd{field*}} \\ \pd{field} &= &\pd{orientation} \text{name } \kw{:} \pd{type} \\ \pd{orientation}&= &\kws{default} \vert \kws{reverse} \\ \end{array} \] Bundle types in FIRRTL are composite types formed from an ordered sequence of named, nested types. All fields in a bundle must have a given orientation, name, and type. The following is an example of a possible type for representing a complex number. \[ \bundleT{\kw{default} \text{real } \kw{:} \kws{SInt}\kws{$<$} 10 \kws{$>$}, \kw{default} \text{imag } \kw{:} \kws{SInt}\kws{$<$} 10 \kws{$>$}} \] It has two fields, real, and imag, both 10-bit signed integers. By convention, we specify the directions within a bundle type with their relative orientation. For this reason, the real and imag fields for the complex number bundle type are both specified to be {\em default}. The following bundle type has a data field that is specified to be a 10-bit unsigned integer type, a valid signal that must be a 1-bit unsigned integer type, and a reversed ready signal that must be a 1-bit unsigned integer type. \[ \begin{aligned} \{ \kw{default} &\text{data } \kw{:} \kws{UInt}\kws{$<$} 10 \kws{$>$}, \\ \kw{default} &\text{valid } \kw{:} \kws{UInt}\kws{$<$} 1 \kws{$>$}, \\ \kw{reverse} &\text{ready } \kw{:} \kws{UInt}\kws{$<$} 1 \kws{$>$}\} \\ \end{aligned} \] If an output port had this bundle type, the {\em ready} field would be an input to the module. Note that all field names within a bundle type must be unique. As in the case of vector types, bundle types may also be nested ad infinitum (i.e., the types of the fields themselves may also be bundle types, which will in turn contain more fields, etc.) \section{Statements} \label{statements} FIRRTL circuit components are instantiated and connected together using {\em statements}. \subsection{Wires} A wire is a named combinational circuit element that can be connected to using the connect statement. A wire with a given name and type can be instantiated with the following statement. \[ \kw{wire} \text{name } \kw{:} \pd{type} \\ \] Declared wires are {\em bidirectional}, which means that they can be used as both an input (by being on the left-hand side of a connect statement), or as an output (by being on the right-hand side of a connect statement). \subsection{Registers} A register is a named stateful circuit element. A register with a given name, type, clock reference, and reset reference, can be instantiated with the following statement. \[ \kw{reg} \text{name } \kw{:} \pds{type},\pds{clk,} \pds{reset} \\ \] Like wires, registers are also {\em bidirectional}, which means that they can be used as both an input (by being on the left-hand side of a connect statement), or as an output (by being on the right-hand side of a connect statement). The onreset statement is used to specify the initialization value for a register, which is assigned to the register when the declared \pds{reset} signal is asserted. smem read from same address as write \subsection{Memories} A memory is a stateful circuit element containing multiple elements. The type for a memory must be completely specified; it cannot contain any unknown widths or bundle types with reverse fields. Unlike registers, memories can {\em only} be read from or written to through {\em accessors}, and cannot be initialized using a special FIRRTL construct Instead, the circuit itself must contain the proper logic to initialize the memory. Additionally, if a memory is written via two or more accessors to the same memory address, the resulting stored value is undefined. Memories always have a synchronous write, but can either be declared to be read combinatorially or synchronously. A combinatorially read memory with a given name, type, and size integer can be instantiated with the following statement. \[ \begin{aligned} \kw{cmem} \text{name } \kw{:} \pds{type} , size\\ \end{aligned} \] A synchronously read memory with a given name, type, and size integer can be instantiated with the following statement. \[ \begin{aligned} \kw{smem} \text{name } \kw{:} \pds{type} , size \\ \end{aligned} \] A synchronously read memory has the additional restriction that a read to an address on the same cycle its written returns an undefined value. \subsection{Poisons} A poison component is a named combinational circuit element that holds a random/garbage value. It cannot be connected to using the connect statement, and its type cannot contain a bundle with a flipped value. A poison component with a given name and type can be instantiated with the following statement. \[ \kw{poison} \text{name } \kw{:} \pd{type} \\ \] Declared poisons are {\em unidirectional}, which means that they can only be used as a source (being on the right-hand side of a connect statement). \subsection{Nodes} A node is simply a named intermediate value in a circuit, and is akin to a pointer in the C programming language. A node with a given name and value can be instantiated with the following statement. \[ \kw{node} \text{name } = \pd{exp} \\ \] Unlike wires, nodes can only be used in {\em output} directions. They can be connected from, but not connected to. Consequentially, their expression cannot be a bundle type with any reversed fields. \subsection{Accessors} Accessors are used for either connecting to or from a vector-typed expression, from some {\em variable} index. \[ \begin{aligned} &\pd{dir} \kw{accessor} \text{name} = \pds{exp}[\text{index}] \pds{,clk} \\ &\pd{dir} = \kws{infer} \vert \kws{read} \vert \kws{write} \vert \kw{rdwr} \\ \end{aligned} \] Given an accessor direction, a name, an expression to access, the index at which to access, and the clock domain it is in, the above statement creates an accessor that may be used for connecting to or from the expression. The expression must have a vector type, and the index must be a variable of UInt type. A read, write, and inferred accessor is conceptually one-way; it must be consistently used to connect to, or to connect from, but not both. A read-write accessor (\kws{rdwr}) is conceptually two-way; it can be used to connect to, to connect from, or both, {\em but not on the same cycle}. If it is written to and read from on the same cycle, its behavior is undefined. The following example demonstrates using accessors to read and write to a memory. The accessor, \pds{reader}, acts as a memory read port that reads from the index specified by the wire \pds{i}. The accessor, \pds{writer}, acts as a memory write port that writes 42 to the index specified by wire \pds{j}. \[ \begin{aligned} &\kw{wire} i : \kws{UInt}\kws{$<$} 5 \kws{$>$} \\ &\kw{wire} j : \kws{UInt}\kws{$<$} 5 \kws{$>$} \\ &\kw{cmem} m : \kws{UInt}\kws{$<$} 10 \kws{$>$},10 \\ &\kw{read} \kw{accessor} reader = m[i] , clk \\ &\kw{write} \kw{accessor} writer = m[j] , clk \\ &writer <= \kws{UInt}\kws{$<$} \kws{?} \kws{$>$}(42) \\ &\kw{node} temp = reader \\ \end{aligned} \] As mentioned previously, the only way to read from or write to a memory is through an accessor. However, accessors are not restricted to accessing memories. They can be used to access {\em any} cmem, smem, or wire/reg with vector-valued type. An accessor passed a poisoned value as an index writes or reads a poisoned value. \subsection{Instances} An instance refers to a particular instantiation of a FIRRTL module. An instance is constructed with a given name and a given module name. \[ \begin{aligned} \kw{inst} \text{name } \kw{:} \text{module} \end{aligned} \] The resulting instance has a bundle type, where the given module's ports are fields and can be accessed using the subfield expression. The orientation of the {\em output} ports are {\em default}, and the orientation of the {\em input} ports are {\em reverse}. An instance may be directly connected to another element, but it must be on the right-hand side of the connect statement. The following example illustrates directly connecting an instance to a wire: { \fontsize{11pt}{1.15em}\selectfont \[ \begin{aligned} &\kw{extmodule} Queue \ \kws{:} \\ &\quad \kw{input} clk \ \kw{:} \kws{Clock} \\ &\quad \kw{input} in \ \kw{:} \kws{UInt$<$}16\kws{$>$} \\ &\quad \kw{output} out \ \kw{:} \kws{UInt$<$}16\kws{$>$} \\ &\kw{module} Top \ \kws{:} \\ &\quad \kw{input} clk \ \kw{:} \kws{Clock} \\ &\quad \kw{inst} queue \ \kw{:} Queue \\ &\quad \kw{wire} connect \ \kw{:} \bundleT{\kw{default} out \ \kw{:} \kws{UInt$<$}16\kws{$>$},\kw{reverse} in \ \kw{:} \ \kws{UInt$<$}16\kws{$>$},\kw{reverse} clk \ \kw{:} \ \kws{Clock}} \\ &\quad connect \ \kw{$<$=} queue \\ \end{aligned} \] } The output ports of an instance may only be connected from, e.g., the right-hand side of a connect statement. Conversely, the input ports of an instance may only be connected to, e.g., the left-hand side of a connect statement. The following example illustrates a proper use of creating instances with different clock domains: { \fontsize{11pt}{1.15em}\selectfont \[ \begin{aligned} &\kw{extmodule} AsyncQueue \ \kws{:} \\ &\quad \kw{input} clk1 \ \kw{:} \kws{Clock} \\ &\quad \kw{input} clk2 \ \kw{:} \kws{Clock} \\ &\quad \kw{input} in \ \kw{:} \bundleT{\kw{default} data \ \kw{:} \kws{UInt$<$}16\kws{$>$},\kw{reverse} ready \ \kw{:} \kws{UInt$<$}1\kws{$>$}} \\ &\quad \kw{output} out \ \kw{:} \bundleT{\kw{default} data \ \kw{:} \kws{UInt$<$}16\kws{$>$},\kw{reverse} ready \ \kw{:} \kws{UInt$<$}1\kws{$>$}} \\ &\kw{extmodule} Source \ \kws{:} \\ &\quad \kw{input} clk \ \kw{:} \kws{Clock} \\ &\quad \kw{output} packet \ \kw{:} \bundleT{\kw{default} data \ \kw{:} \kws{UInt$<$}16\kws{$>$},\kw{reverse} ready \ \kw{:} \kws{UInt$<$}1\kws{$>$}} \\ &\kw{extmodule} Sink \ \kws{:} \\ &\quad \kw{input} clk \ \kw{:} \kws{Clock} \\ &\quad \kw{input} packet \ \kw{:} \bundleT{\kw{default} data \ \kw{:} \kws{UInt$<$}16\kws{$>$},\kw{reverse} ready \ \kw{:} \kws{UInt$<$}1\kws{$>$}} \\ &\kw{module} TwoClock \ \kws{:} \\ &\quad \kw{input} clk1 \ \kw{:} \kws{Clock} \\ &\quad \kw{input} clk2 \ \kw{:} \kws{Clock} \\ &\quad \kw{inst} src \ \kw{:} Source \\ &\quad src.clk \ \kw{$<$=} clk1 \\ &\quad \kw{inst} snk \ \kw{:} Sink \\ &\quad snk.clk \ \kw{$<$=} clk2 \\ &\quad \kw{inst} queue \ \kw{:} AsyncQueue \\ &\quad queue.clk1 \ \kw{$<$=} clk1 \\ &\quad queue.clk2 \ \kw{$<$=} clk2 \\ &\quad queue.in \ \kw{$<$=} src.packet \\ &\quad snk.packet \ \kw{$<$=} queue.out \\ \end{aligned} \] } There are restrictions upon which modules the user is allowed to instantiate, so as not to create infinitely recursive hardware. We define a module with no instances as a {\em level 0} module. A module containing only instances of {\em level 0} modules is a {\em level 1} module, and a module containing only instances of {\em level 1} or below modules is a {\em level 2} module. In general, a {\em level n} module is only allowed to contain instances of modules of level $n-1$ or below. \subsection{The Connect Statement} The connect statement is used to specify a physical wired connection between one hardware component to another, and is the most important statement in FIRRTL. The following statement is used to connect the output of some component, to the input of another component. \[ \text{input } \kw{$<$=} \text{output} \] For a connection to be legal, the types of the two expressions must match exactly, including all field orientations if the elements contain bundle types. However, the widths of the types do not need to be equivalent. If the {\em output} expression has a smaller width than the {\em input} expression, the {\em output} is padded according to its type. If the {\em output} expression has a larger width than the {\em input} expression, this triggers an error. If the {\em input} width is unknown, it is inferred to be the width of the largest {\em output} that it is connected to. If the {\em output} width is unknown, it cannot inferred from this connection. The component on the right-hand side must be able to be used as an output, and the component on the left-hand side must be able to be used as an input. \subsection{The OnReset Connect Statement} The onreset connect statement is used to specify the default value for a \kws{reg} element. \[ \kw{onreset} \text{r } \kw{$<$=} \text{output} \] For a connection to be legal, the types of the two expressions must match exactly, including all field orientations if the elements contain bundle types. The component on the right-hand side must be able to be used as an output, and the component on the left-hand side must be a \kws{reg} element. The widths of the types may mismatch, and the semantics are the same as the connect statements. Memories cannot be initialized with this construct. By default, a \kws{reg} will not have an initialization value and will maintain its current value under the reset signal specified in their declaration. The following example demonstrates declaring a \kws{reg}, and changing its initialization value to forty-two. \[ \begin{aligned} & \kw{reg} r : \kws{UInt}\kws{$<$} 10 \kws{$>$} \kws{(} clk, \ reset \kws{)}\\ & \kw{onreset} r <= \kws{UInt}\kws{$<$} \kws{?} \kws{$>$}(42) \end{aligned} \] \subsection{The Partial Connect Statement} The partial connect statement is a connect statement that does not require both expressions to be the same type. During the lowering pass, the partial connect will expand to some number of connect statements, possibly zero statements. The following statement is used to connect the output of some component, to the input of another component. \[ \text{input } \kw{$<$--} \text{output} \] For a partial connect between two components of a bundle-type, fields that are of the same type, orientation, and name will be connected. Fields that do not match will not be connected. For a partial connect between two components of a vector-type, the number of connected elements will be equal to the length of the shorter vector. A partial connect between two components of the same ground type is equivalent to a normal connect statement. All other combinations of types will not error, but will not generate any connect statements. \subsection{The Conditional Statement} The conditional statement is used to specify a condition that must be asserted under which a list of statements hold. The condition must be a 1-bit unsigned integer. The following statement states that the {\em conseq} statements hold only when {\em condition} is assert high, otherwise the {\em alt} statements hold instead. \[ \begin{aligned} \kw{when} \text{condition } \kw{:} \text{conseq } \kw{else :} \text{alt} \end{aligned} \] Notationally, for convenience, we omit the \kws{else} branch if it is an empty statement. \subsubsection{Initialization Coverage} Because of the conditional statement, it is possible for wires to be only partially connected to an expression. In the following example, the wire {\em w} is connected to 42 when enable is asserted high, but it is not specified what {\em w} is connected to when enable is low. This is an illegal FIRRTL circuit, and will throw a \kws{wire not initialized} error during compilation. \[ \begin{aligned} &\kw{wire} w : \kws{UInt}\kws{$<$} \kws{?} \kws{$>$} \\ &\kw{when} enable : \\ &\quad w <= \kws{UInt}\kws{$<$} \kws{?} \kws{$>$}(42) \\ \end{aligned} \] \subsubsection{Scoping} The conditional statement creates a new {\em scope} within its consequent and alternative branches. It is an error to refer to any component declared within a branch after the branch has ended. Note that there is still only a single identifier namespace in a module. Thus, there cannot be two components with identical names in the same module, {\em even if} they are in separate scopes. This is to facilitate writing transformational passes, by ensuring that the component name and module name is sufficient to uniquely identify a component. \subsubsection{Conditional Connect Semantics} Inside a when, a connection to a component is conditional only if the component is declared outside the when statement. If the component is both declared and connected to inside a when, the connection is {\em not} conditional on that when. Conceptually, a when creates a mux between the stuff outside and the stuff inside - it acts as type of "conditional barrier". Thus, if you draw a line between a component's declaration and a connection to it, that connection is dependent on all intersected when predicates being true. The following example shows a {\em conditional} connection inside a when statement, where the register \pd{r} is assigned the value of 42 only if \pds{enable} is true. \[ \begin{aligned} &\kw{reg} r : \kws{UInt}\kws{$<$} \kws{6} \kws{$>$} \\ &\kw{when} enable : \\ &\quad r <= \kws{UInt}\kws{$<$} \kws{6} \kws{$>$}(42) \\ \end{aligned} \] The following shows an {\em unconditional} connection inside a when statement, where the register \pd{r} is assigned the value of 42 {\em every cycle}. \[ \begin{aligned} &\kw{when} enable : \\ &\quad \kw{reg} r : \kws{UInt}\kws{$<$} \kws{6} \kws{$>$} \\ &\quad r <= \kws{UInt}\kws{$<$} \kws{6} \kws{$>$}(42) \\ \end{aligned} \] \subsection{Statement Groups} Several statements can be grouped into one using the following construct. \[ \begin{aligned} (\pd{stmt*}) \end{aligned} \] Ordering is important in a statement group. Later connect statements take precedence over earlier connect statements, and circuit components cannot be referred to before they are instantiated. \subsubsection{Last Connect Semantics} Because of the connect statement, FIRRTL statements are {\em ordering} dependent. Later connections take precedence over earlier connections. In the following example, the wire w is connected to 42, not 20. \[ \begin{aligned} &\kw{wire} w : \kws{UInt}\kws{$<$} \kws{?} \kws{$>$} \\ &w <= \kws{UInt}\kws{$<$} ? \kws{$>$}(20) \\ &w <= \kws{UInt}\kws{$<$} ? \kws{$>$}(42) \\ \end{aligned} \] By coupling the conditional statement with last connect semantics, many circuits can be expressed in a natural style. In the following example, the wire w is connected to 20 unless the enable expression is asserted high, in which case w is connected to 42. \[ \begin{aligned} &\kw{wire} w : \kws{UInt}\kws{$<$} \kws{?} \kws{$>$} \\ &w <= \kws{UInt}\kws{$<$} \kws{?} \kws{$>$}(20) \\ &\kw{when} enable : \\ &\quad w <= \kws{UInt}\kws{$<$} \kws{?} \kws{$>$}(42) \\ \end{aligned} \] \subsection{The Stop Statement} The stop statement is used to halt simulations of the circuit. \[ \begin{aligned} \kw{stop} \end{aligned} \] If a backend does not support stop, it will omit emitting this node. \subsection{The Printf Statement} The printf statement is used to print a formatted string during simulations of the circuit. \[ \begin{aligned} \kw{printf}(\strings,\pds{exp*}) \end{aligned} \] If a backend does not support printf, it will omit emitting this node. \subsection{The Empty Statement} The empty statement is specified using the following. \[ \begin{aligned} \kw{skip} \end{aligned} \] The empty statement does nothing and is used simply as a placeholder where a statement is expected. It is typically used as the alternative branch in a conditional statement. In addition, it is useful for transformation pass writers. \section{Expressions} FIRRTL expressions are used for creating values corresponding to the ground types, for referring to a declared circuit component, for accessing a nested element within a component, and for performing primitive operations. \subsection{Unsigned Integers} A value of type \kws{UInt} can be directly created using the following expression. \[ \kws{UInt}\kws{$<$} \pds{width} \kws{$>$}(\text{value}) \] The given value must be non-negative, and the given width, if known, must be large enough to hold the value. If the width is specified as unknown, then FIRRTL infers the minimum possible width necessary to hold the value. \subsection{Signed Integers} A value of type \kws{SInt} can be directly created using the following expression. \[ \kws{SInt}\kws{$<$} \pds{width} \kws{$>$}(\text{value}) \] The given width, if known, must be large enough to hold the given value in two's complement format. If the width is specified as unknown, then FIRRTL infers the minimum possible width necessary to hold the value. \subsection{References} \[ \text{name} \] A reference is simply a name that refers to some declared circuit component. A reference may refer to a port, a node, a wire, a register, an instance, a memory, a node, or a structural register. \subsection{Subfields} \[ \pds{exp}.\text{name} \] The subfield expression may be used for one of two purposes: \begin{enumerate} \item To refer to a specific port of an instance, using instance-name.port-name. \item To refer to a specific field within a bundle-typed expression. \end{enumerate} \subsection{Subindex} \[ \pds{exp}[\text{index}] \] The subindex expression is used for referring to a specific element within a vector-valued expression. It is legal to use the subindex expression on any vector-valued expression, except for memories. \subsection{Primitive Operation} \[ \pds{primop}(\pds{exp*}, \ints\text{*}) \] There are a number of different primitive operations supported by FIRRTL. Each operation takes some number of expressions, along with some number of integer literals. Section \ref{primitives} will describe the format and semantics of each operation. \section{Primitive Operations} \label{primitives} All primitive operations expression operands must be ground types. In addition, some operations allow all permutations of operand ground types, while others on allow subsets. When well defined, input arguments are allowed to be differing widths, with the semantics of sign-extending the input arguments prior to executing the operation. \subsection{Add Operation} \[ \begin{array}{rll} \kws{Input Types} & \kws{Resultant Type} & \kws{Resultant Width} \\ \kws{add}(\pds{op1}:UInt, \pds{op2}:UInt) & UInt & max(width(op1),width(op2)) + 1 \\ \kws{add}(\pds{op1}:UInt, \pds{op2}:SInt) & SInt & max(width(op1),width(op2)) + 1 \\ \kws{add}(\pds{op1}:SInt, \pds{op2}:UInt) & SInt & max(width(op1),width(op2)) + 1 \\ \kws{add}(\pds{op1}:SInt, \pds{op2}:SInt) & SInt & max(width(op1),width(op2)) + 1 \\ \end{array} \] The resultant's value is 1-bit larger than the wider of the two operands and has a signed type if either operand is signed (otherwise is unsigned). \subsection{Subtract Operation} \[ \begin{array}{rll} \kws{primop} & \kws{Resultant Type} & \kws{Resultant Width} \\ \kws{sub}(\pds{op1}:UInt, \pds{op2}:UInt) & SInt & max(width(op1),width(op2)) + 1 \\ \kws{sub}(\pds{op1}:UInt, \pds{op2}:SInt) & SInt & max(width(op1),width(op2)) + 1 \\ \kws{sub}(\pds{op1}:SInt, \pds{op2}:UInt) & SInt & max(width(op1),width(op2)) + 1 \\ \kws{sub}(\pds{op1}:SInt, \pds{op2}:SInt) & SInt & max(width(op1),width(op2)) + 1 \\ \end{array} \] The subtraction operation works similarly to the add operation, but always returns a signed integer with a width that is 1-bit wider than the max of the widths of the two operands. \subsection{Add Wrap Operation} \[ \begin{array}{rll} \kws{primop} & \kws{Resultant Type} & \kws{Resultant Width} \\ \kws{addw}(\pds{op1}:UInt, \pds{op2}:UInt) & UInt & max(width(op1),width(op2)) \\ \kws{addw}(\pds{op1}:UInt, \pds{op2}:SInt) & SInt & max(width(op1),width(op2)) \\ \kws{addw}(\pds{op1}:SInt, \pds{op2}:UInt) & SInt & max(width(op1),width(op2)) \\ \kws{addw}(\pds{op1}:SInt, \pds{op2}:SInt) & SInt & max(width(op1),width(op2)) \\ \end{array} \] The add wrap operation works identically to the normal add operation except that the resultant width is the maximum of the width of the two operands, instead of 1 bit greater than the maximum. In the case of overflow, the result silently rolls over. \subsection{Subtract Wrap Operation} \[ \begin{array}{rll} \kws{primop} & \kws{Resultant Type} & \kws{Resultant Width} \\ \kws{subw}(\pds{op1}:UInt, \pds{op2}:UInt) & UInt & max(width(op1),width(op2)) \\ \kws{subw}(\pds{op1}:UInt, \pds{op2}:SInt) & SInt & max(width(op1),width(op2)) \\ \kws{subw}(\pds{op1}:SInt, \pds{op2}:UInt) & SInt & max(width(op1),width(op2)) \\ \kws{subw}(\pds{op1}:SInt, \pds{op2}:SInt) & SInt & max(width(op1),width(op2)) \\ \end{array} \] Similarly to the add wrap operation, the subtract wrap operation works identically to the normal subtract operation except that the resultant width is the maximum of the width of the two operands. In the case of overflow, the result silently rolls over. \subsection{Multiply Operation} \[ \begin{array}{rll} \kws{primop} & \kws{Resultant Type} & \kws{Resultant Width} \\ \kws{mul}(\pds{op1}:UInt, \pds{op2}:UInt) & UInt & width(op1) + width(op2) \\ \kws{mul}(\pds{op1}:UInt, \pds{op2}:SInt) & SInt & width(op1) + width(op2) \\ \kws{mul}(\pds{op1}:SInt, \pds{op2}:UInt) & SInt & width(op1) + width(op2) \\ \kws{mul}(\pds{op1}:SInt, \pds{op2}:SInt) & SInt & width(op1) + width(op2) \\ \end{array} \] The resultant value has width equal to the sum of the widths of its two operands. \subsection{Divide Operation} \[ \begin{array}{rll} \kws{primop} & \kws{Resultant Type} & \kws{Resultant Width} \\ \kws{div}(\pds{op1}:UInt, \pds{op2}:UInt) & UInt & width(op1) \\ \kws{div}(\pds{op1}:UInt, \pds{op2}:SInt) & SInt & width(op1) + 1 \\ \kws{div}(\pds{op1}:SInt, \pds{op2}:UInt) & SInt & width(op1) \\ \kws{div}(\pds{op1}:SInt, \pds{op2}:SInt) & SInt & width(op1) + 1 \\ \end{array} \] The first argument is the dividend, the second argument is the divisor. The resultant width of a divide operation is equal to the width of the dividend, plus one if the divisor is an SInt. The resultant value follows the following formula : div(a,b) = round-towards-zero(a/b) + mod(a,b) \subsection{Modulus Operation} \[ \begin{array}{rll} \kws{primop} & \kws{Resultant Type} & \kws{Resultant Width} \\ \kws{mod}(\pds{op1}:UInt, \pds{op2}:UInt) & UInt & width(op2) \\ \kws{mod}(\pds{op1}:UInt, \pds{op2}:SInt) & UInt & width(op2) \\ \kws{mod}(\pds{op1}:SInt, \pds{op2}:UInt) & SInt & width(op2) + 1 \\ \kws{mod}(\pds{op1}:SInt, \pds{op2}:SInt) & SInt & width(op2) \\ \end{array} \] The first argument is the dividend, the second argument is the divisor. The resultant width of a modulus operation is equal to the width of the divisor, except when the modulus is positive and the result can be negative. The resultant value follows the following formula : div(a,b) = round-towards-zero(a/b) + mod(a,b) \subsection{Quotient Operation} \[ \begin{array}{rll} \kws{primop} & \kws{Resultant Type} & \kws{Resultant Width} \\ \kws{quo}(\pds{op1}:UInt, \pds{op2}:UInt) & UInt & width(op1) + 1 \\ \kws{quo}(\pds{op1}:UInt, \pds{op2}:SInt) & SInt & width(op1) \\ \kws{quo}(\pds{op1}:SInt, \pds{op2}:UInt) & SInt & width(op1) + 1 \\ \kws{quo}(\pds{op1}:SInt, \pds{op2}:SInt) & SInt & width(op1) \\ \end{array} \] The first argument is the dividend, the second argument is the divisor. The resultant width of a quotient operation is equal to the width of the dividend, plus one if the divisor is an SInt. The resultant value follows the following formula : quo(a,b) = floor(a/b) + rem(a,b) \subsection{Remainder Operation} \[ \begin{array}{rll} \kws{primop} & \kws{Resultant Type} & \kws{Resultant Width} \\ \kws{rem}(\pds{op1}:UInt, \pds{op2}:UInt) & UInt & width(op2) \\ \kws{rem}(\pds{op1}:UInt, \pds{op2}:SInt) & SInt & width(op2) \\ \kws{rem}(\pds{op1}:SInt, \pds{op2}:UInt) & UInt & width(op2) + 1 \\ \kws{rem}(\pds{op1}:SInt, \pds{op2}:SInt) & SInt & width(op2) \\ \end{array} \] The first argument is the dividend, the second argument is the divisor. The resultant width of a modulus operation is equal to the width of the divisor, except when the divisor is positive and the result can be negative. The resultant value follows the following formula : quo(a,b) = floor(a/b) + rem(a,b) \subsection{Comparison Operations} \[ \begin{array}{rll} \kws{primop} & \kws{Resultant Type} & \kws{Resultant Width} \\ \kws{lt} (\pds{op1}:UInt, \pds{op2}:UInt) & UInt & 1 \\ \kws{lt} (\pds{op1}:UInt, \pds{op2}:SInt) & UInt & 1 \\ \kws{lt} (\pds{op1}:SInt, \pds{op2}:UInt) & UInt & 1 \\ \kws{lt} (\pds{op1}:SInt, \pds{op2}:SInt) & UInt & 1 \\ \kws{leq} (\pds{op1}:UInt, \pds{op2}:UInt) & UInt & 1 \\ \kws{leq} (\pds{op1}:UInt, \pds{op2}:SInt) & UInt & 1 \\ \kws{leq} (\pds{op1}:SInt, \pds{op2}:UInt) & UInt & 1 \\ \kws{leq} (\pds{op1}:SInt, \pds{op2}:SInt) & UInt & 1 \\ \kws{gt} (\pds{op1}:UInt, \pds{op2}:UInt) & UInt & 1 \\ \kws{gt} (\pds{op1}:UInt, \pds{op2}:SInt) & UInt & 1 \\ \kws{gt} (\pds{op1}:SInt, \pds{op2}:UInt) & UInt & 1 \\ \kws{gt} (\pds{op1}:SInt, \pds{op2}:SInt) & UInt & 1 \\ \kws{geq} (\pds{op1}:UInt, \pds{op2}:UInt) & UInt & 1 \\ \kws{geq} (\pds{op1}:UInt, \pds{op2}:SInt) & UInt & 1 \\ \kws{geq} (\pds{op1}:SInt, \pds{op2}:UInt) & UInt & 1 \\ \kws{geq} (\pds{op1}:SInt, \pds{op2}:SInt) & UInt & 1 \\ \end{array} \] Each operation accepts any combination of SInt or UInt input arguments, and always returns a single-bit unsigned integer. \subsection{Equality Comparison} \[ \begin{array}{rll} \kws{primop} & \kws{Resultant Type} & \kws{Resultant Width} \\ \kws{eq}(\pds{op1}:UInt, \pds{op2}:UInt) & UInt & 1 \\ \kws{eq}(\pds{op1}:UInt, \pds{op2}:SInt) & UInt & 1 \\ \kws{eq}(\pds{op1}:SInt, \pds{op2}:UInt) & UInt & 1 \\ \kws{eq}(\pds{op1}:SInt, \pds{op2}:SInt) & UInt & 1 \\ \end{array} \] The equality comparison operator accepts either two unsigned or signed integers and checks whether they are arithmetically equal. The resulting value is a 1-bit unsigned integer. \subsection{Not-Equality Comparison} \[ \begin{array}{rll} \kws{primop} & \kws{Resultant Type} & \kws{Resultant Width} \\ \kws{neq}(\pds{op1}:UInt, \pds{op2}:UInt) & UInt & 1 \\ \kws{neq}(\pds{op1}:UInt, \pds{op2}:SInt) & UInt & 1 \\ \kws{neq}(\pds{op1}:SInt, \pds{op2}:UInt) & UInt & 1 \\ \kws{neq}(\pds{op1}:SInt, \pds{op2}:SInt) & UInt & 1 \\ \end{array} \] The not-equality comparison operator accepts either two unsigned or signed integers and checks whether they are arithmetically not equal. The resulting value is a 1-bit unsigned integer. \subsection{Equivalence Comparison} \[ \begin{array}{rll} \kws{primop} & \kws{Resultant Type} & \kws{Resultant Width} \\ \kws{eqv}(\pds{op1}:UInt, \pds{op2}:UInt) & UInt & 1 \\ \kws{eqv}(\pds{op1}:SInt, \pds{op2}:SInt) & UInt & 1 \\ \end{array} \] The equivalence comparison operator accepts either two unsigned or two signed integers and checks whether they are bitwise equivalent. The resulting value is a 1-bit unsigned integer. For an arithmetic equals between a signed and unsigned integer, use the equality operator. \subsection{Not-Equivalence Comparison} \[ \begin{array}{rll} \kws{primop} & \kws{Resultant Type} & \kws{Resultant Width} \\ \kws{neqv}(\pds{op1}:UInt, \pds{op2}:UInt) & UInt & 1 \\ \kws{neqv}(\pds{op1}:SInt, \pds{op2}:SInt) & UInt & 1 \\ \end{array} \] The not-equivalent comparison operator accepts either two unsigned or two signed integers and checks whether they are not bitwise equivalent. The resulting value is a 1-bit unsigned integer. If an arithmetic not-equals between a signed and unsigned integer is desired, use the not-equals operator. \subsection{Multiplex} \[ \begin{array}{rll} \kws{primop} & \kws{Resultant Type} & \kws{Resultant Width} \\ \kws{mux} (\pds{condition}:UInt, \pds{op1}:UInt, \pds{op2}:UInt) & UInt & max(width(op1),width(op2)) \\ \kws{mux} (\pds{condition}:UInt, \pds{op1}:SInt, \pds{op2}:SInt) & SInt & max(width(op1),width(op2)) \\ \end{array} \] The multiplex operation accepts three signals, a 1-bit unsigned integer for the condition expression, followed by either two unsigned integers, or two signed integers. If the condition is high, then the result is equal to the first of the two following operands. If the condition is low, then the result is the second of the two following operands. The output is of the same width as the max width of the inputs. \subsection{Padding Operation} \[ \begin{array}{rll} \kws{primop} & \kws{Resultant Type} & \kws{Resultant Width} \\ \kws{pad}(\pds{op}:UInt, \text{num}) & UInt & num \\ \kws{pad}(\pds{op}:SInt, \text{num}) & SInt & num \\ \end{array} \] A pad operation is provided which either zero-extends or sign-extends an expression to a specified width. The given width, num, must be equal to or greater than the existing width of the expression. \subsection{Reinterpret Bits as UInt} \[ \begin{array}{rll} \kws{primop} & \kws{Resultant Type} & \kws{Resultant Width} \\ \kws{asUInt}(\pds{op1}:UInt) & UInt & width(op1) \\ \kws{asUInt}(\pds{op1}:SInt) & UInt & width(op1) \\ \end{array} \] Regardless of input type, primop returns a UInt with the same width as the operand. \subsection{Reinterpret Bits as SInt} \[ \begin{array}{rll} \kws{primop} & \kws{Resultant Type} & \kws{Resultant Width} \\ \kws{asSInt}(\pds{op1}:UInt) & SInt & width(op1) \\ \kws{asSInt}(\pds{op1}:SInt) & SInt & width(op1) \\ \end{array} \] Regardless of input type, primop returns a SInt with the same width as the operand. \subsection{Shift Left Operation} \[ \begin{array}{rll} \kws{primop} & \kws{Resultant Type} & \kws{Resultant Width} \\ \kws{shl}(\pds{op}:UInt, \text{num}) & UInt & width(op) + num \\ \kws{shl}(\pds{op}:SInt, \text{num}) & SInt & width(op) + num \\ \end{array} \] The shift left operation accepts either an unsigned or a signed integer, plus a non-negative integer literal specifying the number of bits to shift. The resultant value has the same type as the operand. The output of a shift left operation is equal to the original signal concatenated with $n$ zeros at the end, where $n$ is the shift amount. \subsection{Shift Right Operation} \[ \begin{array}{rll} \kws{primop} & \kws{Resultant Type} & \kws{Resultant Width} \\ \kws{shr}(\pds{op}:UInt, \text{num}) & UInt & width(op) - num \\ \kws{shr}(\pds{op}:SInt, \text{num}) & SInt & width(op) - num \\ \end{array} \] The shift right operation accepts either an unsigned or a signed integer, plus a non-negative integer literal specifying the number of bits to shift. The resultant value has the same type as the operand. The shift amount must be less than or equal to the width of the operand. The output of a shift right operation is equal to the original signal with the least significant $num$ bits truncated, where $num$ is the shift amount. \subsection{Dynamic Shift Left Operation} \[ \begin{array}{rll} \kws{primop} & \kws{Resultant Type} & \kws{Resultant Width} \\ \kws{dshl}(\pds{op1}:UInt, \pds{op2}:UInt) & UInt & width(op1) + pow(2,width(op2)) \\ \kws{dshl}(\pds{op1}:SInt, \pds{op2}:UInt) & SInt & width(op1) + pow(2,width(op2)) \\ \end{array} \] The dynamic shift left operation accepts either an unsigned or a signed integer, plus an unsigned integer dynamically specifying the number of bits to shift. The resultant value has the same type as the operand. The output of a dynamic shift left operation is equal to the original signal concatenated with $n$ zeros at the end, where $n$ is the dynamic shift amount. The output width of a dynamic shift left operation is the width of the original signal plus 2 raised to the width of the dynamic shift amount. \subsection{Dynamic Shift Right Operation} \[ \begin{array}{rll} \kws{primop} & \kws{Resultant Type} & \kws{Resultant Width} \\ \kws{dshr}(\pds{op}:UInt, \pds{op2}:UInt) & UInt & width(op) \\ \kws{dshr}(\pds{op}:SInt, \pds{op2}:UInt) & SInt & width(op) \\ \end{array} \] The shift right operation accepts either an unsigned or a signed integer, plus a non-negative integer literal specifying the number of bits to shift. The resultant value has the same type as the operand. The shift amount must be less than or equal to the width of the operand. The output of a shift right operation is equal to the original signal with the least significant $n$ bits truncated, where $n$ is the dynamic shift amount. The output width of a dynamic shift right operation is the width of the original signal. \subsection{Logical Convert to Signed} \[ \begin{array}{rll} \kws{primop} & \kws{Resultant Type} & \kws{Resultant Width} \\ \kws{cvt}(\pds{op}:UInt) & SInt & width(op) + 1 \\ \kws{cvt}(\pds{op}:SInt) & SInt & width(op) \\ \end{array} \] The convert operation accepts either an unsigned or a signed integer. The resultant value is always a signed integer. The output of a convert operation will be the same arithmetic value as the input value. The output width is the same as the input width if the input is signed, and increased by one if the input is unsigned. \subsection{Negate} \[ \begin{array}{rll} \kws{primop} & \kws{Resultant Type} & \kws{Resultant Width} \\ \kws{neg}(\pds{op1}:UInt) & SInt & width(op1) + 1 \\ \kws{neg}(\pds{op1}:SInt) & SInt & width(op1) \\ \end{array} \] If the input type is UInt, primop returns the negative value as an SInt with the width of the operand plus one. If the input type is SInt, primop returns -1 * input value, as an SInt with the same width of the operand. \subsection{Bitwise Operations} \[ \begin{array}{rll} \kws{primop} & \kws{Resultant Type} & \kws{Resultant Width} \\ \kws{not} (\pds{op1:UInt}) & UInt & width(op1) \\ \kws{and} (\pds{op1:UInt}, \pds{op2:UInt}) & UInt & max(width(op1),width(op2)) \\ \kws{or} (\pds{op1:UInt}, \pds{op2:UInt}) & UInt & max(width(op1),width(op2)) \\ \kws{xor} (\pds{op1:UInt}, \pds{op2:UInt}) & UInt & max(width(op1),width(op2)) \\ \end{array} \] The above operations correspond to bitwise not, and, or, and exclusive or respectively. The operands must be unsigned integers, and the resultant width is equal to the width of the wider of the two operands. \subsection{Reduce Bitwise Operations} \[ \begin{array}{rll} \kws{primop} & \kws{Resultant Type} & \kws{Resultant Width} \\ \kws{andr} (\pds{op:UInt}) & UInt & 1 \\ \kws{orr} (\pds{op:UInt}) & UInt & 1 \\ \kws{xorr} (\pds{op:UInt}) & UInt & 1 \\ \end{array} \] The above operations correspond to bitwise not, and, or, and exclusive or respectively, reduced over every bit of a single unsigned integer. The resultant width is always one. \subsection{Concatenation} \[ \begin{array}{rll} \kws{primop} & \kws{Resultant Type} & \kws{Resultant Width} \\ \kws{cat}(\pds{op1}:UInt, \pds{op2}:UInt) & UInt & width(op1) + width(op2) \\ \end{array} \] The concatenation operator accepts two unsigned integers and returns the bitwise concatenation of the two values as an unsigned integer. The resultant width is the sum of the widths of the two operands. \subsection{Bit Extraction Operation} \[ \begin{array}{rll} \kws{primop} & \kws{Resultant Type} & \kws{Resultant Width} \\ \kws{bit}(\pds{op}:UInt, \text{index}) & UInt & 1 \\ \end{array} \] The bit extraction operation accepts an unsigned integer, plus an integer literal specifying the index of the bit to extract. The resultant value is a 1-bit unsigned integer. The index must be non-negative and less than the width of the operand. An index of zero indicates the least significant bit in the operand, and an index of one less than the width the operand indicates the most significant bit in the operand. \subsection{Bit Range Extraction Operation} \[ \begin{array}{rll} \kws{primop} & \kws{Resultant Type} & \kws{Resultant Width} \\ \kws{bits}(\pds{op}:UInt, \text{high}, \text{low}) & UInt & high - low + 1 \\ \end{array} \] The bit range extraction operation accepts either an unsigned integer, plus two integer literals that specify the high (inclusive) and low (inclusive) index of the bit range to extract. The index must be non-negative and less than the width of the operand. Regardless of the type of the operand, the resultant value is a $n$-bit unsigned integer, where $n = \text{high} - \text{low} + 1$. \section{FIRRTL Forms} To simplify the writing of transformation passes, any FIRRTL implementation will provide a {\em resolving} pass, which resolves all types, widths, and checks the legality of the circuit, and a {\em lowering} pass, which rewrites any FIRRTL circuit into an equivalent {\em lowered form}, or LoFIRRTL. \subsection{Resolved Form} The resolved form is guaranteed to be well-formed, meaning all restrictions to a FIRRTL circuit have been checked. In addition, all unknown widths and inferred accessor directions have been resolved. \subsection{Lowered Form} The lowered form, LoFIRRTL, is a structured subset of FIRRTL, making it a minimal representation that is convenient for low-level transforms. The body of a lowered module consists of a list of declarations, connect statements, and {\em predicated single connect statements}. A predicated single connect statement is a conditional statement containing a single connect statement and no else branch. The following circuit is lowered: \[ \begin{aligned} &\kw{module} \text{MyCounter} : \\ &\quad \kw{clock} \text{clk : UInt$<$1$>$} \\ &\quad \kw{input} \text{reset : UInt$<$1$>$} \\ &\quad \kw{input} \text{inc : UInt$<$1$>$} \\ &\quad \kw{output} \text{out : UInt$<$3$>$} \\ &\quad \kw{reg} \text{counter : UInt$<$3$>$, clk, reset} \\ &\quad \kw{when} \text{inc : counter $<$= addw(counter, UInt$<$1$>$(1))} \\ &\quad \text{out $<$= counter} \\ \end{aligned} \] The following restrictions also hold for modules in LoFIRRTL. \begin{itemize} \item \kws{No Nested Expressions} : In the declaration of the structural elements, the only nested expressions allowed are references, and unsigned and signed literals. All other nested expressions must be lifted to a named node, and referred to through a reference. \item \kws{No Composite Types} : No module port or wire may be declared with a bundle or vector type. The lowering pass will recursively expand ports into its constituent elements until all ports are declared with ground types. \item \kws{Single Connect} : Every declared component can only be connected to once within a module. This connect could be a predicated single connect. \item \kws{No Nested Whens} : Other than predicated single connect statements, no other conditional statements are allowed. \item \kws{Inlined Lowered Form} : A further (and optional) pass provided by FIRRTL is the inlining pass, which recursively inlines all instances in the top-level module until the top-level module is the only remaining module in the circuit. Inlined LoFIRRTL is essentially a flat netlist which specifies every component used in a circuit and their input connections. \end{itemize} \section{Annotations - IN PROGRESS} Supporting annotations is a critical piece of FIRRTL, yet is a very difficult problem to solve properly. We are in the experimental phase of supporting annotations, and our philosophy is outlined below. It remains to be seen whether our philosophy is correct - if not, we will certainly devise a new strategy. \begin{enumerate}[topsep=3pt,itemsep=-0.5ex,partopsep=1ex,parsep=1ex] \item Writing a correct circuit is difficult - avoid silent failures at all costs. \item If annotations are held in the graph, every pass must properly propagate all possible annotations. \item A pass incorrectly propagating an annotation cannot be easily detected (silent failure). \item If annotations are held in an exteral data structure mapping names to annotations, the structure must updated after every pass. \item Incorrectly updating the structure will cause a mismatching of names between circuit components and annotation entries, which is easily detected. \item Thus, we feel the ability to detect failure outweighs the additional burden on annotation writers. \end{enumerate} To implement this philosophy, we encourage passes to either preserve names in the graph, use simple algorithms to transform names, or provide a rename table after a pass. The annotation writer then updates their data structure accordingly. \section{Concrete Syntax}\label{concrete} This section describes the text format for FIRRTL that is supported by the provided readers and writers. \subsection*{General Principles} FIRRTL's text format is human-readable and uses indentation to indicate block structuring. The following characters are allowed in identifiers: upper and lower case letters, digits, as well as the punctuation characters \verb|~!@#$%^*-_+=?/|. Identifiers cannot begin with a digit. Comments begin with a semicolon and extend until the end of the line. Commas are treated as whitespace, and may be used by the user for clarity if desired. Statements are grouped into statement groups using parenthesis, however a colon at the end of a line will automatically assume the next indented region is a statement group. This mechanism is used for indicating block structuring. The following circuit, module, port and statement examples all exclude the info token \verb|@[filename:line.col]|, which can be optionally included at the beginning of the first line of each elements' concrete syntax. \subsection*{Circuits and Modules} A circuit is specified the following way. \begin{verbatim} circuit name : (modules ...) \end{verbatim} Or by taking advantage of indentation structuring: \begin{verbatim} circuit name : modules ... \end{verbatim} A module is specified the following way. \begin{verbatim} module name : (ports ... stmts ...) \end{verbatim} The module body consists of a sequence of ports followed immediately by a sequence of statements. If there is more than one statement they are grouped into a statement group by the parser. By using indentation structuring: \begin{verbatim} module name : ports ... stmts ... \end{verbatim} The following shows an example of a simple module. \begin{verbatim} module mymodule : input a: UInt<1> output b: UInt<1> clock clk: UInt<1> b <= a \end{verbatim} \subsection*{Types} The unsigned and signed integer types are specified the following way. The following examples demonstrate an unsigned integer with known bit width, signed integer with known bit width, an unsigned integer with unknown bit width, and signed integer with unknown bit width. \begin{verbatim} UInt<42> SInt<42> UInt SInt \end{verbatim} The bundle type consists of a number of fields surrounded with braces. The following shows an example of a decoupled bundle type. Note that the commas are for clarity only and are not necessary. \begin{verbatim} {default data: UInt<10>, default valid: UInt<1>, reverse ready: UInt<1>} \end{verbatim} The vector type is specified by immediately postfixing a type with a bracketed integer literal. The following example demonstrates a ten-element vector of 16-bit unsigned integers. \begin{verbatim} UInt<16>[10] \end{verbatim} \subsection*{Statements} The following examples demonstrate declaring wires, registers, memories, nodes, instances, poisons, and accessors. \begin{verbatim} wire mywire : UInt<10> reg myreg : UInt<10>, clk, reset cmem mycombmem : UInt<10>,16 smem myseqmem : UInt<10>,16 inst myinst : MyModule poison mypoison : UInt<10> infer accessor myaccessor = e[i],clk \end{verbatim} The connect statement is specified using the \verb|<=| operator. \begin{verbatim} x <= y \end{verbatim} The onreset connect statement is specified using the onreset keyword and the \verb|<=| operator. \begin{verbatim} onreset x <= y \end{verbatim} The partial connect statement is specified using the \verb|<-| operator. \begin{verbatim} x <- y \end{verbatim} The assert statement is specified using the assert keyword. \begin{verbatim} assert x \end{verbatim} The conditional statement is specified with the \verb|when| keyword. \begin{verbatim} when x : x <= y else : x <= z \end{verbatim} Or by using indentation structuring: \begin{verbatim} when x : x <= y else : x <= z \end{verbatim} If there is no alternative branch specified, the parser will automatically insert an empty statement. \begin{verbatim} when x : x <= y \end{verbatim} For convenience when expressing nested conditional statements, the colon following the \verb|else| keyword may be elided if the next statement is another conditional statement. \begin{verbatim} when x : x <= y else when y : x <= z else : x <= w \end{verbatim} \subsection*{Expressions} The UInt and SInt constructors create literal integers from a given value and bit width. The following examples demonstrate creating literal integers of both known and unknown bit width. \begin{verbatim} UInt<4>(42) SInt<4>(-42) UInt(42) SInt(-42) \end{verbatim} References are specified with an identifier. \begin{verbatim} x \end{verbatim} Subfields are expressed using the dot operator. \begin{verbatim} x.data \end{verbatim} Subindices are expressed using the \verb|[]| operator. \begin{verbatim} x[10] \end{verbatim} Primitive operations are expressed by following the name of the primitive with a list containing the operands. \begin{verbatim} add(x, y) add(x, add(x, y)) shl(x, 42) \end{verbatim} \section{Future Plans} Some choices were made during the design of this specification which were intentionally conservative, so that future versions could lift the restrictions if suitable semantics and implementations are determined. By restricting this version and potentially lifting these restrictions in future versions, all existing FIRRTL circuits will remain valid. The following design decisions could potentially be changed in future spec revisions: \begin{enumerate}[topsep=3pt,itemsep=-0.5ex,partopsep=1ex,parsep=1ex] \item Disallowing zero-width types \item Always expanding memories into smaller memories (if its type is a non-ground-type) \item Not including a \kws{ROM} node \item Custom annotations are not held in FIRRTL nodes \item Not requiring that all names are unique \end{enumerate} \section{Questions and Answers} \begin{enumerate}[topsep=3pt,itemsep=-0.5ex,partopsep=1ex,parsep=1ex] \item Why are there three connect operators? Each is needed for a particular use case - the better question is why did we chose to create multiple connect statements instead of other constructs. Statements, as opposed to expressions, are very restricted in how they nest. Thus, the desired supported behavior (partial connects, full connects, and resets) will never be used in an arbitrary nested expression where the semantics would be unintuitive. In addition, both the implementation and the user only needs to look at the single statement to implement it. \item Aren't there a lot of idiosyncrasies in FIRRTL? The FIRRTL specification is an ongoing process, and as we push more code through it, it is likely to change. In our opinion, the idiosyncrasies are necessary for a cohesive design (and all languages have idiosyncrasies). It remains an unknown whether there are too many idiosyncrasies for frontend writers. Because the spec is not frozen, we can certainly adapt it if necessary. However, at this point, we just need to push more code through. \item Why have a separate construct for initializing a register? The problem is initializing a register with a vector/bundle type, where a subset of the fields are initialized. If the initial value is kept with the declaration, we would need a new construct to specify a subset of values of ALL (potentially) nested vector/bundle types. It makes much more sense to separate initialization from the declaration, and use something like a <= to initialize the fields/vector sub-components of the register. The next question is why not just have users specify the initial value using their own "when reset :" statement. This doesn't work because of last connect semantics - the user could easily clobber their initialization when statement without knowing. Creating an onreset statement does two things: (1) specifies to the USER exactly what the reset value will be for any sub-component of a register, (2) encapsulates the reset value in a way that is easy for the implementation to special case it (so it doesn't get clobbered). \item Why do operations allow inputs of differing widths? We tried restricting widths, but it actually complicated width inference and made supporting front-ends with more lax width restrictions very difficult. Because there is perfectly well defined semantics, we opted to allow differing widths. In line with the Linux "funnel" philosophy of being accepting with your inputs and restrictive with your outputs. \item Why require all names unique? Passes usually need unique names, so there needs to be a renaming pass somewhere. Standardizing how names gets mangled requires a lot of thought, and we didn't feel comfortable putting this into the spec at the moment and potentially regretting it later. For now, names have to be unique, and it is the front-end's responsibility to do this. \item Why allow declaring components in when statements? We want the important property that a module is just a box of components inside - for any jumble of components, you can always lace them in the box, and it will preserve the semantics. You need to declare wires inside whens - because generators could run within a when in a front-end. You should always be able to pull them into a module if we want. Now its inconsistent if you can't declare registers in the scope. \item Why not just have LoFIRRTL? LoFIRRTL leaves out general when usage, vector and bundle types, and requires a single connect. For performance backends, we will need to emit arrays and structs. If there is only a lowered circuit, we lose that ability. We cannot simply add vector/bundle types to LoFIRRTL as front-ends cannot easily remove whens without removing the complex types as well. Instead, one will need the expressiveness in FIRRTL to write a performant backend which does not need to operate on LoFIRRTL. \item Why the stop statement have no arguements? Like the enable for write-accessors, the lowering step will preserve the sequence of when statements under which a simulation will stop. \item Why disallow zero-width wires? Very tricky to get the semantics correct. On the todo list. \item Why not require default value for wires? Isn't this a SAT problem? We do the same thing that is done in Java, and is standard programming language practice. \item Why did/didn't you include XXX primop? Up for debate. \item How do you support subword assignment? We decided to not support subword assignment directly, and instead require the user to separate the subword assignment into a vector type. Then, the user uses the subindex expression to assign to an element in the vector. \end{enumerate} \end{document}