CADR LISP Machine and CADR Processor

June 15, 2023(updated on July 09, 2023)

Table of Contents


Introduction

This post contains an overview of the MIT CADR LISP Machine and its processor. The technical information is brief; however, it might be more readable if you have some background in computer architecture. I will repeat things written in the original documents, but I aim to give more or less a complete picture quicker than reading all available documents. I minimized the detailed information about microinstructions because they are mostly similar to a general-purpose processor, but I explained macroinstructions in more detail. These sections can be skipped if you are not interested in this level of detail. I explained the I/O board and diagnostic interface in detail to give a better sense of the system.

It was not initially my intention, but because that is the only way to experience the CADR LISP Machine, I also provide some information and a few hints for the configuration of usim CADR emulator used by the LM-3 project.

What is interesting about the LISP machine besides its historical importance? For example:

  • Its primary programming language is LISP, so the majority of its operating system is written in LISP.

  • It runs LISP quite directly. Not in the sense that the processor runs LISP directly at the hardware level, but I am also not sure what this means. If you do not see the microcode, as you do not see it in modern processors, then it is also valid to say the LISP machine runs LISP natively as much as a modern processor runs its machine code natively.

I believe there is no such computer presently, having its operating system almost totally written in a symbolic/high-level language like LISP and executing this language directly at the hardware level.

There are actually four related LISP machines. CONS is built first at MIT, and then CADR is developed as an improvement. Then the commercialization started in 1980: Symbolics released LM-2 and Lisp Machines, Inc. (LMI) released LMI-CADR, both with a license from MIT. All of these four LISP machines are very similar, particularly CADR, LM-2 and LMI-CADR.

Since only about 30 CADR, about 100 LM-2 and about 25 LMI-CADR were produced 40+ years ago, it is not possible to use any of these at the moment. However, it is possible to experience one with the emulator. Because the CADR LISP Machine is not a very complex system compared to a modern computer, studying the emulator source code is not difficult and is also entertaining and informative.

I would like to thank to Alfred M. Szmidt for the clarifications regarding various points in this post.

Clarifications

I use the acronym LISP, not Lisp. I think before about 1975, it was called LISP and then Lisp, as it means LISt Processor. If it is not LisP, LISP makes more sense to me.

I try to use the terms’ microinstruction, microcode, and microprogram consistently. A microinstruction is just a single microinstruction. A microcode contains some microinstructions. A microprogram is a microcode doing a certain job, like a specific function. All of these are also valid when micro is replaced by macro.

In modern processors, only when specifically talking about microcode, the term microinstruction is used. Instruction (without adding micro) means what a developer sees in the instruction set of this processor (such as mov in x86). In this post, I try to be clear by explicitly saying micro or macro every time. However, if I forget to use neither micro nor macro, it should be understood from the context.

Instruction memory, control memory, or control store refer to the same thing. It is the memory where the microcode is stored.

The original (MIT) documents about the LISP machine use octal numbers. I think we are not that familiar with octal numbers anymore (definitely less than hexadecimal numbers). So, I have converted them to hexadecimal numbers in this post, but occasionally I indicate the octal values as well.

Because the memory (main memory and others) is almost always word (32-bit) addressed in the CADR LISP machine, the unit of the memory capacity is Words, not Bytes. For example, 1MW (1 Mega-Words) main memory is physically 4MB (plus parity). It is more complicated for the internal memory areas within the processor because the size of a word changes. For example, 16K instruction memory is physically, I think, 24KB (plus parity) because instructions are 48-bit in CADR. When I write only the magnitude, such as K or M, the unit is Words and the word size depends on the memory type. Thus, 16K instruction memory means 16KW, not 16KB.

In the official documents, there is a term called order code. It means the macrocode (of a compiled LISP expression). I do not know yet where this term “order code” comes from. I am not using it in this post.

There is a term called load band in CADR documents. Load band is a partition (LOD) on the disk, and it contains an image of memory pages at a time, like a snapshot of the current LISP environment. This is used during boot to load the initial system.

LISP

A programming system called LISP (for LISt Processor) has been developed for the IBM 704 computer by the Artificial Intelligence Group at MIT.

John McCarthy in 1959

A milestone in the history of computing is the LISP programming language. LISP was invented by AI pioneer John McCarthy at MIT in the late 1950s and early 1960s. It makes LISP the second major programming language in history after FORTRAN. The first versions of the LISP programming system, LISP I and LISP 1.5, were implemented on an IBM 704 computer.

IBM 704 in 1957 (source: wikipedia)

IBM 704 in 1957 (source: wikipedia)

In History of LISP (1978), McCarthy gives a great historical overview. Here is what I find interesting in this paper:

  • Initially, the goal was not to develop a new programming language. For example, implementing symbolic computation in FORTRAN was an option. Later, after working on what needed to be done, it became clear that a new programming language was needed.

  • The initial ideas revealed an “elegant mathematical system as well as a practical programming language”.

  • IBM 704 was initially chosen because IBM was setting up a computation center at MIT, so it was accessible, and McCarthy was consulting IBM on a different project.

  • LISP uses prefix notation because the main connective must be determined first before deciding what to do next. If infix were used, a translation program would be needed.

  • From the start, explicit memory management was not an option (“it was clearly unaesthetic to use explicit erasure as did IPL”). For automatic memory management, there were two alternatives: reference counting and garbage collection. For reference counting, only 6 bits were available for the “count” when it was stored together with the pointer, and it was too small. So, garbage collection was picked. Moreover, it was possible to postpone the actual garbage collection implementation because only simple programs were being done, so there was no need to free the memory. The first implementation was a stop-the-world collector, which runs only when all the memory is exhausted.

  • Why does LISP represent programs as lists? Actually, it was not initially thought so. However, when writing the eval function, the LISP program was required to be represented as LISP data, which is represented as lists.

  • IBM 704 registers have address and decrement parts and specific instructions to operate on them. LISP I was implemented using these registers, where the first element is pointed by the address field and the remaining is pointed by the decrement field. Thus, the names of functions car and cdr come from IBM 704 instructions: car means contents of the address part of register number and cdr means contents of the decrement part of register number.

  • It was not planned, but after implementing the eval function, it was realized that it could function as an interpreter.

  • After LISP I, it became clear that some improvements and some additional features were needed. This starts the LISP 1.5 work.

  • There was a LISP 2 project, but it was dropped before it started.

After the first LISP implementations, in the late 60s and 70s, MACLISP at MIT and Interlisp at BBN/Xerox PARC became the most common LISP dialects. Both of these were running on DEC PDP minicomputers. MACLISP was descended from LISP 1.5. “MAC” comes from MIT’s Project MAC.

In fact, the half word instructions and the stack instructions of these machines (DEC PDP-6 and PDP-10) were developed with LISP’s requirements in mind.

History of LISP, McCarthy, 1978

DEC PDP-10 (source: historybit.it)

DEC PDP-10 (source: historybit.it)

Why is LISP different from many other programming languages? McCarthy concludes History of LISP (1978) by giving two facts about the longevity of LISP. Among all the reasons he described, I think that having lists representing both data and the program itself gives it a unique position. Interestingly, McCarthy and others have regarded this as a disadvantage.

CADR LISP Machine

The main problem of using LISP on these mainstream general-purpose computers is the limited amount of memory and low performance. PDP-6 has 36-bit data word and 18-bit addressing (max. 256K words), PDP-10 (KI10) has 36-bit data word and 22-bit addressing with paging (max. 4M words), and PDP-11 has 32-bit data word and 22-bit addressing with paging (max. 4 MB).

The idea of a “LISP machine” has been kicked around more or less constantly
since t=-∞.

Richard Greenblatt, 1975

It appears that the idea of a LISP machine has been in people’s minds since LISP was invented. Probably, it only became technologically possible around 1975. The idea of a LISP machine with more memory and faster execution speed was attractive.

Based on Richard Greenblatt’s work, Tom Knight developed the CONS LISP Machine, also called the Knight Machine, in 1974. Then, in 1976, a revised second version called the CADR LISP Machine was built. This machine was well received, and about 30 CADR Machines were built at MIT. The story of Symbolics and Lisp Machines, Inc. (LMI) starts at this point.

CADR LISP Machine (source: wikipedia)

CADR LISP Machine (source: wikipedia)

CADR was designed to be a LISP machine; however CADR processor was a microprogrammed general purpose processor. CADR processor, together with its microcode, was “interpreting” LISP. Thus, LISP Machine means the processor and the microcode together.

CONS and CADR are very similar, but because CADR is an improved design, and is much more famous, I focus on CADR in this post.

R. Greenblatt (right) and T. Knight (left) with CADR LISP Machine at MIT (source: Computer History Museum)

R. Greenblatt (right) and T. Knight (left) with CADR LISP Machine at MIT (source: Computer History Museum)

System Overview

CADR LISP machine has 32-bit data word, and 24-bit virtual, and 22-bit physical addressing. It also has Unibus and Xbus. Unibus is the standard 16-bit PDP-11 bus.

It is a bit difficult to find information about the CADR LISP machine as a system, so I also use the LM-2 documents. I believe it comes in a cabinet of ~21U rack height. This cabinet includes two cages: a cage for 6 processor boards, and a cage for Unibus and Xbus cards or boards. The disk drives are separate (and big and heavy) units. By default, I think, it comes with 3 boards:

  • I/O board (also called IOB) (Unibus): for keyboard and mouse and a few other things
  • disk interface board (Xbus): to connect a disk drive
  • bitmap interface board (Xbus): to connect a display

There are plenty of available slots for expansion. On LM-2, there are 10 Unibus slots, and I think there are 22 Xbus slots. Some cards may consume the space of 2 slots due to their construction. Lots of expansion slots are needed because, for example, the main memory is also installed on Xbus.

It is possible to connect a disk drive unit to the disk interface board. With a multiplexer board, 7 additional disk drive units can be connected.

Here is a system diagram from the official documents; I made a redraw for clarity. Disk interface board is shown as Disk control, and bitmap interface board is shown as TV control.

CADR LISP Machine System Diagram (redrawn based on AI Memo 528)

CADR LISP Machine System Diagram (redrawn based on AI Memo 528)

It is probably not obvious that the processor is not a single (VLSI) IC chip. It is built from many other basic ICs, since VLSI technology was just starting in the 1970s. The same is true for the functionality implemented by the boards.

  • For example, the 32-bit ALU of the processor is implemented using the 74181 4-bit ALU IC, and there are 9 of these and some other ICs just for implementing the ALU block.

  • For example, the bitmap memory of the TV control is implemented with 4116 16 Kbit memory ICs; thus, there are 64 of these memory ICs to implement 1024 Kbit memory to support, for example, an 800×1024 monochrome display.

CADR schematics are available.

All the memory blocks of CADR LISP machine have parity checking. A parity error in internal memories can cause the machine to be halted. A parity error in main memory can halt the machine, cause a microcode trap, or can be ignored, depending on mode flags.

LISP as a System Language

These are some useful points from LISP Machine Progress Report, 1977, LISP as a System Language section:

  • The software in LISP machine is coded only with LISP and CADR machine microcode.

  • For some low-level operations, some “sub-primitive” functions exists, and their names start with a %.

  • Memory paging is handled by microcode and is not visible to LISP code.

  • The input/output software is totally written in LISP, using only %UNIBUS-READ and %UNIBUS-WRITE sub-primitive functions.

  • There are also some microcoded display-related functions to speed up the drawing of characters. These are microcoded for performance reasons.

Unibus

I/O Board (IOB)

Having an IOB seems like a must, as it supports the following basic devices (ordered by Unibus locations/addresses):

  • Keyboard: The data from the keyboard is encoded as a 32-bit word read from Unibus locations 0x3E840 (low 16-bit) and 0x3E842 (high 16-bit) (764100 and 764102 octal). It generates an interrupt, vector number 260. The LM-2 keyboard has 100 keys. This keyboard was originally called Space Cadet keyboard. You can also watch a youtube review.
Symbolics LM-2 Keyboard (source: wikipedia)

Symbolics LM-2 Keyboard (source: wikipedia)

  • Mouse: Also called “Display Cursor Positioner”. Because there are no arrow keys, I think it was often used for this purpose. The X-position (as a relative position) is at Unibus location 0x3E846 (764106) and the Y-position and the buttons are at 0x3E844 (764104). It is polled 60 times per second. It can also generate an interrupt (vector number 260, same as the keyboard), but I think the interrupt is not used.
Symbolics LM-2 Keyboard, Mouse and Display (source: Symbolics LM-2 Brochure)

Symbolics LM-2 Keyboard, Mouse and Display (source: Symbolics LM-2 Brochure)

  • Speaker: There is a simple speaker inside the keyboard. The beep function can be used for this. Reading the Unibus location at 0x3E848 (764110) inverts the state of the bit that generates the sound.

  • Microsecond Clock: It is a 32-bit register that increments every microsecond. Its value can be accessed using the functions time:microsecond-time and time:fixnum-microsecond-time. These functions read from the Unibus locations at 0x3E850 (low 16 bits) and at 0x3E852 (high 16 bits) (764120 and 764122 octal).

  • 60-Hz Clock: It is a 16-bit register that increments every 1/60 second. This clock is driven by the AC power line, so the accuracy is poor in the short term, but good in the long term. Its value can be read at Unibus location 0x3E854 (764124 octal). Because the register is not synchronized to the bus clock, it has to be read multiple times, and its value is valid only when two consecutive values are the same.

  • Interval Timer: It is a 16-bit register that decrements every 16 microseconds and generates a Unibus interrupt when it reaches zero. It can be programmed by writing to Unibus location 0x3E854 (764124 octal). Its value cannot be read, but it generates an interrupt when it reaches 0. The interrupt vector number is 274.

  • General Purpose I/O Port: This is a 16-bit parallel bidirectional port. It is not used by the system, but it can be used by the user using %unibus-read and %unibus-write. The 16-bit register for this port is at Unibus location 0x3E856 (764126 octal).

  • Chaosnet: This is a pre-Ethernet local computer network. The chaosnet interface is at Unibus location 0x3E860-0x3E86A (764140-764152 octal), and the interrupt vector number is 270. The IOB contains two DIP switches to set the Chaosnet network address, controlling the subnet number and the host number. The address is formed by concatenating the subnet number (8-bit) with the host number (8-bit). The address can be read from the Unibus location at 0x3E862 (764142 octal).

  • Serial I/O Port: It is a general-purpose synchronous or asynchronous (RS-232) serial data port. It is located at Unibus location 0x3E870-0x3E876 (764160-764166 octal). The serial port uses interrupt vector number 264.

Diagnostic Interface

The Diagnostic Interface is accessible on Unibus. It has 16 locations for reading, and 8 locations for writing. The same address for read and write does not mean they refer to the same thing.

My understanding is that this interface has at least two purposes. One is taking part in the boot process, particularly using the mode register. The second is to facilitate debugging, probably by connecting another computer, a PDP-11, to this interface.

Below, I just summarize what is in the original documents to give a sense of this interface.

Read Locations

These readable locations are also called “Spy Feature”, as it allows one to spy internal machine operations.

LocationDescription
0x3EC00IR[15:0], IR is Instruction Register
0x3EC02IR[31:16]
0x3EC04IR[47:32]
0x3EC06not used
0x3EC08the last OPC Register
(containing the value of PC 8 clocks ago)
0x3EC0APC, PC is Program Counter
0x3EC0COB[15:0], OB is Output Bus
0x3EC0EOB[31:16]
0x3EC10Flag Register 1
0x3EC12Flag Register 2
0x3EC14M-source[15:0], selected by IR
0x3EC16M-source[31:16], selected by IR
0x3EC18A-source[15:0], selected by IR
0x3EC1AA-source[31:16], selected by IR
0x3EC18ST[15:0], ST is Statistics Counter
0x3EC1AST[31:16]
Flag Register 1
BitNameDescription
15~WAIT1 if the machine is running, 0 if it is waiting for the memory
14~V1PE0 if level-2 map had a parity error at the last clock
13~V0PE0 if level-1 map had a parity error at the last clock
12HIGHOK1 if power-supply check is OK
11~STATHALT0 if the machine has been stopped by the statistics counter
10ERR1 if an error condition is present
9SSDONE1 if a single-step has been completed.
8SRUN1 if the machine is trying to run (but it may be stopped)
7~HIGHERR1 if there was HIGHOK at the last clock
6~MEMPE0 if there was a main memory parity error
5~IPE0 if there was a control memory parity error
4~DPE0 if there was a dispatch memory parity error
3~SPE0 if there was an SPC stack parity error
2~PDLPE0 if there was a PDL-buffer parity error
1~MPE0 if there was a M-memory parity error
0~APE0 if there was a A-memory parity error
Flag Register 2
BitNameDescription
15unused
14unused
13VMAPDprevious cycle indicated a write to the map
12DESTSPCDprevious cycle indicated a write to SPC stack
11IWRITED
10IMODD
9PDLWRITEDprevious cycle caused a write into PDL-buffer
8SPUSHDprevious cycle caused a write into SPC stack
7unused
6unused
5IR[48]parity bit of IR
4NOP1 if the current instruction is a NOP
3~VMAOKThe last main memory cycle was failed because the map indicated a page fault
2JCOND1 if the jump-condition is satisfied
1:0PCS1-0Next PC selector
0: SPC Stack
1: IR[25:12], address specified by a JUMP2: DPC[13:0], the dispatch memory
3: IPC[13:0], PC+1
Write Locations

Here are 8 writable locations:

LocationDescription
0x3EC00DEBUG-IR[15:0]
0x3EC02DEBUG-IR[31:16]
0x3EC04DEBUG-IR[47:32]
0x3EC06Clock control register
0x3EC08OPC control register
0x3EC0AMode register
0x3EC0Cnot used
0x3EC0Enot used
Clock Control Register
BitNameDescription
4LDSTATsetting this to 1, then clocking the machine, causes the statistics counter to load from IWR[31:0].
3IDEBUGsetting this to 1 causes IR to load DEBUG-IR instead of the PROM or the instruction memory
2NOP11setting this to 1 forces NOP in this cycle.
1STEPsetting this to 1 when SSDONE is 0 causes the processor clock to run for one cycle and then set SSDONE.
0RUNsetting this to 1 causes the machine to start running
OPC

There are 8 OPC registers that keeps the last 8 PC values. Only the last OPC register that keeps the PC value 8 clocks ago can be read.

OPC control register contains bits that is used to restore the state of the machine from a saved state. Its fields are:

BitNameDescription
2OPCINHsetting this to 1 inhibits the OPCs from being clocked by the processor clock. it is used for loading OPCs by executing JUMPs
1OPCCLKsetting this to 1 and then 0 (thus making a high-low transition) generates a clock to shift OPCs, it is used for reading OPCs
0LPC.HOLDsetting this to 1 prevents LPC register from loading the PC register when the machine is clocked. it is used by IR<25> in DISPATCH instruction.
Mode Register
BitNameDescription
7PROG.BOOTsetting this to 1 starts a bootstrap sequence
6PROG.RESETsetting this to 1 resets the machine
5PROMDISABLE1 disables the PROM. 0 replaces the the first 1K locations of control memory with the PROM.
4TRAPENBsetting this to 1 causes main memory parity errors to cause microcode traps at location 0.
3STATHENBsetting this to 1 enables overflow of the statistics counter to halt the machine
2ERRSTOPsetting this to 1 enables hardware errors (HIGHERR and various parity errors) to halt the machine.
1:0SPEED0: Extra Slow, 1: Slow, 2: Normal, 3: Fast

In addition to SPEED bits, ILONG bit in the instruction affects the speed (by slowing it down by 40 nanoseconds).

Xbus

The main memory, the disk, and the display are connected to Xbus. Xbus is 32-bit wide.

It is possible to install up to 4M words (22-bit) of main memory. It is possible to use 16M words (24-bit) of virtual memory with paging on the disk.

The disk drives can be T-80 80 MB (shown on the diagram) or T-300 300 MB disks (maybe also others, but these are the ones I always saw in various documents). T-300 is a 3600 rpm disk drive using IBM 3336-11 disk packs. As a reminder that we are talking about the 1970s, T-300 weighs 218kg.

Trident T-300 Disk Drive (source: CalComp T-300 Brochure, 1976)

Trident T-300 Disk Drive (source: CalComp T-300 Brochure, 1976)

It is also possible to attach a magnetic tape drive and controller (I guess the controller is a Xbus board).

A bitmap interface board is needed to attach a display. Then either a high resolution 768×900 monochrome or a 512×512×8bpp color monitor can be used. LMI documents also mention an 800×1024 display.

On CADR documents, it is mentioned that the raster memory for display is up to 864×1168 monochrome or 512×512 4-bit pixels.

PROM and Bootstrapping

When CADR LISP machine is powered on, it resets itself, but it does not automatically start. The start/boot can be initiated in various ways (even from the chaos network interface). The most common is to use the keyboard, as the machine recognizes a special set of keyboard commands to start booting, typically left and right control-meta keys.

When boot is initiated, the machine resets and halts. Then it turns on “RUN”, but nothing works because the clock is not running. After starting the clock at the slowest speed, it disables checks and halts, and then enables PROM. PROM is a read-only control memory. The control jumps to location 0 of PROM. The code in PROM:

  • clears all internal memories (and fills the correct parity bits)
  • resets the Unibus
  • enables checks and halts
  • sets the machine speed to normal
  • runs diagnostic checks
  • loads the microcode from the disk (from an MCR partition)
  • loads the initial contents of main memory from the disk (from an LOD partition)
  • transfers control to the microcode at its start address, which is taken from the diagnostic interface

I explain the disk partitions in the next section.

The switch from PROM to instruction memory is done with the PROGDISABLE bit of the Mode register, which is modified with a write on Unibus.

On a real LISP machine, PROM is a real IC, but on the emulator, a PROM dump has to be given. On LM-3 System 100, it is stored as sys/ubin/promh.mcr. Naturally, this should contain microcode (hence the mcr extension). The development of the usim emulator started with the PROM images. Brad Parker wrote: “… Certainly if Al (Kossow) had not sent me the PROM images I would never have started. …” (source: unya/usim repository).

Disk Partitions

PROM starts and boots the LISP Machine by loading microcode and the initial contents of memory from the disk.

The disk controller is said to be Trident compatible, so it supports Trident disk drives. usim emulates a Trident T-300 disk drive.

The disk contains a label in the first block (1 block=256 words=1024 bytes). The label contains information about the disk (number of cylinders, heads, blocks per track etc.) and the partition table. Each partition has a name, start block, size in blocks, partition type and a comment or description. The number of partitions is not fixed (but I only saw one example, so I am not sure if there is any limitation somewhere else). The layout of the disk that comes with LM-3 has 18 partitions named: MCR1 to MCR8 (Microcode), PAGE, LOD1 to LOD8 (Load) and FILE. An MCR partition contains a microcode dump. An LOD partition contains an initial system, a memory dump. So, the boot process is as simple as:

  • restoring the microcode dump from an MCR partition on the disk to the Instruction Memory
  • restoring the memory dump from an LOD partition on the disk to the PAGE partition on the disk (why PAGE, continue reading)

One of the MCR and one of the LOD partitions are used during boot. These are called the current MCR and current LOD partitions and this information is written on the disk label as well.

Other than the MCR and LOD partitions, there is a PAGE partition. This is the virtual memory paging area. This is why the current LOD is copied to the PAGE during boot. An LOD partition is also called a “load band”.

The process above describes cold-booting. In warm-booting, the memory dump stored in the LOD partition is not loaded to PAGE, thus the existing PAGE partition (existing memory content) is retained and used.

After MCR and LOD partitions are copied, the machine starts from (thus LC is set to) whatever %INITIAL-FEF is mapped to. This is normally the function LISP-TOP-LEVEL in SYS:SYS;LTOP.LISP. (Thanks to Alfred M. Szmidt for the clarification)

usim emulator has a diskmaker tool for creating and manipulating T-300 disk images. On LM-3 System 100, it shows:

$ ./usim/diskmaker -p -f disk-sys-100-0.img 
##
output:
disk-sys-100-0.img
##
label:
cyls	815
heads	19
blockspertrack	17
brand	Trident T-300
text	MIT-LISPM-1
comment	System 100 Distribution Tape
mcr	MCR1
lod	LOD2
##
partitions:
MCR1	021	0224		UCADR 323       
MCR2	0245	0224		                
MCR3	0471	0224		                
MCR4	0715	0224		                
MCR5	01141	0224		                
MCR6	01365	0224		                
MCR7	01611	0224		                
MCR8	02035	0224		                
PAGE	02414	0200000		                
LOD1	0202414	057241		cold 3/23/23    
LOD2	0261655	057241		Exp 100.0       
LOD3	0341116	057241		                
LOD4	0420357	057241		                
LOD5	0477620	057241		                
LOD6	0557061	057241		                
LOD7	0636322	057241		                
LOD8	0715563	057241		                
FILE	0775024	05071		            

You can see at the top comment System 100 Distribution Tape, which I think indicates this is restored from a tape. Also, mcr MCR1 and lod LOD2 shows the names of current MCR and LOD partitios that are used at boot. I guess when you get one of these LISP machines, it comes with one or more tapes, and they are restored to a disk first.

Above, it shows that all MCR partitions are 224 blocks and all LOD blocks are 57241 blocks. PAGE is 200000 (200K) blocks. “Each disk block contains one Lisp Machine page worth of data, i.e. 256 words or 1024 bytes” says the comment in usim/disk.c. The virtual memory addresses are 24-bits, so 16M words. The PAGE partition has 200000 blocks, which is 51.2M (200000 * 256) words, more than the maximum addressable virtual memory size.

Instead of using Chaosnet, the FILE partition can also contain a local file system. (Thanks to Alfred M. Szmidt for the clarification)

Indicators

On a real LISP machine, there are 5 octal displays showing the current value of PC, and the decimal points of the display show:

  • PROMENABLE, indicating if the current instruction is from PROM
  • IPE, DPE, TILT0, and TILT1, indicating parity errors in various memories

Price

Since Symbolics Price List from 1980 is available, let’s check how expensive it was:

  • The base Lisp Machine Model 2 (LM-2) with 12K microcode memory was ~80K USD.
  • An 64K word memory module costs ~5K USD.
  • Adding I/O-bus module costs 6K USD. The keyboard, mouse, and chaos net interface cost ~7K USD. Total ~13K$USD.
  • Adding a T-300 disk (300 MB) costs 5K USD for the interface and 30K USD for the disk drive, for a total of 35K USD.
  • Adding a monochrome display costs ~7K USD for the bitmap interface and ~5K USD for the monitor, for a total of 12K USD.
  • LM-2 operating system costs 20K USD.

So, an LM-2 with 1M memory, and 300MB disk costs ~240K USD.

CADR Processor

CADR processor is a general-purpose processor, but its architecture is optimized for interpreting LISP macrocode.

I would say the processor is a von Neumann architecture, as the data and (macro) instructions are fetched over the same bus; however, there are two types of instructions, and they stay in different memories. The microinstructions are what the processor actually executes (in hardware), and they are stored in a separate (16K) instruction memory. The macroinstructions stay in the main memory (up to 4M physical, 16M virtual).

CADR processor:

  • fetches a macroinstruction from the main memory over XBus
  • finds the location of its “micro” implementation within the instruction memory
  • calls this location to execute this microprogram
  • when the microprogram is done, after it returns, it fetches another macroinstruction

It is like the CADR processor has a von Neumann architecture for macroinstructions, but a Harvard architecture for microinstructions.

It can be said that the CADR processor is emulating the macroinstruction with the microinstructions. Today, we can say it could be done (and it is done with the emulator) that the emulation could be done purely in software. However, it is much faster with hardware assistance (particularly in the 1970s and in the 1980s).

I am not sure if it is a fair comparison, but below you can find CADR Processor vs. Motorola 68000. Motorola 68000 was introduced in 1979, and it was quite an advanced general-purpose processor (for example, it was used in Amiga 1000 in 1985). Below is a comparison of basic features, not taking into account that the architecture of CADR is optimized for executing LISP:

FeatureCADRMotorola 68000
Instruction Set16-bit macro
48-bit micro
32-bit
17-bit micro
Data BusXBus 32-bit
Unibus 16-bit
16-bit
Registers1024 32-bit register memory8 32-bit data registers
8 24-bit address registers
Address Bus24-bit virtual
22-bit physical
24-bit
ALU32-bit
plus Hardware Shifter and Masker
16-bit
Instruction Cycletypically 180ns (variable clock)250 ns (4MHz)

A full 32-bit version of 68000, 68020, was launched in 1984.

68000 is a complex instruction set computer (CISC) microprocessor, and it also has a microcode, but the microcode is not visible to the user. Using the microcode, 68000 provides its instruction set.

In early microprocessors (1980s), the microcode stays in a read-only memory (ROM) or programmable logic array (PLA). In a CADR processor, microcode has its own memory and is fully programmable.

Looking at it this way, it can be said, the instruction set of the CADR processor is macroinstructions. That is why I wrote before that it is not strange to say the CADR processor is executing LISP directly.

Features

CADR Processor has the following (uncommon) features:

FeatureCADR
A scratchpad/memory1024 (1K) locations
32-bit wide
M scratchpad/memory32 locations
32-bit wide
PDL/Stack buffer1024 (1K) locations
32-bit wide
SPC memory,
subroutine return stack
32 locations
19-bit wide
PC (Program Counter)14-bit wide
Instruction Memory16K locations
48-bit wide
DPC, dispatch memory2048 (2K) locations
17-bit wide
IR (Instruction Register)48-bit wide microcode
Q register32-bit
LC (Location Counter)26-bit wide
ST (Statistics Counter)32-bit wide

The processor has a 32-bit data path, a 24-bit (virtual) address path, and a 32-bit ALU including a Masker and a Shifter.

PC (Program Counter) points to the next microinstruction in instruction memory. Instruction memory is 16K, and hence PC is 14-bit wide.

LC (Location Counter) points to the next macroinstruction in the main memory. It is byte addressed and supports byte or half-word addressing. In the case of half-word addressing, the last bit is 0. Since the virtual address width of the system is 24-bit, the size of LC is 26-bits.

You might ask, where are the registers? There are no registers similar to our current processors. However, A and M scratchpads are used like registers. In that case, there are a lot of (1024) registers in the CADR processor. M scratchpad is actually a copy of the first 32 locations of A scratchpad. It is like having a dual port memory for the first 32 locations, thus making it possible to use these in more flexible ways.

SPC keeps the PC values and is used as a subroutine return stack. Not similar to modern processors, where the stack is in the main memory, SPC is implemented separately, so its size is fixed. This means only 32 subroutine calls can be nested. The subroutine here is a subroutine of microcode, not a LISP (macro) function. In addition to PC values (14-bit), there are 5 “software” bits. The 14th bit (one of the software bits) is used to mark the end of a macroinstruction execution.

Dispatch memory is quite interesting. It contains 17-bit entries, where each entry is formed as R bit, P bit and N bit, and a 14 bit address for PC.

  • R bit is to ignore PC but pop one from the stack (SPC) (for RETURN operation)
  • P bit is to push PC to the stack (SPC) (for CALL operation).
  • N bit is to inhibit the execution of next microinstruction (more about this later)

If P and R are both zero, then it is like a BRANCH or unconditional JUMP instruction. So, a single entry can encode a RETURN, CALL or unconditional JUMP/BRANCH operation.

If P and R are both one, then DISPATCH is not performed. However, with miscellaneous functions, it is possible to write to the dispatch memory.

PDL buffer is a memory with a hardware “push-down” pointer that improves the performance of stack operations as they are very common in LISP execution. It is basically a hardware stack.

A microinstruction is 48-bit wide; hence IR is 48-bits.

Q register is primarily used to help with multiplication and division.

Statistics Counter (ST) is incremented if the statistics bit (46th bit) of a microinstruction is 1. If ST overflows from -1 (0xFFFFFFFF) to 0, and if STATHENB of the mode register is 1, then the machine stops. Bit 46 is 0 for all PROM microinstructions. Statistics Counting can be used for metering, debugging and breakpointing.

Other than these internal memories, there is no instruction or data cache as in modern processors.

Pipeline

It is not described like this in the original documents, but I think the CADR processor can be considered a 2-stage fully pipelined processor. A microinstruction is fetched at every cycle, and the fetch of the next microinstruction overlaps the execution of the current instruction. Because of this, when a branch is taken, the next instruction becomes invalid, and its execution is inhibited.

C: cycle
F: fetch
E: execute or store

---- time --->
C1 C2 C3 C4 C5
F2 F3 F4 F8 F9
E1 E2 E3 -- E8

At C3, due to E3, a branch is taken (to PC=8), thus E4 is inhibited.

Naturally, the inhibited execution cycle is wasted. The inhibition is signaled by using the N bit in the change of control instructions, where:

  • N bit is stored with the instruction for a conditional JUMP instruction
  • N bit is stored in the dispatch memory entry for a DISPATCH instruction

Clock and Cycle Timing

CADR processor has one clock signal. As far as I understand, the clock is generated and controlled separately for the processor and the bus.

The control signals of the A and M scratchpads, PDL buffer, and SPC stack are asynchronous, but it is not visible to the user. The reason for the pass-arounds in data paths (shown in the next section) is this, and these are also not visible to the user.

I think it can be said that the clock is active high, but both halves (or transitions) are used. The first half is the read phase and the second half is the write phase. Unlike any modern processor, the clock cycle is not fixed. The duration of the first half (when the signal is high) is controlled by ILONG bit of the instruction, so an instruction may indicate it requires more or less time, and by two “speed” bits from the diagnostic interface. The duration of the second half (when the signal is low) is normally fixed.

The clock can be stopped at the end of the read phase, and this is called a wait. A wait happens because:

  • halt is requested from the diagnostic interface
  • single-step is requested from the diagnostic interface
  • an error (such as parity) happened
  • statistics counter is overflowing
  • memory-wait condition is happened

When any of these happen, the processor clock is stopped, but the external clock to the bus interface continues because the bus operations should complete. When the stop condition disappears (e.g., bus operation completes), the processor clock starts synchronously to the external (bus) clock.

The clock stop can also happen at the end of the write phase, and this is called hang. A hang happens only for memory reads. If a memory read is not completed, then a stop happens after the write phase. During the hang, both clocks stop and start synchronously when the hang condition disappears. To be honest, I did not totally understand this yet (why do both clocks stop?), but it is described like this in the original documents.

CADR Cycle Timing (source: MIT AI Memo 528)

CADR Cycle Timing (source: MIT AI Memo 528)

The top diagram in the figure above shows fetch of the next microinstruction and store (or execute) of the current instruction in the same cycle, but in different halves of the clock.

The bottom diagram in the figure above shows the read and write phases and the typical cycle times.

Control and Data Paths

The control paths (things involving PC) of the CADR processor are shown below.

CADR Processor Control Paths (source: MIT AI Memo 528)

CADR Processor Control Paths (source: MIT AI Memo 528)

You can see the dispatch memory (DPC) on top left, the SPC stack on bottom left, and the instruction memory on bottom right.

The data paths of the CADR processor are shown below.

CADR Processor Data Paths (source: MIT AI Memo 528)

CADR Processor Data Paths (source: MIT AI Memo 528)

You can see the A scratchpad (top middle), the M scratchpad (top middle), and the PDL buffer (bottom middle). Both the A and M buses are 32-bits. The A bus is reserved for the A scratchpad, but in addition to the M scratchpad, many things can be on the M bus. (see Microinstructions for more info)

The output of an execution can be directed to (or, in other words, the output bus is connected to) many things. (see Microinstructions for more info)

The output can also be sent to the Q register, but it is selected differently.

Virtual Memory

CADR processor has 22-bit physical address space (2^22=0x400000) and it is organized like this:

Physical Address RangeDevice
3E0000:3FFFFFUnibus
3C0000:3DFFFFXbus I/O
000000:3BFFFFXbus memory

Because the Unibus is 16-bit, only the low 16 bits (15:0) of the 32-bit data bus is used.

The VMA (virtual memory address) and MD (memory data) register are used for all operations on Unibus and Xbus, not only for main memory.

  • VMA register is 24-bit wide (actually 32-bit but 8-bit is ignored)
  • MD register is 32-bit wide

Unibus and Xbus is accessed by a two-level virtual memory system, which maps a 24-bit virtual address into a 22-bit physical address.

  • The first level map has 2048 5-bit wide locations. It is addressed by VMA[23:13] or MD[23:13] (10-bit wide addressing for 2048 locations).

  • The second level map has 1024 14-bit wide locations. It is addressed by the concatenation of first level map output (5-bit) and by VMA[12:8] or MD[12:8] (10-bit wide addressing for 1024 locations).

The virtual address space consists of 2048 blocks (11-bit addressing), and 32 pages (5-bit addressing) in each block, and 256 (8-bit addressing) words in each page (11+5+8=24). The first level finds the block. The second level finds the page. The physical address is the concatenation of the physical page number (14-bit, 16K pages) and VMA[7:0] (8-bit, 256 words/page).

Virtual Address (24-bit)
----------------------------------------------------
|23                  13|12       8|7              0|
----------------------------------------------------
          |                  |            |
          |                  |            |
          | (11-bit)         |            |
          |                  |            |
    +----------------+       |            |
    | LEVEL 1 (2048) |       |            |
    +----------------+       |            | 
          |                  |            | 
          | (5-bit)          | (5-bit)    |
          |                  |            |
        +----------------------+          |
        |    LEVEL 2 (1024)    |          |
        +----------------------+          |
                |                         |
                | (14-bit)                | (8-bit)
                |                         |
              +-----------------------------+
              |  Physical Address (22-bit)  |
              +-----------------------------+

MD can also be used for VMA[23:13] and VMA[12:8].

There are 2048 blocks, 32 pages/block, 256 words/page.

The output of the second level map also includes access (read) and write permissions for the page.

Because the second level map only has 1024 locations (1024 pages=32 blocks), only 32 blocks can be described. So, most of the entries in the first level map should point to a “no information available” entry. This is done by reserving the last 32 locations of the second level map with a “no information available” record. Most of the first level map entries point to these “no information available” records. It is the last 32 entries because the first 5-bit comes from the output of the first level map, it will be independent of the value of VMA[12:8].

Instruction Stream Feature

It is still not clear to me how this exactly works; I will update this section when I learn more.

I took the name of this section, “Instruction Stream Feature”, directly from MIT AI Memo 528.

There is obviously a need to coordinate the “interpretation” of the macroinstructions with the microinstructions. After the boot is completed, the interpretation of the macroinstructions should start.

To fetch a macroinstruction from the main memory, LC divided by 4 (word boundary) is transferred to the VMA and a memory read cycle is started. This returns a word to MD, and the correct half-word is selected by the M-rotate field. LC is incremented by 2 (LISP macroinstructions are half-word) because it always points to the next macroinstruction.

The end of a macroinstruction execution is signaled by a RETURN from the SPC stack when the bit-14 of the SPC entry is set. This initiates a new macroinstruction fetch.

Microinstructions

I minimized the information on this section. For more information, please check the original MIT CADR documents or usim/ucode.c source code.

CADR processor has only 4 microinstructions (2-bit opcode field). They are more like instruction groups that can do different things depending on the bit fields. The four opcodes are:

  • Opcode 0: ALU. Destination receives the result of a boolean or arithmetic operation.

Example: A_memory[1] = A_memory[1] + M_memory[2]

  • Opcode 1: (CONDITIONAL) JUMP. Transfer of control conditional on different things.

Example: If (A_memory[1] > M_memory[2]) jump PC+10

  • Opcode 2: DISPATCH. Transfer of control based on an entry in dispatch memory, addressed by 7-bits of a byte in the M bus.

Example: Call subroutine at Dispatch_Memory[M_memory[0] + 10]

  • Opcode 3: BYTE. Destination receives the result of a byte extraction, byte deposit, or selective field substitution from one source to another. This works by a rotation and a mask combination.

Example: A_memory[1] = (A_memory[1] & 0xFFFFFF00) | (M_memory[2] & 0x000000FF)

The format of these instructions is very regular.

All instructions have these fields:

  • an odd parity bit
  • statistics bit; when set, the statistics counter is incremented
  • ILONG bit; indicates a longer write phase for the clock
  • 2 bits Opcode (0=ALU, 1=JUMP, 2=DISPATCH, 3=BYTE)
  • a bit to indicate POPJ transfer
  • (source 1) A bus select: A memory address
  • (source 2) M bus select: M memory address or a functional source
  • 2 bits miscellaneous function select
  • destination can be A memory or a functional destination

Basically, it is similar to this:

   +---------------------------------------+
   |                                       |
   |                       +-----------+   |
A memory --------------A-->|           |---+
                           | Micro     |   |
M memory -----------+      | Operation |   |
                    |--M-->|           |   |
Functional source --+      +-----------+   |
   or destination                          |
   |                                       |
   +---------------------------------------|

(as a reminder, M memory is actually the first 32 locations of A memory)

The functional source on the M bus can be many things:

  • 0x00: Dispatch constant (10 bits)
  • 0x01: SPC pointer (5 bits), SPC data (19 bits)
  • 0x02: PDL pointer (10 bits)
  • 0x03: PDL index (10 bits)
  • 0x05: PDL buffer, addressed by index
  • 0x06: OPC registers (14 bits)
  • 0x07: Q register
  • 0x0A: VMA (virtual memory address) register
  • 0x0B: Virtual Memory Map[MD]
  • 0x0C: MD (memory data) register
  • 0x0D: LC (location counter)
  • 0x0E: SPC pointer and data, pop
  • 0x14: PDL buffer, addressed by pointer, pop
  • 0x15: PDL buffer, addressed by pointer

Functional destination can be many things:

  • 0x00: None
  • 0x01: LC (location counter)
  • 0x02: Interrupt Control on bits 29:26
  • 0x08: PDL, addressed by pointer
  • 0x09: PDL, addressed by pointer, push
  • 0x0A: PDL, addressed by index
  • 0x0B: PDL index
  • 0x0C: PDL pointer
  • 0x0D: SPC data, push
  • 0x0E: Next instruction modifier (OA register), bits 25:0
  • 0x0F: Next instruction modifier (OA register), bits 47:26
  • 0x10: VMA (virtual memory address) register
  • 0x11: VMA register, start main memory read
  • 0x12: VMA register, start main memory write
  • 0x13: VMA register, write MAP*
  • 0x18: MD (memory data) register
  • 0x19: MD register, start main memory read
  • 0x1A: MD register, start main memory write
  • 0x1B: MD register, write to Virtual Memory Map

Table below shows the normal ALU operations:

CADR Processor ALU Codes (source: MIT AI Memo 528)

CADR Processor ALU Codes (source: MIT AI Memo 528)

It is also possible to make: multiply, divide, remainder correction, and initial divide steps with conditional ALU operation codes.

BYTE instruction also has these fields:

  • a bit to indicate Mask rotate
  • a bit to indicate Source rotate
  • fields to indicate length and rotation count

BYTE instruction operates on a single byte of A and M sources. M source is taken as it is, optionally rotated to produce R. A selector mask is produced based on the rotation count and the length. This mask is used to merge the A source with R bit by bit (if mask bit=0, use A, otherwise, use R). The result of the merge operation is written back to the destination. The selector mask basically has ones starting from bit number rotation count for length+1 bits.

LISP Macrocode

Before jumping to macroinstructions, it will be useful to explain how LISP programs and LISP data are represented.

Program Representation

A program in the LISP machine can have one of the three representations:

  • Interpreted LISP is the same as textual LISP code input. However, this is the slowest form of execution, so it is only used for debugging etc.

  • Compiled LISP is the macrocode representation of LISP programs. It still contains the checks of interpreted code, so it is usually normal to compile every program after typing in the editor.

  • Microcode is the other representation. These are the microprograms that execute macroinstructions or programs that perform low-level system tasks. Some of these cannot be done in LISP because they are very low-level. Some can be done in LISP, but they are hand tuned at the microcode level to have the best performance.

It is possible to “microcompile” a LISP function to a microcode to have the best performance if needed. However, instruction memory holding the microcode is limited (16K), so only a few microcompiled functions can be loaded at a time.

Data Representation

This section is incomplete. It is only meant to give a sense of how data is represented in LISP.

LISP objects are represented by a 5-bit data type field (32 possible data types) and a 24-bit pointer. 24-bit is because CADR has 24-bit virtual addresses. Most of the 32 data types are defined, and they have names like DTP-LIST, DTP-SYMBOL, DTP-FIXNUM etc.

A symbol is stored in four consecutive words. These four words are:

  • PRINT NAME cell which holds a string object
  • VALUE cell which contains the current value of the symbol
  • FUNCTION cell which contains a function to be evaluated when needed
  • PROPERTY LIST cell which contains the property list

The data type of symbol object is DTP-SYMBOL.

A list object has a data type DTP-LIST, and normally, it points to a two-word structure. The first word contains the CAR (first element), and the second contains the CDR (a list containing the rest of the elements). Since a LISP object is only 29 bits (5-bits data type and 24-bits address), two of the remaining bits (because the word is 32-bits) are used to optimize storage and to form a compressed list representation. These two bits are called CDR-CODE and can have the following values:

  • CDR-NORMAL: indicates the two-word use as described above, the two-word structure keeps CAR and CDR.
  • CDR-NEXT and CDR-NIL: indicate the list is stored as a vector, where the first word is the element (the value), and the second word points to the next two-word (when CDR-CODE=CDR-NEXT) or marks that this is the last element (CDR-CODE=CDR-NIL).
  • CDR-ERROR: indicates the address should not be in the list.

Normally, the compressed representation is used unless RPLACD is called on a list.

There are three data types for number objects:

  • DTP-FIX: represents 24-bit signed integers. The pointer part of the object contains the actual value.
  • DTP-EXTENDED-NUMBER: represents other types of numbers such as floating point, arbitrary precision integers, complex numbers etc. The pointer points to the actual storage of the number.
  • DTP-PDL-NUMBER: similar to DTP-EXTENDED-NUMBER but these stay only on the PDL buffer.

There is also a “locative pointer” data type, which is an actual pointer to a memory location. This is used for low-level system programming needs.

LISP Macrocompiler

The LISP macrocompiler, called QCMP, compiles a LISP function to macrocode. This “macrocompilation” process is so simple and straightforward that it is not necessary to save the compilation result. It can be used like a just-in-time compiler.

It is also unnecessary to write macrocode manually because it brings no benefits. Macrocode is well-matched to LISP, so instead of writing macrocode manually, the actual LISP function should be written and compiled.

When a function is macrocompiled by QCMP, a Function Entry Frame (FEF) is produced, and the compiled function is represented by a LISP symbol object whose FUNCTION cell contains a pointer to FEF, which is an object of type DTP-FEF-POINTER.

FEF contains:

  • a header; which is always present, 7 words long
  • pointers to external objects needed by the function: pointers to VALUE cells and FUNCTION cells
  • description of arguments: the Argument Description List (ADL) which contains one entry to each argument.
  • actual macroinstructions, the macrocode of the LISP function

Macroinstructions

This section is incomplete and is provided only to give a sense of macroinstructions. However, it is also not yet in a state that I am happy with; I am still working on it. For more information, please check the original MIT CADR documents or usim/ucode.c source code.

Each macroinstruction is 16-bits. LC points to macrocode in main memory (like PC points to microcode in instruction memory), and LC is incremented by 2 after each fetch because LC has byte-addressing not word-addressing.

Macrocode is executed on a stack-oriented machine. The stack (do not confuse this with SPC or other things within the processor related to the execution of microinstructions) is organized as frames. Each frame contains the arguments, the local variables, a stack for intermediate results, and various other information. The current frame is always in the PDL buffer, so there is no need to access main memory for the arguments and the local variables. Other frames can also be kept in the PDL buffer for performance.

Macroinstructions are grouped into four classes (plus a reserved class).

This is the basic form of macroinstructions:

------------------------------------
|15  13|12     9|8                0|
------------------------------------
         OPCODE

If macroinstruction[12:9]/OPCODE field is:

  • 0x9, 0xA or 0xB: Class II
  • 0xC: Class III
  • 0xD: Class IV
  • 0xE or 0xF: Class V - Reserved
  • otherwise: Class I

Class I - Data Transmission and Function Calling

-------------------------------------
|  3   |   4    |  3   |     6      |
-------------------------------------
  DEST   OPCODE REGISTER   OFFSET

SOURCE, or effective address E, is calculated according to what REGISTER field says and OFFSET is added to it.

REGISTER can be:

REGISTERFUNCTIONDESCRIPTION
0FEFstarting location of FEF
1FEF+100
2FEF+200
3FEF+300
4CONSTANTS PAGEpage of widely used constants, such as T, NIL, small numbers etc. these are not repeated in each FEF and shared by all FEFs
5LOCAL BLOCKaddress of the local block on the PDL, used for addressing local variables
6ARG POINTERargument pointer into the PDL, used for addressing the arguments
7PDLoffset must be 77, the top of the stack is popped and used

DESTINATION can be:

DESTFUNCTIONDESCRIPTION
0IGNOREresult is discarded, but flags are set
1TO STACKpush result to the stack
2TO NEXT
3TO LAST
4TO RETURNreturn the result as the value of this function (return from subroutine)
5TO NEXT, QUOTED
6TO LAST, QUOTED
7TO NEXT LIST

There are 9 macroinstructions in Class I:

OPCODEINSTRUCTIONDESCRIPTION
0CALLcall function, return value goes to DESTINATION
1CALL0call a function with no arguments
2MOVEmove the contents at effective address E to the DESTINATION
3CARput the CAR of the contents at effective address E to the DESTINATION
4CDRput the CAR of the contents at effective address E to the DESTINATION
5CADRput the CAR of the contents at effective address E to the DESTINATION
6CDDRput the CAR of the contents at effective address E to the DESTINATION
7CDARput the CAR of the contents at effective address E to the DESTINATION
8CAARput the CAR of the contents at effective address E to the DESTINATION

Class II - Arithmetic, Boolean, Comparison and Binding Operations

------------------------------------
|      7       |  3   |     6      |
------------------------------------
     OPCODE    REGISTER   OFFSET

Since there is no DESTINATION in these macroinstructions, the result is either pushed to the stack (like TO STACK or TO NEXT destination) or it is stored at the effective address. The calculation of effective address is the same as in Class I instructions.

Class II macroinstructions are further grouped into three subgroups. The subgroup number is encoded in OPCODE[2:0].

In the below tables, C(E) denotes the content at effective address E.

Subgroup 0x9 (11 octal) contains:

OPCODE[6:3]INSTRUCTIONDESCRIPTION
0not used
1+stack.push(stack.pop() + C(E))
2-stack.push(stack.pop() - C(E))
3*stack.push(stack.pop() * C(E))
4/stack.push(stack.pop() / C(E))
5ANDstack.push(stack.pop() bitwise-AND C(E))
6XORstack.push(stack.pop() bitwise-XOR C(E))
7ORstack.push(stack.pop() bitwise-OR C(E))

Subgroup 0xA (12 octal) contains these.

For the compare instructions below, C(E) is compared to top of the stack according to operation, and if the condition is true, NIL indicator is cleared, otherwise it is set. The stack is popped.

OPCODE[6:3]INSTRUCTION
0=
1>
2<
3EQ

| OPCODE[6:3] | INSTRUCTION | DESCRIPTION |
| 4 | SCDR | stores CDR of C(E) to E |
| 5 | SCDDR | stores CDR of CDR of C(E) to E |
| 6 | 1+ | stack.push(stack.pop() + 1) |
| 7 | 1- | stack.push(stack.pop() - 1) |

Subgroup 0xB (13 octal) contains:

OPCODE[6:3]INSTRUCTIONDESCRIPTION
0BINDThe cell at E is bound to itself.
1BINDNILThe cell at E is bound to NIL.
2BINDPOPThe cell at E is bound to a value popped off the stack.
3SETNILStore NIL in E.
4SETZEROStore fixnum 0 in E.
5PUSH-EPush a locative pointer to E on the stack.
6MOVEMMove the data on top of the stack to E.
7POPPop the top of the stack into E.

Class III - Branching

------------------------------------
|  3   |   4    |        9         |  
------------------------------------
 BRANCH  (14)          OFFSET
  CODE

* second field always contains 0xC (octal 14)

Class III macroinstructions are:

BRANCH CODE [2:0]INSTRUCTIONDESCRIPTION
0ALWAYSAlways branch
1NILINDBranch if the NIL indicator is set, else drop through
2NOT NILINDBranch if the NIL indicator is not set, else drop through
3NILIND ELSE POPsame as NOT NILIND but if fails stack is popped
4NOT NILIND ELSE POPsame as NOT NILIND ELSE POP but if fails stack is popped
5ATOMINDBranch if the ATOM indicator is set, else drop through
6NOT ATOMINDBranch if the ATOM indicator is not set, else drop throught

If branch is to be taken, offset is considered as a signed offset and it is added to the PC. However, if the offset is 777, it is considered a long branch and the real offset is in the next halfword (16-bit). The PC added to is always the incremented PC, +1 in short branch and +2 in the long branch.

Class IV - Miscellaneous Functions

------------------------------------
|  3   |   4    |        9         |  
------------------------------------
  DEST   (15)         OPCODE

* second field always contains 0xD (octal 15)

DESTINATION field is used as same as Class I macroinstructions.

Class V - Reserved

The instructions containing octal 16 and 17 (0xE and 0xF) in the second field of macroinstruction format as in Class III and Class IV are reserved.

Examples

These examples are to be run on CADR emulation.

Disassembling a LISP function to Macroinstructions

Disassembling is a strange word to use for this purpose, but what disassemble does is to macrocompile macrocompiling a LISP function, then disassemble the macrocompiled function, and display the macroinstructions.

A simple example:

(disassemble (lambda (x y) (+ x y)))
Definition of closed-over function
 14 MOVE D-PDL ARG|0          ;X
 15 + ARG|1                   ;Y
 16 MOVE D-RETURN PDL-POP

Line by line:

14 MOVE D-PDL ARG|0STACK.push(argument_0=X)
15 + ARG|1STACK.push((add STACK.POP() argument_1=Y))
16 MOVE D-RETURN PDL-POPRETURN STACK.POP()
Compiling a LISP function

To macrocompile a LISP function, compile can be used.

The representation of a LISP function can be checked with describe function. For a macrocompiled function (Compiled LISP), it returns the function definition as a DTP-FEF-POINTER. For a microcompiled function (Microcode), it returns the function definition as a DTP-U-ENTRY. The function definition of an interpreted LISP function is LAMBDA or NAMED-LAMBDA.

See the example below:

Interpreted, Compiled and Microcoded function descriptions

Interpreted, Compiled and Microcoded function descriptions

CADR Emulator

usim

The CADR software emulation is called usim. It was developed by Brad Parker between 2004 and 2006, and MIT gave permission and licensed the distribution of “recovered” CADR system software, including the microcode. I wrote recovered because the software was in old (backup) tapes, and the data in these old tapes had to be recovered.

An incomplete but valuable table shows the release history of actual CADR LISP Machine system software versions (including the microcode version). The latest version, dated 19 September 1983, and a release message from Richard Stallman, is System 97 (97.6) which requires microcode 257.

Note that the microcode version number was wrapped in ~1982; a smaller number does not mean it was released earlier.

You might wonder why the microcode version is also mentioned. The microcode has to be compatible with the system software. Other than the macroinstruction interpretation functions, there are low-level system-related functions within the microcode. A minor change in either microcode or system software might break the backward compatibility.

It seems Brad Parker’s active usim development ended in 2006, with v0.9 (2006-07-08) being the latest version. I am not sure what happened exactly after this, but looking at the changes, it seems Brad Parker made a minor fix in 2011, then a few different people made other changes. It seems like the latest codebase is at tumbleweed.nu as usim, and it is maintained by Alfred M. Szmidt who has been making changes since 2017.

History of the Restoration

(Thanks to Alfred M. Szmidt for the clarifications.)

First, LISP machine backup tape of System (version) 46 (and microcode version 694) was recovered. These tapes were found in Daniel Weinreb’s basement:

Lisp Machine System Software
Reel 1,2,3,4 10/17/81	
9 track - 800bpi	
ITS dump format	

source: http://www.unlambda.com/mit/README

They are probably 2400ft tapes, since the tpc formatted extract of these tapes are over 10MB but less than 20MB. These files contain: system version 46, prom version 9 and microcode version 694.

However, there was no load band found for these tapes, so it cannot be used to boot a system. It contained the system source code and the schematics.

After some time, the tapes of System 78 are found also in Daniel Weinreb’s storage. This was the first loadable system. For a long time, this was the only system used with usim.

tape10i  2400  Distribution - 78.48     (source code for 78.48)
		       ZMail 38.5 Symbolics 8.13
		       Tape 6.5 LMFS 21.34
		       Microcode 841 1/11/82  BEE
		       "(added in pen - ""BROKEN!!"")"

tape11  2400  System 78.48 Band         (LOD1 band for 78.48)
		      1/12/82  BEE
		      "(in pencil - ""couldn't read 1/21/82  Roberts)"

source: https://tumbleweed.nu/r/attic/sys78/index

After a period of no development in this area, Alfred M. Szmidt found the tapes for a complete System 98, which could be used to boot a system. He then found an incomplete System 99 (TID/671 from MIT’s Tapes of Tech Square project), and he released System 100 based on System 99 after fixing a lot things. To be historically accurate, this is not a version a real CADR LISP Machine has ever run.

LM-3

LM-3 — resurrecting the MIT CADR is the project containing not only usim but also everything else such as the system software, microcode, PROM etc. to run an emulated MIT CADR LISP machine.

As mentioned above, LM-3 System 100 (microcode 323) is the latest version.

LM-3 System 100 release contains the following:

  • usim: CADR emulator source code
  • chaos: Chaosnet for Unix source code
  • sys: system software, PROM dump as well as PROM and microcode symbol files
  • disk-sys-100-0.img: disk image containing microcode MCR and load band LOD partitions
  • usim-100-0.ini: sample configuration to use with usim

chaos is used when building usim. With the sample configuration, Chaosnet is run in “local” mode, and sys is mounted as tree on the LISP machine. PROM (microcode) file, PROM symbols, and microcode symbols are also in sys folder and used by usim.

Running usim with LM-3 is as simple as:

  • download LM-3 System 100 release
  • extract the downloaded file: tar zxvf usim-2023-03-24.tar.gz
  • change the current directory into the usim source directory: cd usim-2023-03-24/usim
  • build the CADR emulator: run make, this will build usim, diskmaker and readmcr executables
  • change directory to parent: cd ..
  • run usim with the supplied configuration: ./usim/usim -c usim-100-0.ini

On Ubuntu 22.04, build-essential and libx11-dev packages are needed to build usim. It is also possible to use the system software and disk image from LM-3 System 100 but use the latest source code of usim (and chaos) from their repositories.

It is useful, or possibly a must, to check the Operating the Lisp Machine document to get a basic idea of this computer and how to use it.

Configuring usim

It is possible to build and run usim on the Windows Subsystem for Linux (WSL) running Ubuntu 22.04. However, it is difficult to control the keyboard mapping properly. So, I prefer using usim on a virtual machine.

There are two issues that you need to be aware of and address before being able to properly use the emulator.

4K/HiDPI Display

I have a 4K display, and I usually use it at 200% scale. usim emulates the CADR TV control and display system with a 768x963 bitmap and displays this bitmap inside an X window. I think because it is a raw X window, it is not automatically scaled even if 200% scale is used, and it is pretty impossible to use it like this on a 4K display because the text is too small. The simplest solution is to use a lower-resolution monitor. I am using a 1920x1080 monitor for the virtual machine. Since the bitmap height is 963px and usim window has a title etc., it works OK with a 1080px display height.

Keyboard Mapping

When using usim, a keyboard mapping is required because LISP machines use a special keyboard with many specially labeled function keys and additional modifier keys. There is a default mapping described below that can be further customized.

When you get serious about using the emulator, I highly recommend using a second keyboard, attaching this keyboard directly to the virtual machine, and putting the space-cadet labels on the keys. This is the keyboard I am using:

My CADR keyboard with space-cadet labels

My CADR keyboard with space-cadet labels

The default mapping in usim source code is below:

X11 ModifierCadet Key
ShiftShift
LockShift-Lock
ControlControl
Mod1Meta
Mod4Top
X11 SymbolCadet Key
Escapealtmode
F1system
F2network
F3status
F4terminal
F5help
F6clear
F7break
Inserthelp
Homebreak
EndEnd
Page_UpAbort
Page_DownResume
Lefthand_left
Righthand_right
Uphand_up
Downhand_down
Returnreturn
Tabtab

The character keys A-Z, 0-9, punctuation marks, and other symbols are mapped to the same ones by default.

The supplied ini file configures the keyboard as cadet and configures only two keys: F7 as integral and Mod4 as Super.

I use a different mapping, and this is in my ini file:

[kbd]
Escape = line
F1 = system
F2 = network
F3 = terminal
F4 = null
F5 = call
F6 = null # top
F7 = null # greek
F8 = null # hyper
F9 = status
F10= null
F11= null
F12=clear_screen
Scroll_Lock = hold_output
Pause       = suspend
BackSpace   = rubout
Insert      = help
Delete      = delete
Home        = clear_input
End         = end
Page_Up     = abort
Page_Down   = resume

[kbd.modifiers]
Shift   = Shift
Lock    = CapsLock
Control = Control
Mod1 = Meta
Mod2 = Greek
Mod3 = Hyper
Mod4 = Super
Mod5 = Top

The entries under kbd are in the form of X keysym = LM key. LM means lisp machine.

  • X keysym names are defined in the X11/keysymdef.h file in the system (e.g., /usr/include). For example, #define XK_BackSpace means there is a X keysym named BackSpace. The easiest way to check this is to use the xev utility and press the key.

  • LM key names are defined in the lmch.defs file. For example, X(suspend, 0201) in lmch.defs means there is an LM key named suspend (which sends scan code octal 201 in the original keyboard). The function keys of (space-)cadet keyboard are these (ordered by the code in lmch.defs, between 201-236): break, clear_input, call, terminal, macro, help, rubout, overstrike, tab, line, delete, clear_screen, return, quote, hold_output, stop_output, abort, resume, status, end, roman_i, roman_ii, roman_iii, roman_iv, hand_up, hand_down, hand_left, hand_right, system, network.

These keys are a must to use the LISP machine:

  • help, rubout, abort, resume, end, system

It is not possible to use the CADR LISP machine without these keys (without a major hassle at least). You will probably also need:

  • terminal, clear_input, clear_screen, call, status

I also assign line to Escape; this goes to the next line (like pressing Return) and aligns the cursor position properly (like using Tab).

You most likely do not need these special keys unless you already know what you are doing:

  • quote and stop_output are not used by the system software. Maybe also hold_output, but I am not sure.
  • overstrike moves the cursor back without deleting.
  • delete is not exactly specified. What we now know as delete is called rubout.

Some function keys are reserved for user assignment, the system does not use them. Until you have a use for them, there is no need to map these keys:

  • hand_up, hand_down, hand_left, hand_right, roman_i, roman_ii, roman_iii, roman_iv

The entries under kbd.modifiers are in the form of X modifier = LM modifier. LM means lisp machine.

  • X modifier can be: Shift, Lock, Control, Mod1, Mod2, Mod3, Mod4, Mod5

  • LM modifier can be: Shift, Top, Control, Meta, ShiftLock or CapsLock, ModeLock, Greek, Repeat, AltLock, Hyper, Super.

LISP machine system software does not distinguish between the left and right keys of the same modifier (Left-Control vs. Right-Control) except for warm-booting and cold-booting, but this is not used in the emulator.

ModeLock, CapsLock and AltLock keep their state. However, ModeLock and AltLock are not used by the system software.

At a minimum, you have to learn the locations of these modifiers: Shift, Control, Meta and Super.

Similar to the function keys mentioned above, Hyper modifier is also reserved for the user and not used by the system.

In addition to Shift, Top and Greek (also called Front) keys also modify the character keys to produce different characters. These are printed on top and on front of the keys (see the Symbolics keyboard image), hence their name. Greek is used for printing Greek or mathematical symbols, hence the name.

If you want to assign a normal key (such as F7) as an LM modifier (such as Greek), you have to first use xmodmap to assign F7 as a Mod (such as Mod2), and then assign Mod2 to Greek. It is not possible to assign F7 directly as Greek with usim. I use the following xmodmap file to set modifiers:

clear Mod1
clear Mod2
clear Mod3
clear Mod4
clear Mod5
add Mod1 = Alt_L Alt_R
add Mod2 = F7
add Mod3 = F8
add Mod4 = Super_L Super_R
add Mod5 = F6

The current modifier mapping of Linux can be displayed with xmodmap -pm.

Logging

To see a bit more log messages, I have these in my ini:

[trace]
level=info
facilities=x11 kbd usim

The defaults of trace section is level=notice and facilities=usim.

  • level can be one of these: emerg, alert, crit, err, warning, notice, info, debug.
  • facilities can be one of these: all, none, usim, ucode, microcode, macrocode, int, vm, unibus, xbus, iob, kbd, tv, mouse, disk, chaos, x11, spy, lashup, misc.

Be aware that debug or even info level and certain facilities can output a lot of messages.

Testing usim

When you are done with the config file, you can run the emulator with usim -c usim.ini. You should see a window like this:

usim CADR emulator with LM-3 System 100

usim CADR emulator with LM-3 System 100

To halt the LISP machine, you can Ctrl-C the usim process on the terminal or use (sys:%halt) at LISP listener.

You can also try running a demo to check the installation by typing (hacks:crock) in LISP listener.

Crock demo

Crock demo

References