Anecdotes

This is a collection of (sometimes) mildly amusing anecdotes. You may already have tired of hearing these from me, or they may be new to you. They are all true. Some have technical content, some don't. This page will grow over time as my memory manages to retrieve archived information.

Technical stuff

These have rather more technical content than the ones in the following section.

The phantom job

In my first year as an undergraduate, we were taught BASIC - it was the only language available online, as opposed to in 'batch' mode via punched cards and printer (I ended up teaching that course myself some years later).

After a few weeks of BASIC, I decided to learn assembly language for the mainframe - an ICL 4130, which was a 24 bit word oriented machine with 96kW of memory. In practice, the best type of target program was a 'subsystem' for KOS, the online facility. Users had only 1536 words of working memory (excluding the code itself), and about 600 of those were used for the I/O system and other services. Nevertheless, I wrote a simple linear regression program (mirroring the one we had done in BASIC for an assessment).

Running a subsystem was something for which one had to get permission, as a fault could stop the online system (although a simple operator command would continue it from the stop point). Once I had proved myself, I was allowed to write more programs.

However, I (and others) became increasingly ambitious, and did some naughty things. Fairly early on, I 'acquired' a copy of the (assembler) source code for the online system, KOS. My first venture into the 'naughty' area was to write a program that would simulate the effect of Ctrl-C (well, its equivalent in those days) on a specified terminal. I was able to do this by modifying a status bit in the terminal multiplexer device driver (I had previously found a way to write anywhere in the entire machine's memory, subverting the memory management). Later, I discovered that I could remotely log someone out as well.

Various other programs followed, and a few of us decided it would be a good idea to be able to submit batch work to the batch queue via a terminal, instead of on punched cards. One could then use other languages such as Algol and FORTRAN, which were not available in the on-line system.

This wasn't too difficult:

Load a program that inserted itself between the executive card reader driver and the batch system.
Fulfil requests for 'next card' by getting one from the card reader driver and passing it on - until an end of job card was detected.
Pass on the end of job card, but follow it with card images taken from a previously prepared file.
At the end of that file, reconnect the batch system to the card reader driver and exit the program.

It worked perfectly. Except ... when the computer operators came to collect up all the punched cards for the completed jobs, and reconcile them with their associated printouts ... there were no cards for one lot of printout! They spent some time looking for them, and to avoid further suspicion, action was taken. Cards were quickly typed for the 'phantom' job, and dropped down the back of the table that held the trays of returned cards. Then: "Oh look, what's that down there?".

We got away with that one. Over time we encountered restrictions on running subsystems, and had to write a loader. That is another story.

Loading programs the BASIC way

This follows on from the story above, and it's about the loader. This was a joint effort between (I think) two or three of us.

Writing subsystems (see above) was great fun. They could only be done in assembler, and then they had to be put in a place where they could be loaded. The only place the system was set up to find them was a particular area on 'disc 2'. We could copy our programs there, and they could then be loaded simply by typing their name.

We got up to various antics with subsystems (see above), which eventually resulted in special restrictions being imposed on where one could copy files. Disc 2 was off limits; it wasn't technically possible to put files there except from a batch job (too traceable) or the operator's console. So we were stopped in our tracks.

But not for long. This is where BASIC comes in. The BASIC system had been written locally (as had the entire online system) but of course it was rather limited by the available space for each user (a bit less than 1000 (24 bit) words each). The BASIC system lived in a 'common program' memory area which was separate from that, as indeed did all subsystems.

Because of the space limitations, one could generate a 'BASIC library'. Bear in mind that BASIC itself compiled to real code. One could write a load of BASIC, using high line numbers. A special command generated a text file as output; this file consisted of a stream of octal numbers representing compiled code. This could be fed to the assembler (it was syntactically valid for that) and turned into a callable library, which could be put somewhere that a BASIC user could access it. It was then possible to call up that library, and use the subroutines in it from one's own BASIC programs, the library code being loaded into the common program area, not using up the user's own space. You just had to know the line numbers for the subroutines in the library, as they weren't visible when listing your BASIC program.

So, the hack was this. We wrote a few lines of BASIC based at (say) line 10000. Then we generated the source for the library, as usual. At this point, said source was edited to add rather more assembler - in fact, a program loader. The 'library' was then assembled and put in a suitable place on disc.

Using this was simple. One merely ran BASIC, and loaded the hacked library. Then a command such as:

GOTO 10000

was issued. That entered and ran the loader, which prompted with:

ENTER PROGRAM TO LOAD:

The name of your subsystem (e.g. the one to pre-empt the card reader) was then typed. Hit the carriage return key, and you'd be in your program.

I don't think we ever got caught with that. The main perpetrators were me and a guy who ended up being head of software - the job of the guy we were hiding it all from at the time. And I ended up managing the next mainframe.

A postscript to my exploits

This happened nearly 30 years after the two exploits detailed above. I was hosting a retirement dinner for several staff, all of whom had been around since I was an undergraduate. One of the guests was the former system software manager, who had been one of the people who nearly caught us. I am pretty sure he knew who was responsible, even if he couldn't prove it.

Of course, we got chatting (he had been my boss when I had a part time job), and he asked what I was doing now. I replied that I had been teaching there for many years. He asked what I taught, and I said my primary interest was operating systems.

His droll reply was "Why am I not surprised?"

Hacking the CPU

This isn't really an amusing one, but people seem interested, so here it is.

I studied Electronics as an undergraduate. The degree programme wasn't quite what I expected (it leaned more towards pure electronics than electronic engineering), and I found it had rather too much physics in it, and not enough emphasis on the practical stuff; I think the problem was the inbuilt assumption that I'd have been playing with stuff (and designing it) for quite a while before starting the degree.

Two things happened. One was that I soon became interested in all things related to computers, and the second was that I found I could (sort of) do digital electronics OK (much easier!). I learned a lot of stuff about various systems, in particular our mainframe (see previous item), which ran VMs-within-VMs, effectively.

We had to attend a compulsory Long Vacation course at the end of our second year, doing something useful at university for three weeks over the summer. I was involved in designing and building a plotter interface, which was a spectacular disaster but did teach me what not to do.

My final year project was unusual in that it was a joint one with another student; he ended up doing the actual building part (including PCB design), and I did the logic design and the software (hardware tests). The project was to modify the CPU of a Honeywell DDP-516 minicomputer, which was fitted with something called a memory lockout option (MLO).

I had realised that the MLO provided insufficient facilities and controls to provide a proper virtual machine (as a hardware emulation). It did provide something called restricted mode, which stopped certain instructions (e.g. CPU halt) from being executed. The project was intended to fix this. It was made easier on a practical level by the fact that this was a set of wire wrapped backplanes containing multiple, rather small, circuit boards. There were some spare, uncommitted board slots, which were wired and used for this.

This is not the place to go into a lot of detail, but essentially the modifications were:

In restricted mode, some instructions were silently treated as a no-op (NOP) instruction, rather than causing an interrupt to the operating system. This meant that instruction emulation inside a virtual machine was impossible. This was fixed by the addition of extra logic to cause an interrupt in this situation.
It was also not possible to emulate restricted mode inside a virtual machine, because the instruction to enter restricted mode (ERM) was treated as a no-op if already in restricted mode. This was fixed by the addition of logic to force an interrupt.
This wasn't strictly necessary but made life easier. When an interrupt occurred, it wasn't possible to tell if the interrupt had come from restricted mode, or from normal mode. A latch was added to preserve the previous mode. There was an instruction to save machine flags, etc. on an interrupt; it was, strangely, called 'input keys' (INK). It simply copied the flags to the accumulator so that the operating system could store them somewhere. INK was modified to save the state of the 'previous mode' latch in an unused bit. Note that the 'output keys' instruction (OTK) was naturally not modified, as all it could do would be to write back into the latch!
I am pretty sure there was other stuff, too, but I can't remember right now.
And of course I wrote a set of hardware test programs.

Also take a look at the weird experience I had a lot later on.

Here is a picture of just the control panel.

A weird experience

The Hacking the CPU experience happened, as stated, a long time ago. However, there is a little postscript to the story.

As part of a possible project, I was recently trying to get some good photographs and physical details of the Honeywell DDP-516 control panel (this sat on a desk, connected to the main cabinets via a thick cable). I was after pictures, as well as accurate measurements, and the details of how each switch operated (momentary, two state, three state, etc.) I was able to locate some information online; these pictures show what it looked like.

I noticed that the pictures were of an item held by the Science Museum in London, so I decided to contact them. After a false start when no one got back to me, after several months I was able to have a discussion with a very helpful lady who arranged for me to have access to the panel. Unfortunately it was actually on display, and because of that (and other reasons) I was obliged to make my visit between 0700 and 1000 (i.e. before Museum opening time) on the day arranged. We settled on 0745 as the earliest feasible time for me, and I got up very early.

On arrival, I was admitted via a side door and went to sign in. We had to traverse a lot of the main floor of the museum, which was quite interesting as it was deserted apart from a couple of cleaners! I was then taken down to a basement lab, where the panel awaited me on a bench. Apparently, due to the proximity of another exhibit, there had been a few problems extracting the panel from the display cabinet; a small member of staff had to crawl in to get it!

At this point, I think I gave the conservator a bit of a shock. As I laid my hand on the panel, I said something like "This is the first time I have touched this panel for over 44 years". Yes, it was the same panel. I had done some research, and discovered that when it left the University of Kent, the DDP-516 had been donated to a local school. Several years later, it had been taken by the Science Museum, although they only retained the panel.

I spent about an hour and a half examining and photographing the panel, and taking measurements. I then had coffee in the Victoria and Albert café across the road, before entering the Museum again as a member of the public. I spent some time touring the 'Information Age' exhibition before returning home to sleep.

Crashing the system by deleting a file

This was another escapade on the ICL 4130 running KOS. By this time I was a (supposedly responsible) postgraduate.

The 4130 was running out of disk space, but it was nearing the end of its 10 year funded life. It had four 2MB disks, but needed more; however, it was not cost effective to buy more disks, and, I believe, an extra disk controller.

Two members of staff (one being Brian Spratt, the Director of the Computing Laboratory) thought up a cunning plan for cheap disk space. This used a PDP-11 as a kind of file server. One of the staff built a hardware interface between the 4130 and the PDP-11, and the other wrote the link software. There was also a little extra software in the PDP-11.

The basic idea was that the PDP-11 appeared as an extra disk - the current disks had single digit numbers, and the disk on the PDP-11 became disk 99. The PDP-11 ran its manufacturers' operating system - a pretty basic one called DOS/BATCH. Filenames on the 4130 were a maximum of eight characters, whereas they were 9 characters on the PDP-11. One could thus directly map filenames from 36 users on the 4130 to a single user on the PDP-11, by using the extra letter or digit to differentiate files for different users (there were a limited number of user accounts in DOS/BATCH).

This all worked surprisingly well. The disk on the PDP-11 was an RP02, which was 20 megabytes; this was a vast improvement. A second disk was added later.

Until I came along! One day, I had written a program to do something pretty innocuous; I forget what it was. I accidentally got it into a loop writing to a file, and managed to fill up the rest of the 20 megabytes. I realised what I'd done, so I simply deleted the rather large output file. This would have been OK, but ...

DOS/BATCH used the system of 'block chaining' to construct files in its filing system. Essentially, files were linked lists of blocks, with a bitmap or a free list recording free blocks. When I deleted my large file, its deletion involved laboriously crawling down the very long chain of blocks in the file, returning each one and marking it as free. This took a long time - so long, in fact that the 4130 thought the PDP-11 had crashed. The software was very simple - it did the easiest thing - it halted the machine.

So I brought the University mainframe to a standstill by deleting a file.

Crashing the system by editing a file

This happened again on the ICL 4130 running KOS.

I was doing research on software portability, in which I had been interested for some time. I had obtained a portable editor from a postgraduate at the University of Essex, and had implemented it on KOS. It worked in very limited memory (essential on KOS) but had advanced looping and decision constructs which made it very powerful. A select group of people (including me, of course) used it a lot.

KOS was simply a layer on top of the manufacturer's operating system; as such, it had to deal with unexpected error returns from the system. I general, these did not happen very much at all. For development purposes (and KOS was being developed continually), any unexpected error would cause KOS to stop scheduling its timeshared users, print the message LOGICAL ERROR on the operator's console, and pause for operator input. A simple command would allow it to continue, but of course the error had to be investigated first.

My portable editor just occasionally caused a logical error. I tried my best but could never find the fault. Then, one morning, I managed to cause four logical errors. The system manager wasn't happy, and he printed an octal dump of the entire KOS slave (we would call it a virtual machine these days). This was on 11 inch by 8 inch paper, quite thin, and a pile about a foot thick. He dumped it on my desk, with the order "Fix it!"

I took the pile back to my Darwin study bedroom, and left it on the floor for several days. On the Saturday evening, I and several other postgrads gathered in Darwin Bar, and I had quite a lot to drink. At closing time, I staggered back to my room, not at all sleepy. I assume I said to myself, no doubt in a slurred voice: "Ah, fix the editor!"

Apparently, I did so. I have no more recollection of that night, but I woke up the next morning to find paper all over the floor. On the top sheet was written "Uninitialised variable in fourth word of VFILE control block". And so it was.

Remote diagnosis with a twist

When I was an undergraduate, we used a locally developed timesharing system called the Kent Online System (KOS). I was a geek, and was trying out all the programs I could find. One of them seemed fascinating, but all I could make it do was spit out error messages.

Eventually I asked Professor Peter Brown, and it turned out that he'd written it as part of his Ph.D. He explained that it was a general purpose macro processor called ML/I, and showered me with documentation. It transpired that this included instructions for using it to translate itself for a new system. The year after, I did just that on a minicomputer to which I had access in the Electronics department (it was the Honeywell DDP-516 mentioned elsewhere here).

Fast forward another year, and I'm doing a Master's at the University of Essex. My speciality was (and still is, as many people know) operating systems, and I got to know a Ph.D. student researching that area. We chatted, and I found out that he was manually translating that macro processor for a 12 bit minicomputer (it was such a tight fit that it had to be done manually). This was a spare time project for him. For those who are interested, it was a PDP-8.

A few days later, I met him on the stairs. I asked him how the macro processor was doing, and he said it was all fine except for one weird bug. When it produced any error message, it forgot something called the 'insert environment'. Perhaps I could look at the code sometime?

I replied that I didn't need to. He was corrupting the fourth word from the bottom of the stack, probably due to a faulty backward move routine with an off by one error. I then rushed off to my lecture.

Next day, I saw him again. He was most impressed, because I was exactly right, and I hadn't even seen the code.

I never told him that the day before he told me about the bug, I'd been reading the documentation on the data structures used by ML/I ...

Peter Brown eventually handed over maintenance of ML/I to me. It is still in use, and I run a website for it. Peter sadly passed away a while ago.

Experience with the Edinburgh IMP language

I was recently reminded of my first exposure to IMP. This might be interesting to one or two people, and perhaps someone can expand on the bits I don't know.

For those who don't know, IMP was (and indeed is) a high level language, suitable for system programming, which was based on Atlas Autocode. Despite the name, Atlas Autocode was a high level language similar to Algol 60, but with some of the hard to implement parts taken out, and with some useful additions. More details can be found on Wikipedia, and manuals can be found here.

It would have been somewhere around 1975 or 1976. We, at the University of Kent, were looking for a high level language in which to implement a terminal concentrator; this was for the PDP-11. We only found two possibilities; IMP and BCPL. I was tasked with getting the IMP compiler going on a PDP-11 running DOS/BATCH.

I was provided with an RK05 disk containing a system called MUSS-11 (I assume this was derived from a Manchester system?). It ran an IMP compiler, and looked (I think) a bit like a later multi tasking system (from Edinburgh) called DEIMOS. It was probably a forerunner. It contained the compiler source, but I never really understood it (although, much later, I had a good look at Peter Stephens' IMP for EMAS, and could probably go back and understand the MUSS-11 one now). This was almost certainly a Peter Robertson compiler.

I first had to work out a way of getting text and binary files between the MUSS-11 PDP-11 and the DOS/BATCH one (this was nearly ten years before Kermit for us). I used the DOS loader format (checksummed blocks) and some handshaking over serial line; the MUSS-11 end was written in IMP, and the DOS/BATCH end in PDP-11 assembler. That worked OK.

Then I moved the IMP compiler compiled binary over, and I think I wrote some PERMS (intrinsic and support functions) in assembler too. I linked it all with a shim I wrote, and the compiler successfully ran on DOS/BATCH. There was a converter between IMP compiled output and DOS/BATCH binaries.

Postscript: The powers that be went with BCPL. The concentrator was very successful, and was down line loaded into PDP-11/03 machines over serial lines, with about six initialisation overlays! I later hacked up an existing Z80 BCPL compiler, and the whole thing was ported to Z80 systems. We were still using those when EMAS 2900 was in use; the FEP talked to the Cambridge Ring, and so did the Z80s. I still have the compiler somewhere.

Fooling the managers

The University of Kent's ICL 2960 was reasonably reliable - rather more so after we retired VME/K (the manufacturer-supplied operating system) in favour of EMAS, an operating system from the University of Edinburgh. The main points of hardware failure seemed to be fans and power supply units.

When the machine did break down, it caused great disruption to classes, and these were not easily rescheduled. We were thus under great pressure to get back in operation as soon as possible. We had invaluable assistance in this from our site engineers, in particular a lovely man called Harry Sweet, who lived in Herne Bay.

One of the most frustrating things was the time it took a spare PSU to reach us, even with two engineers doing a halfway meet (the trials of being in deepest East Kent). So we had a cunning plan; we kept unofficial on-site spares - unknown to the engineers' manager, who, by judicious use of smoke and mirrors, was persuaded (unwittingly) to provide several spare units, at least one of each kind.

The problem was where to store them. They had to be accessible to the engineers, but could not be kept in the room provided for them, because their manager might have noticed. Instead, they were stored under the false floor in the machine room, scattered in various empty spaces.

Of course, there was then the problem of finding the right unit without lifting half the floor. This was solved by the production of a 'treasure map', the grid corresponding to the floor tile layout. The map was taped to the back of a drawer in the engineers' room...well away from management eyes.

We still found a couple of mislaid PSUs when the machine was decommissioned.

Trials and tribulations with VME/K

The University of Kent's ICL 2960 was supplied with the VME/K operating system. To put it mildly, this system was a crock; it was abandoned by ICL not long after Kent moved to using EMAS. It was, I believe thrown together rather quickly for use on machines that were not powerful enough to run the monster known as VME/B.

To say that VME/K was unreliable would be putting it kindly. I was tasked with checking that it did what the documentation said, and in little more than two weeks I had submitted over 200 separate bug reports. I was not popular with the development team. The problem was that this was the main University computer, and it was vital to the running of the place.

In sheer numerical terms, it was awful. Some of the failures could be attributed to hardware problems (it wasn't particularly well designed), but the software could not cope well with these. We maintained a rolling average of the combined (hardware and software) MTBF over 13 weeks, and it was about 20 hours. This improved to about 2000 hours when we moved to using EMAS.

The system used virtual memory, so there was a swap/page area on disk. One common problem was that if there was an error when reading this area, there was no attempt at recovery or mitigation; the system just crashed in its entirety. One day I was innocently running a blameless program, when it got a FSER 350 (as this swap/page error was known) and crashed. One of the few diagnostics was an identification of the user running at the time. Unfortunately, every time I ran that program that day, FSER 350 resulted. I was blamed. Of course, it would have been a lot better if just the user process that encountered the error had been terminated.

Another problem was the disk controller. This was a monster that occupied two or three 19 inch racks. It was old technology; ICL had recycled an earlier disk controller and made a few modifications, one of which was to paint it '2900 Tango Orange'. This controller was unreliable; it quite often just froze, causing a VME/K crash. EMAS handled this a lot better; it stepped the disk controller into diagnostic mode, to the point where it could issue a command to reboot it and reload the microcode (which was stored on a Compact Cassette). This took about 90 seconds, during which time EMAS queued pending disk transfers; after the reboot and reload was complete, the disk transfers were released and the system continued without loss of work.

One more shortcoming of VME/K was its handling of memory errors. The hardware was quite sophisticated, and it had Hamming correction on each 64 bit double word. This meant that if a single bit was in error, the error could be both detected and corrected, the software being notified via an interrupt. The failing memory could then be read (with correct data) and rewritten to fix the erring bit. If two bits were in error, that was fatal, but could still be detected. The problem was that VME/K did nothing more than rewrite the data; it didn't log or report the error, so the memory chip often just deteriorated further until another bit failed, and the system crashed. Once EMAS arrived, the situation changed; now the errors were logged, and once a day a report was printed for the engineers. This report not only detailed the failing memory board, but the exact chip that had caused the error.

The engineer doesn't always know best

When the University of Kent's ICL 2960 mainframe was installed, it came with a site engineer. For quite a while, one of these was someone who was a Kent graduate. He was somewhat of a 'company man', and was not keen when we abandoned VME/K in favour of EMAS (from the University of Edinburgh).

I managed EMAS; it had a novel way of handling filestore and (for the purposes of this story) peripherals such as printers. These were managed via a Spooler process, which handled all of the exception conditions, farmed out to it by the actual supervisor. Whoever wrote the code at Edinburgh had been a little obsessive about detailed error messages - a good thing, and possible because all of the messages were inside a paged process.

One day, we saw a message we had never seen before. I forget the exact text, but it indicated that a particular fuse had blown in the printer. Edit: I just reviewed the source code; it was "Hammer Driver Fuse Blown". We duly called the engineer from his room. He looked at the message, and shook his head, stating that no such fuse existed and "our" system was wrong.

We pressed him on this, and after casting his eye over the defunct printer he retired to his office and manuals. He returned a few minutes later, bearing a fuse. He silently opened a small panel in the printer casing, and changed the fuse.

Hacking the hardware

Once the University of Kent had moved to using EMAS, it was enjoying a rolling MTBF averaging about 2000 hours over a 13 week period. This was much better than the 20 hours we had been getting from VME/K. People were very happy.

And then one day it all began to fall apart. The machine just stopped. No crash, nothing. The engineer's panel indicated that the microcode had halted. We re-IPLed the system, and an hour or two later it stopped again. Eventually we called the engineers, and they ran tests. Lots of them. They pronounced that there was nothing wrong.

Then the 'crashes' stopped, for a couple of weeks. Then they started again. We couldn't get a handle on what was wrong at all. It was eventually decided that, the next time it happened, I should use the engineer's panel, for as long as it took, to investigate the state of the machine. In the event, I simply dumped out all the target machine registers, and the microcode PC.

Our engineers obligingly left a microcode training manual lying around, together with a microfiche listing of the microcode. Oh, and some circuit diagrams. I retired to a darkened room for much of that day; and the next. Eventually I emerged with the reason for the crashes. Without going into too much technical detail, it seemed that the microcode and the hardware handed off tasks to each other; in particular, a part of the hardware called the 'scheduler' was responsible for validating the type field in the descriptor register during the execution of any instruction that used a descriptor to access an operand. Any invalid type was trapped, and sent back to the microcode to force an exception (known as a 'contingency'). All other type values were considered valid, and passed back to the microcode to be used in accessing a jump table, thence invoking the right bit of microcode for that descriptor type.

So, what was going wrong? It turned out that there was what can only be described as a hardware design error. The scheduler didn't detect one particular invalid type code, so it handed it back to the microcode, which accessed the jump table with it. This of course accessed an entry marked 'can never happen', and the microcode halted. We later discovered that a physicist's errant FORTRAN program was overwriting a descriptor, and generating the bad type value. If the machine stopped, he just submitted the job again until he got fed up and went off for a week or two. Then he tried again, never noticing the causal connection.

We contacted ICL, but we never seemed to reach anyone who either understood what the problem was, or had the power or inclination to get it fixed (which would not have been a quick job, in any case).

So I decided I had better fix this another way. Back to the microcode listing. I found an empty patch area, and hand assembled a new bit of microcode which I linked to the right jump table entry. All this did was generate a 'descriptor error' contingency with a hitherto unused subtype code. I then wrote a tool to extract the microcode from the system disk, patch it, and put it back again. We IPLed the system, and tested it (by this time I had a test program). Success - it correctly triggered the new contingency and the microcode didn't halt!

The only thing left to do was to modify the various components of the operating system to do the right thing, culminating in a change to the FORTRAN run-time system to generate a suitable message. That only took me a few minutes.

We had no more microcode halts and the users were happy.

Dual? What dual?

The University of Kent's ICL 2960 was installed in 1976, and it moved to the EMAS operating system in 1979. EMAS was very efficient, but as the years went on the system was being stretched to its limits. By 1983 the system was fully committed pretty well 24/7. Government policy meant that we wouldn't get a replacement for another three years.

We knew we couldn't afford much of an upgrade, but we found out that there was a spare ICL 2960 OCP lying in a warehouse in Southall (I believe it had been used for the recent Census). It was free to a good home (us) but we had to pay about £350 for transport, etc. ICL kindly supplied the extra bits we needed to hook it up, and by slightly reducing the peripheral configuration (we no longer needed a card reader) we were pretty well able to cover maintenance costs within budget.

The day came, and we IPLed the dual system for the first time. EMAS said 'Dual OCP found' and went to work. Basically, it worked until anything went wrong, but it turned out that under exception conditions the operating system was unable properly to control the second OCP (e.g. to halt it). EMAS had never before been run on a dual 2960 OCP (there wasn't one at Edinburgh), and it turned out that the instructions and image store locations needed to communicate between OCPs were not standard across the 2900 range.

We asked ICL for documentation. No one knew where it could be found (we assume), or perhaps someone decided we shouldn't have it. In any case, we were stuck. Without documentation we couldn't modify the system supervisor to make dual OCPs work as they should.

I had previously learned quite a bit about the ICL 2960 microcode, so I retired once again to a darkened room with the microcode training manual, and a microfiche reader. It took me about a day before I emerged, having read a great deal of microcode and essentially reverse engineered all of the image store locations and bit positions needed to do what we needed; I think it was quite short, and here is most, if not all, of it. Armed with this, it was the work of minutes to modify the supervisor, rebuild it and re-IPL.

The system worked very well for its final three years.

The hairy PON

EMAS was a message based operating system; that is, each component within it communicated with the other components by sending it a message via a central message pool and 'switchboard'. Each message contained a header stating its source and destination, each being split into a unique number for the component, and another identifying the relevant activity. This is a very flexible system which allows, for example, a component to reside on a completely different machine, in a very transparent manner.

The routine that sent a message was called PON, which stood (apparently) for "Parameters ON queue". There was a corresponding POFF for taking a message off an activity's queue in order to action it.

Because EMAS was under continual development, there were occasional glitches. One such occurred when we were trying to run a trial service; a semaphore was getting stuck and locking up access to a file directory; this turned out to be because the system, when in use at Edinburgh, had never managed to fill the directory cache. I was proud that I found this myself, and managed to do so before the person who had designed and written that component! It was relatively easy to fix, but in order to confirm the problem (and its solution) it was necessary to 'signal' the semaphore (i.e. release it) manually. This operation was trivial, because EMAS had an operator console command for issuing an arbitrary PON with any desired destination, activity and parameters. I carefully typed the right numbers, and magically the directory was unlocked.

Clearly, this was a very dangerous facility; get one tiny bit wrong and it would almost certainly be fatal for the system. So, such uses of it became known as "Hairy PONs" (no, there is no 'R' in there).

Hacking an operating system into life

This happened back in my graduate days, but was a precursor to some later stuff.

I was doing some work on a PDP-11, mainly for fun and because I was curious. The machine was shared with students, and one had to book 'slots' to use it. It was a teaching machine, and fairly basic. It had 56kB of memory, two DECtape drives, a teletype and a (slow) dot matrix printer. That was it. The only language it supported was PDP-11 assembly language. It ran an operating system called DOS-11, which was OK, but slow and clunky (in fairness, it ran in a tiny amount of memory with many overlays). A lot of the 'slot' was often taken up with printing. I was a bit frustrated, but it was all there was.

I had previously used a PDP-10, which was a much larger machine. It was very nice to use, with a very usable command interface. I found out that DEC had another PDP-11 operating system called RT-11, and it seemed to have a much better command interface, not unlike that on the PDP-10.

A friend came into possession of the complete source code of the current version of RT-11. But there was no way to run it. Frustrated, I came up with a plan, which I put into operation. It is important to explain at this point that DECtapes worked like disks. They held 360kB, and had individually addressable blocks, just like a disk. This meant that it was possible to run a system off them (albeit slowly) just as if it was on a disk. What I did was this:

Build all of the system binaries using the assembler and linker on DOS-11. This was easy, but it produced binaries in DOS-11 format.
I wrote a program (in assembler of course) to transform the DOS binaries into RT-11 format binaries. This was fairly easy, as they were quite close to being simple memory images.
Next, I wrote a program to 'format' an empty DECtape in RT-11 format, writing all of the file system metadata, etc. This was a little tricky because the programming interface was at the hardware level, and required one to seek the right numbered block and then write the data as it flew by.
I built the system boot block, and wrote a program to put that on the first block of a DECtape.
The trickiest program was next. It had to write files into the file system on the DECtape, updating all necessary metadata. With this, one could write the system files to the DECtape.
At this point, one could boot from the DECtape. RT-11 ran fine. I could them transfer it to the disk. The only issue was that the machine had one small disk, so before use I had to generate an image of the normal system on tape and write it out, so that it could be restored after my session. But that was quick and easy.

The interesting thing about RT-11 was that there were two different versions of the 'monitor' (the operating system). One was the Single Job (SJ) monitor, which was a normal single user system. The other was the Foreground/Background (FB) monitor, which allowed one program to run without interaction. The first thing I did was write a print spooler! It checked the disk (every ten seconds) for any files with an extension of .LPT. If found, it printed it and deleted it, while I was happily programming away uninterrupted. It was very useful, and made much better use of my time slot.

I didn't dream that I would be doing something like this again in the future. But I did, so see the next anecdote.

Another operating system resurrection

Back in the late 1970s, there was a project (at the University of Cambridge, UK) to develop a portable operating system. It was called TRIPOS. For more details, see here. It initially ran on a PDP-11, although space was a bit tight.

In the pandemic lockdown of 2020, I became bored, and thought I'd like to play with TRIPOS. I was familiar with using a PDP-11 emulator based on SIMH, so I didn't need real hardware (although I do possess four PDP-11s). Most (but not all) of the source code was available online. TRIPOS was written in assembler and BCPL. A BCPL compiler was readily available, and TRIPOS came with an assembler written in BCPL, which I got working on my normal programming platform (FreeBSD).

The implementation process was fairly similar to the earlier one for RT-11, involving multiple stages and file format changes. It worked, and I have a running (but not perfect) TRIPOS system.

Immortalising Bobby Tables

Many Computer Science students will know about Bobby Tables. He appears in this xkcd cartoon. If you don't understand it, then take a look at the explanation.

The University of Kent has a Footsteps Project, which aims to raise money by getting former students to donate chunks of money in return for having a brick laid in a memorial path. Bricks can have any wording required, as long as enough money is provided!

If one counts from the end of the path nearest the Gulbenkian Theatre, I think the Bobby Tables brick is in the sixth row, right hand side...

See also the piece about the misaligned path later.

Rather less technical stuff

These should be accessible to non computing people!

When I can remember them. And if the period specified by any statute of limitations has elapsed.

Other stuff

This is stuff that doesn't fit well into a category.

Lord Grimond's funeral

First, some background. The Chancellor of the University is its titular head, whose principal duties are ceremonial (e.g. conferring degress, etc.) The Chancellor is usually a public figure. The first Chancellor of the University of Kent was Princess Marina, Duchess of Kent, who sadly passed away just after attending her first degree congregation. She was succeeded by Jo Grimond (later Lord Grimond), the Liberal politician. There have been several Chancellors since; Princess Marina was the only Chancellor that I did not meet. I found them to be of variable quality and commitment, the best being Lord Grimond, and Gavin Esler (the current incumbent as of 2023). Both of these individuals have been very friendly and student oriented.

While we are here, what about the Vice Chancellor? They are the executive head of the University, roughly equivalent to a CEO. They are generally appointed from outside the University. I have met all of the Kent incumbents; as with the Chancellors, they have varied greatly. Some (well, two) have been outgoing and friendly, while the rest have ranged from the reserved to the frankly robotic (at that end of the scale, rarely acknowledging members of staff, let alone students). I will not name names for the latter, but the two I liked were David Ingram and Robin Sibson. Many may express surprise at my selection of the latter; the explanation is that he was rather shy, but quite pleasant when you got to know him.

Now, on with the story. Jo Grimond retired from being Chancellor somewhere around 1990. He died on 24th October 1993. At that time, I was Master of Darwin College, and was surprised (and disturbed) to learn that the University had no plans to send a representative to his funeral. I decided that (management willing) I would go, as Jo had been an honorary Senior Member of Darwin College. I should mention at this stage that the funeral was to be in Orkney!

Time was tight. I forget the exact date, but I remember that I discovered the concerning news regarding the University's lack of involvement early on a Thursday morning; the funeral was to be around noon on the following day. My very efficient Master's Secretary (aka PA) secured agreement that the University would foot the cost of my attendance (this was a significant amount, and not in my College budget). She then booked flights and hotel accommodation, of which more below. I went home to prepare, and get an early night.

Next day, I got up at 3 a.m., had breakfast and drove to Heathrow Airport; I had to leave plenty of time in case of delays, and to give time to track down and pick up my tickets. I boarded the 0700 flight to Aberdeen, and arrived at about 0830. I grabbed a coffee, then boarded the flight to Kirkwall airport in Orkney. This is where I'd had a bit of luck; I had worked a lot with someone at the University of Edinburgh, who had retired some years before to live in Kirkwall, and he picked me up at the airport. He took me back to his house for coffee and a chat, and we then strolled down to St Magnus' Cathedral.

I had gathered from our chat that my friend was quite well connected, with the town council and others. I found that the family were seated in the front two rows, and we were in the third! I was next to Jo's political agent of many years. Jo had married Laura Bonham-Carter, granddaughter of Herbert Henry Asquith, and there were a lot of members of the Bonham-Carter family there; the famous Helena wasn't, as I think she was filming in the USA. There were eulogies from various people, including David Steel. After this, we boarded hired double decker buses for the interment, a few miles away, at Finstown. I remember that it was very cold and windy. We then repaired to a hall nearby for refreshments; I can honestly say that I have never seen so much whisky in my life! I had tea, though. Then, back on the buses and back to my friend's house. My flight was on time; no waiting for a slot at that airport, we just taxied to the end of the runway, turned round and took off. I remember sitting across the aisle from Clement Freud.

The flight back to Heathrow was uneventful. I had arranged for a hotel room for the night, as I had been up for nearly 18 hours and didn't fancy driving home that night. I had been booked a room in the Heathrow Holiday Inn (I think), just north of the airport. I settled in and ordered some food via room service, and went to bed. I should explain that this hotel consisted of a central hub (reception, bar, restaurant) and three long spokes containing rooms. I was in the last room at the end of a spoke, on the top floor.

I had ordered breakfast to my room, so I staggered out of bed in my pyjamas, and put the tray from the previous night outside my door. SLAM! I had locked myself out! The only option was to walk, in bare feet and pyjamas, right down to the lifts in the middle. I then shared the lift with people all dressed for breakfast, down to the ground floor. I then had to walk through the restaurant to reach reception, and get a fresh key. Then back up again.

I had my breakfast, and drove home with no more drama.

The misaligned path

This refers to the brick path built for the University of Kent's Footsteps Project. This aims to raise money by getting former students to donate chunks of money in return for having a brick laid in a memorial path. Bricks can have any wording required, as long as enough money is provided!

Although it doesn't say so any longer, the path is meant to follow the path of the Canterbury and Whitstable railway line, which ran in a tunnel beneath the University. That tunnel collapsed in 1974, causing consternation and inconvenience to many University members, not least the Computing Laboratory (the main computer had to be moved in a hurry). As a nod to the railway line motif, the brick path is bordered by steel rails.

However, the path doesn't follow the railway line - it is quite a few degrees out. Looking from the Gulbenkian Theatre, it should point a bit further to the right.

You can see this for yourself. View this map. It shows the railway line as a double dotted line (the tunnel) underneath what later became the site of the University. In the panel on the left, slowly drag the blue dot to the left, and a modern Google Maps overlay will fade in. You can see the line of the path - it's different!

Never mind. It's probably very sad that I noticed in the first place.

N.B. I have now obtained further details pertaining to the tunnel collapse. I even gave a talk about it, which you can find here.

Right Said Fred

If you don't understand the title by the time you've finished this, look here.

At the University of Kent, the School of Computing used to occupy, inter alia, the first floor of the Cornwallis South building (and a bit of the ground floor). The rest of the ground floor was occupied by Information Services (who provide the central computing service). Prior to the early 1990s, these two entities were one and the same - the Computing Laboratory.

It will be noticed that the ground floor (east of the main entrance and lobby) consists of offices and corridors that surround a central, windowless set of rooms. The largest of these was, at one time, the main Computer Room (!) which contained the University's mainframe computer; it is now the main server room. There used to be windows so that visitors could be shown The Computer; these were on the south side, and if you go into the main entrance, turn right through the door, and walk a little way down the corridor, you can (if you squint, and in the right light) see where the windows were filled in. Originally, this central complex of rooms was completely rectangular.

In the mid 1970s, the University obtained funding for its ICL 2960 replacement mainframe computer. ICL were very good at site surveying (well, most of the time, but that's another story). The site surveyors came, and decreed that there were a number of problems. The most pressing one was building access; the system's OCP was contained in a long cabinet nearly a metre wide and several metres long. It would go in the front door. It could be rotated to go through the door on the right. It would not negotiate the slight dogleg necessary to go through the door and lobby on the corner of the Computer Room.

The solution was to modify the wall. Which is why, to this day, that corner of the server room is cut away a bit.

Glossary

It occurred to me that some of the items above might need to have explanations of one or two of the terms. So here they are.

BCPL: BCPL is a typeless systems programming language developed in the 1960s. It influenced the development of other languages such as B and C. It is still available on many platforms.
DEC: DEC was the Digital Equipment Corporation (of Maynard, Massachusetts). Founded around 1958, it was taken over by Compaq in the 1990s, and hence folded into HP (Hewlett Packard). It is no more. The company had a focus on technology and engineering, and produced a number of different computer systems, mostly named as PDP-something. This stood for Programmed Data Processor, as it was felt they wouldn't be credible as a computer manufacturer in their early years.
EMAS: The Edinburgh Multi-Access System. EMAS was an operating system written at the University of Edinburgh, originally for the English Electric System 4 (a near clone of the IBM 360/370 series). It was ported to the ICL 2900 series, and later on to the newer IBM mainframes. For more details, see the Wikipedia article.
ICL 2960: The ICL 2960 was manufactured by International Computers Limited, the British computer company at the time (early 1970s). ICL was the result of serial mergers of a number of companies; for full details, see the Wikipedia article. The 2960 was part of ICL's 'New Range' of computers, intended to replace the ageing System 4 (inherited from English Electric), the 1900 series (inherited from International Computers and Tabulators), and the 4100 series (inherited from Elliott Automation). The machine had a 32 bit (4 byte) word, and some sophisticated facilities, with intelligent peripheral controllers.
ICL 4130: The ICL 4130 was manufactured by International Computers Limited, the British computer company at the time (early 1970s). ICL was the result of serial mergers of a number of companies; for full details, see the Wikipedia article. The 4130 was part of the 4100 series (the only other model being the more basic 4120). The 4130 had a 24 bit word, some interesting shared code features, and some simple base and range style memory management, together with the ability to operate in privileged and non-privileged modes.
Image store: The ICL 2900 architecture included instructions for accessing image store. These instructions were quite restricted (basically just 'read' and 'write'). The image store looked like a special kind of memory, but changes in its contents were used to modify the behaviour of the machine, and for accessing peripheral controllers. Image store instructions can be thought of as a mixture of access to machine control registers, and as input/output instructions (with image store addresses being analogous to input/output port numbers).
IPL: Initial Program Load. The act of loading a small program into a computer, which then loads a larger one, and eventually the operating system. More commonly known as bootstrapping or booting.
KOS: The Kent On-line System. KOS was a simple timesharing system allowing small applications, simple programming (mainly in BASIC), and editing of disk files. It ran alongside a conventional 'punched card and printer' batch system, on the ICL 4130 series of machines.
MTBF: Mean Time Between Failure.
OCP: Order Code Processor. Machines in the ICL 2900 series were built from a series of components; usually, each was in a separate cabinet (or several cabinets in many cases). The OCP was what is normally called a CPU - it was the component that executed instructions (the order code), and did not include peripheral controllers, memory, etc.
PDP-11: A range of 16 bit minicomputers manufactured by the Digital Equipment Corporation of Maynard, Massachusetts. The company name was usually abbreviated to DEC. They were relatively inexpensive (for the time), flexible and easy to interface.
PSU: Power Supply Unit. Power supply units usually convert mains power into a form suitable for powering computer components. Large computers may have several different ones, powering different components.
VM: Shorthand for virtual machine.
VME/K: Virtual Machine Environment K. VME/K was an operating system for smaller machines in the ICL 2900 series. It was introduced as an alternative to the mainstream operating system, VME/B, because the latter was too big and slow on the smaller machines. However, its life was short, and it was retired in the early 1980s due to cost cutting and internal politics.