far.in.net


Seven Saturdays with Singular Learning Theory

I got a grant to work on introductory resources for singular learning theory (SLT) for six weeks. I’ll spend a little time every Saturday and write about my progress, goals, and lessons learned. (There will be seven Saturdays if I include the Saturdays before the first week and after the last week.)

Contents:

Prologue

In late June, I attended the inaugural SLT and Alignment Summit in Berkeley, California. High on motivation towards the end of the summit, I was desperately looking for some funding so that I could spend some time contributing to research on SLT.

I was lining up a visit to Krueger’s lab for a few months later this year. The start date was initially planned for early August, but it was looking like it would be pushed back to more like September. I could wrap up my RA position at Melbourne by the end of July, and so this left an appealing window of time to work on SLT.

All I needed was funding, and if I was going to catch this opportunity, I needed funding fast. Luckily, two new fast-turnaround funding programs, Lightspeed grants and manifund had just launched and were taking applications. I sketched out my application on the flight home and finished it a few days later, just before the lightspeed deadline. I also sent it to a manifund regrantor (and previous collaborator of mine), Adam Gleave, who liked the project enough to fund it!

So it all lined up perfectly! And I’ll see you on the Saturday before the project!

Saturday 0

Saturday, July 22nd, 2023

I just finished my second last week in my RA role at Melbourne. It’s effectively my final week, because next week I am off to ICML to present my CHAI internship paper. So I spent the week nicely wrapping up my contributions to the project (refactoring and documenting code, documenting next steps, etc.). Satisfying work!

One of the days I was on campus, I met with Edmund and we mapped out some of the key ideas to cover in an SLT literature review. This is the main project I’m going to be working on for the 6 weeks.

There’s not a lot more to say yet, other than perhaps to share this Venn diagram, reproduced from the whiteboard in MathSpace where we sketched out the review:

Possible aim of the literature review: help people from camp A to camp B.

More next week!

Saturday 1 (take 1…)

Saturday, July 29th, 2023

Writing from the AirBnB in Waikiki. I’m not sure how anyone manages to get any work done during conferences. Well, I guess I did get moments here and there, but I spent them working towards a seemingly more urgent deadline. Anyway, I will adjust my actions and my expectations in the future (like a good Fristonian). For now, I will extend the grant by a week. (I’ll figure out what to do about the clash with Cambridge later).

I can share a lesson that I started to learn at the SLT summit and that has been reinforced here at ICML: research is not a personal enterprise. Publishing a paper at a conference isn’t just a matter of dumping facts into some abstract research record with your name attached to them. The positive accumulation of human knowledge that Sarton told me about is actually implemented as, wouldn’t you know it, a community of humans. The conversations and connections at the conference are as important (perhaps more important) than the posters and the oral presentations.

This lesson is more general than about the SLT project, but it has implications. If we’re going to get SLT off the ground as a research program, we need to get SLT into the hearts and minds of the researchers, not (just) into NeurIPS, ICML, and ICLR.

One other update on the SLT project. Dan invited me to work with himself, Susan, Jesse, and Liam on an experimental project attempting to explain the phase transition to in-context learning in transformers using SLT.

Goals for next week:

  1. Get home safe.
  2. Get to work!

Saturday 1 (take 2…)

Saturday, August 5th, 2023

I made a slow start this week. The first part of the week was spent travelling home, and the second catching up on some admin. I will probably not count this week either, and try to start again next week! Either way, now the project is finally in motion!

On the SLT literature review:

On the in-context learning project:

(On side projects:)

Saturday 1 (third time’s the charm!)

Saturday, August 12th, 2023

OK, I decided to make this past week the official first week of the 6-week project. That’s final. This worked out for the best because:

Anyway, since I was able to spend most of my attention on research this week, things really got into a swing!

For the SLT literature review:

For the in-context learning project:

This week I also experimented with working from various locations on different days.

On the topic of productivity, last week I recruited a few colleagues and a few of my peers from Master’s to join me for twice-weekly virtual ‘shut up and write’ sessions (wherein we join a voice call, but are mostly silently writing/working, using the pomodoro technique with a screen-shared virtual timer). I ran two such sessions during the week. Ben and Edmund also joined my end of the call in-person for (part of) both sessions.

The sessions were pretty effective! This is not such a surprise, since I was simply reviving a practice I had started with some of these peers during my own Master’s degree. At one point, we were doing these sessions once or twice daily, and I credit them with a substantial portion of the words in my thesis (not to mention the many more words ultimately cut from earlier drafts of my thesis). I think this is a pretty powerful system I have here, and I should try to find a way to keep it as part of my work life going forward.

(On side projects:)

Saturday 2

Saturday, August 19th, 2023

This week, as Edmund worked towards the AAAI submission deadline for another paper, I mainly focussed on the in-context learning project:

The SLT literature review was basically on hiatus. However, I did have one meeting with Ben to go through the reading list and identify areas where it needs expansion. Next week we’ll pick this back up with roughly the same goals as last week.

(On side projects:)

Saturday 3

Saturday, August 26th, 2023

Today marks the project half way point! There is still so much to do. But I’m also positive about the start I’ve made.

On the in-context learning project:

On the SLT literature review project:

(On side projects:)

Speaking of my Cambridge visit, I received my visa to work in the UK this week! In the end, it took around 10 business days—even less than the 3 week optimistic estimate, and of course far short of the 6 weeks I feared. A great outcome, that helps me plan my travel, you know, not right before the travel. Anyway, I’m one step closer to Cambridge! Of course, that also means I’m one step closer to the end of this grant—another reminder to keep up the pace!

Saturday 4

Saturday, September 2nd, 2023

Sorry, I don’t have time to write a full detailed update this week. Here’s a brief update.

The literature review project is going okay:

The in-context learning project seems to be progressing smoothly, at least as far as the first milestone—replicating the phase transition—goes. We have conducted a few training runs and appear to be getting similar results to the plot in the main figure in the task diversity paper so far. It remains to conduct a full set of training runs and compare our results. After that, of course, we have to start the SLT-based analysis of the phase transition.

Until next week!

Saturday 5

Saturday, September 9th, 2023

TODO.

Saturday 6

Saturday, September 15th, 2023

TODO.

Epilogue

TODO.