#5 - The most watched econ seminar in history?
Dear reader,
This is definitely not a newsletter article I imaged I would write when I relaunched LOGOecon in June. But it really feels at home with the kind of topics I want to cover.
Unless you live under a rock, you haven't missed an important presentation that happened last month at the NBER Summer Institute: Florian Ederer presented a working paper he co-authored with Paul Goldsmith-Pinkham and Kyle Jensen. They were able to identify IP addresses of people posting in EJMR, the toxic cesspool where a non-trivial number of economists share racist, misogynistic, homophobic and transphobic delicacies about their colleagues. These IP addresses shows that people in all institutions, including in so-called โeliteโ institutions, happily wade in this cesspool.
The presentation was live-streamed on YouTube โ a replay is available here. Thanks to a lucky turn of events (more on that below), I was able to record the live number of โViewersโ during the presentation, discussion included. As you can see in the plot below, the number of โViewersโ peaked at more than 1900 viewers.
I don't have data for other well-known economics presentations, but I wouldn't be surprised if Florian's presentation was one of, if not the most, watched econ seminar in history.
Before I dive into how I got the data, a quick observation: thanks to modern technology, scientific presentations can now reach an unprecedented number of people. Not all scientific presentations are destined to have thousands of viewers. But the fact that some can reach such numbers is truly amazing to me. Can you imagine a seminar with up to 1900 people in the room eating popcorn, sorry, listening? I wouldn't be surprised if these tools help to reduce (to a certain extent) the negative effects of institutional barriers and geographical distance in academia.
Now, the data. How did I get it? Well, contrary to Florian, Paul and Kyle, I had no need to develop a complex (and frankly, impressive) strategy to exploit a hash algorithm used so poorly that you're left wondering how many other massive privacy loopholes still exist on EJMR. What happened is a lot more mundane.
I started to watch the presentation on my iPad. I quickly realized that the YouTube app has a โViewersโ item in its interface (during a live, you can access it by taping on the description of the video โ where โฆ more
is located). I basically took multiple screenshots during the presentation, and I extracted the numbers of โViewersโ ex-post.
I needed two pieces of data from the screenshots: the timecode and the number of viewers. As I have 37 screenshots, it made sense to automate part of this work.
I used R (obviously) to extract the timecode. Each screenshot taken on iOS and iPadOS comes with EXIF metadata, including the exact time it was taken (including seconds). The exifr package makes possible to extract and work with EFIX metadata in R. As EXIF gives the date and the time when the image was taken, I used lubridate to remove the date and turn the time into an actual timecode - with 0
being the approximate beginning of the presentation. The presentation started at 22h30 my time (10:30 PM), so I subtracted 1350
minutes to the time.
mutate(
time_code = hms::as_hms(time_code) - dminutes(1350)
)
I exported an Excel file with the end result, and I manually recorded the number of viewers from each screenshot (I probably could have used some OCR to do that, but it felt overkilling).
Last, but not least, what's a YouTube โViewerโ exactly? It's not completely clear to me. Based on this (unclear) support page from Google, it seems that a viewer is basically one โdeviceโ that is streaming the live feed. The support page explains what the person streaming is seeing on the interface, not what the person watching is seeing on the interface. I'm assuming that โViewerโ has the same definition for both streamers and watchers.
If a person has two devices streaming the same live (for instance, a smartphone and a computer), I'm assuming it would count as two viewers (unless Google can correct the count if a given logged-in account is streaming the same feed twice?). One โdeviceโ streaming the live feed can be seen by multiple people, so the number of โViewersโ is only an approximate measure of the actual number of people watching live.
Despite these precautions and limitations, the number of โViewersโ is probably a decent approximation of the actual number of people who watched the presentation live. (As the presentation is available on replay, the total number of โViewersโ is now above 1900, and will continue to increase for as long as the replay stays online.)
It was an interesting plot to make, even if the underlying paper comes with grim, albeit completely unsurprising, results. EJMR is a plague, and I hope that, collectively, we will be able to get rid of it. The sooner, the better.
Feel free to subscribe to my newsletter to not miss my upcoming articles.
Olivier