96

The Event Horizon Telescope (EHT) was used to create an image of Messier 87*, a supermassive black hole in the Messier 87 galaxy. This result has been heavily reported in the media leading up to its announcement 2019-04-10.

Recent postings on social media have argued over who should receive credit for this work. Some have credited Katie Bouman for the image, which others have contested as overstating her contributions to the overall project.

For example:

enter image description here

According to data provided publicly by GitHub, Bouman made 2,410 contributions to the over 900,000 lines of code required to create the first-of-its-kind black hole image, or 0.26 per cent. Bouman’s contributions also occurred toward the end of the work on the code.In contrast, contributor Andrew Chael wrote over 850,000 lines of code. While CNN attempted to give Bouman full credit, explaining “That’s where Bouman’s algorithm — along with several others — came in,” they slyly admitted that fellow researchers told CNN “‘(Bouman) was a major part of one of the imaging subteams,'” even after CNN incorrectly wrote on the previous line that she was on one of the “imaging teams,” not subteams.

Source: https://bigleaguepolitics.com/woman-who-media-claims-created-black-hole-image-contributed-0-26-of-code/

This analysis seems to disregard the way collaborative scientific research actually works; are the metrics being discussed sufficient to measure this kind of contribution, or the impact someone has on a project of this nature?

Sklivvz
  • 78,578
  • 29
  • 321
  • 428
SSimon
  • 1,726
  • 3
  • 12
  • 20
  • 19
    Reminder to prospective contributors: 1) do not answer in comments, we will nuke them aggressively without notice; 2) original research is not allowed on this site, please base any answers on reputable sources and not on your personal opinion of what should count as a contribution. – Sklivvz Apr 12 '19 at 09:48
  • It seems my earlier comment was deleted since it was part of a bit of a chat, but I'll reiterate my actually suggestion for improvement: Please more of what CNN said that Big League Politics described as attempting to give full credit. – Azor Ahai -him- Apr 12 '19 at 23:43
  • I did a preliminary edit to help with adding context. However, someone who knows more about precisely what people on social media are arguing about could probably elaborate a bit more on the significance/relevance of the meme. – Nat Apr 13 '19 at 01:40
  • 11
    @TimPost: your edit to the title has completely altered what is being asked, invalidating the existing answers. Please revert or give a solid explanation for your change? – Jack Aidley Apr 13 '19 at 06:12
  • 14
    The 5th revision completely changes the meaning of the question. The CNN quote is still complete nonsense, but the titular question is *completely* different. That is not a change that should have been made. – Polygnome Apr 13 '19 at 08:19
  • Off-topic... the controversy being stirred up reminds me of College Humor's [Internet Bridge Troll](http://www.collegehumor.com/video/5817726/internet-bridge-troll). Bouman and her teammates are professionals, and have repeatedly said it is a team effort. The trolls refuse to listen to them. –  Apr 15 '19 at 00:07
  • You're misinterpreting the tweet you added to the question, it doesn't mean that her algorithm wasn't used at all. The edit also contains too much editiorializing, so I'm removing it again. Please do not roll back again. – Mad Scientist Apr 15 '19 at 09:47
  • @MadScientist do you have any evidence for that? As a scientist, I understood it is NOT used it is in the interest of new question to include that tweet, which is really important! The new question is if credit was misattributed, either roll back to the first original question or leave the important reference from a member of the group. – SSimon Apr 15 '19 at 10:20
  • @JackAidley The original title was _extremely_ confusing and click-baitey. I'm 100% fine with my edit to address that being improved, but I'm not fine with rolling it back to what it was. If you want to poke around an edit and the mods feel like it's an improvement, don't let the fact that it's my edit stop you :) I don't understand some of your criticism of the change, but I don't need to if you've got something you think would work better because it should be obvious then. – Tim Post Apr 16 '19 at 12:00
  • @TimPost: The original title asked about a specific claim that could be addressed from readily available sources. As such, I think it was a better question than what has followed, although it could perhaps have been elaborated on to be clearer. Since then the question and title have mutated through a series of more general claims regarding Bouman's role in creating the image. So, I'm afraid I simply don't agree with your claim that it was click-baity and unclear, and since we don't agree on a problem with the original I don't see that we can agree on a new version. – Jack Aidley Apr 16 '19 at 12:13
  • 2
    Please DO NOT change the claims or add new claims to a question that already has good answers. – DJClayworth Apr 16 '19 at 14:06
  • @DJClayworth It is new question and data. otherwise the answer have no sense – SSimon Apr 16 '19 at 14:09
  • 2
    If it's a new question, please use the "New Question" button. There was a question about lines of code and it was answered. Asking about the use of the algorithm is a new question. – DJClayworth Apr 16 '19 at 14:21
  • @DJClayworth it wasn't answered, someone in a comment posted a tweet of Dr Kate Group member, apparently that group member claims her algorithm was not used in the visualization of Black hole. – SSimon Apr 20 '19 at 08:53
  • If the tweet raises a new question, you can ask with the Ask Question button. But you should probably be aware that the claim is refuted by the tweeter a few tweets later. – DJClayworth Apr 20 '19 at 15:21

3 Answers3

260

The metric does not measure what it is claimed it does, and even if it did it would be meaningless for assessing the role of Dr. Kate Bouman in creating the image. I'll go on to why, but I first want to draw particularly attention to the fact that Dr. Bouman has explicitly rejected the idea that she deserves sole credit:

But Dr Bouman, now an assistant professor of computing and mathematical sciences at the California Institute of Technology, insisted the team that helped her deserves equal credit.

The effort to capture the image, using telescopes in locations ranging from Antarctica to Chile, involved a team of more than 200 scientists.

"No one of us could've done it alone," she told CNN. "It came together because of lots of different people from many different backgrounds."

(source)

The primary reason the metric is meaningless is that Dr. Bouman is credited with developing an algorithm not with typing lines of code, so any metric measuring code production is simply not measuring the thing she is credited with doing. She could have typed not a single character and still designed the algorithm that played a key role. It's like trying to measure the input of an architect by how many bricks they laid in a building.

Additionally, the project is broader in scope than simply implementing the algorithm credited to Dr. Bouman. Large amounts of code are involved in simply loading and co-ordinating files, displaying and saving images, and the like. All of which is necessary to the project at large but not specific to the algorithm used.

Finally and least importantly, the statistics in GitHub are not even measuring Lines of Code written - as claimed in the source - they are measuring lines changed in submission. Those lines can be code, or a change to a line, or a line copied between branches, even blank lines. In fact, (and, hat tip @Polygnome) the count includes lines which aren't even in the code at all, as there is also data and documentation included.

Jack Aidley
  • 2,524
  • 2
  • 17
  • 20
  • Comments are not for extended discussion; this conversation has been [moved to chat](https://chat.stackexchange.com/rooms/92415/discussion-on-answer-by-jack-aidley-was-credit-for-the-black-hole-image-misattri). – Sklivvz Apr 14 '19 at 08:06
  • 8
    Here's an hour-long presentation about the algorithms behind the image, well worth watching. Presented by Dr. Bouman herself! https://www.youtube.com/watch?v=UGL_OL3OrCE – John Dvorak Apr 14 '19 at 16:34
  • 4
    Also, some of the best changes to a repository is when one can replace 100 lines of code with 10. – Per Alexandersson Apr 14 '19 at 16:57
  • 12
    This answer fails to mention CHIRP, the [actual algorithm that Dr Bouman was credited with developing](https://news.mit.edu/2016/method-image-black-holes-0606) years ago, or how it relates to the black hole image, or how it relates to the code base in question. I believe the answer is "it was essential" and "it has next to nothing to do with that code base", but I am uncertain; if right, however, it means that we are measuring how much a sprinter who happens to be a licensed pilot helped win medals by how long he was in the airplane cockpit of the plane that flew them to the olympics. – Yakk Apr 14 '19 at 17:13
  • 1
    @Yakk: yeah, the question was changed completely after I answered. I was addressing the more specific claim highlighted in the original version. – Jack Aidley Apr 14 '19 at 17:30
  • 3
    The whiteboard is mightier than the github commit. – mckenzm Apr 15 '19 at 05:19
133

Technically, that is the percentage of the code she contributed, 2410 lines in 90 commits.

From https://github.com/achael/eht-imaging/graphs/contributors

enter image description here

But that tells us nothing about what the code does. Andrew Chael, the man who is credited with doing the work in that article, has spoken up against the rhetoric used against her.

From https://www.independent.co.uk/life-style/gadgets-and-tech/news/katie-bouman-black-hole-image-first-telescope-a8866536.html

"While I wrote much of the code for one of these pipelines, Katie was a huge contributor to the software; it would have never worked without her contributions and the work of many others who wrote code, debugged, and figured out how to use the code on challenging EHT data.

"With a few others, Katie also developed the imaging framework that rigorously tested all three codes and shaped the entire paper.

"As a result, this is probably the most vetted image in the history of radio interferometry. I'm thrilled Katie is getting recognition for her work and that she's inspiring people as an example of women's leadership in STEM.

Jerome Viveiros
  • 2,639
  • 2
  • 16
  • 26
  • @JeromeViveiros can you edit the answer to reflect the new question? – SSimon Apr 13 '19 at 10:44
  • 18
    "that is the _percentage_ of the code she contributed" (emphasis mine) - this isn't correct, since most of those lines are not actually "code" i.e. while at a quick glance it looks like `achael` wrote most of the python in the repo, they did not write 850,000 lines (unsurprisingly). `find . -name '*.py' | xargs wc -l` shows there are about 36K of what a reasonable person would call "lines of code" – jberryman Apr 13 '19 at 23:08
  • 3
    @jberryman They **did** write 850k lines of code. The fact that most of these lines were overwriting other lines does not mean they did not write them. A painter can easily say that they created a drawing with 850k strokes, even though the outermost layer of paint that is visible at the end of the process only contains 36k strokes. – Bakuriu Apr 14 '19 at 20:41
  • 9
    @Bakuriu: Chael said *no*. The majority of that 850k is data, not *code* anybody wrote. «Also I did not write "850,000 lines of code" -- many of those "lines" tracked by github are in model files. There are about 68,000 lines in the current software, and I don't care how many of those I personally authored.» See: https://twitter.com/thisgreyspirit/status/1116518550297096194 – Gábor Apr 14 '19 at 20:57
  • 4
    @Bakuriu no, most of those lines are data of some sort, e.g. what look like trained model parameters. (I realize I wasn't clear about that in my comment). I spent about 3 minutes looking at the repo, so I can't give you a more detailed analysis, but just enough to know that the way the data was being interpreted is nonsense, and people are terrible – jberryman Apr 14 '19 at 20:57
  • It's furthermore completly implausible for a image rendering library to hold 850.000 meaningful lines of code. – Magisch Apr 15 '19 at 09:08
  • Also, *Technically, that is the percentage of the code she contributed, 2410 lines in 90 commits* using the word percentage here is unprecise since code is not meaningfully or regularly measured by lines of the whole. – Magisch Apr 15 '19 at 09:20
41

The old title asked "Did researcher Katie Bouman only contribute 0.26% of code that created Black Hole image," and the existing answers do a good job explaining why it isn't true and why lines of code aren't a useful metric. The new title, however, asks "Was credit for the black hole image misappropriated?" and the correct answer should appear rather differently.

On the one hand, we know that Bouman deserves a large share of credit.

From https://www.cnn.com/2019/04/10/us/katie-bouman-mit-black-hole-algorithm-sci-trnd/index.html

"(Bouman) was a major part of one of the imaging subteams," said Vincent Fish, a research scientist at MIT's Haystack Observatory.

For the past few years, Bouman directed the verification of images and selection of imaging parameters.

"We developed ways to generate synthetic data and used different algorithms and tested blindly to see if we can recover an image," she told CNN. "We didn't want to just develop one algorithm. We wanted to develop many different algorithms that all have different assumptions built into them. If all of them recover the same general structure, then that builds your confidence."

That's where Bouman's algorithm -- along with several others -- came in. Using imaging algorithms like Bouman's, researchers created three scripted code pipelines to piece together the picture.

However, Bouman does not deserve all of the credit. So, for the claim to be true, we would need to see that journalists claimed that she deserves all of the credit.

CNN's headline is sufficiently reserved and the article goes far in explaining Bouman's work.

That image of a black hole you saw everywhere? Thank this grad student for making it possible

However, BBC News was rather more sensational.

Katie Bouman: The woman behind the first black hole image

Furthermore, it claims:

The black hole image, captured by the Event Horizon Telescope (EHT) - a network of eight linked telescopes - was rendered by Dr Bouman's algorithm.

However, as CNN explained above, multiple algorithms were used.

Another source, phys.org went with the factually inaccurate claim in its headline:

Scientist superstar Katie Bouman designed algorithm for black hole image

(it is arguable what is being claimed due to a missing article, although in such a headline "the" might be presumed)

So, I can see evidence for the claim that credit was misappropriated by some publications. Off topic: I find it rather unfortunate, too, since clickbait headlines and claims drive societal schisms.

  • 5
    OP has since added the text of a tweet that they claim supports a view that Dr. Bouman deserves *zero* credit. So it seems like OP is coming from the standpoint that, if she has been given *any* credit is is therefore misappropriated or misapplied. You might want to address that as well. – GalacticCowboy Apr 13 '19 at 20:53
  • The phys.org piece seems to be a syndicated AFP text. – E. P. Apr 15 '19 at 07:28