Hi there,
I'm a researcher studying software evolution. As part of my current research, I'm studying the implications of open-sourcing a proprietary software, for instance, if the project succeed in attracting newcomers. However, I observed that some projects, like _reason_, deleted their software history.
https://github.com/facebook/reason/commit/1037c7dd36dfe94d7f03a7cb4a7c073f5d3cb840
Knowing that software history is indispensable for developers (e.g., developers need to refer to history several times a day), I would like to ask reason developers the following four brief questions:
Thanks in advance for your collaboration,
Gustavo Pinto, PhD
http://www.gustavopinto.org
Hey! I'll answer this one, but note that my views are strictly my own and do not necessarily represent the views of my employer blablabla...
facebook/ contains projects actively used by Facebook. This is good for reputation building as well imo.You could argue that we can still sync out the history; The more important aspect of this is that, because it's used internally, PRs/commits/discussions contain security sensitive information. Open-sourcing a code dump already needs to go through some security/privacy checks. Going through potentially thousands of commits, realistically, means no one will take on the heroic task of even open-sourcing the product. When React was open-sourced, Pete Hunt had to shake it off of internal dependencies. Had he not been there, the few contributors React had would probably have done it but later.
Also, remember that often something started as one person's random weekend project. Keeping a pristine history might not have been a priority.
Does this answer your question? If so, feel free to close this issue =).
thank you!
Hi @chenglou,
thanks once again for answering our research inquiries. We were able to collect 35 responses and we drafted a research paper with the results. The paper was submitted and accepted for the 14th International Conference on Open Source Systems (http://oss2018.org/). You can find the paper here. Hope you enjoy reading the paper!
Thanks again,
Gustavo
Most helpful comment
Hey! I'll answer this one, but note that my views are strictly my own and do not necessarily represent the views of my employer blablabla...
facebook/contains projects actively used by Facebook. This is good for reputation building as well imo.You could argue that we can still sync out the history; The more important aspect of this is that, because it's used internally, PRs/commits/discussions contain security sensitive information. Open-sourcing a code dump already needs to go through some security/privacy checks. Going through potentially thousands of commits, realistically, means no one will take on the heroic task of even open-sourcing the product. When React was open-sourced, Pete Hunt had to shake it off of internal dependencies. Had he not been there, the few contributors React had would probably have done it but later.
Also, remember that often something started as one person's random weekend project. Keeping a pristine history might not have been a priority.
Does this answer your question? If so, feel free to close this issue =).