Four key themes emerged: conception and meaning of data sharing, reward system for data-sharing, researchers’ autonomy and data control, and institutional data sharing governance
1. Conception and meaning of data-sharing
The understanding, meaning and implications of data-sharing varied among the participants. While most researchers demonstrated a comprehensive grasp of the concept, some expressed limited clarity. A professor holding an administrative position, IDIR01 defined data-sharing as, “making data accessible to other people who could potentially use that data.” Data-sharing’ was thus understood as both a means and outcome of making data accessible to other people who could potentially use it. It was seen as a process involving the transfer of ownership, ceding of power, loss of control, and expectations associated with allowing others (often unofficial) to use the data. At the same time, other participants illustrated limited knowledge, for instance one mentioned, “I must admit, I’m not very familiar with it, and I’m not entirely confident in my understanding.” (IDIR12, a PhD student)
Some researchers viewed data sharing as a means to foster research collaborations. They observed that making datasets available for sharing draws potential collaborators and promotes collaboration. As Professor IDIR20, with large portfolio of collaborations, expressed “The more you open up your data for others to use, the more you can attract collaborators to work with you.” Others however considered the intersection between collaboration, data sharing, and data ownership. They questioned whether using the same dataset within a collaborative project constituted data-sharing or data co-ownership. A professor argued that data-sharing involves providing data to external parties, not collaborators, stating,
"...if you are involved in a collaboration, and your collaborators are utilizing the data you generated together, I believe that could be considered co-ownership. However, in the strictest sense, data-sharing involves providing my data to someone anonymous" (IDIR22).
2. Unfair Reward System for Data-Sharing
Incentives
Researchers highlighted the general unfairness of the current reward system, noting that data-sharing thrives when supported by appropriate incentives. They stressed the importance of mechanisms to reward those who invest time and resources in creating valuable datasets. While formal acknowledgment in publications was seen as a positive step, many researchers believe that co-authorship on publications that utilize their data is a more equitable and meaningful incentive.
As a PhD fellow remarked, "If they acknowledged my contribution, I wouldn't have any problem with it. Because that is what research is all about” (IDIR02). On the same note, a junior researcher suggested, "People who access secondary data need to take an extra step, not merely acknowledgement but reaching out to those who share data and exploring the possibility of co-authorship…” (IDIR13).
Participants also observed that the current research assessment system, heavily focuses on publications and grants, and often overlooks data-sharing contributions. Participants proposed incorporating data-sharing metrics into research evaluations. As IDIR13 a junior researcher noted,
"…if we implement a scoring system that values data-sharing leading to publications, it not only elevates the institution's profile but also warrants recognition and rewards for individuals within the promotion criteria…"
The concept of a tiered reward system based on researchers' financial investment in data generation was also suggested but this provoked debate. Some especially those who invested personal funds believed that such a system would incentivize data sharing. As a recently graduated PhD student stated,
“Data collection often involves significant costs, and I incurred some of those out-of-pocket expenses for my research. I believe researchers who contribute to data collection should be compensated. Perhaps a percentage of funding could be allocated for data collection contributions, or a separate fund to support those who invest their resources in research.” (IDIR12)
However, others like IDIR20 a junior staff experienced in research regulation, argued that the inherent benefits of research, such as career advancement and professional development, already compensate self-funded researchers. He noted that,
Even if researchers invest personal funds, they have gained valuable experience by completing their research, answering their research question, and potentially publishing the findings. This, in itself, serves as a form of compensation.
Risks and Benefits
Researchers in this study opined that incentives alone might not be sufficient to encourage data sharing. They emphasized the importance of carefully considering risks and benefits before sharing data, even with incentives in place. Several benefit and risks data-sharing were highlighted including; increased efficiency in resources utilization, fosters collaboration, and bolstering research integrity.
Researchers mentioned that sharing data eliminates the unnecessary duplication of research activities thus saving resources and reducing the burden to both researchers and participants, as a very experienced researcher stated, “Using existing datasets can save a lot of time and finances as well as being safe to participants, and less cumbersome to researchers.” (IDIR06)
It also enhances data utility and increases research output. As a junior researcher IDIR13 aptly stated, "We can utilize the same data set to investigate many questions." Failing to share data can lead to a waste of resources, as further noted by IDIR13, “...sometimes you hold on to data thinking, oh, I'll use it, and then you actually don't really use it. And you know, it's wasted resources.”
Additionally, sharing data enables researchers to gain a more comprehensive understanding of research problems from diverse perspectives, facilitating quicker resolution of societal issues. As a researcher noted, “Data-sharing enables us to tackle research problems from multiple angles, enriching our understanding and ultimately contributing to addressing societal challenges." (IDIR13)
Researchers also considered data-sharing as a form of peer review that promotes data integrity, as IDIR02, a PhD student articulated: "Data sharing allows scrutiny of data quality and the interaction between researchers can further enhance data integrity."
While data sharing offers significant benefits, researchers also recognize potential risks. These include data misuse, privacy breaches, loss of intellectual property, exploitation, unwarranted scrutiny, unauthorized use, insufficient attribution, and loss of control over their data. Balancing these risks against the benefits is crucial when deciding to share data.
Researchers expressed concern that data sharing could invite undue scrutiny, potentially damaging their reputation. As IDIR02 noted, “Someone accessing data with the intent to find flaws could lead to criticism of the research and cast doubt on findings, harming the researcher’s reputation.”
Data sharing also reduces one’s control over data access, raising concerns about unauthorized use. As IDIR12, a recent PhD graduate narrated her experience, “I shared data with a colleague, who then shared it with others without my permission.”
Researchers also expressed concerns over the possibility of exploitation, where individuals might access data and use it without proper attribution or equitable sharing of benefits. As a researcher IDIR21 shared an experience, “People fear data theft. Accessible data can be used to write a paper before you do. I presented data at a conference and later saw a published paper with my data but without my name.”
3. Researchers’ Autonomy and Data Control
The study examined the factors that influence researchers' involvement in making data-sharing decisions. Findings revealed that researchers' autonomy is often constrained by various factors, such as the source of the research idea, funding mechanisms, economic status, and analytical capabilities. These factors can affect data ownership, access control, and subsequent reuse.
Research Funding Mechanism and Research Ideas Origination
Researchers reported that their autonomy in data-sharing decisions is often influenced by external factors, especially where external funding or research concept are involved. Funding agencies and individuals who conceive research ideas frequently shape data-sharing practices. The analysis revealed that
When researchers independently conceive and pursue their research projects, they often seek external funding to support their work. Such funding often comes with conditions, Including data ownership and access requirements. As Professor involved in international collaborative research participant remarked
“Data-sharing has become a prerequisite for both obtaining funding and publishing research. " (IDIR20).
However, the extent to which funders influence researchers' data-sharing decisions varied significantly. Some funders grant researchers ownership and data control rights while others don’t. For example, a PhD student recounted, "After we collected the data and had the dataset, someone expressed interest in using it in their publication. However, we declined, claiming that the data could not be released until our research was published.” (IDIR02)
When funders initiate the research idea, conceptualize the research and contract researchers to carry it out, data ownership typically resides with the funder. As one professor in the late 60s (IDIR18) explained, " Right from the start, they make it clear that the data belongs to the funder. They allow us to contribute to the writing, but the data remains under the funder's control."
Conversely, when researchers conceive the research idea and implement the project using personal savings, they generally have greater control over data sharing decisions.
Economic Status
This study found that researchers' economic status significantly influenced their ability to share data effectively. Those with limited funding often resorted to less secure methods, such as storing data on personal laptops or in locked cabinets. This not only hindered long-term preservation but also made data difficult to share and access. As IDIR13, a researcher having self-funded projects explained,
"I have a dedicated cabinet funded by the project for data storage, and I also keep some qualitative data in electronic format on my computer."
While these methods were feasible for individual researchers, they present significant barriers to long-term data preservation and shareability, requiring researchers’ physical presence to access data. Paper records were particularly vulnerable to threats like theft, fire, and even abandonment. IDIR13 further recounted a tragic experience: "We had data on paper that a colleague had kept, but when I needed it, some of it had already been destroyed." In another incident, the same researcher's laptop crashed, preventing him from sharing data with people who requested it after reading his published paper.
Conversely, well-funded researchers often stored data electronically on servers. This approach offered numerous advantages, including long-term preservation, easy access and facilitated collaboration among research partners. A professor involved in several collaborations, IDIR06 emphasized the importance of this method, stating,
We store electronic data on servers… We ensure both our partners and ourselves have copies.
Therefore, when funding agencies provided better data storage technologies, they gained more influence over data-sharing decisions. Such technology allows funders to dictate the terms and conditions for data sharing while limiting researcher input.
Analytical Capabilities
The disparity in analytical skills and access to advanced data analysis tools created an uneven playing field in research, favoring researchers with better capabilities and resources. In data-sharing collaborations, researchers with superior analytical capacities were often able to process data more quickly and efficiently, giving them a significant advantage over those with limited funding. As one researcher (IDIR21) expressed: "Sharing data is risky. Someone could easily write a paper with your data before you. Raw data offers no ownership claim. It's happened to me. We discussed data, and then a paper emerged from my data without my name."
This disparity often made researchers with limited resources hesitant to collaborate and share data fearing exploitation.
4. Institutional Data Sharing Governance
Institutional data governance, comprising of policies, cultural norms, and customs, significantly influenced the ethical landscape of data sharing. These factors define roles, responsibilities, and decision-making processes related to data-sharing. While policies provide formal guidelines, cultural norms and customs shape the informal expectations and behaviours that impact data-sharing practices. Together, they significantly influence researchers' willingness and approaches to data sharing within institutions, often prioritizing sole ownership of research data and discouraging data-sharing.
Institutional culture
Researchers reported that the prevailing cultural norms view research as a linear process involving idea conception, data collection, and exclusive researcher use of the data. This mindset is often instilled during academic training, as one influential professor noted,
"Since our undergraduate years, we've believed that researchers conceive their work, gather data, and use it exclusively. The notion of sharing was absent from this equation. The culture at our university, including the MakCHS, has long embraced this perspective." (IDIR06)
This stereotype within some academic circles can stigmatize researchers who leverage existing data, portraying them as lazy or lacking essential research skills. As an experienced researcher pointed out:
"When someone relies solely on existing data, questions arise about whether they acquire comprehensive research skills, extending beyond data analysis. This encompasses ethical research conduct and community involvement. Sharing data opens the door for anyone, including researchers perceived as lazy." (IDIR06)
Sociality in data-sharing
Researchers reported that the quality of social relations influence data-sharing. It was easier for the researchers to shared data with persons they had good quality relationships with, such as trusted colleagues, students they supervise, or close friends. However, this trend, restricted data sharing to confined tightly knitted groups, creating "silos" of information. A recent PhD graduate highlighted this issue, stating, "The way research is conducted currently... seems to be in silos" (IDIR14). This sentiment was corroborated by others, notably those who shared data with their students. One professor, emphasized that for every research project, he collaborates with at least one PhD candidate and three master’s degree students by providing them with data (IDIR06).
Participants described this as a discriminatory and isolating culture, limiting access to key datasets for researchers outside these social networks. As IDIR14 a middle level researcher eloquently put it, "You cannot share what you don’t know whether it exists or it doesn't exist."
Even on rare occasions when researchers attempted to access data, they encountered resistance and significant delays. One professor recounted a frustrating experience:
“I sought access to a colleague’s dataset after he published two papers. Unfortunately, permission to share it was granted three years later, rendering my intended analysis impossible.” (IDIR13)
Institutional policies and Guidelines
Researchers reported a lack of clear national and institutional guidelines or policies governing data sharing. This created a chaotic and unpredictable environment for researchers, hindering collaboration and potentially compromising the ethical data-sharing. This fostered the perception that data sharing is an optional, altruistic act rather than an ethical responsibility. As middle career researcher stated,
"There's no clear policy to share data, and many institutions don't support it. This creates an impression that data sharing is a benevolent act and not an ethical obligation." (IDIR14)
Without a standardized framework, researchers faced challenges in negotiating data-sharing agreements with partners, even when involving legal departments. Researchers felt vulnerable to exploitation and susceptible to the demands of funders and collaborators. As explained by a professor in an administrative position,
"By solely relying on external funders, we become overly vulnerable. Without clear national and institutional policies, researchers lack the necessary safeguards. This dependence on external actors hinders our ability to protect our interests and resources." (IDIR19)
The absence of clear policies has contributed to recent disputes over research outcomes and benefits, highlighting the urgent need for national and institutional guidelines to govern data-sharing practices.
"Given the increased investment of public funds in research, clear guidelines are imperative. Recent instances of substantial research funding with limited public benefit underscore the urgent need for clear policies and stakeholder engagement." (IDIR19)