Skip to main content
eScience in Action

Building Open Qualitative Science with Open Curriculum


Abstract

Open science has been gradually embraced over recent decades both in libraries and across a wide range of disciplinary communities. It promises paths toward transparency, reproducibility, and evidence synthesis — all in service of the ultimate goals of trustworthy and high-impact scientific research.

Qualitative researchers, however, have been slow to accept open science goals as unqualified goods and adopt open science practices, such as data and process sharing. Their hesitancy comes for a variety of reasons, some as prosaic as software and data limitations and others embedded in epistemological assumptions.

This paper describes how recent developments in infrastructure, technology, and methodology have begun to reduce this adoption gap between quantitative and qualitative researchers. Capitalizing on these opportunities, however, requires those training new qualitative researchers (including an increasing number of instructional librarians and faculty) have the technical and methodological skills to integrate open science throughout qualitative workshops and courses.

We then detail how we developed and pilot tested a pair of open qualitative research curricula for novice learners and instructors who may have limited qualitative research training themselves. The curricula use entirely open resources, including the Qualitative Data Repository, two open-source qualitative data analysis software packages (Taguette and QualCoder), the open REFI-QDA standard for the exchange of coded qualitative data, and the Carpentries Workbench curriculum infrastructure. This allows wide adoption of the curriculum with the ability to re-mix and finetune lessons and data to the target audience and without the need to maintain licenses for any services.

Keywords: qualitative, mixed methods, open education, data education, open science

How to Cite:

Porter, Nathaniel D., and Sebastian Karcher. 2026. “Building Open Qualitative Science with Open Curriculum.” Journal of eScience Librarianship 15 (1): e1172. https://doi.org/10.7191/jeslib.1172

Rights:

Copyright © 2026 The Author(s). This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (CC-BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Funding

Name
Institute of Museum and Library Services
FundRef ID
https://doi.org/10.13039/100000208
Funding ID
LG-250128-OLS-21
Funding Statement

Curriculum development was funded in part through IMLS National Leadership Grant LG-250128-OLS-21, in association with Syracuse University and the Qualitative Data Repository.

Name
Institute of Museum and Library Services
FundRef ID
https://doi.org/10.13039/100000208
Funding ID
RE-252335-OLS-22
Funding Statement

Curriculum development was also funded in part through IMLS Laura Bush 21st Century Librarian Program Grant RE-252335-OLS-22 in association with UCLA and Library Carpentry.

90 Views

31 Downloads

Published on
2026-01-30

Peer Reviewed

5da457f7-a26e-4b1e-9d58-8b0affd5f3ce

Introduction

For decades, the open science movement has been building among quantitative researchers, aiming to share data and analysis as completely as possible. In part, this is meant to improve scientific transparency, so findings can be checked and verified (Open Science Collaboration 2015), but it also serves to provide a basis for evidence synthesis and meta-analysis (Palmer et al. 2016; Perrino et al. 2013), developing future studies (Bader and Finke 2017), and methodological training (Sobal 1982; Standing 2016).

Qualitative researchers have been much slower to adopt data and analysis sharing, for a few different reasons, including both technical and normative constraints. On the technical side, qualitative data has had limited standards, and analysis has often been conducted manually or with specialized software with limited compatibility with other software (Evers et al. 2020).

Perhaps more importantly, qualitative researchers tend to emphasize the role of both the context of data collection and the positionality of the researcher in the research process — in contrast to quantitative methods that often seek precisely to minimize the subjectivity of their methods and draw wide-ranging conclusions about entire populations (Karcher et al 2021, Mauthner and Parry 2009).

Moreover, qualitative data often presents challenges for protecting privacy when sharing data, because interviews, focus groups, and other qualitative methods provide rich detail that can make it easier to identify participants even if direct identifiers are removed, as well as often dealing with sensitive issues (Kirilova and Karcher 2017).

Despite these challenges, however, there is a small but growing movement to share and reuse qualitative and mixed methods data and analysis, helped along by recent technical developments. In the United Kingdom, archiving and sharing of qualitative data started in the late 1990s with the pioneering work of QualiBank, now part of the UK Data Archive (Corti 2000); in the United States, more recent work by ICPSR and the Qualitative Data Repository (QDR) (spurred, not unlike in the UK, by pressure from funding agencies to make funded data products widely available) has led to broader availability of appropriate infrastructure for sharing qualitative data (DuBois et al. 2023). 

Developing Open Qualitative Curriculum

Libraries often find themselves the only campus units positioned to offer training in qualitative methods, data sharing and reuse, and qualitative data analysis software (QDAS), at least beyond courses in individual departments that are intended primarily for their own majors or graduate students. Given the range of responsibilities of data librarians, many of whom support data-driven research as only a small portion of their duties, there is a need for low- or no-cost, readily accessible and adaptable curriculum for this type of training.

Data repositories fill some of this need by offering training materials such as QDR’s Teaching Resources page (QDR 2021), but these are frequently limited to guides for data management, sharing, and reuse, with limited, if any, training in specific QDAS. Some software distributors offer structured training, in addition to manuals and user guides, but these tend to be recorded video demonstrations (and often require additional fees, such as the NVivo modules in Lumivero Academy; see Lumivero 2025). Video training is available freely through some professional organizations, such as the IASSIST Qualitative Social Science & Humanities Data Interest Group (iassistdata 2025). Such training, while valuable, lacks two key features of effective workshops for technology training - interactivity and responsiveness to participants’ interests and interventions. Moreover, many available tutorials focus on the most popular paid QDAS. Many universities offer interactive and evolving workshops through their libraries or other units, but these tend to be developed more or less independently, leading to a great deal of duplicate effort and gaps based on existing expertise of library teams.

A Fertile Ground

Over the last 5-10 years, several concurrent developments have enabled a dramatic change in how we teach computer-assisted qualitative methods. In addition to the increase in qualitative data archives discussed above, the REFI-QDA standard was launched in 2019 (qdasoftware.org; see also Evers et al. 2020), making it simple to move entire QDAS projects between a variety of packages for the first time. At the same time, several free, libre and open source (FLOSS) QDAS packages have debuted, most notably Taguette (Rampin and Rampin 2021) and QualCoder (Curtain 2025), with easier-to-use feature sets suited to a variety of project types, maintained well after their initial releases.

An opportunity to dedicate time to creating training resources with some of these new tools arose when the University of California, Los Angeles (UCLA) selected Open Qualitative Science as part of its new Library Carpentry lesson development cohort (UCLA Libraries 2025; for Library Carpentry see The Carpentries 2025), funded by the Institute of Museum and Library Services (IMLS). We selected Taguette to use for the initial lesson because it combined stable cross-platform compatibility with a clean and easy-to-learn interface. The Taguette lesson was developed to provide basic training in qualitative data reuse, qualitative data analysis and reasoning, and data sharing (Porter 2024).

Taguette also has downsides, however, three of which proved significant enough to consider developing a parallel lesson for a different QDAS: Taguette only works with a limited number of text-based datafile types, it has limited (codebook-only) compatibility with REFI-QDA data, and it provides no tools for mixed methods analysis of qualitative data through queries. As part of a larger IMLS-funded project to expand QDR’s infrastructure to take full advantage of REFI-QDA, the opportunity arose to develop such a lesson using QualCoder (Porter and Karcher 2025), a less intuitive but more full-featured open-source QDAS, once QualCoder reached development maturity (less bugs, out of the box support for all major operating systems, etc.).

Both lessons, as developed, follow three interconnected guiding purposes:

  1. Provide OA training for FLOSS QDAS that is widely discoverable and has support for ongoing development and maintenance

  2. Integrate QDR data and REFI-QDA standards deeply into QDAS training where possible, not as separate offerings (which are inevitably less attractive to busy students and faculty looking to learn they minimum they need to complete their analysis)

  3. Use the live-coding/live-demonstration philosophy to integrate experiential learning with the tool, repository, and standard that equips learners for their own projects

Pilot Workshops and Outcomes

Once initial development of the curricula was complete, a series of pilot workshops were taught between Fall 2024 and Summer 2025 to test and refine the lessons for a variety of audiences. Each curriculum was taught twice as a three-hour hybrid or virtual workshop through a research-intensive university’s professional development network, with a multi-disciplinary audience of primarily graduate students and faculty, though they were also open to undergraduates, staff, and community members. In addition, QualCoder was offered at two professional conferences for library and archive professionals, the Research Data Access and Preservation Summit (RDAP) 2025 (virtual) and the annual meeting of the International Association of Social Science Information Service & Technology (IASSIST) 2025 (in-person).

All pilot workshops were delivered using a participatory live coding approach (Nederbragt et al. 2020). In live coding, a team of 2 or more co-instructors takes turns demonstrating skills in real-time, with participants performing the same or similar actions, supporting practice-based active learning (Brown et al. 2023). They do so by using their own computers and software, thus ensuring the learning environment closely matches their likely research environment compared to working in custom web or containerized (i.e. Docker) environments. Instructors not actively teaching make themselves available to respond to learner questions or errors in real-time, whether in the classroom or via Zoom chat. Pilot workshops differed from traditional Carpentries workshops in two ways: there were no helpers other than the co-instructors and they were delivered individually rather than as part of a more comprehensive 2-day workshop covering multiple software packages.

The two initial university pilots of the Taguette and QualCoder workshops were each attended by approximately 30 participants and taught by the lesson developers (individually or in collaboration). Few participants responded to post-workshop survey invitations, but informal feedback indicated that the workshops were helpful in choosing software, learning philosophies and methods of qualitative coding, and finding and using secondary data. At least five participants requested follow-up consultations to support Taguette or QualCoder implementation in new or ongoing research projects.

The RDAP workshop was attended by around 50 virtual participants. The workshop was co-taught by the lesson developers. An additional RDAP facilitator was present throughout the session, monitoring the chat and available to provide Zoom support. RDAP solicits standardized feedback for all workshops, which was completed by 13 participants. 11 out of 13 participants rated the workshop as 4 or 5 on a 5-point scale for “I found the workshop engaging and useful.” Positive qualitative feedback emphasized the “hands on” nature of the workshop as well as the “examples and activities.” Several participants emphasized that they valued that the software introduced (QualCoder) was open source. Constructive comments focused particularly on pacing, highlighting both the general difficulty in providing differentiated instruction in large workshops and, more specifically to this workshop, a challenge on whether to emphasize the qualitative coding or the software-related elements of the workshop.

The workshop at IASSIST was taught by one lesson developer together with an expert instructor not involved in the course creation to 8 in-person participants. Given both lower attendance and one hour of additional time, facilitators added two components not included in the published course: 1) frequent pauses to share tips and experiences for teaching QualCoder with the audience (mainly data librarians) and 2) a short discussion and demonstration of the AI features included in QualCoder since version 3.7. While there is significant disagreement about AI use in qualitative research (e.g., Davison et al. 2024), workshop participants expressed curiosity about the functionality, indicating it may be wise to consider including some discussion of AI in future workshops. Due to the lengthy initial set-up and required API keys for AI services, we would recommend a brief demonstration over hands-on activities for most contexts.

Overall, the workshop was found to be very effective in its current form. To allow personalized instruction, the number of participants may be limited to about 20, with clear advertisement of the principal goal of the workshop (e.g., learning QualCoder or Taguette, learning computer-assisted qualitative coding). The data used in the course can also be substituted with relative ease if participants have a particular background or expertise, with the Carpentries Workbench infrastructure (Davey et al. 2025a, 2025b, 2025c) providing user-friendly tools for adapting lesson length and content.

Ongoing Use and Development

Both lessons are designed to be teachable by any instructor with a basic understanding of qualitative research and Carpentries teaching methodologies (Brown et al 2023). The QualCoder lesson is now in beta status, having been taught multiple times by both the authors and other instructors. Taguette has still only been taught by the author and thus remains in alpha (The Carpentries 2025), but additional pilot testing is underway. We intend for both to eventually become a stable part of the Library Carpentry suite of lessons. The availability of open data, standards, and tools finally make it viable to develop and teach a Carpentries lesson on computer assisted qualitative data analysis. As all Library Carpentry lessons, this lesson is licensed under a creative commons attribution license and can be freely used, copied, and remixed as long as credit is given to its creators — as part of official Carpentries instruction or in self-organized workshops. Both QualCoder and the data at QDR are also free to use, so integrating the lesson into a library’s curriculum requires no licenses or other financial commitments.

As part of QDR’s work on QDAS data, a wider variety of REFI-QDA formatted data is expected to become available at QDR and beyond over the next 12 months.

Conclusion

Collaboratively developing lessons on qualitative research and software using The Carpentries framework and infrastructure has exposed two key challenges of adapting approaches primarily intended for training in quantitative tools to more diverse approaches.

First, the live coding framework (Nederbragt et al 2020), while adaptable to non-coding live demonstration, relies in part on the ability of learners with different experience levels and computing speed to continue viewing code after it is run, something which is generally impossible for mouse- and menu-driven commands. Given this limitation, successfully running inclusive trainings that leave no learners behind may require more co-instructors or instructional assistants per learner than traditional live coding workshop to help learners resolve errors or catch up.

Second, the lessons’ guiding purposes proved ambitious in seeking to simultaneously train novice users on software, qualitative research methodology, OA and data reuse. Particularly in the case of QualCoder, keeping the workshop short enough to be taught during a term when most potential learners are teaching or enrolled in classes that can create schedule conflicts, necessitated simplifying or omitting major areas of functionality. This is, however, consistent with Carpentries philosophy, which emphasizes building literacy in core tool functionality over complete training for a single research workflow. In this way, learners build knowledge, practice skills, and internalize positive affect that enable them to continue learning on their own. In the future, the authors intend to develop optional add-on modules for teaching mixed methods analysis and embedded AI tools with QualCoder for situations where more time is available.

It is of course our hope that these lessons will be widely adopted within and beyond libraries for qualitative research software training. Beyond that, however, we believe these lessons and this paper provide two additional springboards for future improvement. They provide the first openly licensed curriculum using both FLOSS software and open data for maximum transparency and flexibility. Finally, the insights gleaned from observation and feedback as they are delivered can be used by both librarians and educators in diverse disciplines to teach qualitative research skills more effectively, regardless of the software and data sources they use.

References

Bader, Christopher D., and Roger Finke. 2017. “Evaluating Survey Measures Using the ARDA’s Measurement Wizard.” In Faithful Measures: New Methods in the Measurement of Religion, edited by Christopher D. Bader and Roger Finke. NYU Press. https://doi.org/10.18574/nyu/9781479875214.003.0006

Brown, Sarah M., Karen Word, Maneesha Sane, et al. 2023. carpentries/instructor-training: The Carpentries Instructor Training Februrary 2023. V. v2023.02.06. Zenodo, released February. https://doi.org/10.5281/zenodo.7612756.

Corti, Louise. 2000. “Progress and Problems of Preserving and Providing Access to Qualitative Data for Social Research—The International Picture of an Emerging Culture.” Forum Qualitative Sozialforschung / Forum: Qualitative Social Research 1 (3). https://doi.org/10.17169/fqs-1.3.1019

Curtain, Colin. 2025. QualCoder: Qualitative Data Analysis for Text, Images, Audio, Video. Python. V. 3.7. Released. https://github.com/ccbogel/QualCoder

Davey Robert, Erin Becker, and Zhian Kamvar. 2025. pegboard: Explore and Manipulate Markdown Curricula from the Carpentries. R package version 0.7.9. https://carpentries.github.io/pegboard.

Davey Robert, Erin Becker, and Zhian Kamvar. 2025.varnish: Front-end for The Carpentries Lesson Infrastructure. R package version 1.0.7. https://carpentries.github.io/varnish.

Davey Robert, Erin Becker, Zhian Kamvar, and Joel Nitta. 2025. sandpaper: Create and Curate Carpentries Lessons. R package version 0.17.1. https://carpentries.github.io/sandpaper.

Davison, Robert M., Hameed Chughtai, Petter Nielsen, et al. 2024. “The Ethics of Using Generative AI for Qualitative Data Analysis.” Information Systems Journal 34 (5): 1433–1439. https://doi.org/10.1111/isj.12504

DuBois, James M., Jessica Mozersky, Meredith Parsons, Heidi A. Walsh, Annie Friedrich, and Amy Pienta. 2023. “Exchanging Words: Engaging the Challenges of Sharing Qualitative Research Data.” Proceedings of the National Academy of Sciences 120 (43): e2206981120. https://doi.org/10.1073/pnas.2206981120

Evers, Jeanine, Mauro Ugo Caprioli, Stefan Nöst, and Gregor Wiedemann. 2020. “What Is the REFI-QDA Standard: Experimenting With the Transfer of Analyzed Research Projects Between QDA Software.” Forum Qualitative Sozialforschung / Forum: Qualitative Social Research 21 (2): 2. https://doi.org/10.17169/fqs-21.2.3439

iassistdata. 2025. “Official YouTube Channel of the International Association for Social Science Information Services and Technology.” https://www.youtube.com/channel/UC315efmsReDcFbWHpWBmb9g

Karcher, Sebastian, Dessislava Kirilova, Christiane Pagé, and Nic Weber. 2021. “How Data Curation Enables Epistemically Responsible Reuse of Qualitative Data.” The Qualitative Report 26 (6): 1996–2010. https://doi.org/10.46743/2160-3715/2021.5012.

Kirilova, Dessi, and Sebastian Karcher. 2017. “Rethinking Data Sharing and Human Participant Protection in Social Science Research: Applications from the Qualitative Realm.” Data Science Journal 16 (September): 43. https://doi.org/10.5334/dsj-2017-043.

Lumivero. 2025. “Lumivero Academy.” https://lumivero.com/resources/academy

Mauthner, Natasha S., and Odette Parry. 2009. “Qualitative Data Preservation and Sharing in the Social Sciences: On Whose Philosophical Terms?” Australian Journal of Social Issues 44 (3): 3. https://doi.org/10.1002/j.1839-4655.2009.tb00147.x

Nederbragt, Alexander, Rayna Michelle Harris, Alison Presmanes Hill, and Greg Wilson. 2020. “Ten Quick Tips for Teaching with Participatory Live Coding.” PLOS Computational Biology 16 (9): e1008090. https://doi.org/10.1371/journal.pcbi.1008090

Open Science Collaboration. 2015. “Estimating the Reproducibility of Psychological Science.” Science 349 (6251): aac4716. https://doi.org/10.1126/science.aac4716

Palmer, Margaret A., Jonathan G. Kramer, James Boyd, and David Hawthorne. 2016. “Practices for Facilitating Interdisciplinary Synthetic Research: The National Socio-Environmental Synthesis Center (SESYNC).” Current Opinion in Environmental Sustainability 19 (April): 111–122. https://doi.org/10.1016/j.cosust.2016.01.002

Perrino, Tatiana, George Howe, Anne Sperling, et al. 2013. “Advancing Science Through Collaborative Data Sharing and Synthesis.” Perspectives on Psychological Science 8 (4): 433–444. https://doi.org/10.1177/1745691613491579

Porter, Nathaniel. 2024. “Open Qualitative Research with Taguette.” https://librarycarpentry.github.io/lc-qualitative-taguette.

Porter, Nathaniel, and Sebastian Karcher. 2025. “Open Qualitative Research with QualCoder.” https://librarycarpentry.github.io/lc-qualitative-qualcoder.

QDR. 2021. “Teaching Resources.” https://qdr.syr.edu/guidance/teaching.

Rampin, Rémi, and Vicky Rampin. 2021. “Taguette: Open-Source Qualitative Data Analysis.” Journal of Open Source Software 6 (68): 3522. https://doi.org/10.21105/joss.03522

Sobal, Jeff. 1982. “The Role of Secondary Data Analysis in Teaching the Social Sciences.” Library Trends 30: 479–488. https://www.ideals.illinois.edu/items/7175/bitstreams/25887/data.pdf.

Standing, Lionel G. 2016. “How to Use Replication Team Projects in a Research Methods Course.” In Essays from E-xcellence in Teaching 15 (pp. 26-36). Edited by W. Altman, L. Stein, & J. E. Westfall. Retrieved from https://www.researchgate.net/profile/Jacqueline-Espinoza/publication/327190241_E-xcellence_in_writing_Reducing_undergraduate_writing_apprehension_with_the_basic_psychological_needs/links/5b7f18fca6fdcc5f8b6370c5/E-xcellence-in-writing-Reducing-undergraduate-writing-apprehension-with-the-basic-psychological-needs.pdf#page=31.

The Carpentries. 2025. “Library Carpentry.” https://librarycarpentry.org

The Carpentries. 2025. “The Life Cycle of Lessons.” The Carpentries Handbook. https://docs.carpentries.org/resources/curriculum/lesson-life-cycle.html.

UCLA Libraries. 2025. “Open Science for Librarians.” https://ucla-imls-open-sci.info.