Published November 18, 2025 | Version v1
Dataset Open

Dataset for Incorporation of the NIH Sex as a Biological Variable Policy by R01 Awardees

Contributors

Writing role:

Description

This dataset contains deidentified records derived from NIH RePORTER, a publicly accessible database of NIH-funded research projects. The data include information on R01 awards funded in fiscal years 2017 and 2018 that reported project outcomes, totaling 7,671 projects. From these, a randomized subset of 1,000 awards was screened to identify the most recent peer-reviewed publication associated with each project. For 574 articles meeting inclusion criteria, detailed metadata were manually extracted, including PubMed ID, journal, publication year, author names, subject type (human, non-human, or both), and sex-related reporting practices. Additional coding captured whether studies reported sex-based analyses or provided rationales for single-sex designs. Principal investigator and author gender were inferred using GenderAPI based on first names, enabling categorization of authorship dyads. The dataset supports analyses of sex inclusion, sex-based reporting, and gender representation in NIH-funded research.

Files

Files (2.6 MB)

Name Size Download all
md5:523094aa4707c5b2add3787a31653876
2.6 MB Download

Additional details

Identifiers

Other
NA

Related works

Documents
Dataset: NA (Other)

Dates

Valid
2025-11-18

References

  • NA