We characterized the PubMed articles that mention “case series” in the title or abstract (published 01/01/1987 - 12/31/2023, written in English). We removed articles which discuss (rather than report the results of) case series studies, as well as those better indexed as other standard publication types. A random sample of these articles was evaluated by two annotators who confirmed that the great majority satisfy a formal definition of “case series”. The endpoint is a corpus of case series studies, listed by their PMIDs, that is suitable to use as a training set for automated machine learning indexing methods.
A manuscript describing the corpus in detail is forthcoming soon.