rxiv module

class pygetpapers.repository.rxiv.Rxiv(api='biorxiv')

Bases: RepositoryInterface

Biorxiv and Medrxiv repositories

At present (2022-03) the API appears only to support date searches. The rxivist system is layered on top and supports fuller queries

apipaperdownload(query_namespace)

Takes in the query_namespace object as the parameter and runs the query search for given search parameters.

Parameters: query_namespace (dict) – pygetpaper’s namespace object containing the queries from argparse

download_and_save_results(query, cutoff_size, source, update=False, makecsv=False, makexml=False, makehtml=False)

make_request_add_papers(interval, cursor_mark, source, total_number_of_results, total_papers_list)

make_request_url_for_rxiv(cursor_mark, interval, source)

make_xml_for_rxiv(dict_of_papers, xml_identifier, paper_id_identifier, filename)

noexecute(query_namespace)

Takes in the query_namespace object as the parameter and runs the query search for given search parameters but only prints the output and not write to disk.

Parameters: query_namespace (dict) – pygetpaper’s namespace object containing the queries from argparse

rxiv(query, cutoff_size, source='biorxiv', update=None, makecsv=False, makehtml=False)

rxiv_update(interval, cutoff_size, source='biorxiv', update=None, makecsv=False, makexml=False, makehtml=False)

update(query_namespace)

If there is a previously existing corpus, this function reads in the ‘cursor mark’ from the previous run, increments in, and adds new papers for the given parameters to the existing corpus.

Parameters: query_namespace (dict) – pygetpaper’s namespace object containing the queries from argparse