Skip to contents

_R1 – common from illumina sequencer SRR****_1.fastq.gz – typical from SRA

Usage

DetectRawFileMeta(rawRoot, verbose = FALSE)

Arguments

rawRoot

Path to folder with FASTQ files

Value

A data frame with metadata for the raw input files

Details

TODO Would be convenient to handle multiple samples, as sample1/xxx; in this case, should prepend the sample name to the barcodes.

issue: when shardifying, good to keep info about what to merge. this reduces the work plenty! could keep a list of which shards belong for the next step