From 45df9546c91a6f7a15b0aa9612d310ec8007e891 Mon Sep 17 00:00:00 2001 From: ElinorSterner <86856150+ElinorSterner@users.noreply.github.com> Date: Wed, 5 Apr 2023 16:01:05 -0400 Subject: [PATCH] Update README.md --- Utilities/README.md | 3 +++ 1 file changed, 3 insertions(+) diff --git a/Utilities/README.md b/Utilities/README.md index 0915631..49cb5d1 100644 --- a/Utilities/README.md +++ b/Utilities/README.md @@ -2,6 +2,9 @@ # Utilities > This folder contains many useful tools for analyzing sequence data. +# For taxonomy dir: +> Query_SRA_egs.py: input list of taxonomic names and output all SRAs or GCA since 2020 (can be adjusted by modifying script). For SRAs, the script also gives sequsncing technology used (pacbio, miseq, etc) and experiment type. It excludes all SRAs that include the word 'amplicon'. Input is folder 'unique_taxon_lists' with files of keywords by major clade (separated by new lines). Put -t (transcriptome, SRA db) or -g (genome, assembly db) in the command line to specify data type. +> >Example command line: python Query_SRA_egs.py -t OR -g ### Katz lab