Updating header in 5b_SummaryStats.py

This commit is contained in:
Auden Cote-L'Heureux 2024-01-26 11:40:30 -05:00 committed by GitHub
parent 427f9f7125
commit f3116ff874
No known key found for this signature in database
GPG Key ID: B5690EEEBB952194

View File

@ -1,3 +1,14 @@
# Last updated Sept 2023
# Author: Auden Cote-L'Heureux
# This script produces both taxon- and sequence-level statistics to describe the ReadyToGo files
# output by PhyloToL Part 1, as well as some OG-level information from the Hook (OG reference)
# database. It relies on the utility script CUB.py to calculate composition statistics (GC content,
# Effective Number of Codons, etc.). Both sequence level and taxon-level stats are summarized in tab-separated
# outputs written to the Output folder. This script requires that the OG reference database is available as an
# amino acid fasta file in the Databases/db_OG folder with the same file name as the .dmnd file used in script 4.
# This script is intended to be run as part of the PhyloToL 6 Part 1 pipeline using the script wrapper.py.
import os, sys import os, sys
import argparse import argparse
from Bio import SeqIO from Bio import SeqIO