gc_data.plx
This is a script for calculating the GC content of settled length around target site at a box length continuously (from the first base to the last base of the settled region with the box length as counting unit);
- desired site infomation file contains chromosome and site infomation in the first and second column,no headings;
- desired sequence file is in .fa format, with chromosome id after >;
- desired region should be smaller than the length of the sequence;
- GC box length is the length of the region for calculating GC content;
- base 'N' is excluded from calculation.
seq_extract.plx
This is a script for fetching the sequence of settled length around target site from fasta file;
- desired site infomation file contains identifier and site infomation in the first and second column,no headings;
- desired sequence file is in .fa format, with identifier after >;
- desired region should be smaller than the length of the sequences.