Category : Bioinformatics

15

Mar

Simple Bash command line to reduce the length of the FASTA header lines

 
Simple Bash command line to reduce the length of the FASTA header lines Introduction Hi there, how many times we have a FASTA file that contains huge FASTA headers like this: >gi|600513|gb|M21306.1|DROTRPC Drosophila melanogaster photoreceptor...
15

Mar

Handling large FASTA sequence datasets in R: Shuffle and retrieve “n” number of sequences of fixed length from the whole FASTA file and export them in a new FASTA file

 
Handling large FASTA sequence datasets in R: Shuffle and retrieve "n" number of sequences of fixed length from the whole FASTA file and export them in a new FASTA file Introduction When you are working with large FASTA datasets is likely to find out that the sequences are in sort of a mixed...
14

Mar

Extracting upstream regions of a RefSeq human gene list in R using Bioconductor

 
Extracting upstream regions of a RefSeq human gene list in R using Bioconductor Introduction Suppose that you want to do local mapping of upstream regions of a given RefSeq IDs in a particular genome in R using Bioconductor. Download the script here. In this case, you may take a look at...