Previous: , Up: Introduction   [Contents]

1.3 General options

Synopsis:

input.file=mydata.maf.gz
input.file.compression=gzip
input.format=Maf
output.log=mydata.maffilter.log
...

or

input.file=mydata.fasta
input.file.compression=none
input.format=Fasta(zero_based=yes)
output.log=mydata.maffilter.log
...

Detailed description of commands:

input.file={path}

The MAF file to parse

input.file.compression=[none|zip|gzip|bzip2]

The compression format of the input file.

input.format={Maf|Fasta}

MafFilter takes by default a MAF file. It is, however, possible to use a fasta file as input, which will lead MafFilter to run in "single sequence" mode. In this mode, each sequence of the original file is considered as a distinct block. This mode might be useful to extract annotations and estimate simple statistics. Fasta files can be compressed in the same way as MAF files. The sequence names are expected to be of the form Species:Chromosome:start:strand:length, with possible empty fields. Coordinates are expected to be 0-based (that is, the first nucleotide is position 0). However, passing the zero_based=no argument to the format name will assume that the first nucleotide is position 1.

input.dots={error|as_gaps|as_unresolved}

If set to as_gaps, dotted characters in sequences will be converted to gaps and treated as such in any further step of the analysis, including output to files. If set to as_unresolved, dots will be converted to ’N’.

input.check_sequence_size={boolean}

Tell if sequence sizes are consistent (default: true). An error will be returned in case of inconsistency. Desabling size check can be useful in combination with input.dots=as_gaps. In case of inconsistency, a warning will be displayed unless verbose is set to false.

output.log={path}

The file where to write log messages.

maf.filter={Filter1, Filter2, ...}

The main command, specifying the list of filter commands to apply, in the given order (that is, the input of filter 2 will be the output of filter 1). Filters take a series of arguments, specified within parentheses: Filter(arg1=opt1, arg2=opt2, etc). The next section present all available filters and their corresponding arguments.