SignalX: Search for a common signal in given set of sequences
The Server implements a MEME like algorithm.
Features of the server:
- Type of expected site symmetry may be defined.
- Not all sequences may contain the site.
- The program tunes site width and number of the sites
using Rank statistics evaluation
(see Theory.)
- The server presents a p-value for every solution.
Algorithm
- Scan all sequences
- Select word and create a profile from a single word.
- Iterate:
- Find the best hit of current profile in every sequence.
- Sort hits (word) by score.
- Using rank statistics define threshold and select significant
subset of words.
- Using rank statistics select positions that should be
included in the profile (tune word length)
- Using selected words create new profile.
Presentation of the results
The program gives a number of possible results
(training sets and profile).
Some word in the sequences (site) may be presented in different
results (training sets may overlap).
The results are presented as a table:
- Columns = word sets and profile
- Rows = words (defined as sequence name and position)
- Cell = Z-score for the profile on the word.
For every profile user can see the site set, graphical logo and
position weight matrix.
Author
Andrey Mironov
Department of Department of Bioengineering and Bioinformatics, MSU, Moscow, Russia
Mail to Andrey Mironov