
Home > Problems > Chapter 5
7. Analyze the following ten DNA sequences by the Gibbs sampling algorithm.
seq1 C CAG A
seq2 G TTA A
seq3 G TAC C
seq4 T TAT T
seq5 C AGA T
seq6 T TTT G
seq7 A TAC T
seq8 C TAT G
seq9 A GCT C
seq10 G TAG A
 Assuming that the background base frequencies are 0.25, calculate a log odds matrix for the central three positions.
 Assuming that another sequence G TTT G is the leftout sequence, slide the log odds matrix along the leftout sequence and find the log odds score at each of three possible positions.
 Change each log odds score to an odds score and sum the odds scores. Calculate the probability of a match at each position in the leftout sequence. (Odds score = 2 raised to the power of the log odds score.)
 How do we choose a possible location for the motif in the leftout sequence?


© 2004 by Cold Spring Harbor Laboratory Press. All rights reserved. 

No part of these pages, either text or image, may be used for any purpose other than personal use. Therefore, reproduction, modification, storage in a retrieval system, or retransmission, in any form or by any means, electronic, mechanical, or otherwise, for reasons other than personal use, is strictly prohibited without prior written permission. 


