Assigned: 3/2/2002
Due: 3/11/2002
a) Discuss the probability that Yankee wins if situation A happens next time.
b) You are curious about this association between the color of a specific bead and the result of the game, but you are not sure if it’s true. So you decide to observe a few more games. In the next 2 games, situation A happened, and Yankee won. Discuss the probability that Yankee wins if situation A happens after this observation.
c) Discuss the problem of dimensionality and overfitting in bioinformatics, and suggest a few methods to deal with it.
(Copied/modified from Genetics in Medicine by Thompson, McInnes, and Willard)
|
ID |
SNP1 |
SNP2 |
|
1 |
A |
G |
|
2 |
A |
G |
|
3 |
A |
G |
|
4 |
T |
T |
|
5 |
A |
G |
|
6 |
T |
G |
|
7 |
A |
G |
|
8 |
A |
G |
|
9 |
T |
T |
|
10 |
T |
T |
|
11 |
A |
G |
|
12 |
T |
G |
|
13 |
A |
G |
|
14 |
A |
T |
|
15 |
A |
G |
|
16 |
T |
T |
|
17 |
A |
G |
|
18 |
A |
G |
|
19 |
T |
G |
|
20 |
T |
T |