- Jun 4, 2013
- 10,132
- 996
- Country
- United States
- Gender
- Male
- Faith
- Non-Denom
- Marital Status
- Widowed
- Politics
- US-Others
Phylogenetic algorithms don't care whether the DNA sequences used are the same species are not.
https://en.wikipedia.org/wiki/Computational_phylogenetics
"The goal is to assemble a phylogenetic tree representing a hypothesis about the evolutionary ancestry of a set of genes, species, or other taxa."
So the program is already written to support a hypothesis.....
"The problem of character coding is very different in molecular analyses, as the characters in biological sequence data are immediate and discretely defined - distinct nucleotides in DNA or RNA sequences and distinct amino acids in protein sequences. However, defining homology can be challenging due to the inherent difficulties of multiple sequence alignment. For a given gapped MSA, several rooted phylogenetic trees can be constructed that vary in their interpretations of which changes are "mutations" versus ancestral characters, and which events are insertion mutations or deletion mutations. For example, given only a pairwise alignment with a gap region, it is impossible to determine whether one sequence bears an insertion mutation or the other carries a deletion. The problem is magnified in MSAs with unaligned and nonoverlapping gaps. In practice, sizable regions of a calculated alignment may be discounted in phylogenetic tree construction to avoid integrating noisy data into the tree calculation."
So we agree, when you snip out all the code that doesn't fit, and align only the code that does, while ignoring the gaps you snipped out to make them align, you get weighted results. But only the interpretation the researcher wants to publish is heard.... the rest are discarded.
Last edited:
Upvote
0