|
FIG. 1. Phylogenetic classification of mucins. Muc2, Muc5, Muc6, Muc19, ovomucin, otogelin, and VWF protein sequences were aligned with
Clustal Omega (Sievers and Higgins 2014) and a neighbor-joining tree with 100 bootstrap replicates was obtained with ClustalW (Thompson et al.
2002). Highlighted in color are Muc5 (red), ovomucin (brown), Muc2 (green), Muc19/spiggin (orange), Muc6 (cyan), and invertebrate mucin-like
proteins (blue). Classification of Muc5b and Muc5ac is also possible although all bird/reptile Muc5 are in one group and all mammalian Muc5 in
another. Invertebrate mucin-like sequences seem to be most similar to the SCO-spondins, proteins previously known to be present in vertebrates
only.
|
|
FIG. 2. The mucin Muc6 was present early in vertebrate evolution. Tree was constructed with MrBayes using an alignment of the N-terminal parts
of mucins that include the three VWDâC8âTIL units. Homologs of Muc19, Muc6, Muc5, ovomucin, and Muc2 are shown. Critical nodes are
labeled with green circles. The four Muc6 homologs of C. milii, L. chalumnae, and Le. oculatus clearly belong to the group of Muc6.
|
|
FIG. 3. Genomic organization of X. tropicalis mucins. Genomic region from assembly version 7, accession KB021653.1, region 53,600,000â
57,000,000 is shown. Genes are shown for Muc5 paralogs (green) and Muc2 paralogs (orange). Naming of genes was based on their location
from right to left. Location of VWD domains, PTS domains, and CysD domains (red, blue, and green vertical bars, respectively) are based on the
analysis of the genomic sequence with genewise and an in-house Perl script. Genes where a CysD domain is predicted are indicated with cyan
triangles.
|
|
FIG. 4. X. tropicalis mucin gene transcription. A total of 26 different mucin genes were analyzed with respect to gene expression. Expression levels
(RPKM) were estimated from public RNA-Seq data as listed in supplementary dataset S2, Supplementary Material online, and as described in more
detail in Materials and Methods section. (A) Expression in different tissues. (B) Expression at different developmental stages.
|
|
FIG. 5. A conserved structure of mucin genes in vertebrates. Arrows reflect the strand polarity. In the case of X. tropicalis only a part of a larger
cluster of mucin genes is shown (see also fig. 3).
|
|
FIG. 6. Gel-forming mucins evolved early during metazoan evolution. Phylogenetic distribution of proteins in selected basal Metazoa with a domain structure characteristic of gel-forming mucins. VWD, C8, TIL, and PTS domains are shown in orange, yellow, red, and blue, respectiv
|
|
FIG. 7
Domain structures of selected proteins with a domain characteristic of FCGBP. In vertebrates, FCGBP proteins have a large number of VWDâC8âTIL units, in addition to the FCGBP_N domain. In the invertebrate B. floridae, there is a large number of proteins with the FCGBP_N domain. All domains that are not FCGBP_N, VWD, C8, TIL, TILa, or PTS domains are explained below each representation of protein domain architecture. For a complete collection of all proteins with the FCGBP domain, see supplementary figure S5, Supplementary Material online.
|
|
FIG. 8 Summary of mucin and FCGBP protein evolution. The VWD domain seems to occur in a restricted number of protists, but not in combination with the C8 or TIL domains. The combination of VWD/C8 appears in choanoflagellates and the structure characteristic of gel-forming mucins as well as the FCGBP_N-containing proteins appear in Ctenophora. The F5_F8 and CysD domains occur in Bilateria. A protein with FCGBP_N and multiple VWDâC8âTIL domains appears later in evolution as it is observed only in cephalochordates and vertebrates.
|
|
Fig. 1. Phylogenetic classification of mucins. Muc2, Muc5, Muc6, Muc19, ovomucin, otogelin, and VWF protein sequences were aligned with Clustal Omega (Sievers and Higgins 2014) and a neighbor-joining tree with 100 bootstrap replicates was obtained with ClustalW (Thompson et al. 2002). Highlighted in color are Muc5 (red), ovomucin (brown), Muc2 (green), Muc19/spiggin (orange), Muc6 (cyan), and invertebrate mucin-like proteins (blue). Classification of Muc5b and Muc5ac is also possible although all bird/reptile Muc5 are in one group and all mammalian Muc5 in another. Invertebrate mucin-like sequences seem to be most similar to the SCO-spondins, proteins previously known to be present in vertebrates only.
|
|
Fig. 2. The mucin Muc6 was present early in vertebrate evolution. Tree was constructed with MrBayes using an alignment of the N-terminal parts of mucins that include the three VWDâC8âTIL units. Homologs of Muc19, Muc6, Muc5, ovomucin, and Muc2 are shown. Critical nodes are labeled with green circles. The four Muc6 homologs of C. milii, L. chalumnae, and Le. oculatus clearly belong to the group of Muc6.
|
|
Fig. 3. Genomic organization of X. tropicalis mucins. Genomic region from assembly version 7, accession KB021653.1, region 53,600,000â57,000,000 is shown. Genes are shown for Muc5 paralogs (green) and Muc2 paralogs (orange). Naming of genes was based on their location from right to left. Location of VWD domains, PTS domains, and CysD domains (red, blue, and green vertical bars, respectively) are based on the analysis of the genomic sequence with genewise and an in-house Perl script. Genes where a CysD domain is predicted are indicated with cyan triangles.
|
|
Fig. 4. X. tropicalis mucin gene transcription. A total of 26 different mucin genes were analyzed with respect to gene expression. Expression levels (RPKM) were estimated from public RNA-Seq data as listed in supplementary dataset S2, Supplementary Material online, and as described in more detail in Materials and Methods section. (A) Expression in different tissues. (B) Expression at different developmental stages.
|
|
Fig. 5. A conserved structure of mucin genes in vertebrates. Arrows reflect the strand polarity. In the case of X. tropicalis only a part of a larger cluster of mucin genes is shown (see also fig. 3).
|
|
Fig. 6. Gel-forming mucins evolved early during metazoan evolution. Phylogenetic distribution of proteins in selected basal Metazoa with a domain structure characteristic of gel-forming mucins. VWD, C8, TIL, and PTS domains are shown in orange, yellow, red, and blue, respectively.
|
|
Fig. 7. Domain structures of selected proteins with a domain characteristic of FCGBP. In vertebrates, FCGBP proteins have a large number of VWDâC8âTIL units, in addition to the FCGBP_N domain. In the invertebrate B. floridae, there is a large number of proteins with the FCGBP_N domain. All domains that are not FCGBP_N, VWD, C8, TIL, TILa, or PTS domains are explained below each representation of protein domain architecture. For a complete collection of all proteins with the FCGBP domain, see supplementary figure S5, Supplementary Material online.
|
|
Fig. 8. Summary of mucin and FCGBP protein evolution. The VWD domain seems to occur in a restricted number of protists, but not in combination with the C8 or TIL domains. The combination of VWD/C8 appears in choanoflagellates and the structure characteristic of gel-forming mucins as well as the FCGBP_N-containing proteins appear in Ctenophora. The F5_F8 and CysD domains occur in Bilateria. A protein with FCGBP_N and multiple VWDâC8âTIL domains appears later in evolution as it is observed only in cephalochordates and vertebrates.
|