Protein PY04858 (PlasmoDB link) - New domain Collagen (Pfam link)

Sequence and domain localizations (colored) at: 2307...2364

   1 MNKKFVLALSYLVLCVHFIIASTKGNQDEFDGGKDVRDSFLEKSISKRQSPPCVGDECFC   60
  61 QSYYDLTLILDESGSIKKKHWVKYVVPFTEQIVKGLKVGENDIHVGILLFALKNRDYITF  120
 121 DNDIRYKKTELLKKVNDLNDDYRSGSDTYILEALKYSMNKYSMSKNARDDAPKVTILFTD  180
 181 GNDRHASKSEFHKMYSEYQEKHIKLLVLGVSAAEEKKLKVIAGCEEHSSCPSAMKAEWET  240
 241 INNITNKLTNKICDTESESNIIPEPEPEPEPETSQPIPCIGDDCFCKDYYDLTLILDDSG  300
 301 SITLNKWEKDVIPFSEKLINNLNIGKDNVHVGIMRFSKDIVIDVDYSQDTRYIKNELANI  360
 361 VEGLSKKYRYGSRTDIVDALDYSLKNFTRHPNSRTDAPKVTILFTDGNDTSKTLAEERNM  420
 421 GILYRSEQIRLILVGVGQASSIDLYALADCDHGKHCPQVIECKWNDLTGITSVITDKICD  480
 481 IESGENSNPSETPESSGVTNPTETPESSGVTNPTETPESSGVTNPTETPESSGVTNPTET  540
 541 PESSGVTNPTETPESSGVTNPTETPESSGVTNPTNPDNVVSCQNDEDCYCKDFYDVTLIL  600
 601 DESASIGESRWVLEVIPFAKDIINHLNIDYDSVHVGVLLFSHYALDLVPFSDEARYNKIS  660
 661 LLKKIDSLKTNYGNGYESFIVKTLKYALYNYIKDSGRSNAPKITMLFTDGNDSSESDMDM  720
 721 YNIGSLYRTERVKLLVIGVSMASENKLKQLVGCAQNVPCPFVIKTEWGTLDALSKVFVDK  780
 781 ICDTGSILPPESNKPETEVPTPQCIGNDCFCHDIYDLTVILDESGSIGSYNWKNQVYPFT  840
 841 EQFINNLEISEDKVHVGIMLFAQFNRDFVMFSDKESYDKEHMMKLIKGLKDSYKSGGYTY  900
 901 IIEALNYGLENYTHHKDSRSDVPKVTMLFTDGNNTNPGDKLLSDASLLYKEENVKLLVVG  960
 961 VGASTMANLRLLAGCHKTDGDCPLATKTEWNNLQDISKLMADKICNAETPEIEEPESTCL 1020
1021 GDECICGDYFDLTLITVPITTLDYNHRTGFTRYARNIINMFNIGKKNIHASISMYLGVRS 1080
1081 VNIDFDDDLTHDKKGLLMALDQMNHYFAEEETNITEALEIGLKQIFGKGNREKAPKIALL 1140
1141 LTDSNNDAYEKSKLENISKEYTDKGVKLLVMGRVELSKEILFVAGGCDINNDTCPNVLIY 1200
1201 NSFINDNTAEKFLEENICDNSNGNGNGNGNGNGNGNGNGNGNGNGNGNGNGNGNGNGNGN 1260
1261 GNGNGNGNGNGNGNGNGNGNGNGNGTGNGNGNGTGNGNGNGTGNGNGNGTGNGTGNGTGN 1320
1321 GTGNGTGNGTGNGNGNGNGNGTGNGNGNGNGNGTGSGNGNGNGTGNGTGNGNGNGTDNGN 1380
1381 GTGNGNGNGTGNGNGNGTGNGTGNGNGNGNGTDNGNGNGTGNGNGNGTGNGTGNGNGNGN 1440
1441 GNGNGTDNGNGNGTGNGNGNGTGNGNGNGTGNGTGNGNGNGTGNGTGNGNGNGTDNGNGT 1500
1501 GNGNGNGTGNGNGTGNGNGTDNGNGNGTGNGNGTGNGNGNGTGNGNGTDNGNGNGNGTGN 1560
1561 GNGNGNGNGNGTGNGNGNGNGNGTGNGNGNGNGNGNGTGNGNGTGNGNGNGNGNGNGNGN 1620
1621 GNGNGNGVGNGTDNGNGNGSGESGGSPLPPSIQCDDEFCEECDDDVCDNNPTCKKAMDIV 1680
1681 IALDQSRGITNLQWTTYVKPFMVHTVKENYLSQNRSHVTIVKMRANKGKEQWGLYRKLSY 1740
1741 KKNRILNKIDKLQMSYSNIINLADNLKYIRTKTFKRTPAHKKKLIVMLVEGKSNTDLNEL 1800
1801 RREIELLKLNKITLYVYAIDNIDEKEYKILGDCEGPSSICENIVKVSWENLLSSVEIHNK 1860
1861 FICNKYPEDAECSEWGEWSPCPQLSCDSAVSRRERKKPYYTLKEEGYSGTEYGDSCMDLG 1920
1921 SIEYRSCPVKDECNDICGDFGEWSQCSTSCGDGIRMRTRNASPDNSMCQTFNKTEIEPCN 1980
1981 IQSCGSTEICEDIGDWSEWSSCSKTCGYSIRERKFTIFPESIDEHSYCEHFEKIETEVCS 2040
2041 VPKCENEECFDWEDWSEWSAPCGPRKRVQRARLHKNKSGNPSTIPSPNNGQNDKCEDFYQ 2100
2101 DKIEYDEESSCPDNTCGSWSEWSECDRPCNAGMRIRNFITNIVSLNGENDDECLETYNKI 2160
2161 ENEPCLDLPVCNSGECNDWETWVDCKTDKDSYSCHMPNKRILTRKLDLLKNPKSDTAEAC 2220
2221 NDYSLFREEDCPIGNTPCVDALCNEWDEWGSCSETCGLDSFRIRKRKEPLELIPASSDID 2280
2281 GNIGLTCEEQNVRIEEKEACNVPACVPPVLDGSNTNTGEDGEGGSSGEGFGTGEKISMAA 2340
2341 GIIGLVGLAAGGLIYGYNTLNGGETPHNSNMEFENVENNDGIIEEENEDFEVIDANDPMWN 2401

Alignement of domain consensus (first line) on the sequence (colored line).

Each position reports the amino acid with highest probability; capital letters mean highly conserved residues (i.e. with probability > 50%).

Occurence 2307...2364
   1 ppGppGppGppGppGppGppGpaGapGppppGepGpPGppGppGppGppGapGapGpp   58
2307 PPVLDGSNTNTGEDGEGGSSGEGFGTGEKISMAAGIIGLVGLAAGGLIYGYNTLNGGE 2364