Protein PVX_084325 (PlasmoDB link) - New domain Atrophin-1 (Pfam link)

Sequence and domain localizations (colored) at: 1265...2144

   1 MDIVKGGGTFKMLQLKENKNYLEGNILNCPYLLNLNEGNNILYGKVHSLTRIGNSKCRSR   60
  61 HKRAPFCDEETAGDQEFTLNEQKTPTPGEEELEAGGVVANDVDAHGVDAQGDLDTDIEKC  120
 121 PKRFNQQTSAYQEGEGSKEDIIPCSCNTTVSLNSVNEKKDEVKRFDRDELSLYSIETHCH  180
 181 HQVAYEQTYDKENLKNEPERGSHFLQNGNFTNEYHCRSPLGDESRMPLSSSPPVSSNPPV  240
 241 GSVDESHRVCSARGYNPLGRREVTLGGGGDSGEAYGGEEAHSGEAHNGEVHNGEAHSGEA  300
 301 HSGDSGKRESGDGGSGHGGQGGGGDQSGGGDQGGGGDQNGKDDQSGSGSGKDDQSGRGDR  360
 361 SGRGDRSGRGDRSGKENYKMYFDKISELKSNRFISKYKNSEKYMKKKIFNYLRKNNLIGD  420
 421 LDIEQLFVYVKNIYFLIIMNEKLIFFFIYNTKDAMNAIIHFEKMHNISYKDTDFLLFLLS  480
 481 NDVDMYEKLLSQINFKFEKEELIFKLKSLRRKVKDLKFEEERNLFDECMQQLHLMKTDRR  540
 541 FTRVAYKTGSYCASTGSGPENSSSCYANSSSRAYSVGRAGSSARANTSDYANCSAYTNEN  600
 601 ENENENDNDNGDTSESNALLKISLKNLIPKLYNCKDFKGTSNLIHNIGKSITKRSTVVNV  660
 661 VEKKKKKKKKNPTYSNVSYTKMAYSKMAFSKVPYAKGAYSKTTPPMMNVPHASGAYPNMA  720
 721 HSRGGTFADVPPPSAAYPNIPQANGAYMNVPPQSVAYQNMPPGAGAAPYANMPPANSPYA  780
 781 NASPTSGPYANTSPASASYASMSPTTGAYSNVPHPKGGYSNVTHPSAAYTNVPNQRSAYR  840
 841 SVPANSGAYTNVANQNGAYANVANQNGAYANVANQNGAYMNVSNHPNVSGHHNETYPNGA  900
 901 YAKGPPLPSGASGFYANNFEGAHGENYAGNYAPSFAANYTASYPPNQGTNCAPNFVPNYE  960
 961 GNYPLGCGNDYGKGRPNDHPVGHPAGHPVSHPVSHPVSHPSEHPIGHPSDHPSDHPNEHS 1020
1021 HEYGSECANDYVNDYMHDYMNEYLNDYGMDEPSNANHLDDNKEAKKKTIRGITSFTLFAR 1080
1081 EKRKELLNQKVFLGSSLTEQTTAVAKIWNSLTDEKKKEWAAKASKINEENYLSQKKKKKR 1140
1141 KFTAFSIFAREKRKEHKEKNIDMGLTLAQQNSYVSKLWKQLSIEEKNKYKVLSNNVNAEV 1200
1201 AKTSKSDVNYVSGEKLSGFKGRIKALDNMMNQQFSGNALPYSGGVNDMGYVSGMGYINGV 1260
1261 GVGSAPMGVVPSGGVIPPGGIVPQGGVPPPNRMSGVPPQNGLPHSPTQANRTSGMNVPNE 1320
1321 SIVPNEPIVPNASSDINNLGYMDYVNYMNTVLSHDNPSYMNKIMSGGNENYMSMMNSLTN 1380
1381 EMASGMTQDLSGSLPSGPSTADSKKEKKSAKLKKRKNKKDNMYITEQDQLMVKNGSYKMG 1440
1441 KDAPPPYETNNNMIGVFPAEAAAATGVNLYDGGMPSVPNMAHVANSGTYPPSSFHKSGVH 1500
1501 KNGVHTNNVHTNSGPVNAPAPNDGNTSVDMAYANNLLNYYASQENPNVDDDSMNNVGSVP 1560
1561 NVGSVPNVGSLTNLTNLPTDGAHRPNCKAHPPGNINGSGSPAAAMATPSIGVDKKQPMMN 1620
1621 EQPNSNLGVMTDASMFNNPSKSTANMINNVPYGGMPMGTPLSCNNTALNRVPFNNVPFNT 1680
1681 VPMYNIPPNGAPQGGTGLPSGSFTPTEAHLNNPLINPADGMDMRRGPNGMNQQPFNNPYV 1740
1741 GNMPNVHMPNAEANQLAINDSLKAARKTKSKDKMEEMMRKKMKKDEKQKLKEKKLMLKMY 1800
1801 DKKRKQLLKAEKQMEKNKKNKNKNGVNTIEDGSAGMYYHPHHPAPPNLMSTNVSGMPFDA 1860
1861 PNALVNKPMVGNFGPYGGMDKVDAKDMHVKNAAMVGGGMASGMTSGGMAANGPPIIRGGN 1920
1921 IPERGVDSRAGPLNFQPVNNMNSALTSMRNNQPAPYPFVNNYPSDMPDMRSYLPSGEMKS 1980
1981 YDDASRGLQDKQVPAAMPTTPNEMTTYPSVNLNDKSKKKKKANRMDANLVVDPRVGNLTD 2040
2041 GKNIPLMNEKICYNSGFTNVEYRESYYKDMRVTDFAAFQQTDSSKGGEVDKLPAYHMASF 2100
2101 EVKMGKEKEEGHQKNLHQMNQRSSAFYGPEQMYGGSLASGGANGKHHEGVVNVGSLLKVG 2160
2161 SLPAGTGIPAGATILGDGFYDMNRSNSQNLKSQHSYGYSGKHGAHNVAGNIGGGVVGPPK 2220
2221 EEYYEQVPRIPGEEREGEEATYKTYGSVAKGMGKSADDVGRAADYMSKPGEYLNKPGDFL 2280
2281 NKPGNYLPAMPSEDKSFYTPKGMNLMMGKVPTGLDKANAPFDKQFFPNENKPAEHKAKTY 2340
2341 YSQSEKSKSKSNSTLYGEESLSGSYNHFNSTTTVAAQQQATYNKTLNEDLNVDLNFANVS 2400
2401 YVHADGCNEESQNNSTGYFYPHGSGISGGKKENEKMNNPGAYVSHEMDGVTNANNIMYYP 2460
2461 DQMLPENKMNKNLVYFDTDPNELPQMTDDTELEAPKEKQFFSNASHYAEFVSKKVRRKKT 2520
2521 PCPSNGAGAVAVADADADIGANDNGGTIWDLVQKKGRKKLKVNAAHQGEAESENVPVEDQ 2580
2581 MYANRSDSIFMGSNNEWNNARGGVNSTHEASKEEADDIVNSIFRSLEMEESVSHPGGGAA 2640
2641 MTQSGDNVIGGEGTPQNRLVNNDAEEGTNCENDNPGEQNDGDDIISLLDGPPQSYANKLN 2700
2701 ITESGEVIYEDKKKYNSNIIMSENSIQNNRMNMFMSKRGKFNANMNRSQIETYNNAYKKT 2760
2761 ALFKWTEEETDKFYEAIEMFGVDLMMVRAFLPNFSDKQIRDKYKKERRNNPLRIEEAIKK 2820
2821 NKEIDLDAYENENGRIETSGDLNSDASSFSGDDDTGVKRKTSVASETDGNILSIFEGKED 2880
2881 VDYSMNQYYDNQEDPDFNVLSLF 2903

Alignement of domain consensus (first line) on the sequence (colored line).

Each position reports the amino acid with highest probability; capital letters mean highly conserved residues (i.e. with probability > 50%).

Occurence 1265...2144
   1 gslStLRSGrkkqta..SPdgrtspsnedlrssGrsSpSaaStsssKaEsvkKsaKqKiK   59
1265 APMGVVPSGGVIPPGgiVPQGGVPPPNRMSGVPPQNGLPHSPTQANRTSGMNVPNESIVP 1324
  61 EEasSPkvAsdteEpervsaKkaKTQlsrPdsPSeGeGeGEGEgEesSSdsRSvneegSS  119
1325 NEPIVPNASSDINNLGYMDYVNYMNTVLSHDNPSYMNKIMSGGNENYMSMMNSLTNEMAS 1384
 121 DIDQDNRSsSPSIPSPqdNESDSDSSAqqlqqlqQqqllqsqppplaaalaetpAsssaP  179
1385 GMTQDLSGSLPSGPSTADSKKEKKSAKLKKRKNKKDNMYITEQDQLMVKNGSYKMGKDAP 1444
 181 Pg.ttQlPtiaaqPaPSAsvpPqqSPtaasadvPqqplaqsapvvassiqaasalHpqrp  239
1445 PPyETNNNMIGVFPAEAAAATGVNLYDGGMPSVPNMAHVANSGTYPPSSFHKSGVHKNGV 1504
 241 PphsplslfPPslpPaPsvaqPsLqgpPsPPllqhPhsqpPqnFsllaQasvgqlP.lgt  299
1505 HTNNVHTNSGPVNAPAPNDGNTSVDMAYANNLLNYYASQENPNVDDDSMNNVGSVPnVGS 1564
 301 ssaarshayPqpLPpAPlamPHIKPPPTTPIpQLaaPqsHKhppHlssPspfPqMpsQpa  359
1565 VPNVGSLTNLTNLPTDGAHRPNCKAHPPGNINGSGSPAAAMATPSIGVDKKQPMMNEQPN 1624
 361 ssapvlTQSQsLPspvaslhqvasapPlasHPl...vpsaasaisplPslppsFaTstfp  419
1625 SNLGVMTDASMFNNPSKSTANMINNVPYGGMPMgtpLSCNNTALNRVPFNNVPFNTVPMY 1684
 421 APPpSsaaasaGvppsasssPt.aelpqiqIEpLDeaEEpESPPPPpRSPSPEPTVVntp  479
1685 NIPPNGAPQGGTGLPSGSFTPTeAHLNNPLINPADGMDMRRGPNGMNQQPFNNPYVGNMP 1744
 481 sHASQSARFNSCARtDsSKLAk......KREEAv.EKakREAEQKAREEkEREkErEkEr  539
1745 NVHMPNAEANQLAINDSLKAARktkskdKMEEMMrKKMKKDEKQKLKEKKLMLKMYDKKR 1804
 541 ERERERErEAeRaAclPnhcfcPPPsqkaaSSSsHesrmsepqLaGPahmRpsFeqPPT.  599
1805 KQLLKAEKQMEKNKKNKNKNGVNTIEDGSAGMYYHPHHPAPPNLMSTNVSGMPFDAPNAl 1864
 601 ....TiAAVpPYIGPDTPALRTLHvMSPTsLnPaLAYHMP..GLYnadPsliREReiRER  659
1865 vnkpMVGNFGPYGGMDKVDAKDMHVKNAAMVGGGMASGMTsgGMAANGPPIIRGGNIPER 1924
 661 ELRERMKPFEVKPaNPMEgAit.lPpiPagPHpFAsfHPGLNpLERERLAAGEl.sYpeA  719
1925 GVDSRAGPLNFQPVNNMNSALTsMRNNQPAPYPFVNNYPSDMPDMRSYLPSGEMkSYDDA 1984
 721 ERMAsvasDPlAfNVTPQHSHIHSHLHLHQQDPLHQGSssPvHPLaVDP....LaaGPhL  779
1985 SRGLQDKQVPAAMPTTPNEMTTYPSVNLNDKSKKKKKANRMDANLVVDPrvgnLTDGKNI 2044
 781 PLLGqpPHEHEMLRHPvFGaayPReLq....GAIPqPMSA....AHQLQAMHAQSAELrL  839
2045 PLMNEKICYNSGFTNVEYRESYYKDMRvtdfAAFQQTDSSkggeVDKLPAYHMASFEVKM 2104
 841 amE.....QQWLHGHhhlhgpLPSQEDYYsrlikesdkql  880
2105 GKEkeeghQKNLHQMNQRSSAFYGPEQMYGGSLASGGANG 2144