The pacifastin inhibitor family web site



[Introduction] [Sequences] [Alignments] [Structures] [Phylogenetics] [References]

Created and maintained by Zoltán Gáspári
E-mail: szpari@para.chem.elte.hu

Last modified: 10.02.2004


Sequences

Nomenclature

Nomenclature of the sequences follows the rules described in Simonet et. al., (2002), i.e. 'Genus species pacifastin-related inhibitor n' or Genus species pacifastin-related domain n', according to whether the polypeptide is processed into individual monomers or not. Experimental evidence shows that the locust (Locusta migratoria and Schistocerca gregaria) inhibitors are processed, whereas the pacifastin light chain, containing 9 inhibitory domains, remains intact. In all other cases, we assumed that the polypeptide chains are not processed. But! see also the reasoning about processing in Simonet, G., Claeys, I., Franssens, V., De Loof, A., Vanden Broeck, J. (2003) Genomics, evolution and biological functions of the pacifastin peptide family: a conserved serine protease inhibitor family in arthropods. Peptides 24, 1633-1644.

Module organization
a)
mrcliaicfivlarhcesgalkcspgtegpcaaeqeskdkpsqittddasavqemqs[BMPD1]ifte[BMPD2]stpkk
feliq[BMPD3]iiep[BMPD4]ntpekseliq[BMPD5]inep[BMPD6]ntpkkseliq[BMPD7]idep[BMPD8]n
nfgsgvtekletkgteipelrssqnttk[BMPD9]lpedgdedpenlnpkpat[BMPD10]eeknhtlsrkvrasqqetvkt

b)
mkallilvmtvaahgasleqpdptpasdlpd[PLD1]ykaaqg[PLD2]lakgmfasqte[PLD3]ingiglan[PLD4]v
qspsvpavafrsggrtgkcrpdahddslpd[PLD5]epkpg[PLD6]pdgslpd[PLD7]ykpqpg[PLD8][PLD9]pt

c)
mnvavsvlalllvavgcsa[LMPI6]rr[LMPI7]rr[LMPI8]kr[LMPI9]

d) 
mrafvglvvtlivaasaahasestdvewvfvdklsg[OSPD1]kpraaidknvpvvydwtf[OSPD2]vvfgyvq[OSPD3]g
esvnavdhpvdsvhvvdqppssvhvvdhpvdsvhvvdhpsdsvqvadlprsavqvddsnipv?dgyiqgnqmssfsfhryl

e)
yskpggtimkilmlivatcvvg*alc[AGPD1]elsddsqvrldvqngeslssadekdeihvqtn[A*GPD2]akrsep
ap[AGPD3]gffdqqklkqkrsvpaddlpqsaiapgap[AGPD4]ew
Examples on the module organization of proteins with PLD-like domains. (a) Proposed module organization of a Bombyx mori protein containing pacifastin light chain domains based on EST data. The overlapping region of the ESTs used (accession numbers AU003615, AU005858, AU003467) is underlined. The highly similar module pairs BMPD3-BMPD4, BMPD5-BMPD6 and BMPD7-BMPD8 are separated by segments showing clear relationship. (b) Module organization of the pacifastin light chain (c) Modules on the Locusta migratoria PP3 gene (accession AJ419778) separated by dibasic cleavage sites. (d) PLD-like modules on the Oryza sativa EST CA766257 (e) Modules in the Anopheles gambiae genomic sequence. The position of introns is indicated by asterisk (*). The first intron (phase 1) is located before the first module, the second one (phase 0) interrupts the second PLD-like domain after the first conserved cysteine.

Gene structure

Only one sequence corresponding to a genomic region of the malaria mosquito Anpheles gambiae was found with similarity to pacifastin-related inhibitors (accession AAAB01008816). The predicted encoded protein corresponds to multiple ESTs (e.g. accession BM586029, BM605854). There are two introns interrupting the coding sequence, the first one (phase 1) before the first PLD-like domain while the second (phase 0) lies in the second module after the first conserved cysteine.

Sequence files

Sequence entries are available both as local files and as links to the NCBI GenBank entries. (From the very similar Anopheles clones, only the representative ones are presently listed.)

SpeciesACCESSIONtypeLocal linkEntry @ GenBankSwissProt entryMembers coded
Anopheles gambiaeEAA05233protein (translation of the corresponding genomic sequence)EAA05233.psqEAA05233AGPD-1, AGPD-2, AGPD-3
Anopheles gambiaeAAAB01008816DNA (genomic) - GI 19611762AGPD-1, AGPD-2, AGPD-3
Anopheles gambiaeBM609388mRNABM609388.seqGI 18907492AGPD-1
Anopheles gambiaeBM650327mRNABM650327.seqGI 18949838AGPD-1, AGPD-2
Anopheles gambiaeBM605854mRNABM605854.seqGI 18903958AGPD-3, AGPD-4
Anopheles gambiaeBM640240mRNABM640240.seqGI 18939763AGPD-5
Anopheles gambiaeBM623459mRNABM623459.seqGI 18922970AGPD-6
Anopheles gambiaeBM622696mRNABM622696.seqGI 18922207AGPD-7
Bombyx moriAV398490mRNAAV398490.seqGI 6902142BMPD-1, BMPD-2, BMPD-3n
Bombyx moriAU006451mRNAAU006451.seqGI 4163836BMPD-1, BMPD-2, BMPD-3, BMPD-4p
Bombyx moriAU004115mRNAAU004115.seqGI 4161486BMPD-1, BMPD-2, BMPD-3, BMPD-4t
Bombyx moriAU004146mRNAAU004146.seqGI 4161517BMPD-1, BMPD-2, BMPD-3, BMPD-4t
Bombyx moriAU004998mRNAAU004998.seqGI 4162369BMPD-1, BMPD-2, BMPD-3, BMPD-4t
Bombyx moriAU003615mRNAAU003615.seqGI 4160986BMPD-1, BMPD-2, BMPD-3, BMPD-4t
Bombyx moriAU004114mRNAAU004114.seqGI 4161485BMPD-1, BMPD-2, BMPD-3, BMPD-4t
Bombyx moriAU004695mRNAAU004695.seqGI 4162066BMPD-1, BMPD-2, BMPD-3, BMPD-4t
Bombyx moriAV401970mRNAAV401970.seqGI 6905622BMPD-1, BMPD-2, BMPD-3, BMPD-4t
Bombyx moriAU005858mRNAAU005858.seqGI 4163242BMPD-5, BMPD-6
Bombyx moriAU003467mRNAAU003467.seqGI 4160838BMPD-3, BMPD-7, BMPD-8
Bombyx moriAU003467mRNAAU003467.seqGI 6902142BMPD-9, BMPD-10
Bombyx moriAU004035mRNAAU004035.seqGI 4161406BMPD-11, BMPD-12
Cicindela campestrisBQ475192mRNABQ475192.seqGI 25957466CCPD-1
Ctenocephalides felisBM058348mRNABM058348.seqGI 16900157CFPD-1
Ctenocephalides felisBF779781mRNABF779781.seqGI 22039887CFPD-2, CFPD-3
Locusta migratoriaZ22805mRNAZ22805.seqGI 397613P80060LMCI-1, LMCI-2
Locusta migratoria migratorioidesAJ419777mRNAAJ419777.seqGI 18496016Q8WQ22LMPI-3, LMPI-4, LMPI-5
Locusta migratoria migratorioidesAJ419778mRNAAJ419778.seqGI 18496018Q8WQ21LMPI-6, LMPI-7, LMPI-8, LMPI-9
Manduca sextaAI234474mRNAAI234474.seqGI 3827992MSPD-1
Meladema coriaceaBQ477255mRNABQ477255.seqGI 25959529MCPD-1, MCPD-2
Oryza sativa (indica cultivar-group)CA766257mRNACA766257.seqGI 27548261OSPD-1, OSPD-2, OSPD-3
Pacifastacus leniusculusU81825mRNAU81825.ig0GI 1764108P91776PLD1, PLD2, PLD3, PLD4, PLD5, PLD6, PLD7, PLD8, PLD9
Pyrocoelia rufaAT003791mRNAAT003791.seqGI 12025791PRPD-1
Schistocerca gregariaY09605mRNAY09605.seqGI 2765005O46162SGPI-1 (SGTI), SGPI-2 (SGCI)
Schistocerca gregariaY09606mRNAY09606.seqGI 2765007O46163SGPI-3
Schistocerca gregariaAJ437311mRNAAJ437311.seqGI 21217995Q8MYK3SGPI-4A, SGPI-4B, SGPI-4Cp
Schistocerca gregariaAJ437310mRNAAJ437310.seqGI 21217993Q8MYK4SGPI-4A, SGPI-4B, SGPI-4Ca
Schistocerca gregariaAJ299736mRNAAJ299736.seqGI 15864589SGPI-5A, SGPI-5B