Group III - Collectins

Joint human/mouse sequence alignment
Human sequence alignment
Introduction
Mammalian proteins containing CTLDs
Introduction  Domain organization  Sequence alignments  Human database  Mouse database  Joint human/mouse database

 

If alignments appear scrambled please maximize the width of your browser window.

 
SIGNAL SEQUENCE and CYSTEINE-RICH DOMAIN
 
human MBP-A                     PSFPVLLLSVVTASCS-----KTKACADTQKTC-SMITCGIPVTN
mouse MBP-A                        MLLLPLLPVLLCVVSVSSGSQTCEDTLKTC-SVIAC
human MBP-C                 MSLFPSLPLLLLSMVAASYS-----ETVTCEDAQKTCPAVIACSSP
mouse MBP-C                  MSIFTSFLLLCVVTVVYA-------ETLTEGVQNSCPVV-TCSSP
human SP-A1                MWLCPLALNLILMAASGAA------------------CEVKDVCV
human SP-A2                MWLCPLALNLILMAASGAA------------------CEVKDVCV
mouse SP-A                 MSLGSLAFTLFLTVVAGIK------------------CNGTEVCA
human SP-D                  MLLFLLSALVLLTQPLGYLE--AEMKTYSHRTMPSAC-TLVMCSSVES
mouse SP-D                   MLPFLSMLVLLVQPLGNLG--AEMKSLSQRSVPNTC-TLVMCSPTEN
human COL-L1        MNGFASLLRRNQFILLVLFLLQIQSLG------LDIDSRPTAEVC-ATHTISP
mouse COL-L1        MNGFRVLLRSNLSMLLLLALLHFQSLG------LDVDSRSAAEVC-ATHTISP
human COL-K1        MRGNLALVGVLISLAFLSLLPSGHPQP------------AGDDAC-SVQILVP
mouse COL-K1       MMMRDLALAGMLISLAFLSLLPSGCPQQ------------TTEDAC-SVQILVP 

 

COLLAGENOUS DOMAIN
 
human MBP-A       GTPGRDGRDRPKGEKGEPGQGLRGLQGPPGKMGPPGNTGTSGIPGPRGQKGDRGDN
mouse MBP-A       GRDGRDGPKGEKGEPGQGLRGLQGPPGKLGPPGSVGSPGSPGPKGQKGDHGDN
 
human MBP-C       GINGFPGKDGRDGTKGEKGEPGQGLRGLQGPPGKLGPPGNPGPSGSPGPKGQKGDPGKS
mouse MBP-C       GLNGFPGKDGRDGAKGEKGEPGQGLRGLQGPPGKVGPTGPPGNPGLKGAVGPKGDRGDR
 
human SP-A1       GSPGIPGTPGSHGLPGRDGRDGLKGDPGPPGPMGPPGEMPCPPGNDGLPGAPGIPGECGEKGEPGERGPPGLP
human SP-A2       GSPGIPGTPGSHGLPGRDGRDGVKGDPGPPGPMGPPGETPCPPGNNGLPGAPGVPGERGEKGEAGERGPPGLP
mouse SP-A        GSPGIPGTPGNHGLPGRDGRDGIKGDPGPPGPMGPPGGMPGLPGRDGLPGAPGAPGEHGDKGEPGERGLPGFP
 
human SP-D        GLPGRDGRDGREGPRGEKGDPGLPGAAGQAGMPGQAGPVGPKGDNGSVGEPGPKGDTGPSGPPGPPGVPGPAGREGALGKQGNIGPQGKP
                     GPKGEAGPKGEVGAPGMQGSAGARGLAGPKGERGVPGERGVPGNTGAAGSAGAMGPQGSPGARGPPGLKGDKGIPGDKGAKGESGLP
mouse SP-D        GLPGRDGRDGREGPRGEKGDPGLPGPMGLSGLQGPTGPVGPKGENGSAGEPGPKGERGLSGPPGLPGIPGPAGKEGPSGKQGNIGPQGKP
                     GPKGEAGPKGEVGAPGMQGSTGAKGSTGPKGERGAPGVQGAPGNAGAAGPAGPAGPQGAPGSRGPPGLKGDRGVPGDRGIKGESGLP
 
human COL-L1      GPKGDDGEKGDPGEEGKHGKVGRMGPKGIKGELGDMGDRGNIGKTGPIGKKGDKGEKGLLGIPGEKGKAGTV
mouse COL-L1      GPKGDDGERGDTGEEGKDGKVGRQGPKGVKGELGDMGAQGNIGKSGPIGKKGDKGEKGLLGIPGEKGKAGTI
 
human COL-K1      GLKGDAGEKGDKGAPGRPGRVGPTGEKGDMGDKGQKGSVGRHGKIGPIGSKGEKGDSGDIGPPGPNGEPGLP
mouse COL-K1      GLKGDAGEKGDKGAPGRPGRVGPTGEKGDMGDKGQKGTVGRHGKIGPIGAKGEKGDSGDIGPPGPSGEPGIP 
 
Back to top
 

COILED-COIL NECK 

human MBP-A           SVAEAKLANLERKL*SLRSELDHTKKN*AFSLGKM
mouse MBP-A           RAIEEKLANMEAEIRILKSKLQLTNKLHAFSMGKK
 
human MBP-C          PDGDSSLAASERKALQTEMARIKKWLTFSLGKQ
mouse MBP-C          AEFDTSEIDSEIAALRSELRALRNWVLFSLSEK
 
human SP-A1              AHLDEELQATLHDFRHQILQTRGALSLQGSIMT
human SP-A2              AHLDEELQATLHDFRHQILQTRGALSLQGSIMT
mouse SP-A               AYLDEELQTASYEIKHQILQTMGVLSLQGSMLS
 
human SP-D                 DVASLRQQVEALQGQVQHLQAAFSQYKKVELFPNGQS
mouse SP-D                 DSAALRQQMEALKGKLQRLEVAFSHYQKAALFPDGRS
 
human COL-L1       CDCGRYRKFVGQLDISIARLKTSMKFVKNVIAGIRE
mouse COL-L1       CDCGRYRKFVGQLDISIARLKTSMKFVKNVIAGIRE
 
human COL-K1       CECSQLRKAIGEMDNQVSQLTSELKFIKNAVAGVRE
mouse COL-K1       CECSQLRKAIGEMDNQVTQLTTELKFIKNAVAGVRE

HEPTAD REPEATS

Back to top
 
 
CTLD - N-TERMINAL
 
human MBP-A        SGKKLFVTNGERMPFSKVKALCAGLQATVAAPKNAEENKAIQD----VAKDTAFLGITDEATEG
mouse MBP-A        SGKKLFVTNHEKMPFSKVKSLCTELQGTVAIPRNAEENKAIQE----VATGIAFLGITDEATEG
human MBP-C        VGNKFFLTNGEIMTFEKVKALCVKFQASVATPRNAAENGAIQN----LIKEEAFLGITDEKTEG
mouse MBP-C        VGKKYFVSSVKKMSLDRVKALCSEFQGSVATPRNAEENSAIQK----VAKDIAYLGITDVRVEG
human SP-A1        VGEKVFSSNGQSITFDAIQEACARAGGRIAVPRNPEENEAIAS-FVKKYNTYAYVGLTEGPSPG
human SP-A2        VGEKVFSSNGQSITFDAIQEACARAGGRIAVPRNPEENEAIAS-FVKKYNTYEYVGLTEGPSPG
mouse SP-A         VGDKVFSTNGQSVNFDTIREMCTRAGGHIAAPRNPEENEAIAS-ITKKYNTYPYLGVIEGQTPG
human SP-D         VGEKIFKTAGFVKPFTEAQLLCTQAGGQLASPRSAAENAALQQ-LVVAKNEAAFLSMTDSKTEG
mouse SP-D         VGDKIFRTADSEKPFEDAQEMCKQAGGQLASPRSATENAAIQQ-LITAHNKAAFLSMTDVGTEG
human COL-L1       TEEKFYYIVQEEKNYRESLTHCRIRGGMLAMPKDEAANTLIADYVAKSGFFRVFIGVNDLEREG
mouse COL-L1       TEEKFYYIVQEEKNYRESLTHCRIRGGMLAMPKDEVVNTLIADYVAKSGFFRVFIGVNDLEREG
human COL-K1       TESKIYLLVKEEKRYADAQLSCQGRGGTLSMPKDEAANGLMAAYLAQAGLARVFIGINDLEKEG
mouse COL-K1       TESKIYLLVKEEKRYADAQLSCQARGGTLSMPKDEAANGLMASYLAQAGLARVFIGINDLEKEG
 
 
CTLD - C-TERMINAL
 
human MBP-A        QFMYLT-GGRLTYSNWKKDEPNDHGSGEDCVILLNNGLWNGISCTSSFIAICEFPA
mouse MBP-A        QFMYVT-GGRLTYSNWKKDEPNNHGSGEDCVIILDNGLWNDISCQASFKAVCEFPA
human MBP-C        QFVDLT-GNRLTYTNWNEGEPNNAGSDEDCVLLLKNGQWNDVPCSTSHLAVCEFPI
mouse MBP-C        SFEDLT-GNRVRYTNWNDGEPNNTGDGEDCVVILGNGKWNDVPCSDSFLAICEFSD
human SP-A1        DFRYSD-GTPVNYTNWYRGEPAGRG-KEQCVEMYTDGQWNDRNCLYSRLTICEF
human SP-A2        DFRYSD-GTPVNYTNWYRGEPAGRG-KEQCVEMYTDGQWNDRNCLYSRLTICEF
mouse SP-A         DFHYLD-GASVNYTNWYPGEPRGRG-KEKCVEMYTDGKWNDKGCLQYRLAICEF
human SP-D         KFTYPT-GESLVYSNWAPGEPNDDGGSEDCVEIFTNGKWNDRACGEKRLVVCEF
mouse SP-D         KFTYPT-GEPLVYSNWAPGEPNNNGGAENCVEIFTNGQWNDKACGEQRLVICEF
human COL-L1       QYMFTDNTPLQNYSNWNEGEPSDPYGHEDCVEMLSSGRWNDTECHT-MYFVCEFIKKKK
mouse COL-L1       QYVFTDNTPLQNYSNWKEEEPSDPSGHEDCVEMLSSGRWNDTECHLTMYFVCEFVKKKK
human COL-K1       AFVYSDHSPMRTFNKWRSGEPNNAYDEEDCVEMVASGGWNDVACHTTMYFMCEFDKENM
mouse COL-K1       AFVYSDRSPMQTFNKWRSGEPNNAYDEEDCVEMVASGGWNDVACHITMYFMCEFDKENL
 
 
CONSERVED CORE RESIDUES     Ca SITE 1 LIGANDS     Ca SITE 2 LIGANDS (SUGAR-BINDING SITE)
 
Back to top

________________________________________________________________________________________________________________________

This page last updated:
Wednesday, 18 October 2006
Animal lectins home
Contact information: This site is supported by:
 
Kurt Drickamer
Division of Molecular Biosciences
Faculty of Natural Sciences
Imperial College London
 
Email: k.drickamer@imperial.ac.uk