Chitinase-like lectins - sequence alignment

Introduction to chitinase-like lectins
Alignment of chitinase-like (glycoside hydrolase family 18) domains in human and mouse proteins.  Regions of secondary structure are indicated above the alignment and coloured to correspond with the structure of Ym1, left.  Alpha helices in the TIM-like domain (a1-8) and the small alpha/beta domain (aC) are higlighted in red.  Beta sheets in the TIM-like domain (b1-8) are highlighted in blue and those in the small alpha/beta domain (bA-B, bD-F) in yellow
Residues that are identical in over 75% of the aligned sequences are highlighted in green.  Residues that are similar in over 75% of the sequences are highlighted in cyan.  Cysteine residues which form disulfide bonds are highlighted in yellow.  with disulfide pairings numbered 1 and 2.  Residues from the active site motif DXXDXDXE, which includes the catalytic glutamate residue, are highlighted in magenta where identical to the motif and in pink where similar. 
Side chains in human YKL-40 which make hydrogen bonds to bound chito-oligosaccharides, and the conserved equivalents in other proteins, are highlighted in red.  Hydrophobic residue of the substrate binding cleft, and the conserved equivalents in other proteins, are highlighted in orange. Residues making both hydrogen bonds and hydrophobic interactions are marked x.  Conservation of hydrophobic residues in YKL-40 and active chitinases, but not in Ym1, may account for differences in ligand binding specificity.
 
Structure of murine Ym1

Beta-strands in the TIM domain are shown in light blue and those in the small insertion domain in yellow.  The modelled glucosamine molecule is shown in dark blue.  Protein Data Bank structure ID: 1E9L.

                      leader           b1       a1             a1      b2         b2        a2          
                                         1                        1                                       
HS LOC149620                                                                                       ...RNSQL
HS AMCase       MTKLILLTGLVLILNLQLGSAYQLTCYFTNWAQYRPGLGRFMPDNIDPCLCTHLIYAFAGRQNNEITT-IEWNDVTLYQAFNGLKNKNSQL
MM AMCase       MAKLLLVTGLALLLNAQLGSAYNLICYFTNWAQYRPGLGSFKPDDINPCLCTHLIYAFAGMQNNEITT-IEWNDVTLYKAFNDLKNRNSKL
HS CHIT1        MVRSVAWAGFMVLLMIPWGSAAKLVCYFTNWAQYRQGEARFLPKDLDPSLCTHLIYAFAGMTNHQLST-TEWNDETLYQEFNGLKKMNPKL
MM CHIT1        MVQSLAWAGVMTLLMVQWGSAAKLVCYLTNWSQYRTEAVRFFPRDVDPNLCTHVIFAFAGMDNHQLST-VEHNDELLYQELNSLKTKNPKL
MM Ym1          MAKLILVTGLAILLNVQLGSSYQLMCYYTSWAKDRPIEGSFKPGNIDPCLCTHLIYAFAGMQNNEITY-THEQDLRDYEALNGLKDKNTEL
MM Ym2          MAKLILVTGLAILLNVQLGSSYQLMCYYTSWAKDRPTEGSFKPGNIDPCLCTHLIYAFAGMKNNEITY-LSEQDLRDYEALNGLKDRNTEL
MM LOC229688 ...ISKLVFIMGLNLLLNAQMGSAYQLMCYFNNWPQHQPDVRDIKHEDIDPCLCTHLIYSFAGIWENNFTM-TKRKELDDYKGFNDLKKRNNKL
HS YKL39        MDQKSLWAGVVVLLLLQGGSAYKLVCYFTNWSQDRQEPGKFTPENIDPFLCSHLIYSFASIENNKVII-KDKSEVMLYQTINSLKTKNPKL
HS YKL40        MGVKASQTGFVVLVLLQCCSAYKLVCYYTSWSQYREGDGSCFPDALDRFLCTHIIYSFANISNDHI-DTWEWNDVTLYGMLNTLKNRNPNL
MM YKL40        MGMRAALTGFAVLMLLQSCSAYKLVCYFTSWSQYREGVGSFLPDAIQPFLCTHIIYSFANISSDNMLSTWEWNDESNYDKLNKLKTRNTNL
HS OvGP                        ...GAAHKLVCYFTNWAHSRPGPASILPHDLDPFLCTHLIFAFASMNNNQIVAKDLQDEKILYPEFNKLKERNREL
MM OvGP         MGRLLLLAGLVLLMKHSDGTAYKLVCYFTNWAHSRPGPASIMPHDLDPFLCTHLIFAFASMSNNQIVAKNLQDENVLYPEFNKLKERNREL
HS DIAC       MSRPQLRRWRLVSSPPSGVPGLALLALLALLALRLAAGTDCPCPEPELCRPIRHHPDFEVFVFDVGQKTWKSYDWSQITTVATFGKYDSELMC
MM DIAC                       MALCGLPEFTLLLLPLLARLSAGDCPCSEAALCQPIRHRPDFEVFVFDVGQKTWKSYDWSQITTVAAFGKYDPELMC
HS CHID1      ...SVVLEHRSYCSAKARDRHFAGDVLGYVTPWNSHGYDVTKVFGSKFTQISPVWLQLKRRGREMFEVTGLHDVDQGWMRAVRKHAKGLHIV-
MM CHID1      ...DVVLEHRSYCSSRARERNFAGEVLGYVTPWNSHGYDVAKVFGSKFTQISPVWLQLKRRGREMFEITGLHDVDQGWMRAVKKHAKGVRIV-
 
 
                  b3           a3            a3               b4                     a4                b5 
                        x                                                                                    
HS LOC149620    KTLLAIGGWNFGTAPFTAMVSTPENHQTFINSVIKFLRQYEFDGLDFDWEYPGSRVSPPQDKHLFTVLVQEMREAFEQEAKHINKPRLMVT
HS AMCase       KTLLAIGGWNFGTAPFTAMVSTPENRQTFITSVIKFLRQYEFDGLDFDWEYPGSRGSPPQDKHLFTVLVQEMREAFEQEAKQINKPRLMVT
MM AMCase       KTLLAIGGWNFGTAPFTTMVSTSQNRQTFITSVIKFLRQYGFDGLDLDWEYPGSRGSPPQDKHLFTVLVKEMREAFEQEAIESNRPRLMVT
HS CHIT1        KTLLAIGGWNFGTQKFTDMVATANNRQTFVNSAIRFLRKYSFDGLDLDWEYPGSQGSPAVDKERFTTLVQDLANAFQQEAQTSGKERLLLS
MM CHIT1        KTLLAVGGWTFGTQKFTDMVATASNRQTFVKSALSFLRTQGFDGLDLDWEFPGGRGSPTVDKERFTALIQDLAKAFQEEAQSSGKERLLLT
MM Ym1          KTLLAIGGWKFGPASFSAMVSTPQNRQIFIQSVIRFLRQYNFDGLNLDWQYPGSRGSPPKDKHLFSVLVKEMRKAFEEESVEKDIPRLLLT
MM Ym2          KTLLAIGGWKFGPAPFSSMVSTPQNRQTFIKSVIRFLRQYNFDGLNLDWQYPGSRGSPPKDKHLFSVLVQEMRKAFEEESTLNHIPRLLLT
MM BCLP1/2                        MVSTPHNQQTFINSAIKFLRQYGFDGLNLDWQFPGSRGSPSRDKHLFTVLVQ--------------LPRL...
MM LOC229688    KTLLSIGCWNFGDGSFITMVSTPENRHSFITSIIKFLRKYGFDGLNLAWQYPGCYGSPPRDKHLFTILMHEIRKAFEKEVSKNKKPRLMVT
HS YKL39        KILLSIGGYLFGSKGFHPMVDSSTSRLEFINSIILFLRNHNFDGLDVSWIYPDQK-----ENTHFTVLIHELAEAFQKDFTKSTKERLLLT
HS YKL40        KTLLSVGGWNFGSQRFSKIASNTQSRRTFIKSVPPFLRTHGFDGLDLAWLYPGRR-----DKQHFTTLIKEMKAEFIKEAQPGKK-QLLLS
MM YKL40        KTLLSVGGWKFGEKRFSEIASNTERRTAFVRSVAPFLRSYGFDGLDLAWLYPRLR-----DKQYFSTLIKELNAEFTKEVQPGRE-KLLLS
HS OvGP         KTLLSIGGWNFGTSRFTTMLSTFANREKFIASVISLLRTHDFDGLDLFFLYPGLRGSPMHDRWTFLFLIEELLFAFRKEALLTMRPRLLLS
MM OvGP         KTLLSIGGWNFGTSRFTAMLSTLANREKFIDSVISFLRIHGFDGLDLFFLYPGLRGSPPHDRWNFLFLIEELQFAFEREALLTQHPRLLLS
HS DIAC         YAHSKGARVVLKGDVSLKDIIDPAFRASWIAQKLNLAKTQYMDGINIDIEQEVNCLSPEYDA--LTALVKETTDSFHREIEGSQVTFDV
MM DIAC         YAHSKGARVVLKGDISLKNIIDPTFRASWIAQKVDLAKAQYMDGINIDIEQEVNCSSPEYEA--LTALVKETTESFQREIEGSQVTFDV
HS CHID1        PRLL-FEDWTY--DDFRNVLDSEDEIEELSKTVVQVAKNQHFDGFVVEVWNQLLSQKRVGLIHMLTHLAEALHQA--RLLALLVIPPAITP
MM CHID1        PRLL-FEDWTY--DDFRNVLDSEDEIEELSKTVAQVAKNQHFDGFVVEVWSQLLSQKHVGLIHMLTHLAEALHQA--RLLVILVIPPAVTP
 
                 b5         a5             b6                             a6       a6             b7     bA
                                                                                                      
HS LOC149620    AAVAAGISN-IQSGYEIPQLSQYPDYIHVMTYDLHGSWEG--YTGENSPLYKYPTDTGSNAYLNVDYVMNYWKDNRAPAEKL...
HS AMCase       AAVAAGISN-IQSGYEIPQLSQYLDYIHVMTYDLHGSWEG--YTGENSPLYKYPTDTGSNAYLNVDYVMNYWKDNGAPAEKLIVGFPTYGH
MM AMCase       AAVAGGISN-IQAGYEIPELSKYLDFIHVMTYDLHGSWEG--YTGENSPLYKYPTETGSNAYLNVDYVMNYWKNNGAPAEKLIVGFPEYGH
HS CHIT1        AAVPAGQTY-VDAGYEVDKIAQNLDFVNLMAYDFHGSWEK--VTGHNSPLYKRQEESGAAASLNVDAAVQQWLQKGTPASKLILGMPTYGR
MM CHIT1        AAVPSDRGL-VDAGYEVDKIAQSLDFINLMAYDFHSSLEK--TTGHNSPLYKRQGESGAAAEQNVDAAVTLWLQKGTPASKLILGMPTYGR
MM Ym1          STGAGIIDV-IKSGYKIPELSQSLDYIQVMTYDLHDPKDG--YTGENSPLYKSPYDIGKSADLNVDSIISYWKDHGAASEKLIVGFPAYGH
MM Ym2          STGAGFIDV-IKSGYKIPELSQSLDYIQVMTYDLHDPKNG--YTGENSPLYKSPYDIGKSADLNVDSIITYWKDHGAASEKLIVGFPAYGH
MM BCLP1/2      .............................MTYNLHGSQDG--YTGENSPLYKSLNDTGINTLLNVDYIMTYWNENGAAPEKLIVGFPAYGQ
MM LOC229688    AAVAGVIST-IQFGYEIPQLSQSLDYIQVMTYDLHGSWDG--YTGENSPLYKSPIETGVKAFHNIKYIMDNWKKKGASPEKLIVGFPAYGH
HS YKL39        AGVSAGRQM-IDNSYQVEKLAKDLDFINLLSFDFHGSWEKPLITGHNSPLSKGWQDRGPSSYYNVEYAVGYWIHKGMPSEKVVMGIPTYGH
HS YKL40        AALSAGKVT-IDSSYDIAKISQHLDFISIMTYDFHGAWRG--TTGHHSPLFRGQEDASPDRFSNTDYAVGYMLRLGAPASKLVMGIPTFGR
MM YKL40        AALSAGKVA-IDTGYDIAQIAQHLDFINLMTYDFHGVWRQ--ITGHHSPLFQGQKDTRFDRYSNVNYAVQYMIRLGAQASKLLMGIPTFGK
HS OvGP         AAVSGVPHI-VQTSYDVRFLGRLLDFINVLSYDLHGSWER--FTGHNSPLFSLP-EDPKSSA----YAMNYWRKLGAPSEKLIMGIPTYGR
MM OvGP         AAVSGIPSI-IHTSYDALLLGRRLDFINVLSYDLHGSWEK--FTGHNSPLFSLP-EDSKSSA----YAMNYWRKLGTPADKLIMGFPTYGR
HS DIAC         -AWS--PKNIDRRCYNYTGIADACDFLFVMSYDEQSQIWSECIAAANAPY-NQTL-TGYNDYI----------KMSINPKKLVMGVPWYGY
MM DIAC         -AWS--PKRIDKRCYNYTGIADACDFLFVMSYDEQSQIWSECIAAANAPY-NQTL-TGYIDYI----------KMGISPKKLVMGVPWYGY
HS CHID1        ----GTDQLGMFTHKEFEQLAPVLDGFSLMTYDYSTAHQP----GPNAPL-----SWVRACVQVLDPK-SKWRS------KILLGLNFYGM
MM CHID1        ----GTDQLGMFTHKEFEQLAPILDGFSLMTYDYSTSQQP----GPNAPL-----SWIRACVQVLDPK-SQWRS------KILLGLNFYGM
 
                bA            bB             bB     aC      bD        bE     bF        a7            b8   
                                                     2                                                    x 
HS AMCase       NFILSNPSNTGIGAPTSGAGPAGPYAKESGIWAYYEICTFLKNGATQGWDAPQEVPYAYQGNVWVGYDNIKSFDIKAQWLKHNKFGGAMVW
MM AMCase       TFILRNPSDNGIGAPTSGDGPAGAYTRQAGFWAYYEICTFLRSGATEVWDASQEVPYAYKANEWLGYDNIKSFSVKAQWLKQNNFGGAMIW
HS CHIT1        SFTLASSSDTRVGAPATGSGTPGPFTKEGGMLAYYEVCSW-KGATKQRI-QDQKVPYIFRDNQWVGFDDVESFKTKVSYLKQKGLGGAMVW
MM CHIT1        SFTLASSSDNGVGAPATGPGAPGPYTKDKGVLAYYEACSW-KE--RHRI-EDQKVPYAFQDNQWVSFDDVESFKAKAAYLKQKGLGGAMVW
MM Ym1          TFILSDPSKTGIGAPTISTGPPGKYTDESGLLAYYEVCTFLNEGATEVWDAPQEVPYAYQGNEWVGYDNVRSFKLKAQWLKDNNLGGAVVW
MM Ym2          TFILSDPSKNGIGDPTVSAGPPGKYTNEQGLLAYFEICTFLNEGATEIFDATQEVPYAYLGNEWVGYDNVRSFKLKAQWLKDNNLGGAVVW
MM BCLP1/2      TFTLSDPSNNGISAPTASAGTLGPYTEESGTWAYYEICSFLNDGATEAWDSAQEVPYAYQGNKWVGYDNVKSFRIK...
MM LOC229688    TFILSDSTKTEIGAPSNRGGHPGPHTKQTGFWAYYEICTFLKNGAIQVWNAAQQVPYAFHGNEWVGYDNIKSFHIKAQWLKRNNYGGAMIW
HS YKL39        SFTLAS-AETTVGAPASGPGAAGPITESSGFLAYYEICQFLKGAKITRL-QDQQVPYAVKGNQWVGYDDVKSMETKVQFLKNLNLGGAMIW
HS YKL40        SFTLAS-SETGVGAPISGPGIPGRFTKEAGTLAYYEICDFLRGATVHRT-LGQQVPYATKGNQWVGYDDQESVKSKVQYLKDRQLAGAMVW
MM YKL40        SFTLAS-SENQLGAPISGEGLPGRFTKEAGTLAYYEICDFLKGAEVHRL-SNEKVPFATKGNQWVGYEHKESVKNKVGFLKEKKLAGAMVW
HS OvGP         TFRLLKASKNGLQARAIGPASPGKYTKQEGFLAYFEICSFVWGAKKHWID-YQYVPYANKGKEWVGYDNAISFSYKAWFIRREHFGGAMVW
MM OvGP         NFYLLKESKNGLQTASMGPASPGKYTKQAGFLAYYEVCSFVQRAKKHWID-YQYVPYAFKGKEWLGYDDTISFSYKAMYVKREHFGGAMVW
HS DIAC  DYTCLNLSEDHVCTIAKVPFRGAPCSDAAGRQVPYKTIMKQINSSISGNLWDKDQRAPYYNYKDPAGHFHQVWYDNPQSISLKATYIQNYRLRGIGMW
MM DIAC  DYICLNLSKDDICTITKVPFRGAPCSDAAGHQVPYKVIMKQVNGSVSGSQWNKDQQAPYYNYKDPAGRFHQVWYDNPQSISLKAAYVKNYGLRGIGMW
HS CHID1        ---------------DYATSKDAREPVVGARYIQTLKDHRPRMVWDSQVSEHFFEYKKSRSGRHVVFYPTLKSLQVRLELARELGVGVSIW
MM CHID1        ---------------DYAASKDAREPVIGARYVQTLKDHRPRVVRDSQAAEHFFEYKKNRGGRHVVFYPTLKSLQVRLELARELGVGVSIW

                                             a8         
                           2        
HS AMCase       AIDLDDFTGTFCNQG-KFPLISTLKKALGLQSASCTAP
MM AMCase       AIDLDDFTGSFCDQG-KFPLTSTLNKALGISTEGCTAP
HS CHIT1        ALDLDDFAGFSCNQG-RYPLIQTLRQELSLPYLPSGTP
MM CHIT1        VLDLDDFKGSFCNQG-PYPLIRTLRQELNLPSETPRSP
MM Ym1          PLDMDDFSGSFCHQR-HFPLTSTLKGDLNIHSASCKGPY
MM Ym2          PLDMDDFSGSFCHQG-RFPLTTTLKRDLNVHSASCKASYRGEL
MM LOC229688    TIDMDDYTGSFCGQG-TFPLTSILKKTLKVHSASCNVTVLSANVTVSRNSSSGALEPVLQLSSK
HS YKL39        SIDMDDFTGKSCNQG-PYPLVQAVKRSLGSL
HS YKL40        ALDLDDFQGSFCGQDLRFPLTNAIKDALAAT
MM YKL40        ALDLDDFQGT-CQPKEFFPLTNAIKDALA
HS OvGP         TLDMDDVRGTFCGTG-PFPLVYVLNDIL...
MM OvGP         TLDMDDVRGTFCGNG-PFPLVHILNELL...
HS DIAC         NANELCLDYSGDAVAKQQTEEMWEVLKPKLLQR
MM DIAC         NAN--CLDYSDDALAREQTQEMWGALKPRL
HS CHID1        ---GQGLDYFYDLL
MM CHID1        -ELGQGLDYFYDLL

 

CHITIN-BINDING DOMAIN (ACTIVE CHITINASES ONLY)
                               
HS AMCase       AQPIEPITAAPSGSGNGSGSSSSGGSSGGSGFCAVRANGLYPVANNRNAFWHCVNGVTYQQNCQAGLVFDTSCDCCNWA 
MM AMCase       DVPSEPVTTPP-GSGSGGGS--SGGSSGGSGFCADKADGLYPVADDRNAFWQCINGITYQQHCQAGLVFDTSCNCCNWP
HS CHIT1        ELEVPKPGQPSEPEHGPSPGQDT--------FCQGKADGLYPNPRERSSFYSCAAGRLFQQSCPTGLVFSNSCKCCTWN
MM CHIT1        EQIIPEPRPSSMPEQGPSPGLDN--------FCQGKADGVYPNPGDESTYYNCGGGRLFQQSCPPGLVFRASCKCCTWS
 

_______________________________________________________________________________________________________________________

This page last updated:
Tuesday, 19 September 2006
Animal lectins home
Contact information: This site is supported by:
 
Kurt Drickamer
Division of Molecular Biosciences
Faculty of Natural Sciences
Imperial College London
 
Email: k.drickamer@imperial.ac.uk