Galectins - sequence alignment

Introduction to galectins
Interpro entry: Galectin, galactose-binding lectin

Alignment of galectin sequences in the region of the sugar-binding site. Where two galectin CRDs are present in the same polypeptide, the N- and C-terminal domains are denoted a and b respectively.  Regions of secondary structure are indicated above the alignment and coloured to correspond with the structure of galectin-3, left.  Beta-strands in mammalian are indicated S4, 5, 6a and 6b and highlighted in blue.  Conserved residues involved in ligand binding are highlighted in red (polar) and yellow (hydrophobic).

NB There is no galectin-5, -6 or -11 in humans.  Galectin-5 is present only in rat and galectin-6 only in mouse.  Galectin-11 has been used as an alternative name for two non-human galectin-like proteins.

Accession numbers

Human (HS): 1 P09382 2 P05162 3 P17931 4 P56470 7 P47929 8 O00214 9 O00182 10 Q05315 12 Q96DT0 13 Q9UHV8 14 Q8TCE9.

C elegans (CE): 1 P36573 bO45904 2a Q20684 b Q9GNP7 3 Q09581 bQ95ZI4 cQ8I4A5 dQ8I4A4 4 Q9GNP5 Q18625 5 Q23426 6 Q9N384 Q21057 7 Q09605 8 Q09610 9 Q94169 10 Q23065 11 Q9GNP8 Q94215 F46A8.3 O01329 F46A8.4 O01330 F46A8.5 O01331 F49F1.9 Q9GZE0 F49F1.10 Q9GZD9 F49F1.11 Q9GZD8 C27C7.5 Q9XVD0 W04B5.3a Q9UA61 bQ8WSM4 cQ8ITW2 PQN-84 Q9U2I7 F40H6.5 Q09277 T02G6.7 O62353 F41D3.6 O45510 C53D6.7 Q18813 DC2.3 Q9UAW1 bQ5ZR27.

Drosophila (DM): CG5335 Q9V8J9 Q95SA3 CG11372 Q9VPI6 CG11374 Q9VPI7 CG13950 Q9VPU6 CG14879 Q9VEW5 Q8SXU5 CG32226 Q9VWB0.

If alignments appear scrambled please maximize the width of your browser window.

 

Structure of human galectin-3 with bound LacNAc

Beta-strands in one sheet are coloured yellow and those in the opposing sheet, blue.  The Gal moiety of LacNAc is shown in yellow and the GlcNAc portion in dark blue.  Protein Data Bank structure ID: 1KJL.

 

 

                  S4                    S5        S6a    S6b

                 

HS Galectin-1    NLCLHFNPRFNAHGD------ANTIVCNSKD-GGAWG--TEQRE

HS Galectin-2    KLNLHFNPRFS----------ESTIVCNSLD-GSNWG--QEQRE

HS Galectin-3    DVAFHFNPRFNENN-------RRVIVCNTKL-DNNWG--REERQ

HS Galectin-4a   DVAFHFNPRFDG---------WDKVVFNTLQ-GGKWG--SEERK

             b   DIALHINPRMG----------NGTVVRNSLL-NGSWG--SEEKK

HS Galectin-7    DAALHFNPRLD----------TSEVVFNSKE-QGSWG--REERG

HS Galectin-8a   DVAFHFNPRFKR---------AGCIVCNTLI-NEKWG--REEIT

             b   DIALHLNPRLN----------IKAFVRNSFL-QESWG--EEERN

HS Galectin-9a   DIAFHFNPRFED---------GGYVVCNTRQ-NGSWG--PEERK

             b   HIAFHLNPRFD----------ENAVVRNTQI-DNSWG--SEERS

HS Galectin-10   DIVFHFQVCF-----------GRRVVMNSRE-YGAWK-QVESKN

HS Galectin-12a  DIAFHFNPRFHT--------TKPHVICNTLH-GGRW--QREARW

              b  DQAAHAPVTLR----------ASFADRTLAW-ISRWG-QKKL                

HS Galectin-13   DIAFRFRVHFG-----------NHVVMNRRE-FGIWM--LEETT

HS Galectin-14   DIAFQFRLHFG-----------HPAIMNSRV-FGIWR--YEEKC

RN Galectin-5    DIAFHLNPRFD----------ENAVVRNTQI-NNSWG--PEERS

MM Galectin-6a   DVAFHFNPRFDG---------WDKVVFNTKQ-SGRWG--KEEEK

             b   DIALHINPRI-----------GDCLVRNSYM-NGSWG--TEERM

 

CE Galectin-1a   DVPLHVSVRFD----------EGKIVLNSFS-NGEWG--KEERK

             b   DISFHFNPRFD----------EKHVIRNSLA-ANEWG--NEERE

CE Galectin 2a   DVPLHVSVRFD----------EGKLVFNTFS-KGEWG--KEERK

             b   DIALHLNARFD----------EKHVVRNSLI-NSAWG--NEERE

CE Galectin 3a   DVPLHLSIRFD----------EGKIVYNAYT-KGTWG--KEERA

             b   DIALHFNPRFD----------EKSVVRNSLV-NGEWG--NEERE

CE Galectin 4a   QVILHLKLNFK----------EKKVILNSYE-NGVWG--KEERE

             b   DILFHFNPRIK----------DKAVVRNSHR-NGFWD--TEERE

CE Galectin-5a   DLPLHFSIRFNEGLLG-----SGKLIYNTFE-NGNWS-EDEQRI

             b   EYALQVSIRYN----------EGAVVRNAMT-SNVWG--KEERE

CE Galectin 6    DIVLHFNARFD----------EGAVVNNSTS-GGGWQ--SEDRH

CE Galectin-7    NIVLHVKFEFNG---------EHSIVLNSLI-NGEWG--PQLRH

CE Galectin-8    HIVLHVNFRFEH---------DHIVAMNTCT-NGAWG--AEIRH

CE Galectin 9    NIALHVNFRFEK---------QKIVAINAKI-DNAWG--NEISH

CE Galectin 10   NVVLHVNFRFHH---------EHVVVMNSQF-SGMWG--PEIRH

CE Galectin-11   GVALHISVRMGSYG-------QNVIVFNHLQ-RGRWH--REEHH

CE F46A8.3       TWVFHFACEPT----------KGLVARTRHT-NGAWE-VGETYS

CE F46A8.4       TWVFHFASEPT----------KGLVARTRFT-NGAWE-VGETYG

CE F46A8.5       VWLFRFASQPD----------LGRVVRTRHQ-NGQFL-QGDTYG

CE F46A8.8       TWLFHFASQPD----------LGHVVRTREQ-NGQFQ-TPDTYG

CE F49F1.9       EWLFHFASQPD----------LRRVVRTRHL-NGKFL-LGDTYG

CE F49F1.10      VWVFHFAVEPS----------KGLVARTRHL-NGKWQ-VGETYG

CE F49F1.11      VWVFHFASEPT----------KGLVARTRHT-NGAWE-VGETYG

CE C27C7.5       EVLFHFNPRPE----------VNEVVRT-HR-NGVWE-MYESSG

CE PQN-84        PLSLHIRVQYL----------RNSIIYNSW--FGSWG--GEQFS

CE F40H6.5a      YISIHINMRWA----------TDGIVYNYF--WGMWNYQFESYS

          b      AVPVHISQRWS---------SSARIIYNNF--WGQWG--PEAFS

CE T02G6.7       DLLFHFNPRPN----------AKFVVRATLL-DGVWG-ASEEDG

CE F41D3.6       NILFHFNPRPE----------ENVVVRSTSI-GGVWQ-AAERDG

CE C53D6.7a      LDDRLLALRVDLA--------KGKVFLNACI-NGQWGNFSEHVF

          b      DIPFQLCASFV----------QKKVLRSSEI-SGTKS-TPETNF

CE DC2.3         NKALHFNPRFDAKTGWFSGPGDDKLVINSFV-SGRWG--NEERF

CE W04B5.3       DQALRVELVLVQ---------PSKLKIDCTVSQKKST--AVET

 

DM CG5335 a      DIGLRFSCYFR----------NDVIVRNSRI-NGAWG--EEESH

          b      REELHFSVRFDE----------KAVVRNSMNKNFEFG--SEERH

DM CG11372a      DVALHINPRLP----------QNYIVRNTKV-QDIWG--NEEVS

          b      VIAFHLNPRFSKASSGAIG--KAVVCRNAWL-NGAWA--QEERS

DM CG11374a      DVLLQIGSRLP----------QNYITRNSRL-KGKWG--PEENS

          b      TVSFYFSPCFLRSRHDKIG--TAIITRRAYL-NGDWVNCTVSRL

DM CG13950a      TVFLKIEANFR----------EGQIIRSMFQPGGGWQ--QEEIS

          b      RTVLRFHVNFD----------RTTVSRSYQREDNSFALSDEETE

DM CG14879       NITYHFKVVIP----------TQTIIENYKR-NGEWSSLHQEKY

DM CG32226       VIALHLNPRFN----------ERTTVLNSMK-ESEWL--DEIRN

 

 

Return to top  

___________________________________________________________________________________

 

This page last updated:
Tuesday, 19 September 2006
Animal lectins home
Contact information: This site is supported by:
 
Kurt Drickamer
Division of Molecular Biosciences
Faculty of Natural Sciences
Imperial College London
 
Email: k.drickamer@imperial.ac.uk