Galectins - sequence alignment |
|
|
Alignment of galectin sequences in the region of the sugar-binding site. Where two galectin CRDs are present in the same polypeptide, the N- and C-terminal domains are denoted a and b respectively. Regions of secondary structure are indicated above the alignment and coloured to correspond with the structure of galectin-3, left. Beta-strands in mammalian are indicated S4, 5, 6a and 6b and highlighted in blue. Conserved residues involved in ligand binding are highlighted in red (polar) and yellow (hydrophobic). NB There is no galectin-5, -6 or -11 in humans. Galectin-5 is present only in rat and galectin-6 only in mouse. Galectin-11 has been used as an alternative name for two non-human galectin-like proteins. Accession numbers Human (HS): 1 P09382 2 P05162 3 P17931 4 P56470 7 P47929 8 O00214 9 O00182 10 Q05315 12 Q96DT0 13 Q9UHV8 14 Q8TCE9. C elegans (CE): 1 P36573 bO45904 2a Q20684 b Q9GNP7 3 Q09581 bQ95ZI4 cQ8I4A5 dQ8I4A4 4 Q9GNP5 Q18625 5 Q23426 6 Q9N384 Q21057 7 Q09605 8 Q09610 9 Q94169 10 Q23065 11 Q9GNP8 Q94215 F46A8.3 O01329 F46A8.4 O01330 F46A8.5 O01331 F49F1.9 Q9GZE0 F49F1.10 Q9GZD9 F49F1.11 Q9GZD8 C27C7.5 Q9XVD0 W04B5.3a Q9UA61 bQ8WSM4 cQ8ITW2 PQN-84 Q9U2I7 F40H6.5 Q09277 T02G6.7 O62353 F41D3.6 O45510 C53D6.7 Q18813 DC2.3 Q9UAW1 bQ5ZR27. Drosophila (DM): CG5335 Q9V8J9 Q95SA3 CG11372 Q9VPI6 CG11374 Q9VPI7 CG13950 Q9VPU6 CG14879 Q9VEW5 Q8SXU5 CG32226 Q9VWB0. If alignments appear scrambled please maximize the width of your browser window.
|
|
| Structure of
human galectin-3 with bound LacNAc
Beta-strands in one sheet are coloured yellow and those in the opposing sheet, blue. The Gal moiety of LacNAc is shown in yellow and the GlcNAc portion in dark blue. Protein Data Bank structure ID: 1KJL. |
|
S4 S5 S6a S6b
HS
Galectin-1
NLCLHFNPRFNAHGD------ANTIVCNSKD-GGAWG--TEQRE
HS
Galectin-2 KLNLHFNPRFS----------ESTIVCNSLD-GSNWG--QEQRE
HS
Galectin-3
DVAFHFNPRFNENN-------RRVIVCNTKL-DNNWG--REERQ
HS
Galectin-4a DVAFHFNPRFDG---------WDKVVFNTLQ-GGKWG--SEERK
b DIALHINPRMG----------NGTVVRNSLL-NGSWG--SEEKK
HS
Galectin-7
DAALHFNPRLD----------TSEVVFNSKE-QGSWG--REERG
HS
Galectin-8a DVAFHFNPRFKR---------AGCIVCNTLI-NEKWG--REEIT
b DIALHLNPRLN----------IKAFVRNSFL-QESWG--EEERN
HS
Galectin-9a DIAFHFNPRFED---------GGYVVCNTRQ-NGSWG--PEERK
b HIAFHLNPRFD----------ENAVVRNTQI-DNSWG--SEERS
HS Galectin-10 DIVFHFQVCF-----------GRRVVMNSRE-YGAWK-QVESKN
HS Galectin-12a DIAFHFNPRFHT--------TKPHVICNTLH-GGRW--QREARW
b DQAAHAPVTLR----------ASFADRTLAW-ISRWG-QKKL
HS Galectin-13 DIAFRFRVHFG-----------NHVVMNRRE-FGIWM--LEETT
HS Galectin-14 DIAFQFRLHFG-----------HPAIMNSRV-FGIWR--YEEKC
RN
Galectin-5
DIAFHLNPRFD----------ENAVVRNTQI-NNSWG--PEERS
MM
Galectin-6a DVAFHFNPRFDG---------WDKVVFNTKQ-SGRWG--KEEEK
b DIALHINPRI-----------GDCLVRNSYM-NGSWG--TEERM
CE Galectin-1a DVPLHVSVRFD----------EGKIVLNSFS-NGEWG--KEERK
b DISFHFNPRFD----------EKHVIRNSLA-ANEWG--NEERE
CE
Galectin 2a DVPLHVSVRFD----------EGKLVFNTFS-KGEWG--KEERK
b DIALHLNARFD----------EKHVVRNSLI-NSAWG--NEERE
CE
Galectin 3a DVPLHLSIRFD----------EGKIVYNAYT-KGTWG--KEERA
b DIALHFNPRFD----------EKSVVRNSLV-NGEWG--NEERE
CE
Galectin 4a QVILHLKLNFK----------EKKVILNSYE-NGVWG--KEERE
b DILFHFNPRIK----------DKAVVRNSHR-NGFWD--TEERE
CE Galectin-5a DLPLHFSIRFNEGLLG-----SGKLIYNTFE-NGNWS-EDEQRI
b EYALQVSIRYN----------EGAVVRNAMT-SNVWG--KEERE
CE
Galectin 6 DIVLHFNARFD----------EGAVVNNSTS-GGGWQ--SEDRH
CE
Galectin-7 NIVLHVKFEFNG---------EHSIVLNSLI-NGEWG--PQLRH
CE Galectin-8 HIVLHVNFRFEH---------DHIVAMNTCT-NGAWG--AEIRH
CE
Galectin 9 NIALHVNFRFEK---------QKIVAINAKI-DNAWG--NEISH
CE Galectin
10
NVVLHVNFRFHH---------EHVVVMNSQF-SGMWG--PEIRH
CE
Galectin-11 GVALHISVRMGSYG-------QNVIVFNHLQ-RGRWH--REEHH
CE
F46A8.3
TWVFHFACEPT----------KGLVARTRHT-NGAWE-VGETYS
CE F46A8.4 TWVFHFASEPT----------KGLVARTRFT-NGAWE-VGETYG
CE F46A8.5 VWLFRFASQPD----------LGRVVRTRHQ-NGQFL-QGDTYG
CE F46A8.8 TWLFHFASQPD----------LGHVVRTREQ-NGQFQ-TPDTYG
CE F49F1.9 EWLFHFASQPD----------LRRVVRTRHL-NGKFL-LGDTYG
CE F49F1.10 VWVFHFAVEPS----------KGLVARTRHL-NGKWQ-VGETYG
CE F49F1.11 VWVFHFASEPT----------KGLVARTRHT-NGAWE-VGETYG
CE C27C7.5 EVLFHFNPRPE----------VNEVVRT-HR-NGVWE-MYESSG
CE PQN-84 PLSLHIRVQYL----------RNSIIYNSW--FGSWG--GEQFS
CE F40H6.5a YISIHINMRWA----------TDGIVYNYF--WGMWNYQFESYS
b AVPVHISQRWS---------SSARIIYNNF--WGQWG--PEAFS
CE
T02G6.7
DLLFHFNPRPN----------AKFVVRATLL-DGVWG-ASEEDG
CE
F41D3.6
NILFHFNPRPE----------ENVVVRSTSI-GGVWQ-AAERDG
CE
C53D6.7a
LDDRLLALRVDLA--------KGKVFLNACI-NGQWGNFSEHVF
b
DIPFQLCASFV----------QKKVLRSSEI-SGTKS-TPETNF
CE DC2.3 NKALHFNPRFDAKTGWFSGPGDDKLVINSFV-SGRWG--NEERF
CE W04B5.3 DQALRVELVLVQ---------PSKLKIDCTVSQKKST--AVET
DM
CG5335 a
DIGLRFSCYFR----------NDVIVRNSRI-NGAWG--EEESH
b
REELHFSVRFDE----------KAVVRNSMNKNFEFG--SEERH
DM
CG11372a
DVALHINPRLP----------QNYIVRNTKV-QDIWG--NEEVS
b VIAFHLNPRFSKASSGAIG--KAVVCRNAWL-NGAWA--QEERS
DM
CG11374a
DVLLQIGSRLP----------QNYITRNSRL-KGKWG--PEENS
b TVSFYFSPCFLRSRHDKIG--TAIITRRAYL-NGDWVNCTVSRL
DM CG13950a TVFLKIEANFR----------EGQIIRSMFQPGGGWQ--QEEIS
b RTVLRFHVNFD----------RTTVSRSYQREDNSFALSDEETE
DM CG14879 NITYHFKVVIP----------TQTIIENYKR-NGEWSSLHQEKY
DM
CG32226
VIALHLNPRFN----------ERTTVLNSMK-ESEWL--DEIRN
___________________________________________________________________________________
|
|
|
| Contact information: | This site is supported by: | |
|
|
|
![]() |
||