F-type lectins - sequence alignment

Regions of secondary structure are indicated above the alignment and coloured to correspond with the structure of AAA, left.  310 helices (a1-a4) are highlighted in red.  Beta-strands in the large beta-sheet (b2-4, 6, 7, 9, 10) are highlighted in blue and those in the small beta-sheet (b5, 8, 11) in yellow.  The loops which encircle the ligand-binding site (1-5) are highlighted in grey.  Cysteine residues are highlighted in yellow with the disulphide pairings (1-3) indicated above the alignment.  Selected residues that are conserved in all or nearly all F-type domains are highlighted in green where identical and in cyan where the character of the residue is preserved.  Residues contributing main chain or side chain oxygen atoms to coordinate Ca2+ in AAA and SP2159, and the conserved equivalents in other proteins, are highlighted in orange.  Residues involved in fucose binding in AAA and SP2159, and the conserved equivalents in other proteins, are highlighted in magenta (hydrogen bonds to the ring hydroxyl groups or ring oxygen) and pink (forming the hydrophobic pocket binding the methyl group).  Individual F-type domains within a single polypeptide are indicated a, b, c etc.

Vertebrates
Anguilla anguilla (AA) Agglutinin Q7SIC1 Morone saxatilis (MS) FBP32 Q2LK81 FBP32II Q2LK88 Oncorhynchus mykiss (OM) FBPL4 Q2LK91 Fundulus heteroclitus (FH) fucolectin Q654Q0  Xenopus laevis (XL) PXN-FBPL Q2LK77 (domain e = PXN1 P49263) II-FBPL Q2LK76 X-epilectin Q64GD1 Monodelphis domestica (MD) 24867 17963
 
Invertebrates
Strongylocentrotus pupuratus (SP) CRL Q6RSH4 788755 XP_788755 783458 XP_783458 Drosophila melanogaster (DM) furrowed P91658 CG9095 Q9VXX7 Caenorhabditis elegans (CE) C54G4.4 Q18849
 
Prokaryotes
Acidiphilium cryptum JF-5 (AC) Q2DD02 Streptococcus pneumoniae TIGR4 (SP) SP_2159 Q97N96 SP_0833 Q97RI2
Saccharophagus degradans 2-40 (SD) Sde_3709 Q21EB5 Solibacter usitatus Ellin 6076 (SU) 6955 Q449Z0
Gluconobacter oxydans 621H (GO) GOX0967 Q5FSA7 GOX0982 Q5FS92
Structure of Anguilla anguilla agglutinin with bound fucose

Beta-strands in the three-stranded sheet are coloured yellow and those in the five-stranded sheet are coloured blue.   310 helices are coloured red.  The fucose ligand is shown in purple and the Ca2+ ion in blue.  Protein Data Bank structure ID: 1K12.

                     a1  b2       1     a2   a3      2  a4         3         
                                                              1                              
AA Agglutinin     NVAVRGKATQSAQLRGEHA---ANSEASNAIDGNRDSNFYHG-SCTHSSGQANPW
MS FBP32    a     NVALRGKATQSARYLHTH------GAAYNAIDGNRNSDFEAG-SCTHTIEQTNPW
            b     NLALRGKATQSSLFES--------GIAYNAIDGNQANNWEMA-SCTHTKNTMNPW
MS FBP32II  a     NVALRGKATQSARYVHTF------GAAYNAIDGNSESDFHAG-SCTHTAEQTNPW
            b     NLALQGKATQSSLYGL--------GIAYNAIDGNRASSWNQP-SCTHTNNDINPW
OM FBPL4    a     NVALRGKAAQSSTSYG--------GTAKRAIDGIWNPTYEYL-SCSHTSGETSPW
            b     NVALKKTTRQSSQYSHM-------GGSNNAVDGRRLSMYKDK-SCSRTKSQVNPW
            c     NAATGGIANQSSQWDMF-------GDANNAIDLSWSNRYLEG-SCSHTKAEVDPW
            d     NLVSGGIEVHSSQYDSH-------GAASNATDRKRNPLYHAG-SCSHTEAETNPW
FH Fucolectin     NVALRGKATQAQRYKGD---WDVFGAASNAIDGNTNPNFKDG-SCSHTASQTNPW
XL PXN-FBPL a     NVALDGITSQSSTMAYY-------GNSRHANDGSLANNYLRS-QCSYTKKEADPW
            b     NLAVKGIAQQSSLYNMY-------GEPKNANDGSLASNYFFL-ECASTSEQEDPW
            c     NLAFRGISSQSSTYDNL-------GKAENAIDGSTSTKYMST-HCSHTDLDIEPW
            d     NVALHGAAYQSSTAGE--------ANAKNAVDGKLQNQNPAK-QCAQTTVETDPW
            e     NVAPQGIPYQSSYYGQK-------EQAKRVIDGSLASNYMEG-DCCHTEKQMHPW
XL II-FBPL  a     NVARLGIASQSSTYVHD-----PMPGPERALDGNNKVNAMIH-PCSHTYNDFEPW
            b     NLARLGDATQSSTYRPE-------YNAGAAIDGNKVTNMMLG-SCSHTNNDNPAW
            c     NLARLGDATQSSTYRPE-------YNAGAAIDGNKGTNMMLG-SCSHTNNDNPAW
            d     NVARWGSVSQSSTYRPE-------YSAETAIDGDKETNIFMH-PCAHTNPDNPAW
XL epilectin      NLARSGGVKQSSTYAPQ-------YTVDKAIDGIKNTNTFVQ-ACAITGYDKNAW
MD 24867    a              QSSVFIPS-------QFPEKPMDGSIQSG-----HCVQTLQETDPW
            b     MLSKGKPAFQSSISNPL-------GSPERAVDGSLLADFEKG-TCIQTQREVNPW
            c     MVSRGKPAFQSSVYSAL-------GSPERAVDGSLLSDFNKG-SCIQTYPENDPW
MD 17963    a                                           FGKS-SCIQTLQESNPW
            b     SSPLPGSISQSSIFDPS-------GSPERAVDGSLESDFWRG-SCIQTHQETGPW
            c     LLSQGMNVFQSSTSSLL-------GSPERAIDGCLAGNFSKG-SCIQTLSEYEPW
SP CRL         ...NIILDSHTNVSQTPPKRN--QALIGNAWLARDGNTDSAPQY---CSRTKVTNNPW
SP 788755      ...NKLLDPHTQTSQTPTKNKNNRAVVSNAWLARDGNTDSAPQF---CSRTPLTYHPW
SP 783458      ...PVSRGKPAFQSSTHTSG----GIMANAGLAVDGNPNPSFDQG-SCSRTQSRANSW
DM furrowed    ...NVAYRKPVNQSSYTRSG-------PASY-ANDGKPGNKNPDGQECSETQKEPSPW
DM CG9095      ...NVAAGKAPMQISTDGAG-------APQK-AIDGSTSAFFTPE-TCSLTKAERSPW
CE C54G4.4        DVAYNKPVTQSSGNVAM----------------SLGGT-----MCTMTNDESKSF
AC             ...LISNHCRAIQSTTSEWS-----CVRDLEQDASGALNGVIDGR-AKFHTDLEDAPW
SP 2159     a  ...NIAYAKPTTQSSVDYNG--------DPNRAVDGNRNGNFNSG-SVTHTRADNPSW
            b     VVSTNKVATQSSTNYEG--------VAALAVDGNKDGDYGHH-SVTHTKADSNAW
            c     NIALTKETRQSSTDYNG--------FSRLAVDGNKNGDYGHH-SVTHTKEDSPSW
SP 0833        ...NIASGKQVTQSSTAFGG--------DARRAVDGKVDGNYGHN-SVTHTNFQSKPW
SD 3709        ...NIALGKATSQSSTGYEG--------VSSRAVDGNTNGNWNQG-SITHTNNEYQPW
SU 6955        ...NLASGKQASQSSTFNAG-------SGAEKAIDGNVDGNSADG-STTQTNSEANAW
GO 0967        ...PPSRWGNISQGMTATQISTVHAGTVEEDARRVLSGNLCGK---CQNHTALEHNPW
GO 0982        ...LISDEGKADQSSVCEYSFEKHNTAVEAQRAVTEPLQMA-----PAFHTGFEKNPW
 
                   b3     b4   b5         4         b6              b7               
                                        22                          3          
AA Agglutinin     WRVDLLQVYTITSVTITNRG-DCCGER--ISGAEINIGQHLASNGVNNPECSVI---GSMATGETK
MS FBP32    a     WRVDLLEPYIVTSITITNRG-DCCPER--LNGVEIHIGNSIQENGVANPRVGVI---SHIPAGISH
            b     WRMDLSKTHRVFSVKVTNR--DSFEKR--INGAEIRIGDSLDNNGNNNPRCAVI---TSIPAGAST
MS FBP32II  a     WRVDLLEPYIVTSITITNRG-DCCAER--LDGLQIHIGNSLQNNSLENPMVGTI---AEIGAAKSF
            b     WRLSLPKTHRVFSVKVTNR--DEVEER--INGAEIRIGDCLDNNGNNNPRCAVI---PSIPASATA
OM FBPL4    a     WRVDLLETYQVTSVTITNR--DTLAER--INGAEIRIGNSLENDGNSNPRCALI---SSIPAGGST
            b     WRVDLQRAYNVTSITVTNIE-DVDPEM--IDGAEIHIGNSLQNNGNNNPLCAVI---SSYPAWEVM
            c     WRVDLSKTHNVTYVTITNRG-DCCSDR--ISGAEIHVGDSLFNNGNSNPLCARI---PYIPAGQSR
            d     WRVDLLDTYQVTFVTITNRG-DCCLHK--INGAEIRIGNSLENNGTTNPLCAVI---SEMREGQPM
FH Fucolectin     WRVDLLDSYTITHIIITNRG-DCCHDR--INGATIHIGNSLTLNGAANPSVAAI---SEIPSATS-
XL PXN-FBPL a     WMVDLQKPYQILSVAVTNRVLECCKER--LFNAEIHIGNDPKQGGKLNPRCGVI---SSIESGETL
            b     WMVDLKASHRVYTVAVTNRG-DCCAEK--INNAEIRIGDSNDAGGQQNPVCGII---KSMANGETL
            c     WKVDLINTYNVTEVQITNRG-DCCNNR--INGAEIRIGTAPEKGGTKNPRCAKI---ATMALGESA
            d     WTVDLTSIHKVFSIAVTNRG-DCCSEG--LDGAEIHLGDSAFSW-KKNPVCGTV---SKIGPGETF
            e     WQLDMKSKMRVHSVAITNRG-DCCRER--INGAEIRIGNSKKEGGLNSTRCGVV---FKMNYEETL
XL II-FBPL  a     WRVDLKKTYAVNSVVIVNRM-DCCSER--LEGAQVRIGNSAD---NNNPICGTI---SDAS-QATI
            b     WRLDLKKRYKVDKVVIVNRG-DCCAER--LLGAQIHIGNSAN---NNNPICGGI---NSVS-EATI
            c     WRLDLKKRYKVDKVVIVNRG-DCCDER--LLGAQILIGNSAN---NNNPICGGI---DSVS-EATI
            d     WQLDLKTAYMIESVVIVNRG-DCCSER--LLGAQIRVGNSPF---HNNPVCGTI---TDVS-ETTI
XL epilectin      WQVDLKNSYKVGSVVIVNRG-DCCADR--LKGAQIRVGNSAD---NNNPVCATV---TDVS-QLTI
MD 24867    a     WMVDLGSTQTIGSVSITNRK-DCCQEQ--IKGTMILVGDSPY----QGGKCASI---PFLDLGIKH
            b     WMVDLGCTQTVKSVAITPRK-DCCTEE--LRGALILVGDSPDGGGTLNARCAVI---GFLDREKTE
            c     WMVDLGSPYTVDSVSITSRR-DCCSDL--MNGAMILIGDSQYMNGRMNPRCAMV---PAMRPGKTE
MD 17963    a     WMVDLGSISSVALVSLTNRN-DCCQEQ--INGAEILVGDSSFQGGKFNPKCAII---SSMPPRKTV
            b     WMIDLESTQPVATVAITNRR-DCCHEK--INGAEILVGDSFDKGGKSNPRCARI---FALGPGKTE
            c     WMVDLDS-YLVDYVAITNRK-DCCHEQL--NGAVILVGDSGISGGKFNPRCATI---SSLDAGKTE
SP CRL            WKAVMKDIYNFTDVKIYNRL-DREERRYDLEGAEIRVGLNDN--YTTNLLCGEPVTRRQIESSTRANHGI
SP 788755         WKAVLKNIYNITLVKIYNRL-DREDHRFDLVGAEIRVGVNED--YTTNLLCGEPVTRRQIENSTEANHGI
SP 783458         WYVDLEAAYDVTTVVIVNK--DVQGERLR--GAMVTIGSSATDPSTRTQ-CGRITRTMTNAASRTPHQRI
DM furrowed       WRVDLLTPQAVHVVRITTRG--CCGHQP-LQDLEIRVGNSSADL-QRNPLCAWYPGTLDEGVVKTF
DM CG9095         WYVNLLEPYMVQLVRLDF-GKSCCGNKPATI--VVRVGNNRPDL-GTNPICNRFTGLLEAGQPLFL
CE C54G4.4        WEVDLLGDYSIRSISMRL-GTKSS---PIVSVEAIETG-------GAVHQCIVD--SSLFTINTTT
AC                WRVDLGVVHGIREVRLFNRM-DQPAVAERANRIAIDIGFDPEHFIEVFRRESDEPFGGVDGNPLVF
SP 2159     a     WEVDLKKMDKVGLVKIYNRT-DAETQRLSNFDVILYDNNRNEVAKKHVNNLSGE----SVSLDFKE
            b     WQVDLGEEFTVSKVDIYNRT-DAEPQRLSNFDVIFLSSSGEEVFRRHFDKVVDG----LLSLKVPS
            c     WEIDLAQTEELEKLIIYNRT-DAEIQRLSNFDIIIYDSNDYEVFTQHIDSLESN----NLSIDLKG
SP 0833           WQVDLAKEETIRQINIYNRT-DTAQDRLANFDVILLDSSGKEIE.
SD 3709           WQVDLGSVRSIDQVNLWNRT-NCCSSRLSAFYVLVSDVPFTSQTLSGALSQAGVSAYYFNDTAGSP
SU DUF11          WQVDLGASATVSSITIWNRT-DCCGSRLGDYWVFLSDTPFAAGDTPATLQSRVGTWSSHQIVAPNP
GO 0967           WEIDLESICLVHEMRLFNRL-DGVPERVANFVLQGSLDNQEWFVMTRKNDGIIYGGADGHPYIWID...
GO 0982           WSLELKERAHIKQIVIFNRC-DVEEYARRFNKFSILKSEDGLSWEEFYEKTNTNYVGGEFGEPLQIE
 
                  b8         b9   b10    5             b11           
                     3                             1                    
AA Agglutinin     TFHCP----APMIGRYVVTYLPTSES----LHLCEVEVNVDKPAAA 
MS FBP32    a     TISFT----ERVEGRYVTVLLPGTNK---VLTLCEVEVHGYRAPTGE
            b     EFQCN-----GMDGRYVNIVIPGREE---YLTLCEVEVYGSVLD
MS FBP32II  a     NLPLS----DRPEGRYVTLVLPGSKR---ILTLCEVEVYGYRAPTSE
            b     EFQCN-----GMDGRYINIVIPGRRE---YLTLCEVEVYGSVLD
OM FBPL4    a     TFQCH-----GMRGQYVNVFLRGYMQ---YLTLCEVEVNAHPAPIEMGPTQASELNLIVPVTD
            b     TFQCS-----GIEGRYVNVFLPGCNK---HLSLCEVEVNVGSRPGEEQTLYDMDEHLSSDRRVQDMKRGQDILCPDHKTRYE
            c     TFPCG-----GISGRYVNILLPGKEK---YLTLCEVEVQASTFQAGLPSTAHKTAVTAPNRNLAWLWDLLIPLGRTAIRKEETYNQN(...)
            d     DIPCN------MEGHYVTIVLPGREK---YLALCEVEVYGGK
FH Fucolectin     HRIDI---SDPKEGRYVTIMIPGSDK---ILTLCEVEVYGYRTPTGENLSLQGKATQSSLFEFGFAYNAIDGNRNNEWSKAPC
XL PXN-FBPL a     SFSCQ-----GMVGQYVTITLPGKEE---HLILCEVQVFGLPVSSSDDVEVTAPKYLTTPNGAP
            b     SFECN-----GMQGQYVTVFIPGNKT---SLTICEVQVFGLSSEAPDYTGIYVVSKDDSFHLADIFINFFGLWSSDQEYDYDLPTVATRTDE
            c     TFSC------GMVGRYVTVTIPGRAA---YLTLCEVKAFGHEISGNYTNNPSSPDSEEIEEQQAATELRNILKHSDAAS
            d     SFECN-----GMEGRYVTIVLLGNEK---SLTLCEVQVFGLTVETPNGERNGDFEQQKENHGAK
            e     SFNCK-----ELEGRYVTVTIPDRME---YLTLCEVQVFADPL...
XL II-FBPL  a     TLFCN-----GMVGRYLSVVISGRQE---FLTLCEVEVYGQESDDKD
            b     TLSCH-----GMEGQYVSVVIPGRAE---NLQLCEVEVYGQEVKCVAD
            c     TLSCH-----GMEGQYVSVVIPGRAE---NLQLCEVEVYGQEVKAITAV
            d     TLSCH-----KMEGRYVSVVIPGRAE---YLHICEVEVYGVKI
XL epilectin      NMCCK-----GMVGQYVSVVIPGRNE---YLQLCEVEVYGEENKPEEKPEEKQLCW
MD 24867    a     VVSCE-----GMKGRYVTIIHPGREK---VLSFCDVKVFGKIWNSDKVFPDELELS
            b     TFRCG-----LIHGRYVSVLNPGKEK---MISLCEVRVFGKASKFPPLDNLPI
            c     SFSCG-----SMKGRYVTITIPGRGK---LLTMCEVQVFGKAF
MD 17963    a     FFDCG-----VLKGRYVTIIIPGRRK---TLTLCKVQILGSDPKASL
            b     SFNCG-----SLRGRYVTITIPGRGK---LLTFCEVQILSEMTQSLPAP
            c     TFFCG-----SMNGRYVTVSIPGKNT---FLTFCEVQVFGKL
SP CRL            PIRCVGEGVTGIRGNVISVHIPTITNKKRELSLCEVEVYQQGL...
SP 788755         TIICEREGITGIRGNIISVHIPKI-NERRELSLCEVEVYQQGE...
SP 783458         RIDCP----SPIRGQYVHIGSPRKD----YLSFCEVEVYGRPSSPPTPAPEPTVVCTAAEFECASGRWV
DM furrowed       --TCA----RPLVGQYVAIQLVGVEG---SLSLCEVETFTNDE...
DM CG9095         --PCN----PPMPGAFVSVHLENSTPN--PLSICEAFVYTDQA...
CE C54G4.4        SISCS-------YDNISRLRITATR----RLHLCQVNVYAVNA...
AC                --APS----IPIPGRFVRVRLLERN----YLHLDQVEVYGDVLAQFG
SP 2159     a     -----------KGARYIKVKLLTSGV---PLSLAEVEVFRESDGKQSEEDIDKITEDK
            b     -----------VGAKLVKIELKSAAI---PLSLAEVEVYGSKRTPKKLS
            c     -----------LKGKKVRISLRSAGI---PLSLAEVEVYTYK
SD 3709           TEINI-----DRTGRYVRVQL-SGTN---PLSLAEVEVIEGS...
SU DUF11          STMIS----GAGQGRYLRIQL-SGTD---YLSLAEVQVAGNW...
GO 0982           TDIIA---------RYIKIVL-NGTS---FLHLSKVNIYGDYV

 

____________________________________________________________________________________________________________

This page last updated:
Wednesday, 04 October 2006
Animal lectins home
Contact information: This site is supported by:
 
Kurt Drickamer
Division of Molecular Biosciences
Faculty of Natural Sciences
Imperial College London
 
Email: k.drickamer@imperial.ac.uk