Sugar-binding F-box proteins - sequence alignment

Interpro entries: 1 F-box 2 F-box associated domain
Sequence alignment of Fbs-type F-box associated domains in F-box proteins.  The domain is also found in some polypeptides without F-boxes, but such domains are not included in the alignment.  Proteins containing an F-box together with an Fbs-type F-box associated domain are found in a range of vertebrates.  Although elements of the glycosylation and protein targeting machinery are conserved in lower eukaryotes, the F-box associated domain is not present in proteins from yeast, slime mold or fruit fly.  Surprisingly, one protein with an F-box domain in the relevant architectural context is present in the invertebrate C elegans.  Included in the alignment are all human and murine proteins, plus a selection of other vertebrate proteins and the example from C elegans.
 
Regions of secondary structure are indicated above the alignment and coloured to correspond with the structure of murine Fbs1/Fbx2, left.  Alpha-helices (a1,2) are highlighted in red, beta-strands in one beta-sheet (b1,4,6,7,9) in yellow, and beta-strands in the opposite beta-sheet (b2,3,5,8,10) in blue.  Residues that are identical in over 75% of the aligned sequences are highlighted in green.  Residues that are similar in over 75% of the sequences are highlighted in cyan.  Residues which interact with chitobiose in the Fbs1 structure are highlighted in magenta.  The equivalent residues in other proteins are highlighted in magenta if identical and in pink if the character of the residue is conserved. 

Accession numbers
 
Fbx2 (Fbs1): human Q9UK22 Q5TGY0, mouse Q80UW2, X tropicalis 17406, G gallus 04657.
Fbx6b (Fbs2): human Q9NRD1, mouse Q9QZN4.
Fbx17 (Fbx26 Fbg4): human Q96EF6, mouse Q9QZM8, X tropicalis 05327
Fbx27 (Fbg5): human Q8NI29, mouse Q6DIA9.
Fbx44 (Fbx30 Fbx6a Fbg3): human Q9H4M3, mouse Q8BK26, X tropicalis 27482, 25469 (novel), G gallus 04646.
Fugu 153187; Tetraodon Q4S5T2; C elegans C14B1.3 Q17962.
 
If alignments appear scrambled, please maximise the width of your browser window.
 
Structure of murine Fbs1/Fbx2 with bound GlcNAc disaccharide

The sugar ligand is shown in dark blue.  Protein Data Bank structure ID: 1UMI.

 
           a1                     b1        b2                      b3         b4        
 
HFbx2    FYFLSTRRRNLLRNPCGEEDLEGWCDVEHGGDGWRVEEL----PGDSGVEFTHDDSVKKYFASSFEWC-RKAQVIDLQAEGYW
MFbx2    FYFLSKRRRNLLRNPCGEEDLEGWSDVEHGGDGWKVEEL----PGDNGVEFTQDDSVKKYFASSFEWC-RKAQVIDLQAEGYW
GFbx2    FYFLSKRRRNLIKNPCGEEDLQHWGEVENGGDGWKIEEL----PGDFGKEF-PSEEVHKYFVTSYEWC-RKSQVIDLRAEGYW
XFbx2    VYFLNKRKRNLLKNNSGEEEFDYWEDLNYGGDGWKIEDL----PGDNGNDF-PFEGIKKYFATSFELCL-TAACPDLLPV-FW
XFbx17   LYLKKPFLRNLIRNPCGTEGLQHW-ESTDGGDGWKVEDNHFPLEVADSQTS---------FVTSFRWCK-KTQDVDLLKEGLW
HFbx17   YCLRAPFGRNLIFNSCGEQGFRGW-EVEHGGNGWAIEKNLT-VPGAPSQTC---------FVTSFEWCSPKRQLVDLVMEGVW
MFbx17   FCLRAPFGRNLIHNSCGEQGFRGW-EVEHGGNGWAVEKNLTLVPGAPSQTC---------FVTSFEWCS-KRQLVDLVKEGVW
HFbx27   FCARRPIGRNLIRNPCGQEGLRKW-MVQHGGDGWVVEENRTTVPGAPSQTC---------FVTSFSWCC-KKQVLDLEEEGLW
MFbx27   FCALRPLGRNLISNPCGQ-GLRKW-MVRHGGDGWVVEKNRKPVPGAPSQTC---------FVTSFSWC-RKKQVVDLVEKGLW
HFbx6    FYFLRSLHRNLLRNPCAEEDMFAWQIDFNGGDRWKVESL----PGAHGTDF-PDPKVKKYFVTSFEMCL-KSQLVDLVAEGYW
MFbx6    FYILCSLQRNLLRNPCAEENLSSWRIDSNGGDRWKVETL----PGSCGTSF-PDNKVKKYFVTSFEMCL-KSQMVDLKAEGYC
HFbx44   FYFLRSLHRNLLHNPCAEEGFEFWSLDVNGGDEWKVEDL----SRDQRKEF-PNDQVKKYFVTSYYTCL-KSQVVDLKAEGYW
MFbx44   FYFLRSLQRNLLHNPCAEEGFEFWSLDVNGGDEWKVEDL----SKDQRKEF-PNDQVKKYFVTSYYTCL-KSQVVDLKAEGYW
GFbx44   FYLLCKLKRNLIKNPRAEESFKHWKLDQNEGDKWKIEDL----PGPLGKEL-PDSEVRKYFVTSFGPCF-KSQLITLKKEGYW
XFbx44   FYHISSLKRNLLQNPQAEDSFKSWKIEQNGGDRWNIEDL----PGDCGQPF-PDEHIKKYFVTSYAECK-KSQLIQLKKMGYQ
XFbxN    FYHISSLKRNLLQNPQAEDSFKSWKIEQNGGDRWNIEDL----PGDCGQPF-PDEHIKKYFVTSYAFC---------------
FFbx     FYILSKKRRNLIKNPRGDNEMKFWEIIANGGDRWSAEGLLFPH---------PNEKIQKNFVTSYQRCL-KSQLIDLAEEGYS
TFbx     FYFLCKKRRNLIKNPRGDNAMKFWEIIANGGDRWNPEGLLFAH---------PNEEVKNNFVTSYQKCV-KSQLIDLVNEGYS
C14B1.3                                                   ...PHPDVSKCFVFSFTE-SSISVFIDLVNSGID

 
           a2        b5               b6         b7                                    b8 
 
HFbx2    EELLDTTQPAIVVKDWYSGRSDAGCLYELTVKLLSEHENVLA-EF--------------SSGQVAVPQDSDGGGWMEISHTFT
MFbx2    EELLDTTQPAIVVKDWYSGRTDAGSLYELTVRLLSENEDVLA-EF--------------ATGQVAVP--ED-GSWMEISHTFI
GFbx2    EELMDTTQPKVVVKDWYAGRSDAGCLYELCVKLLSENEDVLA-EY--------------KTETIAIPQDND-ASWTEISYTFS
XFbx2    ILPLPAVLHSVCIFVVYAARSDSGCLYELCVQLLSDNKDIIT-EY--------------KSEIITIPQFSD-ASWNQINHTFS
XFbs17   EDLLDNQQPPICISDWYAGRCDCGCVYEIKVQLLSKDRKHCIDEF--------------TASPDPIPQWNN-GIYHQVSHVFH
HFbx17   QELLDSAQIEICVADWWGARENCGCVYQLRVRLLDVYEKEVV-KF--------------SASPDPVLQWTE-RGCRQVSHVFT
MFbx17   QELLDSGQIEICIADWWGARENCGCIYRLRVRLLDEYENEVV-KF--------------SASPNPVLQWTE-RSCRQVSHVFT
HFbx27   PELLDSGRIEICVSDWWGARHDSGCMYRLLVQLLDANQTV-LDKF--------------SAVPDPIPQWNN-NACLHVTHVFS
MFbx27   PELLDSGGVEIAVSDWWGARHDSGCKYRLFVTLLDAHQNV-IDKF--------------SAVPDPIEQWNN-DIYLQVTHVFS
HFbx6    EELLDTFRPDIVVKDWFAARADCGCTYQLKVQLASADYFVLA-SF--------------EPPPVTIQQWNN-ATWTEVSYTFS
MFbx6    EELMDTFRPDIVVKDWVAPRADCGCTYQLRVQLASADYIVLA-SF--------------EPPPVTFQQWND-AKWQEISHTFS
HFbx44   EELMDTTRPDIEVKDWFAARPDCGSKYQLCVQLLSSAHAPLG-TF--------------QPDPATIQQKSD-AKWREVSHTFS
MFbx44   EELMDTTRPDIEVKDWFAARPDCGSKYQLCVQLLSSAHAPLG-TF--------------QPDPVMIQQKSD-AKWSEVSHTFS
GFbx44   NELMDEKRPEIVVKDWYAARFDCGCRYELRVRLLSENYIVLD-EF--------------CPEPVVIEQWSD-AMWREISHTFS
XFbx44   DKLMDTVQPDIVIEDWYARRWDCGSTYEIVVQLLSKHKKVL-KEF--------------RPNLVRMESHSE-TYWQQMKHIFY
XFbxN    ---------WFFLKDKY-----------------TLHKKFN--QR--------------TASLIYLNSHSGHLKVNKMKHIFY
FFbx     PSLMDTFQPDIRVSDWYAPRWDCGSEYEIRVQLLDGKSNPMK-TF--------------APRQIYFEQWND-QKWHQITHVFQ
TFbx     PSFLDDFQPDVRVSDWYAPRWDCGSEYEINVQLLNENRHPIQ-TF--------------APSKIYFEQWND-QKWQQITHVFQ
C14B1.3  PWILDHVRPRIRITQKVNHRNDCAARLSFAAQLNYHETQWIE-RFGHVQTMSNTDHKRYKSVNKEWAQWTG-QPWEDWTIEFD

 

                     b9              b10     
 
HFbx2    DYGPGVRFVRFEHGGQDSVYWKGWFGARVTNSSVWVEP
MFbx2    DYGPGVRFVRFEHGGQDSVYWKGWFGARVTNSSVWVEP
GFbx2    DYGPGVRFVRFEHGGQDTLFWKGWYGVRVTNSSVTVEP
XFbx2    GYGPGVRFIRFQHGGQDSVFWKGWYGVRVTNSSVTIQP
XFbx17   GYGPGVRFVRFFHMGKDTQFWKGWYGSRITNSSVIVRIKKY...
HFbx17   NFGKGIRYVSFEQYGRDVSSWVGHYGALVTHSSVRVRIRLS
MFbx17   NFGKGIRYVSFEQYGRDTRSWVGHYGALVTHSSVRVRIRLS
HFbx27   NIKMGVRFVSFEHRGQDTQFWAGHYGARVTNSSVIVRVRLS
MFbx27   GIRRGIRFVSFEHWGQDTQFWAGHYGARVTNSSVIIRVCQS
HFbx6    DYPRGVRYILFQHGGRDTQYWAGWYGPRVTNSSIVVSPKMT...
MFbx6    DYPPGVRHILFQHGGQDTQFWKGWYGPRVTNSSIIISHRTA...
HFbx44   NYPPGVRYIWFQHGGVDTHYWAGWYGPRVTNSSITIRPPLP
MFbx44   NYPPGVRYIWFQHGGVDTHYWAGWYGPRVTNSSITIGPPLP
GFbx44   NYPAGVRYIWFQHGGQDTQFWAGWYGIRVTNSSITI
XFbx44   NYGPGVRYI
XFbxN    NYGPGVRYIYFQHSGQDTKYWGGWYGVRVTNSSVTIEPGNLD
FFbx     NYGPGARYIRFTHGGKDTQFWAGWYGIRVTDSCVEICPE
TFbx     NYGPGARYVQFTHGGKDRQFWAGWYGIRVTDSRVEIC
C14B1.3  DYPSGIRHLTILNEGEDRQFWRGFYGPKVANIQV...
 

___________________________________________________________________________________________________________________________
This page last updated:
Tuesday, 19 September 2006
Animal lectins home
Contact information: This site is supported by:
 
Kurt Drickamer
Division of Molecular Biosciences
Faculty of Natural Sciences
Imperial College London
 
Email: k.drickamer@imperial.ac.uk