Group XIV - Endosialin family

Human sequence alignment
Joint human/mouse sequence alignment
Introduction
Mammalian proteins containing CTLDs
Introduction  Domain organization  Sequence alignments  Human database  Mouse database  Joint human/mouse database

If alignments appear scrambled please maximize the width of your browser window.

 
 
SIGNAL SEQUENCE
   
Endosialin        MLLRLLLAWAAAGPTLG
Thrombo.          MLGVLVLGALALAGLGFPAPA
C1qRp             MATSMGLLLLLLLLLTQPGAG
Clec14A           MRPAFALCLLWQALWPGPGGG

 

CTLD

Endosialin        QDPWAAEPRAAC-GPSSCYALFPRRRT----FLEAWRACRELGGDLATPRTPEEAQRVDSLV--------GAGPASRLLWIGLQRQARQCQLQR---PLRG
Thrombo.             EPQPGGSQC-VEHDCFALYPGPAT----FLNASQICDGLRGHLMTVRSSVAADVISLLL----NGDGGVG--RRRLWIGLQLPPG-CGDPKRLGPLRG
C1qRp               TGADTEAVVC-VGTACYTAHSGKLS----AAEAQNHCNQNGGNLATVKSKEEAQHVQRVLAQLLRREAALTARMSKFWIGLQREKGKCLDPS-L-PLKG
Clec14A             EHPTADRAGCSASGACYSLHHATMK----RQAAEEACILRGGALSTVRAGAELRAVLALLRAGPGP--GGGSKDLLFWVALERRRSHCTLENE--PLRG
 
Endosialin        FTWTTGDQ---DTAFTNWAQPASGG-----PCPAQRCVALE--------ASGEHRWLEGSCTLA------VDGYLCQF
Thrombo.          FQWVTGDN---NTSYSRWARLDLNG---APLCG-PLCVAVS---AAEATVPSEPIWEEQQCEVK------ADGFLCEF
C1qRp             FSWVGGGE---DTPYSNWHKELRNS------CISKRCVSLLLDLSQPLLPSRLPKWSEGPCGSPGSPGSNIEGFVCKF
Clec14A           FSWLSSDPGGLESDTLQWV-EEPQR-----SCTARRCAVLQ-----ATGGVEPAGWKEMRCHLR------ANGYLCKY
 
CONSERVED CORE RESIDUES    Ca SITE 1 RESIDUES    POTENTIAL Ca SITE 2 RESIDUES

 

SUSHI-LIKE DOMAIN
 
Endosialin        GFEGACPALQDEAGQAGPA----VYTTPFHLVSTEFEWLPFGSVAAVQCQAG-----RGASLLCVKQPEGGVGWSRAGPLCLGTG
Thrombo.          HFPATCRPLAVEP-GAAAAAVSITYGTPFAARGADFQALPVGSSAAVA--------PLGLQLMCTAPPGAVQGHWAREAPGA
C1qRp             SFKGMCRPLALGGPGQ------VTYTTPFQTTSSSLEAVPFASAANVACGEGDKDETQSHYFLCKEKAPDVFDWGSSGPLCVSPK
Clec14A           QFEVLCPAPRPGAASN------LSYRAPFQLHSAALDFSPPGTEVSALC-----RGQLPISVTCIADEIGARWDKLSGDVLCPC

Back to top

 

EGF DOMAINS
 
Endosialin EGF1   --CSPDNGG-CE--HECVE--EVDGHVSCRCTEGFRLAADGR-SCE----
Endosialin EGF2   DPCAQ--AP-CE--QQCEP--GGPQGYSCHCRLGFRPAEDDPHRCVDT--
Endosialin EGF3   DECQI--AGVCQ--QMCV---NYVGGFECYCSEGHELEADG-ISCSPAG-
 
Thrombo.   EGF1   WDCSVENGG-CE--HACN---AIPGAPRCQCPAGAALQADGR-SCTASAT
Thrombo.   EGF2   QSCND----LCE--HFCVPNPDQPGSYSCMCETGYRLAADQ-HRCEDV--
Thrombo.   EGF3   DDCILEPSP-CP--QRCV---NTQGGFECHCYPNYDLVDG---ECVEPV-
Thrombo.   EGF4   DPCFR--AN-CE--YQCQP--LNQTSYLCVCAEGFAPIPHEPHRCQ----
Thrombo.   EGF5   MFCNQ--T-ACP--ADCD---P-NTQASCECPEGYILDDG--FICTDI--
Thrombo.   EGF6   DECEN--GGFCS--GVCH---NLPGTFECICGPDSALARHIGTDCD----
 
C1qRp      EGF1   YGCNFNNGG-CH--QDCFE--GGDGSFLCGCRPGFRLLDDL-VTCASR--
C1qRp      EGF2   NPCSS--SP-CRGGATCVLG-PHGKNYTCRCPQGYQLDSSQ-LDCVDV--
C1qRp      EGF3   DECQD--SP-CA--QECV---NTPGGFRCECWVGYEPGGPGEGACQDV--
C1qRp      EGF4   DECALGRSP-CA--QGCT---NTDGSFHCSCEEGYVLAGEDGTQCQDV--
C1qRp      EGF5   DECVGPGGPLCD--SLCF---NTQGSFHCGCLPGWVLAPNG-VSC-----
 
Clec14A    EGF1   -PGRYLRAGKCAELPNCL---DDLGGFACECATGFEL-GKDGRSCV----
 
CONSERVED RESIDUES

Back to top

   

PROLINE/SERINE/THREONINE-RICH DOMAIN
 
Endosialin    AMGAQASQDLGDELLDDGEDEEDEDEAWKAFNGGWTEMPGILWME
              PTQPPDFALAYRPSFPEDREPQIPYPEPTWPPPLSAPRVPYHSSVLSVTRPVVVSATHPTLPSAHQPPVIPATHPALSR----------------DHQIPVIAANY
              PDLPSAYQPGILSVSHSAQPPAHQPPMISTKYPELFPAHQSPMFPDTRVAGTQTTTHLPGIPPNHAPLVTTLGAQLPPQAPDALVLRTQATQLPIIPTAQPSLTTT
              SRSPVSPAHQISVPAATQPAALPTL-LPSQSPTNQTSPISPTHPHSKAPQIPREDGPSPKLALWLPSPAPTAAPTALGEAGLAEHSQ
 
Thrombo.      TGKVDGGDTGTGEPPPTPTPGTTLTPPAV
 
C1qRp         TMGPVSLGPPSGPPDEEDKGEKEGSTVPRAATASPTRGPEGTPKATPTTSRPSLSSDAPITSAPLKMLAPSGSPGVWREPSIHHATAASGPQEPAGGDSSVATQNNDGT
 
Clec14A       TSGEGQPTLGGTGVPTRRPPATATSPVPQRTWPIRVDEKLGETPLVPEQDNSVTSIPEIPRWGSQSTMSTLQMSLQAESKATITPSGSVISKFNSTTSSATPQAFDSSSA

 

TRANSMEMBRANE DOMAIN and CYTOPLASMIC TAIL
 
Endosialin       RDDRWLLV--ALLVPTCVFLVVLLALGIVYCTRCGPHAPNKRITDCYRWVIHAGSKSPTEPMPPRGSLTGVQTCRTSV
Thrombo.         GLVHSGLL-IGISIASLCLVVALLALLCHLRKKQGAARAKMEYKCAAPSKEVVLQHVRTERTPQRL
C1qRp             DGQKLLLFYILGTVVAILLLLALALGLLVYRKRRAKREEKKEKKPQNAADSYSWVPERAESRAMENQYSPTPGTDC
Clec14A               VVFIFVSTAVVVLVILTMTVLGLVKLCFHESPSSQPRKESMGPPGLESDPEPAALGSSSAHCTNNGVKVGDCDLRDRAEGALLAESPLGSSDA

TRANSMEMBRANE DOMAIN

Back to top

_______________________________________________________________________________________________________________________________
This page last updated:
Thursday, 09 March 2006
Animal lectins home
Contact information: This site is supported by:
 
Kurt Drickamer
Division of Molecular Biosciences
Faculty of Natural Sciences
Imperial College London
 
Email: k.drickamer@imperial.ac.uk