Guide to the Human Genome
Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help

Search of human proteins with 55769533

BLASTP 2.2.11 [Jun-05-2005]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= gi|55769533 PRKC, apoptosis, WT1, regulator [Homo sapiens]
         (340 letters)

Database: hs.faa 
           37,866 sequences; 18,247,518 total letters

Searching..................................................done

                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gi|55769533 PRKC, apoptosis, WT1, regulator [Homo sapiens]            675   0.0  
gi|239751012 PREDICTED: similar to hCG2041351 [Homo sapiens]           61   1e-09
gi|219842214 protein phosphatase 1, regulatory (inhibitor) subun...    58   1e-08
gi|219842212 protein phosphatase 1, regulatory (inhibitor) subun...    58   1e-08
gi|4505317 protein phosphatase 1, regulatory (inhibitor) subunit...    58   1e-08
gi|41349484 proline-rich protein BstNI subfamily 1 isoform 2 pre...    52   6e-07
gi|209571537 proline-rich protein BstNI subfamily 2 [Homo sapiens]     52   8e-07
gi|48762934 alpha 2 type I collagen [Homo sapiens]                     52   1e-06
gi|37537692 proline-rich protein BstNI subfamily 4 precursor [Ho...    52   1e-06
gi|89276751 alpha 1 type V collagen preproprotein [Homo sapiens]       51   1e-06
gi|41349482 proline-rich protein BstNI subfamily 1 isoform 1 pre...    50   4e-06
gi|239754955 PREDICTED: similar to COL22A1 protein [Homo sapiens]      49   5e-06
gi|55742678 glucocorticoid induced transcript 1 [Homo sapiens]         49   8e-06
gi|40805823 collagen, type XXII, alpha 1 [Homo sapiens]                49   8e-06
gi|117938759 GNAS complex locus XLas [Homo sapiens]                    48   1e-05
gi|91208420 bassoon protein [Homo sapiens]                             48   1e-05
gi|122937321 UNC homeobox [Homo sapiens]                               48   1e-05
gi|90819237 myeloid/lymphoid or mixed-lineage leukemia (trithora...    48   1e-05
gi|90819233 myeloid/lymphoid or mixed-lineage leukemia (trithora...    48   1e-05
gi|90819231 myeloid/lymphoid or mixed-lineage leukemia (trithora...    48   1e-05
gi|117306167 proline-rich protein BstNI subfamily 3 precursor [H...    48   1e-05
gi|4502951 collagen type III alpha 1 preproprotein [Homo sapiens]      47   2e-05
gi|148806928 hypothetical protein LOC57482 [Homo sapiens]              47   2e-05
gi|171543895 ataxin 2 [Homo sapiens]                                   47   2e-05
gi|110349772 alpha 1 type I collagen preproprotein [Homo sapiens]      47   3e-05
gi|32483397 dopamine receptor D4 [Homo sapiens]                        46   4e-05
gi|116256445 nuclear receptor co-repressor 2 isoform 2 [Homo sap...    46   4e-05
gi|56550039 myeloid/lymphoid or mixed-lineage leukemia protein [...    46   5e-05
gi|56549131 transmembrane anchor protein 1 isoform 1 [Homo sapiens]    46   5e-05
gi|111118976 collagen, type II, alpha 1 isoform 1 precursor [Hom...    45   7e-05

>gi|55769533 PRKC, apoptosis, WT1, regulator [Homo sapiens]
          Length = 340

 Score =  675 bits (1741), Expect = 0.0
 Identities = 340/340 (100%), Positives = 340/340 (100%)

Query: 1   MATGGYRTSSGLGGSTTDFLEEWKAKREKMRAKQNPPGPAPPGGGSSDAAGKPPAGALGT 60
           MATGGYRTSSGLGGSTTDFLEEWKAKREKMRAKQNPPGPAPPGGGSSDAAGKPPAGALGT
Sbjct: 1   MATGGYRTSSGLGGSTTDFLEEWKAKREKMRAKQNPPGPAPPGGGSSDAAGKPPAGALGT 60

Query: 61  PAAAAANELNNNLPGGAPAAPAVPGPGGVNCAVGSAMLTRAAPGPRRSEDEPPAASASAA 120
           PAAAAANELNNNLPGGAPAAPAVPGPGGVNCAVGSAMLTRAAPGPRRSEDEPPAASASAA
Sbjct: 61  PAAAAANELNNNLPGGAPAAPAVPGPGGVNCAVGSAMLTRAAPGPRRSEDEPPAASASAA 120

Query: 121 PPPQRDEEEPDGVPEKGKSSGPSARKGKGQIEKRKLREKRRSTGVVNIPAAECLDEYEDD 180
           PPPQRDEEEPDGVPEKGKSSGPSARKGKGQIEKRKLREKRRSTGVVNIPAAECLDEYEDD
Sbjct: 121 PPPQRDEEEPDGVPEKGKSSGPSARKGKGQIEKRKLREKRRSTGVVNIPAAECLDEYEDD 180

Query: 181 EAGQKERKREDAITQQNTIQNEAVNLLDPGSSYLLQEPPRTVSGRYKSTTSVSEEDVSSR 240
           EAGQKERKREDAITQQNTIQNEAVNLLDPGSSYLLQEPPRTVSGRYKSTTSVSEEDVSSR
Sbjct: 181 EAGQKERKREDAITQQNTIQNEAVNLLDPGSSYLLQEPPRTVSGRYKSTTSVSEEDVSSR 240

Query: 241 YSRTDRSGFPRYNRDANVSGTLVSSSTLEKKIEDLEKEVVRERQENLRLVRLMQDKEEMI 300
           YSRTDRSGFPRYNRDANVSGTLVSSSTLEKKIEDLEKEVVRERQENLRLVRLMQDKEEMI
Sbjct: 241 YSRTDRSGFPRYNRDANVSGTLVSSSTLEKKIEDLEKEVVRERQENLRLVRLMQDKEEMI 300

Query: 301 GKLKEEIDLLNRDLDDIEDENEQLKQENKTLLKVVGQLTR 340
           GKLKEEIDLLNRDLDDIEDENEQLKQENKTLLKVVGQLTR
Sbjct: 301 GKLKEEIDLLNRDLDDIEDENEQLKQENKTLLKVVGQLTR 340


>gi|239751012 PREDICTED: similar to hCG2041351 [Homo sapiens]
          Length = 186

 Score = 61.2 bits (147), Expect = 1e-09
 Identities = 54/165 (32%), Positives = 75/165 (45%), Gaps = 35/165 (21%)

Query: 18  DFLEEWKAKREKMRAKQNPPGPAPPG--------------GGSSDAAGKPPAGALGTPAA 63
           DF E+   +R + R +Q  PGP  PG               G   A+ +PP GAL +  A
Sbjct: 15  DFSEQ--RRRLERRRRQVEPGPRGPGMGQQPLQPGSPGRGAGRQRASRQPPCGALTSLQA 72

Query: 64  A---AANELNNNLPGGAPAAPAVPGPGGVNCAV-GSAMLTRAAPGPRRSEDEPPAASASA 119
           A        + +L G   A    P P GVNCAV         +PGP++ ++EP A +   
Sbjct: 73  APQQPPGSAHTSLQGSPLALHLPPPPRGVNCAVCRPGYADPGSPGPQQPDEEPRATARGY 132

Query: 120 APPPQRDEEEPDGVPEKGKSS--GP------SARKGKGQIEKRKL 156
                  E+E DG PEK KSS  GP       A  G+ ++EKR++
Sbjct: 133 -------EKEQDGAPEKCKSSELGPPCQERLGAEDGEMEMEKRQV 170


>gi|219842214 protein phosphatase 1, regulatory (inhibitor) subunit
           12A isoform b [Homo sapiens]
          Length = 943

 Score = 58.2 bits (139), Expect = 1e-08
 Identities = 55/226 (24%), Positives = 100/226 (44%), Gaps = 24/226 (10%)

Query: 127 EEEPDGVPEKGKSSGPSARKGKGQIEKRKLREKRRSTGVVNIPAAECLDEYEDDEAGQKE 186
           E E +G   + +  G    + K   E+R+ REKRRSTGV      +  DE E ++    E
Sbjct: 730 ENEREGEKREEEKEGEDKSQPKSIRERRRPREKRRSTGVSF--WTQDSDENEQEQQSDTE 787

Query: 187 RKREDAITQQNTIQNEAVNLLDPGSSYLLQEPPRTVSGRYKSTTSVSE-EDVSSRYSRTD 245
                  TQ ++I     +    G  Y       ++ GR  S + + E +  SSR  + D
Sbjct: 788 EGSNKKETQTDSISRYETSSTSAGDRY------DSLLGRSGSYSYLEERKPYSSRLEKDD 841

Query: 246 RSGFPRYNRDANVSGTLVSSSTLEKKIE----DLEKEVVRERQENLRLVRLMQDK----- 296
            + F +           + +   +  +E     L+ E   +RQE      L++ +     
Sbjct: 842 STDFKKLYEQILAENEKLKAQLHDTNMELTDLKLQLEKATQRQERFADRSLLEMEKRERR 901

Query: 297 --EEMIGKLKEEIDLLNRDLDDIEDENEQLKQENKTLLKVVGQLTR 340
             E  I +++EE+ +    L D++ +N++LK EN  L++V+ +L++
Sbjct: 902 ALERRISEMEEELKM----LPDLKADNQRLKDENGALIRVISKLSK 943



 Score = 35.4 bits (80), Expect = 0.074
 Identities = 39/190 (20%), Positives = 69/190 (36%), Gaps = 34/190 (17%)

Query: 78  PAAPAVP-GPGGVNCAVGSAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDGVPEK 136
           P A  +P  P  VN A  +  LT    G   S  E      S   P + +E E      +
Sbjct: 540 PTAVTIPVAPTVVNAAASTTTLTTTTAGTVSSTTEVRERRRSYLTPVRDEESE-----SQ 594

Query: 137 GKSSGPSARKGKGQIEKRKLREKRRSTGVVNIPAAECLDEYEDDEAGQKERKREDAITQQ 196
            K+    AR+ +   +   L + + +   +    +    E E++E  ++E++++D   Q+
Sbjct: 595 RKARSRQARQSRRSTQGVTLTDLQEAEKTIGRSRSTRTREQENEEKEKEEKEKQDKEKQE 654

Query: 197 NTIQNEAVNLLDPGSSYLLQEPPRTVSGRYKSTTSVSEEDVSSRYSRTDRSGFPRYNRDA 256
              ++E                               E++   +YSRT    + RY   +
Sbjct: 655 EKKESETSR----------------------------EDEYKQKYSRTYDETYQRYRPVS 686

Query: 257 NVSGTLVSSS 266
             S T  SSS
Sbjct: 687 TSSSTTPSSS 696


>gi|219842212 protein phosphatase 1, regulatory (inhibitor) subunit
            12A isoform a [Homo sapiens]
          Length = 1030

 Score = 58.2 bits (139), Expect = 1e-08
 Identities = 55/226 (24%), Positives = 100/226 (44%), Gaps = 24/226 (10%)

Query: 127  EEEPDGVPEKGKSSGPSARKGKGQIEKRKLREKRRSTGVVNIPAAECLDEYEDDEAGQKE 186
            E E +G   + +  G    + K   E+R+ REKRRSTGV      +  DE E ++    E
Sbjct: 817  ENEREGEKREEEKEGEDKSQPKSIRERRRPREKRRSTGVSF--WTQDSDENEQEQQSDTE 874

Query: 187  RKREDAITQQNTIQNEAVNLLDPGSSYLLQEPPRTVSGRYKSTTSVSE-EDVSSRYSRTD 245
                   TQ ++I     +    G  Y       ++ GR  S + + E +  SSR  + D
Sbjct: 875  EGSNKKETQTDSISRYETSSTSAGDRY------DSLLGRSGSYSYLEERKPYSSRLEKDD 928

Query: 246  RSGFPRYNRDANVSGTLVSSSTLEKKIE----DLEKEVVRERQENLRLVRLMQDK----- 296
             + F +           + +   +  +E     L+ E   +RQE      L++ +     
Sbjct: 929  STDFKKLYEQILAENEKLKAQLHDTNMELTDLKLQLEKATQRQERFADRSLLEMEKRERR 988

Query: 297  --EEMIGKLKEEIDLLNRDLDDIEDENEQLKQENKTLLKVVGQLTR 340
              E  I +++EE+ +    L D++ +N++LK EN  L++V+ +L++
Sbjct: 989  ALERRISEMEEELKM----LPDLKADNQRLKDENGALIRVISKLSK 1030



 Score = 35.4 bits (80), Expect = 0.074
 Identities = 39/190 (20%), Positives = 69/190 (36%), Gaps = 34/190 (17%)

Query: 78  PAAPAVP-GPGGVNCAVGSAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDGVPEK 136
           P A  +P  P  VN A  +  LT    G   S  E      S   P + +E E      +
Sbjct: 627 PTAVTIPVAPTVVNAAASTTTLTTTTAGTVSSTTEVRERRRSYLTPVRDEESE-----SQ 681

Query: 137 GKSSGPSARKGKGQIEKRKLREKRRSTGVVNIPAAECLDEYEDDEAGQKERKREDAITQQ 196
            K+    AR+ +   +   L + + +   +    +    E E++E  ++E++++D   Q+
Sbjct: 682 RKARSRQARQSRRSTQGVTLTDLQEAEKTIGRSRSTRTREQENEEKEKEEKEKQDKEKQE 741

Query: 197 NTIQNEAVNLLDPGSSYLLQEPPRTVSGRYKSTTSVSEEDVSSRYSRTDRSGFPRYNRDA 256
              ++E                               E++   +YSRT    + RY   +
Sbjct: 742 EKKESETSR----------------------------EDEYKQKYSRTYDETYQRYRPVS 773

Query: 257 NVSGTLVSSS 266
             S T  SSS
Sbjct: 774 TSSSTTPSSS 783


>gi|4505317 protein phosphatase 1, regulatory (inhibitor) subunit 12A
            isoform a [Homo sapiens]
          Length = 1030

 Score = 58.2 bits (139), Expect = 1e-08
 Identities = 55/226 (24%), Positives = 100/226 (44%), Gaps = 24/226 (10%)

Query: 127  EEEPDGVPEKGKSSGPSARKGKGQIEKRKLREKRRSTGVVNIPAAECLDEYEDDEAGQKE 186
            E E +G   + +  G    + K   E+R+ REKRRSTGV      +  DE E ++    E
Sbjct: 817  ENEREGEKREEEKEGEDKSQPKSIRERRRPREKRRSTGVSF--WTQDSDENEQEQQSDTE 874

Query: 187  RKREDAITQQNTIQNEAVNLLDPGSSYLLQEPPRTVSGRYKSTTSVSE-EDVSSRYSRTD 245
                   TQ ++I     +    G  Y       ++ GR  S + + E +  SSR  + D
Sbjct: 875  EGSNKKETQTDSISRYETSSTSAGDRY------DSLLGRSGSYSYLEERKPYSSRLEKDD 928

Query: 246  RSGFPRYNRDANVSGTLVSSSTLEKKIE----DLEKEVVRERQENLRLVRLMQDK----- 296
             + F +           + +   +  +E     L+ E   +RQE      L++ +     
Sbjct: 929  STDFKKLYEQILAENEKLKAQLHDTNMELTDLKLQLEKATQRQERFADRSLLEMEKRERR 988

Query: 297  --EEMIGKLKEEIDLLNRDLDDIEDENEQLKQENKTLLKVVGQLTR 340
              E  I +++EE+ +    L D++ +N++LK EN  L++V+ +L++
Sbjct: 989  ALERRISEMEEELKM----LPDLKADNQRLKDENGALIRVISKLSK 1030



 Score = 35.4 bits (80), Expect = 0.074
 Identities = 39/190 (20%), Positives = 69/190 (36%), Gaps = 34/190 (17%)

Query: 78  PAAPAVP-GPGGVNCAVGSAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDGVPEK 136
           P A  +P  P  VN A  +  LT    G   S  E      S   P + +E E      +
Sbjct: 627 PTAVTIPVAPTVVNAAASTTTLTTTTAGTVSSTTEVRERRRSYLTPVRDEESE-----SQ 681

Query: 137 GKSSGPSARKGKGQIEKRKLREKRRSTGVVNIPAAECLDEYEDDEAGQKERKREDAITQQ 196
            K+    AR+ +   +   L + + +   +    +    E E++E  ++E++++D   Q+
Sbjct: 682 RKARSRQARQSRRSTQGVTLTDLQEAEKTIGRSRSTRTREQENEEKEKEEKEKQDKEKQE 741

Query: 197 NTIQNEAVNLLDPGSSYLLQEPPRTVSGRYKSTTSVSEEDVSSRYSRTDRSGFPRYNRDA 256
              ++E                               E++   +YSRT    + RY   +
Sbjct: 742 EKKESETSR----------------------------EDEYKQKYSRTYDETYQRYRPVS 773

Query: 257 NVSGTLVSSS 266
             S T  SSS
Sbjct: 774 TSSSTTPSSS 783


>gi|41349484 proline-rich protein BstNI subfamily 1 isoform 2
           preproprotein [Homo sapiens]
          Length = 198

 Score = 52.4 bits (124), Expect = 6e-07
 Identities = 36/114 (31%), Positives = 43/114 (37%), Gaps = 14/114 (12%)

Query: 35  NPPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPG-PGGVNCAV 93
           NP GP+P GG        PP    G P             G  P  P  PG P G     
Sbjct: 35  NPQGPSPQGGNKPQGPPPPPGKPQGPPPQG----------GNKPQGPPPPGKPQGPPPQG 84

Query: 94  GSAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSGPSARKG 147
             +   R+ PG  + +  PP       PPPQ   + P G P  GK  GP A+ G
Sbjct: 85  DKSRSPRSPPG--KPQGPPPQGGKPQGPPPQGGNK-PQGPPPPGKPQGPPAQGG 135



 Score = 37.4 bits (85), Expect = 0.019
 Identities = 30/107 (28%), Positives = 38/107 (35%), Gaps = 17/107 (15%)

Query: 36  PPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGPGGVNCAVGS 95
           P GP PP GG+      PP    G PA   +   +   P G P  P  P   G N     
Sbjct: 107 PQGP-PPQGGNKPQGPPPPGKPQGPPAQGGSKSQSARSPPGKPQGP--PQQEGNN----- 158

Query: 96  AMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSGP 142
               +  P P     + P A      PP    + P   P+ G+ S P
Sbjct: 159 ---PQGPPPPAGGNPQQPQA------PPAGQPQGPPRPPQGGRPSRP 196



 Score = 31.2 bits (69), Expect = 1.4
 Identities = 12/22 (54%), Positives = 13/22 (59%)

Query: 35  NPPGPAPPGGGSSDAAGKPPAG 56
           NP GP PP GG+      PPAG
Sbjct: 158 NPQGPPPPAGGNPQQPQAPPAG 179



 Score = 30.4 bits (67), Expect = 2.4
 Identities = 15/47 (31%), Positives = 20/47 (42%), Gaps = 1/47 (2%)

Query: 104 GPRRSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSGPSARKGKGQ 150
           G  + +  PP       PPPQ   + P G P  GK  GP  +  K +
Sbjct: 43  GGNKPQGPPPPPGKPQGPPPQGGNK-PQGPPPPGKPQGPPPQGDKSR 88


>gi|209571537 proline-rich protein BstNI subfamily 2 [Homo sapiens]
          Length = 416

 Score = 52.0 bits (123), Expect = 8e-07
 Identities = 36/124 (29%), Positives = 46/124 (37%), Gaps = 6/124 (4%)

Query: 29  KMRAKQNPPGPA---PPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPG 85
           K R+ ++PPG     PP GG+      PP G    P     N+     P G P  P   G
Sbjct: 169 KSRSSRSPPGKPQGPPPQGGNQPQGPPPPPGKPQGPPPQGGNKPQGPPPPGKPQGPPPQG 228

Query: 86  PGGVNCAVGSAMLTRAAP--GPRRSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSGPS 143
                 A       +  P  G  + +  PP       PPPQ    +P G P  GK  GP 
Sbjct: 229 DNKSQSARSPPGKPQGPPPQGGNQPQGPPPPPGKPQGPPPQ-GGNKPQGPPPPGKPQGPP 287

Query: 144 ARKG 147
            + G
Sbjct: 288 PQGG 291



 Score = 50.1 bits (118), Expect = 3e-06
 Identities = 34/120 (28%), Positives = 45/120 (37%), Gaps = 6/120 (5%)

Query: 28  EKMRAKQNPPGPA---PPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVP 84
           +K R+ ++PPG     PP GG+      PP G    P     N+     P G P  P   
Sbjct: 106 DKSRSPRSPPGKPQGPPPQGGNQPQGPPPPPGKPQGPPPQGGNKPQGPPPPGKPQGPPPQ 165

Query: 85  GPGGVNCAVGSAMLTRAAP--GPRRSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSGP 142
           G      +       +  P  G  + +  PP       PPPQ    +P G P  GK  GP
Sbjct: 166 GDNKSRSSRSPPGKPQGPPPQGGNQPQGPPPPPGKPQGPPPQ-GGNKPQGPPPPGKPQGP 224



 Score = 48.5 bits (114), Expect = 8e-06
 Identities = 34/124 (27%), Positives = 46/124 (37%), Gaps = 6/124 (4%)

Query: 29  KMRAKQNPPGPA---PPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPG 85
           K ++ ++PPG     PP GG+      PP G    P     N+     P G P  P   G
Sbjct: 231 KSQSARSPPGKPQGPPPQGGNQPQGPPPPPGKPQGPPPQGGNKPQGPPPPGKPQGPPPQG 290

Query: 86  PGGVNCAVGSAMLTRAAP--GPRRSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSGPS 143
                 +       +  P  G  + +  PP       PPPQ    +P G P  GK  GP 
Sbjct: 291 GSKSRSSRSPPGKPQGPPPQGGNQPQGPPPPPGKPQGPPPQ-GGNKPQGPPPPGKPQGPP 349

Query: 144 ARKG 147
            + G
Sbjct: 350 PQGG 353



 Score = 45.8 bits (107), Expect = 5e-05
 Identities = 31/117 (26%), Positives = 43/117 (36%), Gaps = 10/117 (8%)

Query: 29  KMRAKQNPPGPA---PPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPG 85
           K R+ ++PPG     PP GG+      PP G    P     N+     P G P  P  P 
Sbjct: 293 KSRSSRSPPGKPQGPPPQGGNQPQGPPPPPGKPQGPPPQGGNKPQGPPPPGKPQGP--PP 350

Query: 86  PGGVNCAVGSAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSGP 142
            GG       +   R+ PG  +   +    +    PPP     +    P  G+  GP
Sbjct: 351 QGG-----SKSRSARSPPGKPQGPPQQEGNNPQGPPPPAGGNPQQPQAPPAGQPQGP 402



 Score = 42.7 bits (99), Expect = 5e-04
 Identities = 36/126 (28%), Positives = 42/126 (33%), Gaps = 26/126 (20%)

Query: 35  NPPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAV---PGPGGVNC 91
           NP G  P GG        PP    G P          N P G P  P     P P G N 
Sbjct: 35  NPQGAPPQGGNKPQGPPSPPGKPQGPPPQ------GGNQPQGPPPPPGKPQGPPPQGGNK 88

Query: 92  AVGSAMLTRAAPGPRRSEDEPPAASASAAP---------PPQRDEEEPDG-VPEKGKSSG 141
             G        P P + +  PP    S +P         PP +   +P G  P  GK  G
Sbjct: 89  PQG-------PPPPGKPQGPPPQGDKSRSPRSPPGKPQGPPPQGGNQPQGPPPPPGKPQG 141

Query: 142 PSARKG 147
           P  + G
Sbjct: 142 PPPQGG 147



 Score = 42.4 bits (98), Expect = 6e-04
 Identities = 33/116 (28%), Positives = 37/116 (31%), Gaps = 9/116 (7%)

Query: 36  PPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGPGGVNCAVGS 95
           P GP PP GG+      PP    G P         +  P   P  P  P P G N   G 
Sbjct: 78  PQGP-PPQGGNKPQGPPPPGKPQGPPPQGD----KSRSPRSPPGKPQGPPPQGGNQPQGP 132

Query: 96  AMLTRAAPGPR----RSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSGPSARKG 147
                   GP          PP       PPPQ D +        GK  GP  + G
Sbjct: 133 PPPPGKPQGPPPQGGNKPQGPPPPGKPQGPPPQGDNKSRSSRSPPGKPQGPPPQGG 188



 Score = 42.0 bits (97), Expect = 8e-04
 Identities = 31/113 (27%), Positives = 36/113 (31%), Gaps = 22/113 (19%)

Query: 36  PPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVP-GPGGVNCAVG 94
           P GP P GG  S ++  PP    G P          N P G P  P  P GP        
Sbjct: 283 PQGPPPQGGSKSRSSRSPPGKPQGPPPQ------GGNQPQGPPPPPGKPQGP-------- 328

Query: 95  SAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSGPSARKG 147
                   P        PP       PPPQ   +        GK  GP  ++G
Sbjct: 329 -------PPQGGNKPQGPPPPGKPQGPPPQGGSKSRSARSPPGKPQGPPQQEG 374



 Score = 33.5 bits (75), Expect = 0.28
 Identities = 27/98 (27%), Positives = 34/98 (34%), Gaps = 18/98 (18%)

Query: 54  PAGALGTPAAAAANELNNNLPGGAPAAPAVP-GPGGVNCAVGSAMLTRAAPGPRRSEDEP 112
           P+   G P  A       N P G P+ P  P GP                 G  + +  P
Sbjct: 29  PSLIAGNPQGAPPQ--GGNKPQGPPSPPGKPQGP--------------PPQGGNQPQGPP 72

Query: 113 PAASASAAPPPQRDEEEPDGVPEKGKSSGPSARKGKGQ 150
           P       PPPQ   + P G P  GK  GP  +  K +
Sbjct: 73  PPPGKPQGPPPQGGNK-PQGPPPPGKPQGPPPQGDKSR 109


>gi|48762934 alpha 2 type I collagen [Homo sapiens]
          Length = 1366

 Score = 51.6 bits (122), Expect = 1e-06
 Identities = 47/150 (31%), Positives = 61/150 (40%), Gaps = 17/150 (11%)

Query: 38  GPAPPGGGSSDAAGKPPAGALGTPAAAAANELNN-----NLPGGAPAAPAVPGPGGVNCA 92
           GPA P G   +      +G +G P    AN L        LPG A  AP +PGP G+   
Sbjct: 274 GPAGPAGPRGEVGLPGLSGPVGPPGNPGANGLTGAKGAAGLPGVA-GAPGLPGPRGIPGP 332

Query: 93  VGSAMLTRA-----APGPRRSEDE-----PPAASASAAPPPQRDEEEPDGVPEKGKSSGP 142
           VG+A  T A      PGP  S+ E      P ++    PP    EE   G   +  S+GP
Sbjct: 333 VGAAGATGARGLVGEPGPAGSKGESGNKGEPGSAGPQGPPGPSGEEGKRGPNGEAGSAGP 392

Query: 143 SARKG-KGQIEKRKLREKRRSTGVVNIPAA 171
               G +G    R L       GV+  P +
Sbjct: 393 PGPPGLRGSPGSRGLPGADGRAGVMGPPGS 422



 Score = 40.8 bits (94), Expect = 0.002
 Identities = 34/117 (29%), Positives = 39/117 (33%), Gaps = 26/117 (22%)

Query: 34  QNPPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGPGGVNCAV 93
           Q PPGP    GG  +     P G  G P            P G       PG  G++   
Sbjct: 531 QGPPGPQGVQGGKGEQGPPGPPGFQGLPG-----------PSGPAGEVGKPGERGLHGEF 579

Query: 94  GSAMLTRAAPGPR--RSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSGPSARKGK 148
           G        PGP   R E  PP  S +A P        P G P      GP   KG+
Sbjct: 580 G-------LPGPAGPRGERGPPGESGAAGPTGPIGSRGPSGPP------GPDGNKGE 623



 Score = 39.3 bits (90), Expect = 0.005
 Identities = 29/92 (31%), Positives = 33/92 (35%), Gaps = 16/92 (17%)

Query: 31  RAKQNPPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGPGGVN 90
           R +   PGPA   G         PAG +G+               G P  P  PGP G  
Sbjct: 222 RGRVGAPGPAGARGSDGSVGPVGPAGPIGS--------------AGPPGFPGAPGPKGEI 267

Query: 91  CAVGSAMLTRAAPGPRRSEDEPPAASASAAPP 122
            AVG+A    A P   R E   P  S    PP
Sbjct: 268 GAVGNA--GPAGPAGPRGEVGLPGLSGPVGPP 297



 Score = 37.4 bits (85), Expect = 0.019
 Identities = 32/117 (27%), Positives = 40/117 (34%), Gaps = 24/117 (20%)

Query: 31  RAKQNPPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGPGGVN 90
           R ++ PPGP  PG    D    PP                     G P  P  PG GG  
Sbjct: 41  RGERGPPGP--PGRDGEDGPTGPP---------------------GPPGPPGPPGLGGNF 77

Query: 91  CAVGSAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSGPSARKG 147
            A          PGP           A+ AP PQ   + P G P +   +GP+  +G
Sbjct: 78  AAQYDGKGVGLGPGPMGLMGPRGPPGAAGAPGPQ-GFQGPAGEPGEPGQTGPAGARG 133



 Score = 37.0 bits (84), Expect = 0.025
 Identities = 35/124 (28%), Positives = 43/124 (34%), Gaps = 19/124 (15%)

Query: 36  PPGPAPPGGGSSDAAGKPPAGALG--TPAAAAANELNNNLPGGAPAAPAVPGPGGVNCAV 93
           P GPA P G   +     PAG  G   PA AA          GA       GP G N  V
Sbjct: 701 PAGPAGPRGSPGERGEVGPAGPNGFAGPAGAAGQP-------GAKGERGAKGPKGENGVV 753

Query: 94  GSAMLTRAA--------PGP--RRSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSGPS 143
           G      AA        PGP   R +  PP  +       +     P G+       GP+
Sbjct: 754 GPTGPVGAAGPAGPNGPPGPAGSRGDGGPPGMTGFPGAAGRTGPPGPSGISGPPGPPGPA 813

Query: 144 ARKG 147
            ++G
Sbjct: 814 GKEG 817



 Score = 36.6 bits (83), Expect = 0.033
 Identities = 38/146 (26%), Positives = 49/146 (33%), Gaps = 32/146 (21%)

Query: 31  RAKQNPPGPA-----PPGGGSSDAAGKP----------------PAGALGTP----AAAA 65
           + +Q PPGP      P   G +   GKP                P G  G P    AA  
Sbjct: 543 KGEQGPPGPPGFQGLPGPSGPAGEVGKPGERGLHGEFGLPGPAGPRGERGPPGESGAAGP 602

Query: 66  ANELNNNLPGGAPAAPAVPGPGGVNCAVGSAMLTRAAPGPRRSEDEPPAASASAAPPPQR 125
              + +  P G P      G  GV  AVG+A       GP      P    A+  P  + 
Sbjct: 603 TGPIGSRGPSGPPGPDGNKGEPGVVGAVGTA-------GPSGPSGLPGERGAAGIPGGKG 655

Query: 126 DEEEPDGVPEKGKSSGPSARKGKGQI 151
           ++ EP    E G      AR   G +
Sbjct: 656 EKGEPGLRGEIGNPGRDGARGAPGAV 681



 Score = 35.0 bits (79), Expect = 0.096
 Identities = 31/98 (31%), Positives = 35/98 (35%), Gaps = 13/98 (13%)

Query: 36  PPGP-APPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGPGGVNCA-- 92
           P GP  PPG   S   G PP G  G P AA         P G    P  PGP G      
Sbjct: 764 PAGPNGPPGPAGSRGDGGPP-GMTGFPGAAGRTGPPG--PSGISGPPGPPGPAGKEGLRG 820

Query: 93  -------VGSAMLTRAAPGPRRSEDEPPAASASAAPPP 123
                  VG      A   P  + ++ P+  A  A PP
Sbjct: 821 PRGDQGPVGRTGEVGAVGPPGFAGEKGPSGEAGTAGPP 858



 Score = 32.7 bits (73), Expect = 0.48
 Identities = 20/56 (35%), Positives = 22/56 (39%), Gaps = 1/56 (1%)

Query: 36   PPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNL-PGGAPAAPAVPGPGGVN 90
            P GP  P G S  A      G  GT   A       +  P G P  P  PGP GV+
Sbjct: 1049 PAGPRGPAGPSGPAGKDGRTGHPGTVGPAGIRGPQGHQGPAGPPGPPGPPGPPGVS 1104



 Score = 31.6 bits (70), Expect = 1.1
 Identities = 36/116 (31%), Positives = 42/116 (36%), Gaps = 22/116 (18%)

Query: 45  GSSDAAGKP-PAGALGTPA-AAAANELNNNLPGGAPAAPAVPGPGGVNCAVGSAMLT--- 99
           G+  A G P PAGA G    A AA       P G+P      GP G N   G A      
Sbjct: 676 GAPGAVGAPGPAGATGDRGEAGAAGPAGPAGPRGSPGERGEVGPAGPNGFAGPAGAAGQP 735

Query: 100 -----RAAPGPRRSED-EPPAASASAAPPPQRDEEEPDGVPEKGKSSGPSARKGKG 149
                R A GP+       P     AA P       P+G P      GP+  +G G
Sbjct: 736 GAKGERGAKGPKGENGVVGPTGPVGAAGP-----AGPNGPP------GPAGSRGDG 780



 Score = 31.2 bits (69), Expect = 1.4
 Identities = 32/106 (30%), Positives = 39/106 (36%), Gaps = 6/106 (5%)

Query: 45  GSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVP-GPGGVNCAVGSAMLTRAAP 103
           G   A G P  GA+G P  A A           PA PA P G  G    VG A     A 
Sbjct: 670 GRDGARGAP--GAVGAPGPAGATGDRGEAGAAGPAGPAGPRGSPGERGEVGPAGPNGFA- 726

Query: 104 GPRRSEDEPPAASASAAPPPQRDE--EEPDGVPEKGKSSGPSARKG 147
           GP  +  +P A     A  P+ +     P G       +GP+   G
Sbjct: 727 GPAGAAGQPGAKGERGAKGPKGENGVVGPTGPVGAAGPAGPNGPPG 772



 Score = 29.6 bits (65), Expect = 4.0
 Identities = 39/138 (28%), Positives = 47/138 (34%), Gaps = 14/138 (10%)

Query: 36  PPGPAPPGG--GSSDAAGKPPA-GALGTPAAAAA----NELNNNLPGGAPAAPAVPGPGG 88
           PPG   P G  G+    G P + G  G P  A A      L    P GA   P   G  G
Sbjct: 857 PPGTPGPQGLLGAPGILGLPGSRGERGLPGVAGAVGEPGPLGIAGPPGARGPPGAVGSPG 916

Query: 89  VNCAVGSAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSGPSARKGK 148
           VN A G A      PG     D PP       P  + +   P  +   G +  P      
Sbjct: 917 VNGAPGEAG-RDGNPG----NDGPPGRDGQ--PGHKGERGYPGNIGPVGAAGAPGPHGPV 969

Query: 149 GQIEKRKLREKRRSTGVV 166
           G   K   R +   +G V
Sbjct: 970 GPAGKHGNRGETGPSGPV 987


>gi|37537692 proline-rich protein BstNI subfamily 4 precursor [Homo
           sapiens]
          Length = 247

 Score = 51.6 bits (122), Expect = 1e-06
 Identities = 40/130 (30%), Positives = 51/130 (39%), Gaps = 23/130 (17%)

Query: 36  PPGPA-----PPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGPGGVN 90
           PP P      PP GG+      PP G    P     N+  ++ P   P  P  P P G N
Sbjct: 93  PPHPGKPERPPPQGGNQSQGTPPPPGKPERPPPQGGNQ--SHRPPPPPGKPERPPPQGGN 150

Query: 91  CAVGSAMLTRAAPGPRRSEDEPP-----AASASAAP-----PPQRDEEEPDGVPEKGKSS 140
            + G        P P + E  PP     + SA + P     PPQ++  +P G P  GK  
Sbjct: 151 QSQGPP------PHPGKPEGPPPQEGNKSRSARSPPGKPQGPPQQEGNKPQGPPPPGKPQ 204

Query: 141 GPSARKGKGQ 150
           GP    G  Q
Sbjct: 205 GPPPAGGNPQ 214



 Score = 48.5 bits (114), Expect = 8e-06
 Identities = 35/117 (29%), Positives = 41/117 (35%), Gaps = 8/117 (6%)

Query: 36  PPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGPGGVNCAVGS 95
           P GP P GG  S     PP    G P            P   P  P  P P G N + G+
Sbjct: 57  PQGPPPQGGNQSQGPPPPPGKPEGRPPQGGNQSQG---PPPHPGKPERPPPQGGNQSQGT 113

Query: 96  ----AMLTRAAP-GPRRSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSGPSARKG 147
                   R  P G  +S   PP       PPPQ   +     P  GK  GP  ++G
Sbjct: 114 PPPPGKPERPPPQGGNQSHRPPPPPGKPERPPPQGGNQSQGPPPHPGKPEGPPPQEG 170



 Score = 39.3 bits (90), Expect = 0.005
 Identities = 31/114 (27%), Positives = 37/114 (32%), Gaps = 9/114 (7%)

Query: 36  PPGPA---PPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGPGGVNCA 92
           PPG     PP GG+      PP G    P     N+     P   P  P  P P   N +
Sbjct: 116 PPGKPERPPPQGGNQSHRPPPPPGKPERPPPQGGNQSQG--PPPHPGKPEGPPPQEGNKS 173

Query: 93  VGSAMLTRAAPGPRRSEDE----PPAASASAAPPPQRDEEEPDGVPEKGKSSGP 142
             +        GP + E      PP       PPP     +    P  GK  GP
Sbjct: 174 RSARSPPGKPQGPPQQEGNKPQGPPPPGKPQGPPPAGGNPQQPQAPPAGKPQGP 227



 Score = 33.1 bits (74), Expect = 0.37
 Identities = 21/63 (33%), Positives = 25/63 (39%), Gaps = 3/63 (4%)

Query: 29  KMRAKQNPPGP--APPGGGSSDAAGKPPAGALGTPAAAAANELNNNLP-GGAPAAPAVPG 85
           K R+ ++PPG    PP    +   G PP G    P  A  N      P  G P  P  P 
Sbjct: 172 KSRSARSPPGKPQGPPQQEGNKPQGPPPPGKPQGPPPAGGNPQQPQAPPAGKPQGPPPPP 231

Query: 86  PGG 88
            GG
Sbjct: 232 QGG 234



 Score = 31.2 bits (69), Expect = 1.4
 Identities = 23/95 (24%), Positives = 34/95 (35%), Gaps = 10/95 (10%)

Query: 36  PPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGPGGVNCAVGS 95
           P GP PP  G+   + + P G    P     N+     P G P  P   G        G+
Sbjct: 162 PEGP-PPQEGNKSRSARSPPGKPQGPPQQEGNKPQGPPPPGKPQGPPPAG--------GN 212

Query: 96  AMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEP 130
               +A P   + +  PP       P P + ++ P
Sbjct: 213 PQQPQAPPA-GKPQGPPPPPQGGRPPRPAQGQQPP 246



 Score = 28.5 bits (62), Expect = 9.0
 Identities = 18/52 (34%), Positives = 25/52 (48%), Gaps = 5/52 (9%)

Query: 100 RAAPGPRRSEDEPP-AASASAAPPPQRDEEEPDGVPEKG--KSSGPSARKGK 148
           R  P P + +  PP   + S  PPP   +  P+G P +G  +S GP    GK
Sbjct: 49  RPPPPPGKPQGPPPQGGNQSQGPPPPPGK--PEGRPPQGGNQSQGPPPHPGK 98


>gi|89276751 alpha 1 type V collagen preproprotein [Homo sapiens]
          Length = 1838

 Score = 51.2 bits (121), Expect = 1e-06
 Identities = 36/115 (31%), Positives = 42/115 (36%), Gaps = 22/115 (19%)

Query: 33   KQNPPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGPGGVNCA 92
            K+ PPGPA P G   +   K  AG  G P            P G   AP  PGP G+   
Sbjct: 1391 KRGPPGPAGPEGRQGEKGAKGEAGLEGPPGKTG--------PIGPQGAPGKPGPDGL--- 1439

Query: 93   VGSAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSGPSARKG 147
                   R  PGP   +  P +      P P      P G+P     SGP   KG
Sbjct: 1440 -------RGIPGPVGEQGLPGSPGPDGPPGPM----GPPGLPGLKGDSGPKGEKG 1483



 Score = 42.0 bits (97), Expect = 8e-04
 Identities = 35/116 (30%), Positives = 42/116 (36%), Gaps = 6/116 (5%)

Query: 36   PPGPAPPGGGSSDAAGKPPAGALGTPAAAA-ANELNNNLPGGAPAAPAVPGPGGVNCAVG 94
            PPGP  P G       + P G +G P A     E       G P     PGP G     G
Sbjct: 1253 PPGPRGPSGAPGADGPQGPPGGIGNPGAVGEKGEPGEAGEPGLPGEGGPPGPKGERGEKG 1312

Query: 95   SAMLTRAA--PGPRRSE-DEPPAASASAAPPPQRDEEEPDGVPEKGKSSGPSARKG 147
             +  + AA  PGP+    D+ P  S      P   +  P G P      GP   KG
Sbjct: 1313 ESGPSGAAGPPGPKGPPGDDGPKGSPGPVGFP--GDPGPPGEPGPAGQDGPPGDKG 1366



 Score = 40.4 bits (93), Expect = 0.002
 Identities = 39/142 (27%), Positives = 50/142 (35%), Gaps = 28/142 (19%)

Query: 31   RAKQNPPGPAPPGGGSSDAAGKP-PAGALGTPAAAAANEL----NNNLPGGAPAAP---- 81
            + +Q PPGP  P G      G+P P+GA G P       L     +  P G P  P    
Sbjct: 1173 KGEQGPPGPTGPQG----PIGQPGPSGADGEPGPRGQQGLFGQKGDEGPRGFPGPPGPVG 1228

Query: 82   --AVPGPGGVNCAVGSA--MLTRAAPGPRRSEDEPPAASASAAP-----------PPQRD 126
               +PGP G     G    M     PGPR     P A      P             +  
Sbjct: 1229 LQGLPGPPGEKGETGDVGQMGPPGPPGPRGPSGAPGADGPQGPPGGIGNPGAVGEKGEPG 1288

Query: 127  EEEPDGVPEKGKSSGPSARKGK 148
            E    G+P +G   GP   +G+
Sbjct: 1289 EAGEPGLPGEGGPPGPKGERGE 1310



 Score = 40.0 bits (92), Expect = 0.003
 Identities = 32/118 (27%), Positives = 41/118 (34%), Gaps = 8/118 (6%)

Query: 36  PPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGPGGVNCAVGS 95
           PPGP    G   D     P G  G P       L    P G P  P V G  G     G+
Sbjct: 653 PPGPPGDDGERGDDGEVGPRGLPGEPGPRGL--LGPKGPPGPPGPPGVTGMDGQPGPKGN 710

Query: 96  AMLTRAAPGPRRSEDEP-----PAASASAAPPPQRDEEEPDGVPEKGKSSGPSARKGK 148
            +  +  PGP   +  P     P    +  PP ++      G+P    + GP    GK
Sbjct: 711 -VGPQGEPGPPGQQGNPGAQGLPGPQGAIGPPGEKGPLGKPGLPGMPGADGPPGHPGK 767



 Score = 38.1 bits (87), Expect = 0.011
 Identities = 37/156 (23%), Positives = 56/156 (35%), Gaps = 23/156 (14%)

Query: 2    ATGGYRTSSGLG--GSTTDFLEEWKAKREKMRAKQNPPGPAPPGGGSSDAAGKPPAGALG 59
            A G      G+G  G+  +  E  +A    +  +  PPGP    G   ++    P+GA G
Sbjct: 1265 ADGPQGPPGGIGNPGAVGEKGEPGEAGEPGLPGEGGPPGPKGERGEKGESG---PSGAAG 1321

Query: 60   TPAAAA--ANELNNNLPG-----GAPAAPAVPGPGGVNCAVGSAMLTRAAPGPRRSEDEP 112
             P       ++     PG     G P  P  PGP G +            P   + +D  
Sbjct: 1322 PPGPKGPPGDDGPKGSPGPVGFPGDPGPPGEPGPAGQD-----------GPPGDKGDDGE 1370

Query: 113  PAASASAAPPPQRDEEEPDGVPEKGKSSGPSARKGK 148
            P  + S  P  +     P G       +GP  R+G+
Sbjct: 1371 PGQTGSPGPTGEPGPSGPPGKRGPPGPAGPEGRQGE 1406



 Score = 37.4 bits (85), Expect = 0.019
 Identities = 29/121 (23%), Positives = 40/121 (33%), Gaps = 19/121 (15%)

Query: 30   MRAKQNPPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGPGGV 89
            ++  + PPGP  P G   +      AG +G P              G P     PGP G 
Sbjct: 1073 LKGNEGPPGPPGPAGSPGERGPAGAAGPIGIP--------------GRPGPQGPPGPAGE 1118

Query: 90   NCAVGSAMLTRAAPGPRRSEDEP---PAASASAAPPPQRDEEEPDGVPEKGKSSGPSARK 146
              A G        P  R     P   P  +    PP +  ++   G P +  S G    +
Sbjct: 1119 KGAPGEK--GPQGPAGRDGLQGPVGLPGPAGPVGPPGEDGDKGEIGEPGQKGSKGDKGEQ 1176

Query: 147  G 147
            G
Sbjct: 1177 G 1177



 Score = 35.4 bits (80), Expect = 0.074
 Identities = 41/140 (29%), Positives = 51/140 (36%), Gaps = 14/140 (10%)

Query: 34  QNPPGP-APPGGGSSDAAGKP-PAGALGTPAAAAANELNNNLPG--GAPAAPAVPGPGGV 89
           + PPGP  PPG    D  G+P P G +G            N PG  G P      GP G 
Sbjct: 687 KGPPGPPGPPGVTGMD--GQPGPKGNVGPQGEPGPPGQQGN-PGAQGLPGPQGAIGPPGE 743

Query: 90  NCAVGSAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSGPSARKGKG 149
              +G   L    PG     D PP       PP ++  + P G   +G    P  R  KG
Sbjct: 744 KGPLGKPGLP-GMPGA----DGPPGHPGKEGPPGEKGGQGPPG--PQGPIGYPGPRGVKG 796

Query: 150 QIEKRKLREKRRSTGVVNIP 169
               R L+  +   G    P
Sbjct: 797 ADGIRGLKGTKGEKGEDGFP 816



 Score = 35.4 bits (80), Expect = 0.074
 Identities = 34/128 (26%), Positives = 41/128 (32%), Gaps = 17/128 (13%)

Query: 31  RAKQNPPGPAPPGGGSSDAAGKPP---AGALGTPAAAAANELNNNLPGGAPAAPAVPGPG 87
           R +  P GP   GG + D     P    G LG P            P G+   P  PG  
Sbjct: 837 RGEDGPEGPKGRGGPNGDPGPLGPPGEKGKLGVPGLPGYP--GRQGPKGSIGFPGFPGAN 894

Query: 88  GVNCAVGSAMLTRAAPGPR--------RSEDEPPAASASAAPPPQRDEEEPDGVPEKGKS 139
           G     G    T   PGPR        R E  P   +    P      + P G P +   
Sbjct: 895 GEKGGRG----TPGKPGPRGQRGPTGPRGERGPRGITGKPGPKGNSGGDGPAGPPGERGP 950

Query: 140 SGPSARKG 147
           +GP    G
Sbjct: 951 NGPQGPTG 958



 Score = 32.0 bits (71), Expect = 0.82
 Identities = 19/57 (33%), Positives = 22/57 (38%), Gaps = 9/57 (15%)

Query: 34  QNPPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPG--GAPAAPAVPGPGG 88
           + PPGP  P G         P G +G P            PG  G P A  +PGP G
Sbjct: 468 EGPPGPEGPAGLPGPPGTMGPTGQVGDPG-------ERGPPGRPGLPGADGLPGPPG 517



 Score = 32.0 bits (71), Expect = 0.82
 Identities = 31/131 (23%), Positives = 41/131 (31%), Gaps = 6/131 (4%)

Query: 25   AKREKMRAKQNPPGPAPPGGGSSDAAGKPPAGALGTPAAAA-ANELNNNLPGGAPAAPAV 83
            A R+ ++     PGPA P G   +   K   G  G   +     E     P G       
Sbjct: 1131 AGRDGLQGPVGLPGPAGPVGPPGEDGDKGEIGEPGQKGSKGDKGEQGPPGPTGPQGPIGQ 1190

Query: 84   PGPGGVNCAVG-----SAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDGVPEKGK 138
            PGP G +   G          +   GPR     P        P P  ++ E   V + G 
Sbjct: 1191 PGPSGADGEPGPRGQQGLFGQKGDEGPRGFPGPPGPVGLQGLPGPPGEKGETGDVGQMGP 1250

Query: 139  SSGPSARKGKG 149
               P  R   G
Sbjct: 1251 PGPPGPRGPSG 1261



 Score = 31.2 bits (69), Expect = 1.4
 Identities = 35/117 (29%), Positives = 39/117 (33%), Gaps = 13/117 (11%)

Query: 37  PGPA-PPGGGS-----SDAAGKPPAGALGTPAAAAANELNNNL-PGGAPAAPAVPGPGGV 89
           PGP  PPG G       D   + P G  G P  A            GA   P   GP G 
Sbjct: 570 PGPVGPPGSGGLKGEPGDVGPQGPRGVQGPPGPAGKPGRRGRAGSDGARGMPGQTGPKGD 629

Query: 90  NCAVGSAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDG-VPEKGKSSGPSAR 145
               G A L    PG +    + P  S    PP    E   DG V  +G    P  R
Sbjct: 630 RGFDGLAGL----PGEKGHRGD-PGPSGPPGPPGDDGERGDDGEVGPRGLPGEPGPR 681



 Score = 29.6 bits (65), Expect = 4.0
 Identities = 31/115 (26%), Positives = 37/115 (32%), Gaps = 33/115 (28%)

Query: 36   PPGP-APPG-----GGSSDAAGKPPAGALGT--PAAAAANELNNNLPG------------ 75
            PPGP  PPG     G S     K   G +G   P      + +  LPG            
Sbjct: 1460 PPGPMGPPGLPGLKGDSGPKGEKGHPGLIGLIGPPGEQGEKGDRGLPGPQGSSGPKGEQG 1519

Query: 76   --------GAPAAPAVPGPGGVNCAVGSAMLTRAAPGPRRSEDEPPAASASAAPP 122
                    G P  P +PGP G   A GS     + P   + E   P       PP
Sbjct: 1520 ITGPSGPIGPPGPPGLPGPPGPKGAKGS-----SGPTGPKGEAGHPGPPGPPGPP 1569


>gi|41349482 proline-rich protein BstNI subfamily 1 isoform 1
           preproprotein [Homo sapiens]
          Length = 331

 Score = 49.7 bits (117), Expect = 4e-06
 Identities = 37/129 (28%), Positives = 51/129 (39%), Gaps = 9/129 (6%)

Query: 28  EKMRAKQNPPGPA---PPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVP 84
           +K R+ ++PPG     PP GG+      PP G    P     N      P G P  P  P
Sbjct: 85  DKSRSPRSPPGKPQGPPPQGGNQPQGPPPPPGKPQGPPPQGGNRPQGPPPPGKPQGP--P 142

Query: 85  GPGGVNCAVGSAMLTRAAPGPR---RSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSG 141
             G  + +  S       P P+   + +  PP       PPPQ   ++P G P  GK  G
Sbjct: 143 PQGDKSRSPRSPPGKPQGPPPQGGNQPQGPPPPPGKPQGPPPQ-GGKKPQGPPPPGKPQG 201

Query: 142 PSARKGKGQ 150
           P  +  K +
Sbjct: 202 PPPQGDKSR 210



 Score = 49.7 bits (117), Expect = 4e-06
 Identities = 37/126 (29%), Positives = 50/126 (39%), Gaps = 9/126 (7%)

Query: 28  EKMRAKQNPPGPA---PPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVP 84
           +K R+ ++PPG     PP GG+      PP G    P      +     P G P  P  P
Sbjct: 146 DKSRSPRSPPGKPQGPPPQGGNQPQGPPPPPGKPQGPPPQGGKKPQGPPPPGKPQGP--P 203

Query: 85  GPGGVNCAVGSAMLTRAAPGPR---RSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSG 141
             G  + +  S       P P+   + +  PP       PPPQ   + P G P  GK  G
Sbjct: 204 PQGDKSRSSQSPPGKPQGPPPQGGNQPQGPPPPPGKPQGPPPQGGNK-PQGPPPPGKPQG 262

Query: 142 PSARKG 147
           P A+ G
Sbjct: 263 PPAQGG 268



 Score = 49.3 bits (116), Expect = 5e-06
 Identities = 32/118 (27%), Positives = 44/118 (37%), Gaps = 10/118 (8%)

Query: 28  EKMRAKQNPPGPA---PPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVP 84
           +K R+ Q+PPG     PP GG+      PP G    P     N+     P G P  P  P
Sbjct: 207 DKSRSSQSPPGKPQGPPPQGGNQPQGPPPPPGKPQGPPPQGGNKPQGPPPPGKPQGP--P 264

Query: 85  GPGGVNCAVGSAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSGP 142
             GG       +   R+ PG  +   +    +    PPP     +    P  G+  GP
Sbjct: 265 AQGG-----SKSQSARSPPGKPQGPPQQEGNNPQGPPPPAGGNPQQPQAPPAGQPQGP 317



 Score = 47.0 bits (110), Expect = 2e-05
 Identities = 35/119 (29%), Positives = 44/119 (36%), Gaps = 7/119 (5%)

Query: 35  NPPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGPGGVNCAVG 94
           NP GP+P GG        PP    G P     N+     P G P  P  P  G  + +  
Sbjct: 35  NPQGPSPQGGNKPQGPPPPPGKPQG-PPPQGGNKPQGPPPPGKPQGP--PPQGDKSRSPR 91

Query: 95  SAMLTRAAPGPR---RSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSGPSARKGKGQ 150
           S       P P+   + +  PP       PPPQ     P G P  GK  GP  +  K +
Sbjct: 92  SPPGKPQGPPPQGGNQPQGPPPPPGKPQGPPPQ-GGNRPQGPPPPGKPQGPPPQGDKSR 149


>gi|239754955 PREDICTED: similar to COL22A1 protein [Homo sapiens]
          Length = 1073

 Score = 49.3 bits (116), Expect = 5e-06
 Identities = 42/143 (29%), Positives = 51/143 (35%), Gaps = 14/143 (9%)

Query: 34  QNPPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGPGGVNCAV 93
           Q  PG   P G +     K   GA G P AA           GAP     PGP G   +V
Sbjct: 367 QGRPGELGPQGPTGPPGAKGQEGAHGAPGAAG--------NPGAPGHVGAPGPSGPPGSV 418

Query: 94  GSAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEP-----DGVPEKGKSSGPSARKGK 148
           G+  L R  PG      E  AA    +P P     +P      G P KGK   P  R   
Sbjct: 419 GAPGL-RGTPGKDGERGEKGAAGEEGSPGPVGPRGDPGAPGLPGPPGKGKDGEPGLRGSP 477

Query: 149 GQIEKRKLREKRRSTGVVNIPAA 171
           G       +  R + G+   P +
Sbjct: 478 GLPGPLGTKGDRGAPGIPGSPGS 500



 Score = 38.1 bits (87), Expect = 0.011
 Identities = 37/127 (29%), Positives = 49/127 (38%), Gaps = 22/127 (17%)

Query: 36  PPGPAPPGG--GSSDAAGKP-------PAGALGTPAAAAANELN-------NNLPGGAPA 79
           PPGP+ P G  GS  + G P       PAG  G  +  +  ++N       N+ P G P 
Sbjct: 511 PPGPSGPPGDKGSPGSRGLPGFPGPQGPAGRDGLSSLLSPGDINLLAKDVCNDCPPGPPG 570

Query: 80  APAVPGPGGVNCAVGSAMLTRAAPGPRRSEDEPPAASAS---AAPPPQRDEEEPDG-VPE 135
            P +PG  G     G     R     ++ E  PP        A P   + E   DG V +
Sbjct: 571 LPGLPGFKGDKGVPGKP--GREGTEGKKGEAGPPGLPGPPGIAGPQGSQGERGADGEVGQ 628

Query: 136 KGKSSGP 142
           KG    P
Sbjct: 629 KGDQGHP 635



 Score = 36.2 bits (82), Expect = 0.043
 Identities = 36/133 (27%), Positives = 42/133 (31%), Gaps = 34/133 (25%)

Query: 33  KQNPPGPAPPGG--------GSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVP 84
           K+ PPGP  P G        G     GKP  G  G P              G    P +P
Sbjct: 670 KEGPPGPQGPSGLPGIPGEEGKEGRDGKP--GPPGEP--------------GKAGEPGLP 713

Query: 85  GPGGVNCAVGSAMLT--RAAPGPR--------RSEDEPPAASASAAPPPQRDEEEPDGVP 134
           GP G     G    T    APGPR          ++  P       P   +  + P G P
Sbjct: 714 GPEGARGPPGFKGHTGDSGAPGPRGESGAMGLPGQEGLPGKDGDTGPTGPQGPQGPRGPP 773

Query: 135 EKGKSSGPSARKG 147
            K  S G     G
Sbjct: 774 GKNGSPGSPGEPG 786



 Score = 34.7 bits (78), Expect = 0.13
 Identities = 31/114 (27%), Positives = 34/114 (29%), Gaps = 15/114 (13%)

Query: 36  PPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGPGGVNCAVGS 95
           PPG   P G    A    P G  G+P            P G    P +PG  G     G 
Sbjct: 643 PPGNPGPPGADGIAGAAGPPGIQGSPGKEGPPG-----PQGPSGLPGIPGEEGKEGRDGK 697

Query: 96  AMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSGPSARKGKG 149
                  PGP     EP  A     P P+     P      G S  P  R   G
Sbjct: 698 -------PGP---PGEPGKAGEPGLPGPEGARGPPGFKGHTGDSGAPGPRGESG 741



 Score = 31.6 bits (70), Expect = 1.1
 Identities = 32/119 (26%), Positives = 38/119 (31%), Gaps = 17/119 (14%)

Query: 30   MRAKQNPPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGPGGV 89
            M++ Q  PGP  P G         P G  G P             GG        GP G 
Sbjct: 939  MKSSQGRPGPPGPPGKDGLPGRAGPMGEPGRPG-----------QGGLEGPSGPIGPKGE 987

Query: 90   NCAVGSAMLTRAAPG-PRRSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSGPSARKG 147
              A G       APG   R E  PP        P    +  P G+P     +GP+   G
Sbjct: 988  RGAKGDP----GAPGVGLRGEMGPPGIPGQPGEPGYAKDGLP-GIPGPQGETGPAGHPG 1041



 Score = 28.9 bits (63), Expect = 6.9
 Identities = 31/127 (24%), Positives = 39/127 (30%), Gaps = 35/127 (27%)

Query: 27  REKMRAKQNPPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGP 86
           +E +  K    GP  P G         P G  G P              G+P +P  PGP
Sbjct: 748 QEGLPGKDGDTGPTGPQG---------PQGPRGPPGK-----------NGSPGSPGEPGP 787

Query: 87  GGVNCAVGSAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDGVPEK----GKSSGP 142
            G     GS           + E+  P       P     E    GVP K    GK   P
Sbjct: 788 SGTPGQKGS-----------KGENGSPGLPGFLGPRGPPGEPGEKGVPGKEGVPGKPGEP 836

Query: 143 SARKGKG 149
             +  +G
Sbjct: 837 GFKGERG 843


>gi|55742678 glucocorticoid induced transcript 1 [Homo sapiens]
          Length = 547

 Score = 48.5 bits (114), Expect = 8e-06
 Identities = 46/170 (27%), Positives = 69/170 (40%), Gaps = 17/170 (10%)

Query: 8   TSSGLGGSTTDFLEEWKAKREKMRAKQNPPGPAPPGGGSSDAAGKPPAGALGTPAAAAAN 67
           T+S    S++       ++R +  A  +PP  A  G G+    G    G +G   AA A 
Sbjct: 3   TASSSSSSSSSQTPHPPSQRMRRSAAGSPPAVAAAGSGNGAGGG----GGVGCAPAAGAG 58

Query: 68  ELNN--------NLPGGAPAAPAVPGPGGVNCAVGSAMLTRAAPGPRRSEDEPPAASASA 119
            L           L  G+  +P  P       ++GS     AA GP  S   PPAA+A A
Sbjct: 59  RLLQPIRATVPYQLLRGSQHSPTRPPVAAAAASLGSLPGPGAARGPSPSSPTPPAAAAPA 118

Query: 120 APPPQRDEEEPDGVPEKGKSSGPSARKGKG----QIEKRKLREKRRSTGV 165
              P R +  P   PE  + S    R+  G    + +K K ++ R S+ +
Sbjct: 119 EQAP-RAKGRPRRSPESHRRSSSPERRSPGSPVCRADKAKSQQVRTSSTI 167


>gi|40805823 collagen, type XXII, alpha 1 [Homo sapiens]
          Length = 1626

 Score = 48.5 bits (114), Expect = 8e-06
 Identities = 39/121 (32%), Positives = 44/121 (36%), Gaps = 14/121 (11%)

Query: 34  QNPPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGPGGVNCAV 93
           Q  PG   P G +     K   GA G P AA           GAP     PGP G   +V
Sbjct: 885 QGRPGELGPQGPTGPPGAKGQEGAHGAPGAAG--------NPGAPGHVGAPGPSGPPGSV 936

Query: 94  GSAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEP-----DGVPEKGKSSGPSARKGK 148
           G+  L R  PG      E  AA    +P P     +P      G P KGK   P  R   
Sbjct: 937 GAPGL-RGTPGKDGERGEKGAAGEEGSPGPVGPRGDPGAPGLPGPPGKGKDGEPGLRGSP 995

Query: 149 G 149
           G
Sbjct: 996 G 996



 Score = 43.9 bits (102), Expect = 2e-04
 Identities = 44/180 (24%), Positives = 63/180 (35%), Gaps = 24/180 (13%)

Query: 26  KREKMRAKQNPPGP-------APPGG----GSSDAAGKP-------PAGALGTPAAAAAN 67
           ++E ++ +Q  PGP        PPG     G     G P         G +G P      
Sbjct: 651 QQEGLKGEQGAPGPRGHQGAPGPPGARGPIGPEGRDGPPGLQGLRGKKGDMGPPGIPGLL 710

Query: 68  ELNNNLPGGAPAAPAVPGPGGVNCAVGSAMLTRAAPGPRRSEDEPPAASASAAPPPQRDE 127
            L    P G P  P  PGPGG     G        PGP      P     +  P P   +
Sbjct: 711 GLQG--PPGPPGVPGPPGPGGSPGLPGEIGFP-GKPGPPGPTGPPGKDGPNGPPGPPGTK 767

Query: 128 EEPDGVPEKGKSSGPSARKGKGQIEKRKLREKRRSTGVVNIPAAECLDEYEDDEAGQKER 187
            EP    E+G+   P     +G+I ++ L  +    G   +P A        ++  Q E+
Sbjct: 768 GEPG---ERGEDGLPGKPGLRGEIGEQGLAGRPGEKGEAGLPGAPGFPGVRGEKGDQGEK 824



 Score = 39.3 bits (90), Expect = 0.005
 Identities = 48/171 (28%), Positives = 58/171 (33%), Gaps = 19/171 (11%)

Query: 9   SSGLGGSTTDFLEEWKAKREKMRAKQNPPG-PAPPGG-GSSDAAGKPPAGALGTPAAAAA 66
           S G+ G   +  E        MR  Q PPG P PPG  G+    G+   G  GT      
Sbjct: 546 SKGMRGEPGELGEPGLPGEVGMRGPQGPPGLPGPPGRVGAPGLQGE--RGEKGTRGEKGE 603

Query: 67  NELNNNLPG--------GAPAAPAVPGPGGVNCAVGSAMLTRAAPGPRRSEDEPPAASAS 118
             L +  PG        G P    V GP G    VG A      PG   S  +       
Sbjct: 604 RGL-DGFPGKPGDTGQQGRPGPSGVAGPQGEKGDVGPA----GPPGVPGSVVQQEGLKGE 658

Query: 119 AAPPPQRDEEEPDGVPEKGKSSGPSARKGKGQIEKRKLREKRRSTGVVNIP 169
              P  R  +   G P      GP  R G   ++   LR K+   G   IP
Sbjct: 659 QGAPGPRGHQGAPGPPGARGPIGPEGRDGPPGLQ--GLRGKKGDMGPPGIP 707



 Score = 36.2 bits (82), Expect = 0.043
 Identities = 36/133 (27%), Positives = 42/133 (31%), Gaps = 34/133 (25%)

Query: 33   KQNPPGPAPPGG--------GSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVP 84
            K+ PPGP  P G        G     GKP  G  G P              G    P +P
Sbjct: 1223 KEGPPGPQGPSGLPGIPGEEGKEGRDGKP--GPPGEP--------------GKAGEPGLP 1266

Query: 85   GPGGVNCAVGSAMLT--RAAPGPR--------RSEDEPPAASASAAPPPQRDEEEPDGVP 134
            GP G     G    T    APGPR          ++  P       P   +  + P G P
Sbjct: 1267 GPEGARGPPGFKGHTGDSGAPGPRGESGAMGLPGQEGLPGKDGDTGPTGPQGPQGPRGPP 1326

Query: 135  EKGKSSGPSARKG 147
             K  S G     G
Sbjct: 1327 GKNGSPGSPGEPG 1339



 Score = 35.4 bits (80), Expect = 0.074
 Identities = 36/129 (27%), Positives = 43/129 (33%), Gaps = 13/129 (10%)

Query: 31   RAKQNPPG-PAPPGGGSSDAAGKPPA----GALGTPAAAAANELNNNLPGGAPA------ 79
            R     PG P PPG G     G   +    G LGT AA      + N   G         
Sbjct: 969  RGDPGAPGLPGPPGKGKDGEPGLRGSPGLPGPLGTKAACGKVRGSENCALGGQCVKGDRG 1028

Query: 80   APAVPGPGGVNCAVGSAMLTRAAP-GPRRSEDEPPAASASAAPPPQRDEEEPDGVPEKGK 138
            AP +PG  G     G  +     P GP   +  P +      P PQ      DG P    
Sbjct: 1029 APGIPGSPGSRGDPGIGVAGPPGPSGPPGDKGSPGSRGLPGFPGPQGPAGR-DGAPGNPG 1087

Query: 139  SSGPSARKG 147
              GP  + G
Sbjct: 1088 ERGPPGKPG 1096



 Score = 35.0 bits (79), Expect = 0.096
 Identities = 38/137 (27%), Positives = 48/137 (35%), Gaps = 36/137 (26%)

Query: 33   KQNPPGPAPPGG--GSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGP---- 86
            ++  PGP  P G  G+    G P  G  G P              G   +P +PGP    
Sbjct: 959  EEGSPGPVGPRGDPGAPGLPGPPGKGKDGEP--------------GLRGSPGLPGPLGTK 1004

Query: 87   -------GGVNCAVGSAML--TRAAPG-PRRSEDEPPAASASAAPP----PQRDEEEPD- 131
                   G  NCA+G   +   R APG P             A PP    P  D+  P  
Sbjct: 1005 AACGKVRGSENCALGGQCVKGDRGAPGIPGSPGSRGDPGIGVAGPPGPSGPPGDKGSPGS 1064

Query: 132  -GVPEKGKSSGPSARKG 147
             G+P      GP+ R G
Sbjct: 1065 RGLPGFPGPQGPAGRDG 1081



 Score = 35.0 bits (79), Expect = 0.096
 Identities = 34/120 (28%), Positives = 44/120 (36%), Gaps = 16/120 (13%)

Query: 37   PGPAPPGG--------GSSDAAGKPPAGALGTPAAA--AANELNNNLPGGAPAAPAVPGP 86
            PGP  P G        G     GKP   +L +P      A ++ N+ P G P  P +PG 
Sbjct: 1071 PGPQGPAGRDGAPGNPGERGPPGKPGLSSLLSPGDINLLAKDVCNDCPPGPPGLPGLPGF 1130

Query: 87   GGVNCAVGSAMLTRAAPGPRRSEDEPPAASAS---AAPPPQRDEEEPDG-VPEKGKSSGP 142
             G     G     R     ++ E  PP        A P   + E   DG V +KG    P
Sbjct: 1131 KGDKGVPGKP--GREGTEGKKGEAGPPGLPGPPGIAGPQGSQGERGADGEVGQKGDQGHP 1188



 Score = 34.7 bits (78), Expect = 0.13
 Identities = 31/114 (27%), Positives = 34/114 (29%), Gaps = 15/114 (13%)

Query: 36   PPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGPGGVNCAVGS 95
            PPG   P G    A    P G  G+P            P G    P +PG  G     G 
Sbjct: 1196 PPGNPGPPGADGIAGAAGPPGIQGSPGKEGPPG-----PQGPSGLPGIPGEEGKEGRDGK 1250

Query: 96   AMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSGPSARKGKG 149
                   PGP     EP  A     P P+     P      G S  P  R   G
Sbjct: 1251 -------PGP---PGEPGKAGEPGLPGPEGARGPPGFKGHTGDSGAPGPRGESG 1294



 Score = 32.3 bits (72), Expect = 0.62
 Identities = 30/115 (26%), Positives = 38/115 (33%), Gaps = 12/115 (10%)

Query: 33  KQNPPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGPGGVNCA 92
           +Q  PGP+   G   +     PAG  G P +    E       G       PGP G   A
Sbjct: 618 QQGRPGPSGVAGPQGEKGDVGPAGPPGVPGSVVQQE-------GLKGEQGAPGPRGHQGA 670

Query: 93  VGSAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSGPSARKG 147
            G     R   GP    D PP          ++ +  P G+P      GP    G
Sbjct: 671 PGPPG-ARGPIGP-EGRDGPPGLQGLRG---KKGDMGPPGIPGLLGLQGPPGPPG 720



 Score = 31.6 bits (70), Expect = 1.1
 Identities = 32/119 (26%), Positives = 38/119 (31%), Gaps = 17/119 (14%)

Query: 30   MRAKQNPPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGPGGV 89
            M++ Q  PGP  P G         P G  G P             GG        GP G 
Sbjct: 1492 MKSSQGRPGPPGPPGKDGLPGRAGPMGEPGRPG-----------QGGLEGPSGPIGPKGE 1540

Query: 90   NCAVGSAMLTRAAPG-PRRSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSGPSARKG 147
              A G       APG   R E  PP        P    +  P G+P     +GP+   G
Sbjct: 1541 RGAKGDP----GAPGVGLRGEMGPPGIPGQPGEPGYAKDGLP-GIPGPQGETGPAGHPG 1594



 Score = 28.9 bits (63), Expect = 6.9
 Identities = 31/127 (24%), Positives = 39/127 (30%), Gaps = 35/127 (27%)

Query: 27   REKMRAKQNPPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGP 86
            +E +  K    GP  P G         P G  G P              G+P +P  PGP
Sbjct: 1301 QEGLPGKDGDTGPTGPQG---------PQGPRGPPGK-----------NGSPGSPGEPGP 1340

Query: 87   GGVNCAVGSAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDGVPEK----GKSSGP 142
             G     GS           + E+  P       P     E    GVP K    GK   P
Sbjct: 1341 SGTPGQKGS-----------KGENGSPGLPGFLGPRGPPGEPGEKGVPGKEGVPGKPGEP 1389

Query: 143  SARKGKG 149
              +  +G
Sbjct: 1390 GFKGERG 1396


>gi|117938759 GNAS complex locus XLas [Homo sapiens]
          Length = 1037

 Score = 48.1 bits (113), Expect = 1e-05
 Identities = 61/200 (30%), Positives = 80/200 (40%), Gaps = 24/200 (12%)

Query: 36  PPGPAPPGGGSS-DAAGKPPAGAL-GTPAAAAANELNNNLPGGAPAAPAVPGPGGVNCAV 93
           P  PA P  G++ DA   P AGA    PAA AA E        A  APA P  G      
Sbjct: 447 PDAPADPDSGAAPDAPADPDAGAAPEAPAAPAAAETR-----AAHVAPAAPDAGAPTAPA 501

Query: 94  GSAMLTRAAPGPRRSEDEPPAASASA---APPPQRDEEEPDGVPEKGKSSGPSARKGKGQ 150
            SA  TRAA   RR+    PA+ A       PP  + +  D  P   + +  SA +GK  
Sbjct: 502 ASA--TRAAQ-VRRAASAAPASGARRKIHLRPPSPEIQAAD--PPTPRPTRASAWRGKS- 555

Query: 151 IEKRKLREKRRSTGVVNIPAAECLDEYEDDEAG-----QKERKREDAITQQNTIQNEAVN 205
            E  + R      GV +       DE +D  +G     Q  R R     Q+N ++N  V 
Sbjct: 556 -ESSRGRRVYYDEGVASSDDDSSGDESDDGTSGCLRWFQHRRNRRRRKPQRNLLRNFLVQ 614

Query: 206 LLDPGSSYLLQEPPRTVSGR 225
               G  +   E P+  + R
Sbjct: 615 AF--GGCFGRSESPQPKASR 632



 Score = 40.8 bits (94), Expect = 0.002
 Identities = 37/121 (30%), Positives = 50/121 (41%), Gaps = 7/121 (5%)

Query: 28  EKMRAKQNPP--GPAPPGGGSSDAA--GKPPAGALGTPAAAAANELNNNLPGGAPAAPAV 83
           +K    + PP    A    G++DAA  GK P+   G+PAA AA+   +     APAAPA 
Sbjct: 344 DKRERAERPPVEEEAAEMEGAADAAEGGKVPSPGYGSPAAGAASA--DTAARAAPAAPAD 401

Query: 84  PGPGGVNCAVGSAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSGPS 143
           P  G       S     A P       +P + +A AAP        PD   +    + P 
Sbjct: 402 PDSGATPEDPDSG-TAPADPDSGAFAADPDSGAAPAAPADPDSGAAPDAPADPDSGAAPD 460

Query: 144 A 144
           A
Sbjct: 461 A 461


>gi|91208420 bassoon protein [Homo sapiens]
          Length = 3926

 Score = 48.1 bits (113), Expect = 1e-05
 Identities = 43/141 (30%), Positives = 58/141 (41%), Gaps = 32/141 (22%)

Query: 38  GPAPPGG-------GSSDAAGKPPAGALG---TPAAAAANELNNNLPGGAPAAPAVPGPG 87
           GP PPGG       G    AGKPP+   G    PAA AA          + A P VPGPG
Sbjct: 14  GPLPPGGAGPGPGPGPGPGAGKPPSAPAGGGQLPAAGAAR---------STAVPPVPGPG 64

Query: 88  -GVNCAVGSAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDG--VPEKGKSSGPSA 144
            G     G    +R     R    EP     +A+P P++      G   P + ++ GP+ 
Sbjct: 65  PGPGPGPGPGSTSR-----RLDPKEPLGNQRAASPTPKQASATTPGHESPRETRAQGPAG 119

Query: 145 RKGKG-----QIEKRKLREKR 160
           ++  G     Q++ R  R  R
Sbjct: 120 QEADGPRRTLQVDSRTQRSGR 140



 Score = 33.1 bits (74), Expect = 0.37
 Identities = 37/140 (26%), Positives = 50/140 (35%), Gaps = 26/140 (18%)

Query: 25  AKREKMRAKQNPPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVP 84
           A  E  R    PP P  P    S A  +PPAG     +A A       +P G  A     
Sbjct: 295 APPEVGRVSPQPPQPTKP----STAEPRPPAGEAPAKSATA-------VPAGLGATEQTQ 343

Query: 85  -GPGGVNCAVGSAMLTRAA--------------PGPRRSEDEPPAASASAAPPPQRDEEE 129
            G  G    +G+++LT+A+              P P +   +     AS    P+     
Sbjct: 344 EGLTGKLFGLGASLLTQASTLMSVQPEADTQGQPAPSKGTPKIVFNDASKEAGPKPLGSG 403

Query: 130 PDGVPEKGKSSGPSARKGKG 149
           P   P  G  + P AR G G
Sbjct: 404 PGPGPAPGAKTEPGARMGPG 423



 Score = 33.1 bits (74), Expect = 0.37
 Identities = 31/135 (22%), Positives = 51/135 (37%), Gaps = 10/135 (7%)

Query: 76   GAPAAPAVPGPGGVNCAVGSAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEP-DGVP 134
            G PAA   PG GG +         R  P P  +      A+ + AP P   ++ P D  P
Sbjct: 2283 GKPAAAKAPGAGGPSRPEMPVGAAREEPLPTTTPAAIKEAAGAPAPAPLAGQKPPADAAP 2342

Query: 135  EKGKS--SGPSARKGKGQIEKRK-------LREKRRSTGVVNIPAAECLDEYEDDEAGQK 185
              G    S P   K +   E+R+       L+ +R    +  +      +E E +    +
Sbjct: 2343 GGGSGALSRPGFEKEEASQEERQRKQQEQLLQLERERVELEKLRQLRLQEELERERVELQ 2402

Query: 186  ERKREDAITQQNTIQ 200
              + E+ +  Q  +Q
Sbjct: 2403 RHREEEQLLVQRELQ 2417



 Score = 32.7 bits (73), Expect = 0.48
 Identities = 24/84 (28%), Positives = 28/84 (33%), Gaps = 11/84 (13%)

Query: 72  NLPGGAPAAPAVPGPGGVNCAVGSAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPD 131
           +L GGA   P  PG  G              PGP     +PP+A A     P        
Sbjct: 6   SLEGGAGDGPLPPGGAGPG----------PGPGPGPGAGKPPSAPAGGGQLPAAGAARST 55

Query: 132 GVPE-KGKSSGPSARKGKGQIEKR 154
            VP   G   GP    G G   +R
Sbjct: 56  AVPPVPGPGPGPGPGPGPGSTSRR 79



 Score = 32.0 bits (71), Expect = 0.82
 Identities = 16/44 (36%), Positives = 19/44 (43%)

Query: 34   QNPPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGA 77
            Q  PGP P G  +    G  P    G P A   +  +  LPGGA
Sbjct: 3861 QPAPGPGPAGVKAGARPGGTPGAPAGQPGADGESVFSKILPGGA 3904



 Score = 31.6 bits (70), Expect = 1.1
 Identities = 32/136 (23%), Positives = 52/136 (38%), Gaps = 14/136 (10%)

Query: 50   AGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGPGGVNCAVGSA---------MLTR 100
            A KP A  LG   A   ++     P  APA P    PG  + A  S            + 
Sbjct: 3666 AAKPHARDLGRHEARPHSQ-----PSSAPAMPKKGQPGYPSSAEYSQPSRASSAYHHASD 3720

Query: 101  AAPGPRRSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSGPSARKGKGQIEKRKLREKR 160
            +  G R++   P A  + A P  Q   +     P   +S  PS+R+       R+ + ++
Sbjct: 3721 SKKGSRQAHSGPAALQSKAEPQAQPQLQGRQAAPGPQQSQSPSSRQIPSGAASRQPQTQQ 3780

Query: 161  RSTGVVNIPAAECLDE 176
            +  G+   P  + L +
Sbjct: 3781 QQQGLGLQPPQQALTQ 3796



 Score = 30.4 bits (67), Expect = 2.4
 Identities = 25/89 (28%), Positives = 33/89 (37%), Gaps = 3/89 (3%)

Query: 58  LGTPAAAAANELNNNLPGGAPAAPAVPGPGGVNCAVGSAMLTRA--APGPRRSEDEPPAA 115
           L +PA + A+      P G P      GPGG       A   RA   PGP ++   P   
Sbjct: 242 LHSPALSPAHSPAKQ-PLGKPDQERSRGPGGPQPGSRQAETARATSVPGPAQAAAPPEVG 300

Query: 116 SASAAPPPQRDEEEPDGVPEKGKSSGPSA 144
             S  PP        +  P  G++   SA
Sbjct: 301 RVSPQPPQPTKPSTAEPRPPAGEAPAKSA 329



 Score = 28.5 bits (62), Expect = 9.0
 Identities = 23/68 (33%), Positives = 24/68 (35%), Gaps = 9/68 (13%)

Query: 32   AKQNPPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLP----GGAPAAPAVPGPG 87
            A   P G   PG   S A G  PAG    P A   N           G AP A   PGPG
Sbjct: 3813 AASQPAGKPQPG--PSTATGPQPAGP---PRAEQTNGSKGTAKAPQQGRAPQAQPAPGPG 3867

Query: 88   GVNCAVGS 95
                  G+
Sbjct: 3868 PAGVKAGA 3875


>gi|122937321 UNC homeobox [Homo sapiens]
          Length = 531

 Score = 48.1 bits (113), Expect = 1e-05
 Identities = 43/131 (32%), Positives = 55/131 (41%), Gaps = 26/131 (19%)

Query: 33  KQNPPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPA-VPGP----- 86
           K  PP PA P    + A+    +G  G P +A A    + +   +P APA  P P     
Sbjct: 407 KDAPPAPAVPPAPPAQASFGAFSGPGGAPDSAFARRSPDAV--ASPGAPAPAPAPFRDLA 464

Query: 87  -------GGVNCAVGSAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPD--GVPEKG 137
                  GG +CA        A P P      PPA S    P P    EEP   GVPE G
Sbjct: 465 SAAATEGGGGDCADAGT----AGPAP-----PPPAPSPRPGPRPPSPAEEPATCGVPEPG 515

Query: 138 KSSGPSARKGK 148
            ++GPS  +G+
Sbjct: 516 AAAGPSPPEGE 526



 Score = 32.0 bits (71), Expect = 0.82
 Identities = 36/103 (34%), Positives = 44/103 (42%), Gaps = 18/103 (17%)

Query: 36  PPGPAPPGGGSSDAAGKPPAGALGTPA------AAAANELNNNLPGGAPAAPAVP----- 84
           PP    PG  +S AAG  PA     PA      A   ++ +   P   P  PA       
Sbjct: 251 PPAAKGPGAHASGAAGTAPAPPGEPPAPGTCDPAFYPSQRSGAGPQPRPGRPADKDAASC 310

Query: 85  GPGGVNCAV--GSAMLTRAAPGPRRS--EDEPP---AASASAA 120
           GPG    AV  G+A L +A+P    S   D PP   AAS +AA
Sbjct: 311 GPGAAVAAVERGAAGLPKASPFSVESLLSDSPPRRKAASNAAA 353



 Score = 30.4 bits (67), Expect = 2.4
 Identities = 35/129 (27%), Positives = 46/129 (35%), Gaps = 27/129 (20%)

Query: 20  LEEWKAKREKMRAKQNPPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPA 79
           +E+ K K EK   K        PGG S  +A    + + G   +    E         P 
Sbjct: 200 MEKKKRKHEKKLLKSQGRHLHSPGGLSLHSAPSSDSDSGGGGLSPEPPE---------PP 250

Query: 80  APAVPGPGGVNCAVGSAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDGVPEKGKS 139
            PA  GPG    A G+A    A PG      EPPA              +P   P +   
Sbjct: 251 PPAAKGPGAH--ASGAAGTAPAPPG------EPPAPGTC----------DPAFYPSQRSG 292

Query: 140 SGPSARKGK 148
           +GP  R G+
Sbjct: 293 AGPQPRPGR 301


>gi|90819237 myeloid/lymphoid or mixed-lineage leukemia (trithorax
            homolog, Drosophila); translocated to, 4 isoform 1 [Homo
            sapiens]
          Length = 1834

 Score = 47.8 bits (112), Expect = 1e-05
 Identities = 71/318 (22%), Positives = 126/318 (39%), Gaps = 28/318 (8%)

Query: 7    RTSSGLGGSTTDFLEEWKAKREKMRAKQNPPGPAPPGGGSSDAAGKPPAGALGTPAAAAA 66
            ++ S +  + +  L+   + +E +        PA     S     K PA    TP A + 
Sbjct: 1298 KSDSDMWINQSSSLDSSTSSQEHLNHSSKSVTPASTLTKSGPGRWKTPAAIPATPVAVS- 1356

Query: 67   NELNNNLPGGAPAAPAVPGPGGVNCAVGSAMLTRAAPGPRRSEDEPPAASASAAPPPQRD 126
              +  +LP   P  P V   G  +   G +M     P P  ++   P+A  +AA   +R+
Sbjct: 1357 QPIRTDLPP-PPPPPPVHYAGDFD---GMSMDLPLPPPPSANQIGLPSAQVAAAERRKRE 1412

Query: 127  EEEPDGVPEKGKSSGPSARKGKGQIEKRKLREKRRSTGVVNIPAAECLDEYEDDEAGQKE 186
            E +     EK +           +  +RK RE+ R  G +     + L+         ++
Sbjct: 1413 EHQRWYEKEKARLE---------EERERKRREQERKLGQMR---TQSLNPAPFSPLTAQQ 1460

Query: 187  RKREDAITQQNTIQNEAVNLLDPGSSYLLQEPPRTVSGRYKSTTSVSEEDVSSRYSRTDR 246
             K E   T Q   Q   +  L P      Q+ PRT+  R     +VS+E++SS  S +  
Sbjct: 1461 MKPEKPSTLQRP-QETVIRELQP------QQQPRTIERRDLQYITVSKEELSSGDSLSPD 1513

Query: 247  SGFPRYNRDANVSGTLVSSSTLEKKIEDLEKEVVRERQENLRLVRLMQDKEEMIGKLKEE 306
                           +     L K+I++L+ +  R  +E+ RL +LM + +      K  
Sbjct: 1514 PWKRDAKEKLEKQQQMHIVDMLSKEIQELQSKPDRSAEESDRLRKLMLEWQFQ----KRL 1569

Query: 307  IDLLNRDLDDIEDENEQL 324
             +   +D DD E+E++ +
Sbjct: 1570 QESKQKDEDDEEEEDDDV 1587


>gi|90819233 myeloid/lymphoid or mixed-lineage leukemia (trithorax
            homolog, Drosophila); translocated to, 4 isoform 2 [Homo
            sapiens]
          Length = 1651

 Score = 47.8 bits (112), Expect = 1e-05
 Identities = 71/318 (22%), Positives = 126/318 (39%), Gaps = 28/318 (8%)

Query: 7    RTSSGLGGSTTDFLEEWKAKREKMRAKQNPPGPAPPGGGSSDAAGKPPAGALGTPAAAAA 66
            ++ S +  + +  L+   + +E +        PA     S     K PA    TP A + 
Sbjct: 1299 KSDSDMWINQSSSLDSSTSSQEHLNHSSKSVTPASTLTKSGPGRWKTPAAIPATPVAVS- 1357

Query: 67   NELNNNLPGGAPAAPAVPGPGGVNCAVGSAMLTRAAPGPRRSEDEPPAASASAAPPPQRD 126
              +  +LP   P  P V   G  +   G +M     P P  ++   P+A  +AA   +R+
Sbjct: 1358 QPIRTDLPP-PPPPPPVHYAGDFD---GMSMDLPLPPPPSANQIGLPSAQVAAAERRKRE 1413

Query: 127  EEEPDGVPEKGKSSGPSARKGKGQIEKRKLREKRRSTGVVNIPAAECLDEYEDDEAGQKE 186
            E +     EK +           +  +RK RE+ R  G +     + L+         ++
Sbjct: 1414 EHQRWYEKEKARLE---------EERERKRREQERKLGQMR---TQSLNPAPFSPLTAQQ 1461

Query: 187  RKREDAITQQNTIQNEAVNLLDPGSSYLLQEPPRTVSGRYKSTTSVSEEDVSSRYSRTDR 246
             K E   T Q   Q   +  L P      Q+ PRT+  R     +VS+E++SS  S +  
Sbjct: 1462 MKPEKPSTLQRP-QETVIRELQP------QQQPRTIERRDLQYITVSKEELSSGDSLSPD 1514

Query: 247  SGFPRYNRDANVSGTLVSSSTLEKKIEDLEKEVVRERQENLRLVRLMQDKEEMIGKLKEE 306
                           +     L K+I++L+ +  R  +E+ RL +LM + +      K  
Sbjct: 1515 PWKRDAKEKLEKQQQMHIVDMLSKEIQELQSKPDRSAEESDRLRKLMLEWQFQ----KRL 1570

Query: 307  IDLLNRDLDDIEDENEQL 324
             +   +D DD E+E++ +
Sbjct: 1571 QESKQKDEDDEEEEDDDV 1588


>gi|90819231 myeloid/lymphoid or mixed-lineage leukemia (trithorax
            homolog, Drosophila); translocated to, 4 isoform 3 [Homo
            sapiens]
          Length = 1612

 Score = 47.8 bits (112), Expect = 1e-05
 Identities = 71/318 (22%), Positives = 126/318 (39%), Gaps = 28/318 (8%)

Query: 7    RTSSGLGGSTTDFLEEWKAKREKMRAKQNPPGPAPPGGGSSDAAGKPPAGALGTPAAAAA 66
            ++ S +  + +  L+   + +E +        PA     S     K PA    TP A + 
Sbjct: 1283 KSDSDMWINQSSSLDSSTSSQEHLNHSSKSVTPASTLTKSGPGRWKTPAAIPATPVAVS- 1341

Query: 67   NELNNNLPGGAPAAPAVPGPGGVNCAVGSAMLTRAAPGPRRSEDEPPAASASAAPPPQRD 126
              +  +LP   P  P V   G  +   G +M     P P  ++   P+A  +AA   +R+
Sbjct: 1342 QPIRTDLPP-PPPPPPVHYAGDFD---GMSMDLPLPPPPSANQIGLPSAQVAAAERRKRE 1397

Query: 127  EEEPDGVPEKGKSSGPSARKGKGQIEKRKLREKRRSTGVVNIPAAECLDEYEDDEAGQKE 186
            E +     EK +           +  +RK RE+ R  G +     + L+         ++
Sbjct: 1398 EHQRWYEKEKARLE---------EERERKRREQERKLGQMR---TQSLNPAPFSPLTAQQ 1445

Query: 187  RKREDAITQQNTIQNEAVNLLDPGSSYLLQEPPRTVSGRYKSTTSVSEEDVSSRYSRTDR 246
             K E   T Q   Q   +  L P      Q+ PRT+  R     +VS+E++SS  S +  
Sbjct: 1446 MKPEKPSTLQRP-QETVIRELQP------QQQPRTIERRDLQYITVSKEELSSGDSLSPD 1498

Query: 247  SGFPRYNRDANVSGTLVSSSTLEKKIEDLEKEVVRERQENLRLVRLMQDKEEMIGKLKEE 306
                           +     L K+I++L+ +  R  +E+ RL +LM + +      K  
Sbjct: 1499 PWKRDAKEKLEKQQQMHIVDMLSKEIQELQSKPDRSAEESDRLRKLMLEWQFQ----KRL 1554

Query: 307  IDLLNRDLDDIEDENEQL 324
             +   +D DD E+E++ +
Sbjct: 1555 QESKQKDEDDEEEEDDDV 1572


>gi|117306167 proline-rich protein BstNI subfamily 3 precursor [Homo
           sapiens]
          Length = 309

 Score = 47.8 bits (112), Expect = 1e-05
 Identities = 39/116 (33%), Positives = 47/116 (40%), Gaps = 14/116 (12%)

Query: 36  PPGPAPPGGGSSDAAGKPPA-GALGTPAAAAANELNNNLPGGAPAAPAVPGPGGVNCAVG 94
           P GP P GG  S   G PP  G    P     N+     P   P  P  P P G N + G
Sbjct: 120 PEGPPPQGGNQSQ--GPPPRPGKPEGPPPQGGNQSQG--PPPRPGKPEGPPPQGGNQSQG 175

Query: 95  SAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDGVPEKG--KSSGPSARKGK 148
                   P P + E  PP     +  PP R  + P+G P +G  +S GP  R GK
Sbjct: 176 PP------PHPGKPEGPPPQGGNQSQGPPPRPGK-PEGPPPQGGNQSQGPPPRPGK 224



 Score = 46.6 bits (109), Expect = 3e-05
 Identities = 35/115 (30%), Positives = 43/115 (37%), Gaps = 12/115 (10%)

Query: 36  PPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGPGGVNCAVGS 95
           P G  P GG        PP    G P            P   P  P  P P G N + G 
Sbjct: 36  PEGRRPQGGNQPQRTPPPPGKPEGRPPQGGNQSQG---PPPRPGKPEGPPPQGGNQSQGP 92

Query: 96  AMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDGVPEKG--KSSGPSARKGK 148
                  P P + E +PP     +  PP R  + P+G P +G  +S GP  R GK
Sbjct: 93  P------PRPGKPEGQPPQGGNQSQGPPPRPGK-PEGPPPQGGNQSQGPPPRPGK 140



 Score = 43.9 bits (102), Expect = 2e-04
 Identities = 35/118 (29%), Positives = 42/118 (35%), Gaps = 10/118 (8%)

Query: 36  PPGPAPPGGGSSDAAGKPPA-GALGTPAAAAANELNNNLPGGAPAAPAVPGPGGVNCAVG 94
           P GP P GG  S   G PP  G    P     N+     P   P  P  P P G N + G
Sbjct: 141 PEGPPPQGGNQSQ--GPPPRPGKPEGPPPQGGNQSQG--PPPHPGKPEGPPPQGGNQSQG 196

Query: 95  SAMLTRAAPGP-----RRSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSGPSARKG 147
                    GP      +S+  PP       PP Q   +     P  GK  GP  ++G
Sbjct: 197 PPPRPGKPEGPPPQGGNQSQGPPPRPGKPEGPPSQGGNKPQGPPPHPGKPQGPPPQEG 254



 Score = 43.5 bits (101), Expect = 3e-04
 Identities = 36/123 (29%), Positives = 47/123 (38%), Gaps = 16/123 (13%)

Query: 33  KQNPPGPA-----PPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGPG 87
           ++ PP P      PP GG+      P  G    P     N+     P   P  P    P 
Sbjct: 48  QRTPPPPGKPEGRPPQGGNQSQGPPPRPGKPEGPPPQGGNQSQG--PPPRPGKPEGQPPQ 105

Query: 88  GVNCAVGSAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDGVPEKG--KSSGPSAR 145
           G N + G        P P + E  PP     +  PP R  + P+G P +G  +S GP  R
Sbjct: 106 GGNQSQGPP------PRPGKPEGPPPQGGNQSQGPPPRPGK-PEGPPPQGGNQSQGPPPR 158

Query: 146 KGK 148
            GK
Sbjct: 159 PGK 161



 Score = 43.1 bits (100), Expect = 4e-04
 Identities = 36/121 (29%), Positives = 43/121 (35%), Gaps = 11/121 (9%)

Query: 36  PPGPAPPGGGSSDAAGKPP-AGALGTPAAAAANELNNNLPGGAPAAPAVPGPGGVNCAVG 94
           P GP P GG  S   G PP  G    P     N+     P   P  P  P P G N + G
Sbjct: 162 PEGPPPQGGNQSQ--GPPPHPGKPEGPPPQGGNQSQG--PPPRPGKPEGPPPQGGNQSQG 217

Query: 95  SAMLTRAAPGP-----RRSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSGPSARKGKG 149
                    GP      + +  PP       PPPQ +  +P   P  G+  GP    G  
Sbjct: 218 PPPRPGKPEGPPSQGGNKPQGPPPHPGKPQGPPPQ-EGNKPQRPPPPGRPQGPPPPGGNP 276

Query: 150 Q 150
           Q
Sbjct: 277 Q 277



 Score = 36.6 bits (83), Expect = 0.033
 Identities = 33/112 (29%), Positives = 35/112 (31%), Gaps = 9/112 (8%)

Query: 36  PPGPAPPGGGSSDAAGKPPA-GALGTPAAAAANELNNNLPGGAPAAPAVPGPGGVNCAVG 94
           P GP P GG  S   G PP  G    P     N+     P   P  P  P   G N   G
Sbjct: 183 PEGPPPQGGNQSQ--GPPPRPGKPEGPPPQGGNQSQG--PPPRPGKPEGPPSQGGNKPQG 238

Query: 95  SAMLTRAAPGPRRSE----DEPPAASASAAPPPQRDEEEPDGVPEKGKSSGP 142
                    GP   E      PP       PPP     +    P  GK  GP
Sbjct: 239 PPPHPGKPQGPPPQEGNKPQRPPPPGRPQGPPPPGGNPQQPLPPPAGKPQGP 290



 Score = 32.7 bits (73), Expect = 0.48
 Identities = 29/105 (27%), Positives = 30/105 (28%), Gaps = 22/105 (20%)

Query: 36  PPGPAPPGGGSSDAAGKP--PAGALGTPAAAAANELNNNLPGGAPAAPAVPGPGGVNCAV 93
           PP P  P G  S    KP  P    G P      E N       P  P  P P       
Sbjct: 219 PPRPGKPEGPPSQGGNKPQGPPPHPGKPQGPPPQEGNKPQRPPPPGRPQGPPP------- 271

Query: 94  GSAMLTRAAPGPRRSEDEPPAASASAAPPPQ----RDEEEPDGVP 134
                    PG    +  PP A     PPP     R    P G P
Sbjct: 272 ---------PGGNPQQPLPPPAGKPQGPPPPPQGGRPHRPPQGQP 307


>gi|4502951 collagen type III alpha 1 preproprotein [Homo sapiens]
          Length = 1466

 Score = 47.4 bits (111), Expect = 2e-05
 Identities = 46/153 (30%), Positives = 54/153 (35%), Gaps = 40/153 (26%)

Query: 34  QNPPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPG--------GAPAAP---- 81
           Q PPGP  PGG   D     P G  G P         N  PG        GAP AP    
Sbjct: 617 QGPPGPTGPGGDKGDTGPPGPQGLQGLPGTGGPPG-ENGKPGEPGPKGDAGAPGAPGGKG 675

Query: 82  --AVPGPGGVNCAVGSAMLTRAA--PGPRRSEDE-----PPAASAS-------------A 119
               PG  G     G+  L   A  PGP   +       PP A+ +              
Sbjct: 676 DAGAPGERGPPGLAGAPGLRGGAGPPGPEGGKGAAGPPGPPGAAGTPGLQGMPGERGGLG 735

Query: 120 APPPQRDEEEP-----DGVPEKGKSSGPSARKG 147
           +P P+ D+ EP     DGVP K    GP+   G
Sbjct: 736 SPGPKGDKGEPGGPGADGVPGKDGPRGPTGPIG 768



 Score = 44.3 bits (103), Expect = 2e-04
 Identities = 39/136 (28%), Positives = 46/136 (33%), Gaps = 38/136 (27%)

Query: 36   PPGPAPPGGGSSDAAGKPPAGALGTPA-----------------------AAAANELNNN 72
            PPGP  P G S D     PAG  G P                         AA  + +  
Sbjct: 1051 PPGPVGPAGKSGDRGESGPAGPAGAPGPAGSRGAPGPQGPRGDKGETGERGAAGIKGHRG 1110

Query: 73   LPGGAPAAPAVPGPGGVNCAVGSAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDG 132
             PG  P AP  PGP G   A+GS       PGP       P      + PP +D     G
Sbjct: 1111 FPGN-PGAPGSPGPAGQQGAIGS-------PGP-----AGPRGPVGPSGPPGKD--GTSG 1155

Query: 133  VPEKGKSSGPSARKGK 148
             P      GP   +G+
Sbjct: 1156 HPGPIGPPGPRGNRGE 1171



 Score = 43.9 bits (102), Expect = 2e-04
 Identities = 33/130 (25%), Positives = 45/130 (34%), Gaps = 15/130 (11%)

Query: 36  PPGPAPPGGGSSDAAGKPPAGALGTPAA-AAANELNNNLPGGAPAAPAVPGPGGVNCAVG 94
           PPGP    G       K   G  G+P +  A  +     P G   A   PGP G+N + G
Sbjct: 334 PPGPPGTAGFPGSPGAKGEVGPAGSPGSNGAPGQRGEPGPQGHAGAQGPPGPPGINGSPG 393

Query: 95  SA--------------MLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSS 140
                           M  R  PGP  +   P     +  P     + EP    E+G++ 
Sbjct: 394 GKGEMGPAGIPGAPGLMGARGPPGPAGANGAPGLRGGAGEPGKNGAKGEPGPRGERGEAG 453

Query: 141 GPSARKGKGQ 150
            P     KG+
Sbjct: 454 IPGVPGAKGE 463



 Score = 43.9 bits (102), Expect = 2e-04
 Identities = 39/140 (27%), Positives = 53/140 (37%), Gaps = 26/140 (18%)

Query: 31   RAKQNPPG----PAPPGGGSSDAAGKPP-----AGALGTPAAAA----ANELNNNLPGGA 77
            R    PPG    P PPG   S     PP      GA G+P  +     A +       GA
Sbjct: 878  RGLPGPPGSNGNPGPPGPSGSPGKDGPPGPAGNTGAPGSPGVSGPKGDAGQPGEKGSPGA 937

Query: 78   PAAPAVPGPGGVNCAVGSAMLT--------RAAPGPR--RSEDEPPAASASAAPPPQRDE 127
               P  PGP G+    G+  L         R +PGP+  + E   P A+  +    +R  
Sbjct: 938  QGPPGAPGPLGIAGITGARGLAGPPGMPGPRGSPGPQGVKGESGKPGANGLSG---ERGP 994

Query: 128  EEPDGVPEKGKSSGPSARKG 147
              P G+P    ++G   R G
Sbjct: 995  PGPQGLPGLAGTAGEPGRDG 1014



 Score = 43.5 bits (101), Expect = 3e-04
 Identities = 37/124 (29%), Positives = 47/124 (37%), Gaps = 19/124 (15%)

Query: 31  RAKQNPPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPG--GAPAAPAVPGPGG 88
           + +  PPG A P GGS  A    P G  G   +          PG  G P A  +PGP G
Sbjct: 833 KGEGGPPGVAGPPGGSGPAGPPGPQGVKGERGSPGG-------PGAAGFPGARGLPGPPG 885

Query: 89  VNCAVGSAMLTRAAPGPRRS--EDEPPAASASAAPPPQRDEEEPDG-VPEKGKSSGPSAR 145
            N   G        PGP  S  +D PP  + +   P       P G   + G+   P A+
Sbjct: 886 SNGNPG-------PPGPSGSPGKDGPPGPAGNTGAPGSPGVSGPKGDAGQPGEKGSPGAQ 938

Query: 146 KGKG 149
              G
Sbjct: 939 GPPG 942



 Score = 42.0 bits (97), Expect = 8e-04
 Identities = 34/120 (28%), Positives = 42/120 (35%), Gaps = 12/120 (10%)

Query: 33  KQNPPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGPGGVNCA 92
           K  P GP   G    D   + P G +G P  A   +  +   GGAP  P + GP G    
Sbjct: 743 KGEPGGPGADGVPGKDGP-RGPTGPIGPPGPAG--QPGDKGEGGAPGLPGIAGPRGSPGE 799

Query: 93  VGSAMLTRAAPGPRRSEDEP-----PAASASAAPPPQRDEEEPDGVPEKGKSSGPSARKG 147
            G        PGP      P     P        P ++ E  P GV      SGP+   G
Sbjct: 800 RGET----GPPGPAGFPGAPGQNGEPGGKGERGAPGEKGEGGPPGVAGPPGGSGPAGPPG 855



 Score = 42.0 bits (97), Expect = 8e-04
 Identities = 37/121 (30%), Positives = 42/121 (34%), Gaps = 20/121 (16%)

Query: 34  QNPPGPAPPGGGSSDAAGKPPAGALGTPAAA----AANELNNNLPGGAPAAPAVPGPGGV 89
           + P GP  P G +     K   GA G P  A    +  E     P G    P  PG  G 
Sbjct: 761 RGPTGPIGPPGPAGQPGDKGEGGAPGLPGIAGPRGSPGERGETGPPGPAGFPGAPGQNGE 820

Query: 90  NCAVGSAMLTRAAPGPRRSEDE-----PPAASASAAPPPQRDEEEPDGVP-EKGKSSGPS 143
               G     R APG +          PP  S  A PP       P GV  E+G   GP 
Sbjct: 821 PGGKGE----RGAPGEKGEGGPPGVAGPPGGSGPAGPP------GPQGVKGERGSPGGPG 870

Query: 144 A 144
           A
Sbjct: 871 A 871



 Score = 38.9 bits (89), Expect = 0.007
 Identities = 38/136 (27%), Positives = 51/136 (37%), Gaps = 13/136 (9%)

Query: 21   EEWKAKREKMRAKQNPPGPAP-PG----GGSSDAAGKPPAGAL----GTPAAAAANELNN 71
            E  K     +  ++ PPGP   PG     G     G P +  L    G+P      +   
Sbjct: 979  ESGKPGANGLSGERGPPGPQGLPGLAGTAGEPGRDGNPGSDGLPGRDGSPGGKG--DRGE 1036

Query: 72   NLPGGAPAAPAVPGPGGVNCAVGSAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPD 131
            N   GAP AP  PGP G     G +   R   GP      P  A +  AP PQ    +  
Sbjct: 1037 NGSPGAPGAPGHPGPPGPVGPAGKSG-DRGESGPAGPAGAPGPAGSRGAPGPQGPRGDKG 1095

Query: 132  GVPEKGKSSGPSARKG 147
               E+G ++G    +G
Sbjct: 1096 ETGERG-AAGIKGHRG 1110



 Score = 37.7 bits (86), Expect = 0.015
 Identities = 35/142 (24%), Positives = 47/142 (33%), Gaps = 33/142 (23%)

Query: 39  PAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPG--GAPAAPAVPGPGGV--NCAVG 94
           P PP   +    G+ P G  G P        N + PG  G P +P  PGP G+  +C  G
Sbjct: 89  PQPPTAPTRPPNGQGPQGPKGDPGPPGIPGRNGD-PGIPGQPGSPGSPGPPGICESCPTG 147

Query: 95  SAMLTR--------------------------AAPGPRRSEDEP--PAASASAAPPPQRD 126
               +                             PGP  +   P  P +     PP +  
Sbjct: 148 PQNYSPQYDSYDVKSGVAVGGLAGYPGPAGPPGPPGPPGTSGHPGSPGSPGYQGPPGEPG 207

Query: 127 EEEPDGVPEKGKSSGPSARKGK 148
           +  P G P    + GPS   GK
Sbjct: 208 QAGPSGPPGPPGAIGPSGPAGK 229



 Score = 37.4 bits (85), Expect = 0.019
 Identities = 46/201 (22%), Positives = 65/201 (32%), Gaps = 36/201 (17%)

Query: 5   GYRTSSGLGGSTTDFLEEWKAKREKMRAKQNPPGPAPPGGGSSDAAGKP----PAGALGT 60
           G+R   G  G   +           +  +   PGP  P G   +  G+P     AGA G 
Sbjct: 264 GHRGFDGRNGEKGETGAPGLKGENGLPGENGAPGPMGPRGAPGER-GRPGLPGAAGARGN 322

Query: 61  PAAAAANELNNNLPGGAPAAPAVPGPGGVNCAVGSAMLTRAAPGPRRSEDEPPAASASAA 120
             A  ++      P G P     PG  G    VG A    +   P +  +  P   A A 
Sbjct: 323 DGARGSDGQPG--PPGPPGTAGFPGSPGAKGEVGPAGSPGSNGAPGQRGEPGPQGHAGAQ 380

Query: 121 PPP----------QRDEEEPDGVP-------------EKGKSSGPSARKGKGQIEKRKL- 156
            PP           + E  P G+P               G +  P  R G G+  K    
Sbjct: 381 GPPGPPGINGSPGGKGEMGPAGIPGAPGLMGARGPPGPAGANGAPGLRGGAGEPGKNGAK 440

Query: 157 -----REKRRSTGVVNIPAAE 172
                R +R   G+  +P A+
Sbjct: 441 GEPGPRGERGEAGIPGVPGAK 461



 Score = 37.4 bits (85), Expect = 0.019
 Identities = 37/135 (27%), Positives = 50/135 (37%), Gaps = 25/135 (18%)

Query: 33  KQNPPGPAPPGGGSSDAAGKPPAGALGTPAAA----AANELNNNLPGGAPAAPAVPG-PG 87
           ++  PG   P G +     K PAG  G P  A    AA E   +   G P    +PG PG
Sbjct: 484 ERGAPGFRGPAGPNGIPGEKGPAGERGAPGPAGPRGAAGEPGRDGVPGGPGMRGMPGSPG 543

Query: 88  GVNCAVGSAMLTRAAPGPRRSEDE-----PPAASASAAPP-------PQRDEEEPDGVPE 135
           G          +   PGP  S+ E     PP  S     P       P+ ++  P    E
Sbjct: 544 GPG--------SDGKPGPPGSQGESGRPGPPGPSGPRGQPGVMGFPGPKGNDGAPGKNGE 595

Query: 136 KGKSSGPSARKGKGQ 150
           +G   GP  +   G+
Sbjct: 596 RGGPGGPGPQGPPGK 610


>gi|148806928 hypothetical protein LOC57482 [Homo sapiens]
          Length = 1233

 Score = 47.4 bits (111), Expect = 2e-05
 Identities = 59/251 (23%), Positives = 97/251 (38%), Gaps = 24/251 (9%)

Query: 28   EKMRAKQNPPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGPG 87
            E  +A    P    P       +  PPA  L   +     EL +   G     P+ P   
Sbjct: 947  EHDKAANKMPLAQKPALAPKPTSQTPPASPLSKLSRPYLVELLSRRAGRPDPEPSEPSKE 1006

Query: 88   GVNCAVGSAMLTRAAPGP------RRSEDEPPAASASAAPP-PQRDEEEPDGVPEKGKSS 140
                   S     + PGP      +R E+E       A+PP P   +E+P   PE G+  
Sbjct: 1007 DQE---SSDRRPPSPPGPEERKGQKRDEEEEATERKPASPPLPATQQEKPSQTPEAGRKE 1063

Query: 141  GPSARKGKGQIEKRKLREKRRSTGVVNIP-AAECLDEYEDDEAGQKERKREDAITQQNTI 199
             P   + +  ++  KL EK  +   + I  A +    + + +A ++ERK+     Q   +
Sbjct: 1064 KPML-QSRHSLDGSKLTEKVETAQPLWITLALQKQKGFREQQATREERKQAREAKQAEKL 1122

Query: 200  QNEAVNL-LDPGSSYLLQEPPRTVSGRYKSTTSVSEEDVSSRYSRTDRSGFPRYNRDANV 258
              E V++ + PGSS + +         +KST    E+   +  SR +R            
Sbjct: 1123 SKENVSVSVQPGSSSVSR-----AGSLHKSTALPEEKRPETAVSRLER------REQLKK 1171

Query: 259  SGTLVSSSTLE 269
            + TL +S T+E
Sbjct: 1172 ANTLPTSVTVE 1182



 Score = 38.1 bits (87), Expect = 0.011
 Identities = 32/133 (24%), Positives = 55/133 (41%), Gaps = 21/133 (15%)

Query: 74   PGGAPAAPAVPGPGGVNCAVGSAMLTRAAPGPRRSEDEPPAASASAAPPPQ--------- 124
            PG  PA+   P P     A    +  + A  P+ +   PPA+  S    P          
Sbjct: 934  PGPPPASSQTPAPEHDKAANKMPLAQKPALAPKPTSQTPPASPLSKLSRPYLVELLSRRA 993

Query: 125  -RDEEEPDGVPEKGKSS---------GPSARKGKGQIEKRKLREKRRSTGVVNIPAAECL 174
             R + EP    ++ + S         GP  RKG+ + E+ +  E++ ++    +PA +  
Sbjct: 994  GRPDPEPSEPSKEDQESSDRRPPSPPGPEERKGQKRDEEEEATERKPASPP--LPATQQE 1051

Query: 175  DEYEDDEAGQKER 187
               +  EAG+KE+
Sbjct: 1052 KPSQTPEAGRKEK 1064


>gi|171543895 ataxin 2 [Homo sapiens]
          Length = 1313

 Score = 47.0 bits (110), Expect = 2e-05
 Identities = 58/235 (24%), Positives = 83/235 (35%), Gaps = 30/235 (12%)

Query: 37  PGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGA--PAAPAVPGPGGVNCAVG 94
           PGP P         G PP+     P+A+     N N  GGA  P +  + G GG      
Sbjct: 47  PGPYPSAAPPPPGPGPPPSRQSSPPSASDCFGSNGN-GGGAFRPGSRRLLGLGGPPRPFV 105

Query: 95  SAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSG---PSARKGKGQI 151
             +L  A+PG       PPAA   A+P   R      GV     + G   P+     G +
Sbjct: 106 VLLLPLASPG------APPAAPTRASPLGARASPPRSGVSLARPAPGCPRPACEPVYGPL 159

Query: 152 EKRKLREKRRSTGVVNIPAAECLDEYEDDEAGQKERKREDAITQQNTIQNEAVNLLDPGS 211
                                   + +  +  Q++++++    QQ      A N+  PG 
Sbjct: 160 ------------------TMSLKPQQQQQQQQQQQQQQQQQQQQQQQPPPAAANVRKPGG 201

Query: 212 SYLLQEPPRTVSGRYKSTTSVSEEDVSSRYSRTDRSGFPRYNRDANVSGTLVSSS 266
           S LL  P    S    S +S S    SS  + T   G P   R  N +  L  S+
Sbjct: 202 SGLLASPAAAPSPSSSSVSSSSATAPSSVVAATSGGGRPGLGRGRNSNKGLPQST 256



 Score = 30.4 bits (67), Expect = 2.4
 Identities = 27/126 (21%), Positives = 42/126 (33%), Gaps = 1/126 (0%)

Query: 33  KQNPPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGPGGVNCA 92
           +QN P    PG GS  +            + +    +N  +P  +P       P     +
Sbjct: 510 RQNSPRMGQPGSGSMPSRSTSHTSDFNPNSGSDQRVVNGGVPWPSPCPSPSSRPPSRYQS 569

Query: 93  VGSAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEP-DGVPEKGKSSGPSARKGKGQI 151
             +++  RAA   R     P   S   + P       P   +P++  S GP     K Q 
Sbjct: 570 GPNSLPPRAATPTRPPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSPKAQR 629

Query: 152 EKRKLR 157
             R  R
Sbjct: 630 HPRNHR 635


>gi|110349772 alpha 1 type I collagen preproprotein [Homo sapiens]
          Length = 1464

 Score = 46.6 bits (109), Expect = 3e-05
 Identities = 40/120 (33%), Positives = 50/120 (41%), Gaps = 20/120 (16%)

Query: 30  MRAKQNPPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPG--GAPAAPAVPGPG 87
           +R +  PPGPA    G++  AG P  GA G P A  A    N  PG  GAP  P   GP 
Sbjct: 369 VRGEPGPPGPA----GAAGPAGNP--GADGQPGAKGA----NGAPGIAGAPGFPGARGPS 418

Query: 88  GVNCAVGSAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSGPSARKG 147
           G     G        PGP+ +  E P A  S      + E  P GV      +G   ++G
Sbjct: 419 GPQGPGG-------PPGPKGNSGE-PGAPGSKGDTGAKGEPGPVGVQGPPGPAGEEGKRG 470



 Score = 46.6 bits (109), Expect = 3e-05
 Identities = 40/122 (32%), Positives = 46/122 (37%), Gaps = 14/122 (11%)

Query: 36   PPGPAPPGGGSSDAAGKPPAGALGTPA--AAAANELNNNLPG--GAPAAPAVPGPGGVNC 91
            PPGP  P G         PAGA GTP     A       LPG  G    P +PGP G   
Sbjct: 924  PPGPPGPAGEKGSPGADGPAGAPGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGPSGEPG 983

Query: 92   AVG--SAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEE-EPDGVPEKGKSSGPSARKGK 148
              G   A   R  PGP      PP     A PP +   E  P      G+   P A+  +
Sbjct: 984  KQGPSGASGERGPPGPM----GPPGL---AGPPGESGREGAPGAEGSPGRDGSPGAKGDR 1036

Query: 149  GQ 150
            G+
Sbjct: 1037 GE 1038



 Score = 44.7 bits (104), Expect = 1e-04
 Identities = 41/139 (29%), Positives = 51/139 (36%), Gaps = 35/139 (25%)

Query: 33  KQNPPGPA----------PPGG-GSSDAAGKP-PAGALGTPAAAAANELNNNLPGGAPAA 80
           K  PPGPA          PPG  G +   G P P GA G P  A           G P  
Sbjct: 552 KTGPPGPAGQDGRPGPPGPPGARGQAGVMGFPGPKGAAGEPGKAGER--------GVPGP 603

Query: 81  PAVPGPGGVNCAVGSAMLTRAAPGP-----RRSEDEPPAA------SASAAPPPQRDEEE 129
           P   GP G +   G+    +  PGP      R E  P  +         A PP +  +  
Sbjct: 604 PGAVGPAGKDGEAGA----QGPPGPAGPAGERGEQGPAGSPGFQGLPGPAGPPGEAGKPG 659

Query: 130 PDGVPEKGKSSGPSARKGK 148
             GVP    + GPS  +G+
Sbjct: 660 EQGVPGDLGAPGPSGARGE 678



 Score = 43.5 bits (101), Expect = 3e-04
 Identities = 44/124 (35%), Positives = 50/124 (40%), Gaps = 16/124 (12%)

Query: 31  RAKQNPPGPA----PPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGA-PAAPAVPG 85
           R +  PPGPA    PPG     A G+P  GA G P  A A   +   PG A PA P  PG
Sbjct: 802 RGEPGPPGPAGFAGPPG-----ADGQP--GAKGEPGDAGAKG-DAGPPGPAGPAGP--PG 851

Query: 86  PGGVNCAVGSAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSGPSAR 145
           P G N     A   R + GP  +   P AA     P P  +   P      GK  G   R
Sbjct: 852 PIG-NVGAPGAKGARGSAGPPGATGFPGAAGRVGPPGPSGNAGPPGPPGPAGKEGGKGPR 910

Query: 146 KGKG 149
              G
Sbjct: 911 GETG 914



 Score = 42.4 bits (98), Expect = 6e-04
 Identities = 38/146 (26%), Positives = 50/146 (34%), Gaps = 16/146 (10%)

Query: 31  RAKQNPPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGPGGVN 90
           R    PPG   P G   +A  + P G  G      A E     P G+P    +PGP G  
Sbjct: 598 RGVPGPPGAVGPAGKDGEAGAQGPPGPAG-----PAGERGEQGPAGSPGFQGLPGPAGPP 652

Query: 91  CAVG-----SAMLTRAAPGP--RRSEDEPPAASASAAPPPQRDEEEPDGVP----EKGKS 139
              G            APGP   R E   P       PP        +G P     KG +
Sbjct: 653 GEAGKPGEQGVPGDLGAPGPSGARGERGFPGERGVQGPPGPAGPRGANGAPGNDGAKGDA 712

Query: 140 SGPSARKGKGQIEKRKLREKRRSTGV 165
             P A   +G    + +  +R + G+
Sbjct: 713 GAPGAPGSQGAPGLQGMPGERGAAGL 738



 Score = 42.0 bits (97), Expect = 8e-04
 Identities = 38/120 (31%), Positives = 45/120 (37%), Gaps = 16/120 (13%)

Query: 31  RAKQNPPGPAPP---GGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGPG 87
           R  Q PPGPA P    G   +   K  AGA G P +  A  L   +PG   AA  +PGP 
Sbjct: 685 RGVQGPPGPAGPRGANGAPGNDGAKGDAGAPGAPGSQGAPGL-QGMPGERGAA-GLPGPK 742

Query: 88  GVNCAVGSAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSGPSARKG 147
           G           R   GP+ ++  P          P      P G P     SGPS   G
Sbjct: 743 G----------DRGDAGPKGADGSPGKDGVRGLTGP-IGPPGPAGAPGDKGESGPSGPAG 791



 Score = 42.0 bits (97), Expect = 8e-04
 Identities = 37/124 (29%), Positives = 43/124 (34%), Gaps = 16/124 (12%)

Query: 36  PPGPAPPGG-----------GSSDAAGKP-PAGALGTPAAAAANELNNNLPGGAPAAPAV 83
           PPGPA P G           G+  A G   P GA G P AA    +    P G    P  
Sbjct: 840 PPGPAGPAGPPGPIGNVGAPGAKGARGSAGPPGATGFPGAAG--RVGPPGPSGNAGPPGP 897

Query: 84  PGPGGVNCAVGSAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSGPS 143
           PGP G     G    T   P  R  E  PP     A        + P G P      G +
Sbjct: 898 PGPAGKEGGKGPRGET--GPAGRPGEVGPPGPPGPAGEKGSPGADGPAGAPGTPGPQGIA 955

Query: 144 ARKG 147
            ++G
Sbjct: 956 GQRG 959



 Score = 40.8 bits (94), Expect = 0.002
 Identities = 36/124 (29%), Positives = 46/124 (37%), Gaps = 17/124 (13%)

Query: 33  KQNPPGPAPPGGGSSDA-----AGKPPA-GALGTPAAAAAN-ELNNNLPGGAPAAPAVPG 85
           ++  PGPA P G   +A     AG P A G  G+P +   + +     P G    P  PG
Sbjct: 510 ERGSPGPAGPKGSPGEAGRPGEAGLPGAKGLTGSPGSPGPDGKTGPPGPAGQDGRPGPPG 569

Query: 86  PGGVNCAVGSAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSGPSAR 145
           P G     G        PGP+ +  EP  A     P P      P  V   GK     A+
Sbjct: 570 PPGARGQAG----VMGFPGPKGAAGEPGKAGERGVPGP------PGAVGPAGKDGEAGAQ 619

Query: 146 KGKG 149
              G
Sbjct: 620 GPPG 623



 Score = 40.4 bits (93), Expect = 0.002
 Identities = 45/156 (28%), Positives = 52/156 (33%), Gaps = 22/156 (14%)

Query: 5    GYRTSSGLGGSTTDFLEEWKAKREKMRAKQNPPGPAPPGG-----GSSDAAGKPPA---- 55
            G R   G  G      E  K        ++ PPGP  P G     G S   G P A    
Sbjct: 965  GQRGERGFPGLPGPSGEPGKQGPSGASGERGPPGPMGPPGLAGPPGESGREGAPGAEGSP 1024

Query: 56   GALGTPAAAA-ANELNNNLPGGAPAAPAVPGPGGVNCAVGSAMLTRAAPGPRRSEDEPPA 114
            G  G+P A     E     P GAP AP  PGP G            A     R E  P  
Sbjct: 1025 GRDGSPGAKGDRGETGPAGPPGAPGAPGAPGPVG-----------PAGKSGDRGETGPAG 1073

Query: 115  ASASAAPPPQRDEEEPDGV-PEKGKSSGPSARKGKG 149
             +    P   R    P G   +KG++     R  KG
Sbjct: 1074 PAGPVGPVGARGPAGPQGPRGDKGETGEQGDRGIKG 1109



 Score = 38.5 bits (88), Expect = 0.009
 Identities = 34/105 (32%), Positives = 40/105 (38%), Gaps = 14/105 (13%)

Query: 34   QNPPGPAPPGGGSSDAAGKPPAGALGTPAAAAA--NELNNNLPGGAPAAPAVPGPGGVNC 91
            Q PPGP    G    +    PAG  G P +A A   +  N LPG  P  P  PGP G   
Sbjct: 1117 QGPPGPPGSPGEQGPSGASGPAGPRGPPGSAGAPGKDGLNGLPG--PIGP--PGPRGRTG 1172

Query: 92   AVGSAMLTRAAPGPRRSEDEPPAASA----SAAPPPQRDEEEPDG 132
              G        PGP      P   SA    S  P P +++    G
Sbjct: 1173 DAGPV----GPPGPPGPPGPPGPPSAGFDFSFLPQPPQEKAHDGG 1213



 Score = 38.1 bits (87), Expect = 0.011
 Identities = 43/142 (30%), Positives = 51/142 (35%), Gaps = 30/142 (21%)

Query: 34  QNPPGPAPPGGGSSDAAGKP----------PAGALGTPAAAAA---------NELNNNLP 74
           + PPGP P   G    AGKP          P GA G P  A           + L+    
Sbjct: 220 RGPPGP-PGKNGDDGEAGKPGRPGERGPPGPQGARGLPGTAGLPGMKGHRGFSGLDGAKG 278

Query: 75  GGAPAAP-AVPGPGGVNCAVGSAMLTRAAPGPRRSEDEP-PA-------ASASAAPPPQR 125
              PA P   PG  G N A G  M  R  PG R     P PA       A+ +A PP   
Sbjct: 279 DAGPAGPKGEPGSPGENGAPGQ-MGPRGLPGERGRPGAPGPAGARGNDGATGAAGPPGPT 337

Query: 126 DEEEPDGVPEKGKSSGPSARKG 147
               P G P    + G +  +G
Sbjct: 338 GPAGPPGFPGAVGAKGEAGPQG 359



 Score = 37.0 bits (84), Expect = 0.025
 Identities = 34/117 (29%), Positives = 40/117 (34%), Gaps = 12/117 (10%)

Query: 36  PPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGPGGVNCAVGS 95
           P GPA P G       K  AG  G   +          P G    P  PGP G     G+
Sbjct: 336 PTGPAGPPGFPGAVGAKGEAGPQGPRGSEG--------PQGVRGEPGPPGPAGAAGPAGN 387

Query: 96  AMLTRAAPGPRRSEDEPPAASASAAPPPQ--RDEEEPDGVP-EKGKSSGPSARKGKG 149
                  PG + +   P  A A   P  +     + P G P  KG S  P A   KG
Sbjct: 388 PG-ADGQPGAKGANGAPGIAGAPGFPGARGPSGPQGPGGPPGPKGNSGEPGAPGSKG 443



 Score = 36.6 bits (83), Expect = 0.033
 Identities = 30/123 (24%), Positives = 43/123 (34%), Gaps = 8/123 (6%)

Query: 27  REKMRAKQNPPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGP 86
           ++ +R    P GP  P G   D     P+G  G   A  A    +    G P      GP
Sbjct: 759 KDGVRGLTGPIGPPGPAGAPGDKGESGPSGPAGPTGARGAP--GDRGEPGPPGPAGFAGP 816

Query: 87  GGVNCAVGSAMLTRAAPGPRRSEDE--PPAASASAAPPPQRDEEEPDGVPEKGKSSGPSA 144
            G +   G+    +  PG   ++ +  PP  +  A PP         G      S+GP  
Sbjct: 817 PGADGQPGA----KGEPGDAGAKGDAGPPGPAGPAGPPGPIGNVGAPGAKGARGSAGPPG 872

Query: 145 RKG 147
             G
Sbjct: 873 ATG 875



 Score = 36.2 bits (82), Expect = 0.043
 Identities = 42/151 (27%), Positives = 46/151 (30%), Gaps = 41/151 (27%)

Query: 34  QNPPGPA-------------------PPG-----------GGSSDAAGKPPAGALGTPAA 63
           Q PPGPA                   PPG           G    A  K PAG  G+P  
Sbjct: 457 QGPPGPAGEEGKRGARGEPGPTGLPGPPGERGGPGSRGFPGADGVAGPKGPAGERGSPGP 516

Query: 64  A--------AANELNNNLPG--GAPAAPAVPGPGGVNCAVGSAMLTRAAPGPRRSEDEPP 113
           A        A       LPG  G   +P  PGP G     G A      PGP        
Sbjct: 517 AGPKGSPGEAGRPGEAGLPGAKGLTGSPGSPGPDGKTGPPGPAG-QDGRPGPPGPPGARG 575

Query: 114 AASASAAPPPQRDEEEPDGVPEKGKSSGPSA 144
            A     P P+    EP    E+G    P A
Sbjct: 576 QAGVMGFPGPKGAAGEPGKAGERGVPGPPGA 606



 Score = 33.5 bits (75), Expect = 0.28
 Identities = 29/119 (24%), Positives = 40/119 (33%), Gaps = 20/119 (16%)

Query: 36  PPGPAPPGGGSSDAAGKPPAG--ALGTPAAAAANELNNNLPGGAPAAPAVPGPGGVNCAV 93
           PPGP  P G   + A +   G     T   +    +  + P G P  P  PGP G     
Sbjct: 146 PPGPPGPPGLGGNFAPQLSYGYDEKSTGGISVPGPMGPSGPRGLPGPPGAPGPQGFQ--- 202

Query: 94  GSAMLTRAAPGPRRSEDEPPAAS-----ASAAPPPQRDEEEPDGVPEKGKSSGPSARKG 147
                     GP     EP A+          PP +  ++   G P +    GP   +G
Sbjct: 203 ----------GPPGEPGEPGASGPMGPRGPPGPPGKNGDDGEAGKPGRPGERGPPGPQG 251



 Score = 30.4 bits (67), Expect = 2.4
 Identities = 32/120 (26%), Positives = 39/120 (32%), Gaps = 9/120 (7%)

Query: 34  QNPPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGPGGVNCAV 93
           Q   G   P G +     + PAG  G         L    P G P  P  PG GG     
Sbjct: 105 QETTGVEGPKGDTGPRGPRGPAGPPGRDGIPGQPGLPG--PPGPPGPPGPPGLGGNFAPQ 162

Query: 94  GSAMLTR------AAPGPRRSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSGPSARKG 147
            S           + PGP              AP PQ   + P G P +  +SGP   +G
Sbjct: 163 LSYGYDEKSTGGISVPGPMGPSGPRGLPGPPGAPGPQ-GFQGPPGEPGEPGASGPMGPRG 221


>gi|32483397 dopamine receptor D4 [Homo sapiens]
          Length = 419

 Score = 46.2 bits (108), Expect = 4e-05
 Identities = 41/111 (36%), Positives = 53/111 (47%), Gaps = 11/111 (9%)

Query: 20  LEEWK-AKREKM--RAKQNPPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNL--P 74
           L+ W+ A+R K+  RA + P GP PP    +  A + P    G   A  A  L      P
Sbjct: 219 LQRWEVARRAKLHGRAPRRPSGPGPPS--PTPPAPRLPQDPCGPDCAPPAPGLPRGPCGP 276

Query: 75  GGAPAAPAVP-GPGGVNCAVGSAMLTRAAPGPRRSEDEPPAASASAAPPPQ 124
             APAAP++P  P G +CA  +  L    P P  S   PP A  +AA PPQ
Sbjct: 277 DCAPAAPSLPQDPCGPDCAPPAPGLP---PDPCGSNCAPPDAVRAAALPPQ 324


>gi|116256445 nuclear receptor co-repressor 2 isoform 2 [Homo
           sapiens]
          Length = 2462

 Score = 46.2 bits (108), Expect = 4e-05
 Identities = 43/188 (22%), Positives = 70/188 (37%), Gaps = 24/188 (12%)

Query: 9   SSGLGGSTTDFLEEWKAKREKMRAKQNPPGPAPPGGGSSDAAGKPPAGALGTPAAAAANE 68
           +SG+ G+  + +EE +A        ++ P P       +   G  P   LG         
Sbjct: 707 ASGVSGNEEEMVEEAEATVNNSSDTESIPSPHTEAAKDTGQNGPKPPATLGAD------- 759

Query: 69  LNNNLPGGAPAAPAVPGPGGVNCAVGSAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEE 128
                  G P  P  P P  +         + A   P      PPA  + +APPP   +E
Sbjct: 760 -------GPPPGPPTPPPEDIPAPTEPTPASEATGAP----TPPPAPPSPSAPPPVVPKE 808

Query: 129 EPDGVPEKGKSSGPSARKGKGQIEKRKLREKRRSTGVVNIPA-AECLDEYEDDEAGQKER 187
           E     E+  ++ P   +G+ Q +     E    TG    P  +EC +E E+  A  K+ 
Sbjct: 809 E----KEEETAAAPPVEEGEEQ-KPPAAEELAVDTGKAEEPVKSECTEEAEEGPAKGKDA 863

Query: 188 KREDAITQ 195
           +  +A  +
Sbjct: 864 EAAEATAE 871



 Score = 30.0 bits (66), Expect = 3.1
 Identities = 22/79 (27%), Positives = 28/79 (35%), Gaps = 13/79 (16%)

Query: 77   APAAPAVPGPGGVNCAVGSAM----------LTRAAPGPRRSEDEPPAAS---ASAAPPP 123
            +P  PA   P   +C +G  +          +      PR +  E P A    A  A PP
Sbjct: 1896 SPVRPAATFPPATHCPLGGTLDGVYPTLMEPVLLPKEAPRVARPERPRADTGHAFLAKPP 1955

Query: 124  QRDEEEPDGVPEKGKSSGP 142
             R   EP   P KG    P
Sbjct: 1956 ARSGLEPASSPSKGSEPRP 1974



 Score = 28.5 bits (62), Expect = 9.0
 Identities = 32/102 (31%), Positives = 38/102 (37%), Gaps = 18/102 (17%)

Query: 33   KQNPPGPAPPGG--GSSDAAGKP---PAGALGTPAAAAANELNNNLPGGAPAAPAVPGPG 87
            K  PP P PP      SDA  +P   P G   +PA  A  E    +   A AA A   PG
Sbjct: 976  KPAPPAPPPPQNLQPESDAPQQPGSSPRGKSRSPAPPADKEAEKPVFFPAFAAEAQKLPG 1035

Query: 88   GVNCAVGSAMLTRAAPGP-------RRSEDEPPAASASAAPP 122
               C       T   P P       + S   P  ++ S APP
Sbjct: 1036 DPPC------WTSGLPFPVPPREVIKASPHAPDPSAFSYAPP 1071


>gi|56550039 myeloid/lymphoid or mixed-lineage leukemia protein
           [Homo sapiens]
          Length = 3969

 Score = 45.8 bits (107), Expect = 5e-05
 Identities = 56/171 (32%), Positives = 69/171 (40%), Gaps = 22/171 (12%)

Query: 4   GGYRTSSGLGGSTTDFLEEWKAKREKMRAKQNPPGPAPPGGGSSDAAGKPPAGALGTPAA 63
           GG     GLGG          A R+++ A   PPGP P GGG   A   PPA A    AA
Sbjct: 19  GGGGGRRGLGG----------APRQRVPALLLPPGP-PVGGGGPGAPPSPPAVAA---AA 64

Query: 64  AAANELNNNLPGGAPAAPAVPGPGGVNCAVGSAMLTRAAPGPRRSEDEP---PAASASAA 120
           AAA      +PGGA AA A       + +  S+  + A+ GP      P    A   SAA
Sbjct: 65  AAAGSSGAGVPGGAAAASAA---SSSSASSSSSSSSSASSGPALLRVGPGFDAALQVSAA 121

Query: 121 PPPQ-RDEEEPDGVPEKGKSSGPSAR-KGKGQIEKRKLREKRRSTGVVNIP 169
                R      G    G  SG   +  G G  E+ ++R   RS  V   P
Sbjct: 122 IGTNLRRFRAVFGESGGGGGSGEDEQFLGFGSDEEVRVRSPTRSPSVKTSP 172



 Score = 31.2 bits (69), Expect = 1.4
 Identities = 19/60 (31%), Positives = 29/60 (48%), Gaps = 6/60 (10%)

Query: 103  PGPRRSEDEPPAASASAAPP--PQRDEEEPDGV----PEKGKSSGPSARKGKGQIEKRKL 156
            P P   ED  P  S+S  PP  P  ++ E   V    PE  +++ P++RK   Q+ +  L
Sbjct: 1244 PTPSAREDPAPKKSSSEPPPRKPVEEKSEEGNVSAPGPESKQATTPASRKSSKQVSQPAL 1303



 Score = 29.6 bits (65), Expect = 4.0
 Identities = 31/148 (20%), Positives = 57/148 (38%), Gaps = 29/148 (19%)

Query: 105 PRRSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSGPSA------RKGKGQIEKRKLRE 158
           PR+    P    +S++P P      P    E+G++   +       R     +EK K RE
Sbjct: 816 PRKQTSAPAEPFSSSSPTPLFPWFTPGSQTERGRNKDKAPEELSKDRDADKSVEKDKSRE 875

Query: 159 KRRSTGVVNIPAAECLDEYEDDEAGQKERKREDAITQQNT-------IQNEAVNLLDPGS 211
           + R              E E+    +KE++++ +  Q ++       +  E V     G 
Sbjct: 876 RDRER------------EKENKRESRKEKRKKGSEIQSSSALYPVGRVSKEKV----VGE 919

Query: 212 SYLLQEPPRTVSGRYKSTTSVSEEDVSS 239
                   +  +GR KS++  S  D++S
Sbjct: 920 DVATSSSAKKATGRKKSSSHDSGTDITS 947


>gi|56549131 transmembrane anchor protein 1 isoform 1 [Homo sapiens]
          Length = 204

 Score = 45.8 bits (107), Expect = 5e-05
 Identities = 50/159 (31%), Positives = 66/159 (41%), Gaps = 34/159 (21%)

Query: 36  PPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPG-PGGVNCAVG 94
           PP PAPP    ++A G P       P+   A E     P  +PA P  PG P G+     
Sbjct: 41  PPEPAPP----AEATGAP------APSRPCAPE-----PAASPAGPEEPGEPAGL----- 80

Query: 95  SAMLTRAAPGPRRSEDEPPAASASAAPP--PQRDEEEPDGVPEKGKSS-GPSARKGKG-- 149
             +   A PG      +P AA A A       R EEE D   EKG SS GP    G+G  
Sbjct: 81  GELGEPAGPGEPEGPGDPAAAPAEAEEQAVEARQEEEQDLDGEKGPSSEGPEEEDGEGFS 140

Query: 150 -QIEKRKLREKRRSTGVVNIPAAECLDEYEDDEAGQKER 187
            +    KLR  +    +         +E E+++  QKE+
Sbjct: 141 FKYSPGKLRGNQYKKMMTK-------EELEEEQRVQKEQ 172



 Score = 35.8 bits (81), Expect = 0.056
 Identities = 29/100 (29%), Positives = 40/100 (40%), Gaps = 23/100 (23%)

Query: 101 AAPGPRRSEDEP-PAASASAAPPPQR--------------DEEEPDGVPEKGKSSGPSAR 145
           A+P P R+  EP P A A+ AP P R              +  EP G+ E G+ +GP   
Sbjct: 33  ASPEPARAPPEPAPPAEATGAPAPSRPCAPEPAASPAGPEEPGEPAGLGELGEPAGPGEP 92

Query: 146 KGKGQIEKRKLREKRRSTGVVNIPAAECLDEYEDDEAGQK 185
           +G G        +   +       A E   E E D  G+K
Sbjct: 93  EGPG--------DPAAAPAEAEEQAVEARQEEEQDLDGEK 124


>gi|111118976 collagen, type II, alpha 1 isoform 1 precursor [Homo
           sapiens]
          Length = 1487

 Score = 45.4 bits (106), Expect = 7e-05
 Identities = 45/164 (27%), Positives = 59/164 (35%), Gaps = 11/164 (6%)

Query: 11  GLGGSTTDFLE---EWKAKREKMRAKQNPPGPAPPGGGSSDAAGKPPAGALGTPAAAAAN 67
           GLGG+    +    + KA   ++   Q P GP  P G    A    P G  G P      
Sbjct: 174 GLGGNFAAQMAGGFDEKAGGAQLGVMQGPMGPMGPRGPPGPAGAPGPQGFQGNPGEPGEP 233

Query: 68  ELNNNL-PGGAPAAPAVPGPGGVNCAVGSAMLTRAAPGPRRSEDEP-----PAASASAAP 121
            ++  + P G P  P  PG  G     G A   R  PGP+ +   P     P        
Sbjct: 234 GVSGPMGPRGPPGPPGKPGDDGEAGKPGKAG-ERGPPGPQGARGFPGTPGLPGVKGHRGY 292

Query: 122 PPQRDEEEPDGVP-EKGKSSGPSARKGKGQIEKRKLREKRRSTG 164
           P     +   G P  KG+S  P      G +  R L  +R  TG
Sbjct: 293 PGLDGAKGEAGAPGVKGESGSPGENGSPGPMGPRGLPGERGRTG 336



 Score = 45.4 bits (106), Expect = 7e-05
 Identities = 38/122 (31%), Positives = 44/122 (36%), Gaps = 15/122 (12%)

Query: 31  RAKQNPPGPAPPGGGSSDAAGKPPAGALGTPAAA----AANELNNNLPGGAPAAPAVPGP 86
           R     PGPA P G    A G    GA G    A    A        P G P  P  PGP
Sbjct: 344 RGNDGQPGPAGPPGPVGPAGGPGFPGAPGAKGEAGPTGARGPEGAQGPRGEPGTPGSPGP 403

Query: 87  GGVNCAVGSAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSGPSARK 146
            G +   G    T   PG + S   P  A A   P P+       G P    ++GP   K
Sbjct: 404 AGASGNPG----TDGIPGAKGSAGAPGIAGAPGFPGPR-------GPPGPQGATGPLGPK 452

Query: 147 GK 148
           G+
Sbjct: 453 GQ 454



 Score = 43.1 bits (100), Expect = 4e-04
 Identities = 37/129 (28%), Positives = 45/129 (34%), Gaps = 23/129 (17%)

Query: 36  PPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGPGGVNCAVGS 95
           PPGPA P G   +     P+G  G P             GG P    VPG  G    VG 
Sbjct: 643 PPGPAGPAGERGEQGAPGPSGFQGLPGPPGPPG-----EGGKPGDQGVPGEAGAPGLVGP 697

Query: 96  AMLTRAAPGPRRS-----------------EDEPPAASASAAPPPQRDEEEPDGVPEKGK 138
               R  PG R S                  D P  AS  A PP  +      G+P +  
Sbjct: 698 RG-ERGFPGERGSPGAQGLQGPRGLPGTPGTDGPKGASGPAGPPGAQGPPGLQGMPGERG 756

Query: 139 SSGPSARKG 147
           ++G +  KG
Sbjct: 757 AAGIAGPKG 765



 Score = 41.2 bits (95), Expect = 0.001
 Identities = 40/142 (28%), Positives = 46/142 (32%), Gaps = 31/142 (21%)

Query: 36  PPGPAPPGG-----------GSSDAAGKP----------PAGALGTPAA----AAANELN 70
           PPGPA   G           GS+ A G P          PAG  G P A     A  E  
Sbjct: 793 PPGPAGANGEKGEVGPPGPAGSAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGEQG 852

Query: 71  NNLPGGAPAAPAVPGPGGVNCAVGSAMLTRAAPGPRRSEDEP-----PAASASAAPPPQR 125
                G   AP   GP G     G   +T    G R ++  P     P A+    PP   
Sbjct: 853 EAGQKGDAGAPGPQGPSGAPGPQGPTGVT-GPKGARGAQGPPGATGFPGAAGRVGPPGSN 911

Query: 126 DEEEPDGVPEKGKSSGPSARKG 147
               P G P      GP   +G
Sbjct: 912 GNPGPPGPPGPSGKDGPKGARG 933



 Score = 40.8 bits (94), Expect = 0.002
 Identities = 39/130 (30%), Positives = 46/130 (35%), Gaps = 22/130 (16%)

Query: 31  RAKQNPPGPA----PPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGP 86
           R +  PPGPA    PPG      A K   G  G    A A       P G   AP   GP
Sbjct: 824 RGETGPPGPAGFAGPPGADGQPGA-KGEQGEAGQKGDAGAPG-----PQGPSGAPGPQGP 877

Query: 87  GGVNCAVGSAMLTRAAPGPRRSED--------EPPAASASAAPPPQRDEEEPDGVPEKGK 138
            GV    G+    R A GP  +           PP ++ +  PP        DG      
Sbjct: 878 TGVTGPKGA----RGAQGPPGATGFPGAAGRVGPPGSNGNPGPPGPPGPSGKDGPKGARG 933

Query: 139 SSGPSARKGK 148
            SGP  R G+
Sbjct: 934 DSGPPGRAGE 943



 Score = 38.5 bits (88), Expect = 0.009
 Identities = 43/153 (28%), Positives = 53/153 (34%), Gaps = 16/153 (10%)

Query: 5   GYRTSSGLGGSTTDFLEEWKAKREKMRAKQNPPGPAPPGG--GSSDAAGKP-PAGALGTP 61
           G   + GL G   D   + K        +   PGP  P G  G     G P P GA G P
Sbjct: 555 GLPGARGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQPGVMGFPGPKGANGEP 614

Query: 62  AAAAANELNNNLPGGAPAAPAVPGPGGVNCAVGSAMLTRAAPGPRRSEDEPPAAS----- 116
             A        LP GAP    +PG  G   A G      A P   R E   P  S     
Sbjct: 615 GKAG----EKGLP-GAPGLRGLPGKDGETGAAGPP--GPAGPAGERGEQGAPGPSGFQGL 667

Query: 117 -ASAAPPPQRDEEEPDGVPEKGKSSGPSARKGK 148
                PP +  +    GVP +  + G    +G+
Sbjct: 668 PGPPGPPGEGGKPGDQGVPGEAGAPGLVGPRGE 700



 Score = 38.5 bits (88), Expect = 0.009
 Identities = 35/119 (29%), Positives = 43/119 (36%), Gaps = 12/119 (10%)

Query: 30  MRAKQNPPGPAPPGGGSSDAAGKPPAGALGTPAA-AAANELNNNLPGGAPAAPAVPGPGG 88
           M  ++   G A P G   D   K P GA G          +    P GA       GP G
Sbjct: 751 MPGERGAAGIAGPKGDRGDVGEKGPEGAPGKDGGRGLTGPIGPPGPAGANGEKGEVGPPG 810

Query: 89  VNCAVGSAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSGPSARKG 147
              + G+    R APG  R E  PP  +  A PP        DG P      G + +KG
Sbjct: 811 PAGSAGA----RGAPG-ERGETGPPGPAGFAGPP------GADGQPGAKGEQGEAGQKG 858



 Score = 37.4 bits (85), Expect = 0.019
 Identities = 35/120 (29%), Positives = 39/120 (32%), Gaps = 27/120 (22%)

Query: 34   QNPPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGPGGVNCAV 93
            Q  PGP  P G    +    P+G  G P            P G   A  +PGP G     
Sbjct: 1139 QGLPGPPGPSGDQGASGPAGPSGPRGPPGPVG--------PSGKDGANGIPGPIG----- 1185

Query: 94   GSAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSGPSARKGKGQIEK 153
                     PGPR    E    +  A PP       P G P  G     SA  G G  EK
Sbjct: 1186 --------PPGPRGRSGE----TGPAGPPGNPGPPGPPGPP--GPGIDMSAFAGLGPREK 1231



 Score = 37.0 bits (84), Expect = 0.025
 Identities = 37/130 (28%), Positives = 45/130 (34%), Gaps = 21/130 (16%)

Query: 34  QNPPGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPG--GAPAAPAVPGP----- 86
           + PPGP  P G       +   G  G P      +     PG  G P  P  PGP     
Sbjct: 119 KGPPGPQGPAGEQGPRGDRGDKGEKGAP-GPRGRDGEPGTPGNPGPPGPPGPPGPPGLGG 177

Query: 87  -------GGVNCAVGSAML--TRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDGVPEKG 137
                  GG +   G A L   +   GP      P  A    AP PQ  +  P G P + 
Sbjct: 178 NFAAQMAGGFDEKAGGAQLGVMQGPMGPMGPRGPPGPA---GAPGPQGFQGNP-GEPGEP 233

Query: 138 KSSGPSARKG 147
             SGP   +G
Sbjct: 234 GVSGPMGPRG 243



 Score = 33.9 bits (76), Expect = 0.21
 Identities = 30/112 (26%), Positives = 35/112 (31%), Gaps = 13/112 (11%)

Query: 37  PGPAPPGGGSSDAAGKPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGPGGVNCAVGSA 96
           PGP  P G         P G  G P  A           G       PGP G   A G A
Sbjct: 434 PGPRGPPGPQGATGPLGPKGQTGEPGIAGFK--------GEQGPKGEPGPAGPQGAPGPA 485

Query: 97  MLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSGPSARKGK 148
                  G R +  E P       PP +R      G P +   +GP    G+
Sbjct: 486 ----GEEGKRGARGE-PGGVGPIGPPGERGAPGNRGFPGQDGLAGPKGAPGE 532



 Score = 33.9 bits (76), Expect = 0.21
 Identities = 42/163 (25%), Positives = 60/163 (36%), Gaps = 14/163 (8%)

Query: 31  RAKQNP---PGPAPPGGGSSDAAGKPPAGALGTPAAAA--ANELNNNLPG--GAPAAPAV 83
           + +Q P   PGPA P G    A  +   GA G P              PG  G P    +
Sbjct: 464 KGEQGPKGEPGPAGPQGAPGPAGEEGKRGARGEPGGVGPIGPPGERGAPGNRGFPGQDGL 523

Query: 84  PGPGGVNCAVGSAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDGVPEKGKSSGPS 143
            GP G     G + L     GP+ +  +P        P  +     P     +GK  GPS
Sbjct: 524 AGPKGAPGERGPSGLA----GPKGANGDPGRPGEPGLPGARGLTGRPGDAGPQGK-VGPS 578

Query: 144 ARKGK-GQIEKRKLREKRRSTGVVNIPAAECLDEYEDDEAGQK 185
              G+ G+      +  R   GV+  P  +  +  E  +AG+K
Sbjct: 579 GAPGEDGRPGPPGPQGARGQPGVMGFPGPKGANG-EPGKAGEK 620



 Score = 33.9 bits (76), Expect = 0.21
 Identities = 44/176 (25%), Positives = 58/176 (32%), Gaps = 31/176 (17%)

Query: 5    GYRTSSGLGGSTTDFLEEWKAKREKMRAKQNPPGPA-PPG--GGSSDAAGKPPAGALGTP 61
            G R   G  G      E  K         + PPGP  PPG  G + +   +   GA G P
Sbjct: 987  GQRGERGFPGLPGPSGEPGKQGAPGASGDRGPPGPVGPPGLTGPAGEPGREGSPGADGPP 1046

Query: 62   AAAAANELNNNL-------PGGAPAAPAVPGPGGVNCAVGSAMLTRAAPGPRRSEDEPPA 114
                A  +  +          GAP  P  PGP G     G            R E     
Sbjct: 1047 GRDGAAGVKGDRGETGAVGAPGAPGPPGSPGPAGPTGKQGD-----------RGEAGAQG 1095

Query: 115  ASASAAPPPQRDEEEPDGVPEKGKSSGPSARKGK-GQIEKRKLREKRRSTGVVNIP 169
                + P   R  + P          GP   KG+ G+  +R L+  R  TG+  +P
Sbjct: 1096 PMGPSGPAGARGIQGP---------QGPRGDKGEAGEPGERGLKGHRGFTGLQGLP 1142



 Score = 33.5 bits (75), Expect = 0.28
 Identities = 39/124 (31%), Positives = 44/124 (35%), Gaps = 23/124 (18%)

Query: 36   PPG----PAPPGG-GSSDAAG-KPPAGALGTPAAAAANELNNNLPGGAPAAPAVPGPGGV 89
            PPG    P PPG  G S   G K   G  G P  A    L    P G P     PG  G 
Sbjct: 907  PPGSNGNPGPPGPPGPSGKDGPKGARGDSGPPGRAGEPGLQG--PAGPPGEKGEPGDDGP 964

Query: 90   NCAVGSAMLTRAAPGPRRSEDEPPAASASAAPPPQRDEEEPDGVP----EKGKSSGPSAR 145
            + A G        PGP+    +          P QR E    G+P    E GK   P A 
Sbjct: 965  SGAEG-------PPGPQGLAGQ----RGIVGLPGQRGERGFPGLPGPSGEPGKQGAPGAS 1013

Query: 146  KGKG 149
              +G
Sbjct: 1014 GDRG 1017


  Database: hs.faa
    Posted date:  Aug 4, 2009  4:42 PM
  Number of letters in database: 18,247,518
  Number of sequences in database:  37,866
  
Lambda     K      H
   0.305    0.127    0.352 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 16,829,623
Number of Sequences: 37866
Number of extensions: 1128220
Number of successful extensions: 13708
Number of sequences better than 10.0: 30
Number of HSP's better than 10.0 without gapping: 379
Number of HSP's successfully gapped in prelim test: 1471
Number of HSP's that attempted gapping in prelim test: 8231
Number of HSP's gapped (non-prelim): 5602
length of query: 340
length of database: 18,247,518
effective HSP length: 103
effective length of query: 237
effective length of database: 14,347,320
effective search space: 3400314840
effective search space used: 3400314840
T: 11
A: 40
X1: 16 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (21.9 bits)
S2: 62 (28.5 bits)

Search results were obtained with NCBI BLAST and RefSeq entries.


Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help

Guide to the Human Genome
Copyright © 2010 by Stewart Scherer. All rights reserved.

CSHL Press