Guide to the Human Genome
Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help

Search of human proteins with 239748716

BLASTP 2.2.11 [Jun-05-2005]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= gi|239748716 PREDICTED: hypothetical protein XP_002346782 [Homo
sapiens]
         (267 letters)

Database: hs.faa 
           37,866 sequences; 18,247,518 total letters

Searching..................................................done

                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gi|239748716 PREDICTED: hypothetical protein XP_002346782 [Homo ...   549   e-157
gi|239742745 PREDICTED: hypothetical protein XP_002342616 [Homo ...   429   e-120
gi|158854042 hypothetical protein LOC222183 [Homo sapiens]             53   3e-07
gi|145309322 insulinoma-associated protein IA-6 [Homo sapiens]         52   4e-07
gi|41349482 proline-rich protein BstNI subfamily 1 isoform 1 pre...    52   7e-07
gi|4502961 alpha 1 type VII collagen precursor [Homo sapiens]          50   2e-06
gi|188536004 zinc finger protein 469 [Homo sapiens]                    50   2e-06
gi|209571537 proline-rich protein BstNI subfamily 2 [Homo sapiens]     49   4e-06
gi|58761548 tau tubulin kinase 1 [Homo sapiens]                        47   2e-05
gi|116256464 hypothetical protein LOC343990 [Homo sapiens]             47   2e-05
gi|117306167 proline-rich protein BstNI subfamily 3 precursor [H...    47   2e-05
gi|113416996 PREDICTED: ankyrin repeat domain 33B [Homo sapiens]       47   2e-05
gi|239508698 PREDICTED: similar to mucin [Homo sapiens]                47   2e-05
gi|37537692 proline-rich protein BstNI subfamily 4 precursor [Ho...    47   2e-05
gi|39930517 sterile alpha motif domain containing 1 [Homo sapiens]     46   3e-05
gi|91208420 bassoon protein [Homo sapiens]                             46   4e-05
gi|110832843 TBP-associated factor 4 [Homo sapiens]                    45   5e-05
gi|116256356 alpha 4 type IV collagen precursor [Homo sapiens]         45   5e-05
gi|33667117 MICAL-like 2 isoform 1 [Homo sapiens]                      45   5e-05
gi|239755873 PREDICTED: hypothetical protein [Homo sapiens]            45   9e-05
gi|224548936 hypothetical protein LOC100170229 [Homo sapiens]          44   1e-04
gi|33286446 opioid growth factor receptor [Homo sapiens]               44   1e-04
gi|239758013 PREDICTED: hypothetical protein [Homo sapiens]            44   1e-04
gi|237757308 synaptojanin 1 isoform b [Homo sapiens]                   44   1e-04
gi|110349772 alpha 1 type I collagen preproprotein [Homo sapiens]      44   1e-04
gi|4502951 collagen type III alpha 1 preproprotein [Homo sapiens]      44   1e-04
gi|239754474 PREDICTED: similar to mucin [Homo sapiens]                44   1e-04
gi|38150007 small nuclear ribonucleoprotein polypeptide B/B' iso...    44   1e-04
gi|207113162 Treacher Collins-Franceschetti syndrome 1 isoform e...    44   1e-04
gi|207113160 Treacher Collins-Franceschetti syndrome 1 isoform d...    44   1e-04

>gi|239748716 PREDICTED: hypothetical protein XP_002346782 [Homo
           sapiens]
          Length = 267

 Score =  549 bits (1415), Expect = e-157
 Identities = 267/267 (100%), Positives = 267/267 (100%)

Query: 1   MDRGSGPRGSPPGCKAALWSTLGGSMQQSAPTGQWGIRADSSPRRRTVRTAQPKERSPLR 60
           MDRGSGPRGSPPGCKAALWSTLGGSMQQSAPTGQWGIRADSSPRRRTVRTAQPKERSPLR
Sbjct: 1   MDRGSGPRGSPPGCKAALWSTLGGSMQQSAPTGQWGIRADSSPRRRTVRTAQPKERSPLR 60

Query: 61  RGSCARGEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCELREGRAEAGHSSTSQ 120
           RGSCARGEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCELREGRAEAGHSSTSQ
Sbjct: 61  RGSCARGEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCELREGRAEAGHSSTSQ 120

Query: 121 KDTLSKGSAAAATAVAAGKHLAVPETRGGVPARRRETANPPGPGRARPGGSPFGHRPLLQ 180
           KDTLSKGSAAAATAVAAGKHLAVPETRGGVPARRRETANPPGPGRARPGGSPFGHRPLLQ
Sbjct: 121 KDTLSKGSAAAATAVAAGKHLAVPETRGGVPARRRETANPPGPGRARPGGSPFGHRPLLQ 180

Query: 181 PRQIFAPLQAITNQVRPQQRPEPPPAGSAARSGRRAQRPGRKRSPQSAPAESGELSIDRE 240
           PRQIFAPLQAITNQVRPQQRPEPPPAGSAARSGRRAQRPGRKRSPQSAPAESGELSIDRE
Sbjct: 181 PRQIFAPLQAITNQVRPQQRPEPPPAGSAARSGRRAQRPGRKRSPQSAPAESGELSIDRE 240

Query: 241 EGGPRQAPQGAGHALEGRGRVINGSVT 267
           EGGPRQAPQGAGHALEGRGRVINGSVT
Sbjct: 241 EGGPRQAPQGAGHALEGRGRVINGSVT 267


>gi|239742745 PREDICTED: hypothetical protein XP_002342616 [Homo
           sapiens]
          Length = 248

 Score =  429 bits (1102), Expect = e-120
 Identities = 209/209 (100%), Positives = 209/209 (100%)

Query: 59  LRRGSCARGEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCELREGRAEAGHSST 118
           LRRGSCARGEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCELREGRAEAGHSST
Sbjct: 40  LRRGSCARGEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCELREGRAEAGHSST 99

Query: 119 SQKDTLSKGSAAAATAVAAGKHLAVPETRGGVPARRRETANPPGPGRARPGGSPFGHRPL 178
           SQKDTLSKGSAAAATAVAAGKHLAVPETRGGVPARRRETANPPGPGRARPGGSPFGHRPL
Sbjct: 100 SQKDTLSKGSAAAATAVAAGKHLAVPETRGGVPARRRETANPPGPGRARPGGSPFGHRPL 159

Query: 179 LQPRQIFAPLQAITNQVRPQQRPEPPPAGSAARSGRRAQRPGRKRSPQSAPAESGELSID 238
           LQPRQIFAPLQAITNQVRPQQRPEPPPAGSAARSGRRAQRPGRKRSPQSAPAESGELSID
Sbjct: 160 LQPRQIFAPLQAITNQVRPQQRPEPPPAGSAARSGRRAQRPGRKRSPQSAPAESGELSID 219

Query: 239 REEGGPRQAPQGAGHALEGRGRVINGSVT 267
           REEGGPRQAPQGAGHALEGRGRVINGSVT
Sbjct: 220 REEGGPRQAPQGAGHALEGRGRVINGSVT 248


>gi|158854042 hypothetical protein LOC222183 [Homo sapiens]
          Length = 653

 Score = 52.8 bits (125), Expect = 3e-07
 Identities = 64/222 (28%), Positives = 82/222 (36%), Gaps = 32/222 (14%)

Query: 56  RSPLRRGSCARGEKRPPGPTGGQWRWGV-SPRPSAASATGPPQDRAARGCE--LREGRAE 112
           RSP R     R E R    TG Q   G  SP PS  S  G PQ     G       GR  
Sbjct: 273 RSPSRLSPKHRDEGRK---TGSQRSSGSRSPSPSGGSGWGSPQRNGGSGQRSGAHGGRPG 329

Query: 113 AGHSSTSQKDTLS-----KGSAAAATAVAAGKHLAVPETRGGVPARRRETANPPGPGRAR 167
           + HS   +  + S     K +AAA T  A GK    P          R   +  G G   
Sbjct: 330 SAHSPPDKPSSPSPRVRDKAAAAAPTPPARGKESPSP----------RSAPSSQGRGGRA 379

Query: 168 PGGSPFGHRPLLQPRQIFAPLQAITNQVRPQQRPEPPPAGSAARSGRRAQRPGRKRSPQS 227
            GG+  G R   + R+  +   A   + R ++RP P P   ++RS  RA+      S + 
Sbjct: 380 AGGA--GRRRRRRRRRRRSRSSASAPRRRGRRRPRPAPPRGSSRSLSRARSSSDSGSGRG 437

Query: 228 APAESGELSIDREEGG---------PRQAPQGAGHALEGRGR 260
           AP    E   +R  GG         PR  P     +    GR
Sbjct: 438 APGPGPEPGSERGHGGHGKRAKERPPRARPASTSPSPGAHGR 479



 Score = 40.8 bits (94), Expect = 0.001
 Identities = 70/274 (25%), Positives = 91/274 (33%), Gaps = 66/274 (24%)

Query: 3   RGSGPRGSPPGCKAALWSTLGGSMQQSAPT-----------GQWGIRADSSPRRRTVRTA 51
           R + PRGS      A  S+  GS  + AP            G  G RA   P R    + 
Sbjct: 412 RPAPPRGSSRSLSRARSSSDSGS-GRGAPGPGPEPGSERGHGGHGKRAKERPPRARPAST 470

Query: 52  QPKERSPLRRGSC-ARGEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCELREGR 110
            P   +  RRG    +   R PGP    W    SP  S + +               E R
Sbjct: 471 SPSPGAHGRRGGPEGKSSSRSPGPHPRSWSSSRSPSKSRSRSA--------------EKR 516

Query: 111 AEAGHSSTSQKDTLSKGSAAAATAVAAGKHLAVPETRGGVPARRRETANPPGPGRARPGG 170
             +   S S K  LS+       A    +H     TR    ARRR  +  P   R R   
Sbjct: 517 PHSPSRSPSPKKPLSRDKDGEGRA----RHSEAEATR----ARRRSRSYSPIRKRRRDSP 568

Query: 171 SPFGHRPLLQPRQIFAPLQAITNQVRPQQRPEPPPAGS------AARSGRRAQRPG---- 220
           S       ++PR+I     +   +  P  RP P  + S      + RS  R+  PG    
Sbjct: 569 S------FMEPRRI----TSARKRPIPYYRPSPSSSSSCLSSDYSTRSHSRSPSPGHSHG 618

Query: 221 -----------RKRSPQSAPAESGELSIDREEGG 243
                      R RSP   P+ S       E GG
Sbjct: 619 SYSSRSHGTRSRTRSPSRTPSPSYHSRSSSESGG 652


>gi|145309322 insulinoma-associated protein IA-6 [Homo sapiens]
          Length = 566

 Score = 52.4 bits (124), Expect = 4e-07
 Identities = 65/261 (24%), Positives = 100/261 (38%), Gaps = 30/261 (11%)

Query: 6   GPRGSPPGCKAALWSTLGGSMQQSAPTGQWGIRADSSPRRRTVRTAQPKERSPLRRGSCA 65
           GP+G+PP  + A  ++L G+ + + PT       +   +  T   A+ +  SP R    +
Sbjct: 30  GPQGAPPFLEEAPSASLPGAERATPPT------REEPGKGLTAEAAREQSGSPCRAAGVS 83

Query: 66  RGEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCELREGRAEAGHSSTSQKDTLS 125
            G     G  G +WR G    P  + +  P   + A G ELR    E   SS    ++  
Sbjct: 84  PG---TGGREGAEWRAGGREGPGPSPSPSPSPAKPA-GAELRRAFLERCLSSPVSAESFP 139

Query: 126 KGSAAAATAVAAGKHLAVPETRGGVPARRRETANPPGPGRARPGGSPFGHRPLLQ--PRQ 183
            G+AA A    +    A P                PG     P  +PF   P LQ  P  
Sbjct: 140 GGAAAVAAFSCSVAPAAAP---------------TPGEQFLLPLRAPF-PEPALQPDPAP 183

Query: 184 IFAPLQAITNQVRPQQRPEPPP--AGSAARSGRRAQRPGRKRSPQSAPAESGELSIDREE 241
           + A LQ++      ++R + P   A   A +G +  +  RK S       S  L +  +E
Sbjct: 184 LSAALQSLKRAAGGERRGKAPTDCASGPAAAGIKKPKAMRKLSFADEVTTSPVLGLKIKE 243

Query: 242 GGPRQAPQGAGHALEGRGRVI 262
             P    +G G +    G  I
Sbjct: 244 EEPGAPSRGLGGSRTPLGEFI 264


>gi|41349482 proline-rich protein BstNI subfamily 1 isoform 1
           preproprotein [Homo sapiens]
          Length = 331

 Score = 51.6 bits (122), Expect = 7e-07
 Identities = 69/265 (26%), Positives = 95/265 (35%), Gaps = 25/265 (9%)

Query: 4   GSGPRGSPP--GCKAALWSTLGGSMQQSAPTGQWGIRADSSPRRRTVRTAQPKERSPLRR 61
           G+ P+G PP  G         G   Q   P G+         + R+ R+   K + P  +
Sbjct: 44  GNKPQGPPPPPGKPQGPPPQGGNKPQGPPPPGKPQGPPPQGDKSRSPRSPPGKPQGPPPQ 103

Query: 62  G-SCARGEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCELREGRAEAGHSSTSQ 120
           G +  +G   PPG   G    G + RP      G PQ    +G + R  R+  G     Q
Sbjct: 104 GGNQPQGPPPPPGKPQGPPPQGGN-RPQGPPPPGKPQGPPPQGDKSRSPRSPPGKP---Q 159

Query: 121 KDTLSKGSAAAATAVAAGKHLAVPETRG----GVPARRRETANPPGPGRARPGGSPFGHR 176
                 G+         GK    P   G    G P   +    PP   ++R   SP G +
Sbjct: 160 GPPPQGGNQPQGPPPPPGKPQGPPPQGGKKPQGPPPPGKPQGPPPQGDKSRSSQSPPG-K 218

Query: 177 PLLQPRQIFAPLQAITNQVRPQQRPEPP--PAGSAARSGRRAQRPGRKRSPQSAPAESGE 234
           P   P Q            +PQ  P PP  P G   + G + Q P     PQ  PA+ G 
Sbjct: 219 PQGPPPQ---------GGNQPQGPPPPPGKPQGPPPQGGNKPQGPPPPGKPQGPPAQGGS 269

Query: 235 LSIDREE--GGPRQAPQGAGHALEG 257
            S       G P+  PQ  G+  +G
Sbjct: 270 KSQSARSPPGKPQGPPQQEGNNPQG 294



 Score = 43.5 bits (101), Expect = 2e-04
 Identities = 54/205 (26%), Positives = 74/205 (36%), Gaps = 19/205 (9%)

Query: 53  PKERSPLRRGSCARGEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCELREGRAE 112
           P+  SP + G+  +G   PPG   G    G + +P      G PQ    +G + R  R+ 
Sbjct: 36  PQGPSP-QGGNKPQGPPPPPGKPQGPPPQGGN-KPQGPPPPGKPQGPPPQGDKSRSPRSP 93

Query: 113 AGHSSTSQKDTLSKGSAAAATAVAAGKHLAVPETRGGVPARRRETANPPGPGRARPGGSP 172
            G     Q      G+         GK    P   G  P        PP PG+ + G  P
Sbjct: 94  PGKP---QGPPPQGGNQPQGPPPPPGKPQGPPPQGGNRPQ------GPPPPGKPQ-GPPP 143

Query: 173 FGHR---PLLQPRQIFAPLQAITNQVRPQQRPEPP--PAGSAARSGRRAQRPGRKRSPQS 227
            G +   P   P +   P     NQ  PQ  P PP  P G   + G++ Q P     PQ 
Sbjct: 144 QGDKSRSPRSPPGKPQGPPPQGGNQ--PQGPPPPPGKPQGPPPQGGKKPQGPPPPGKPQG 201

Query: 228 APAESGELSIDREEGGPRQAPQGAG 252
            P +  +    +   G  Q P   G
Sbjct: 202 PPPQGDKSRSSQSPPGKPQGPPPQG 226



 Score = 38.9 bits (89), Expect = 0.005
 Identities = 65/261 (24%), Positives = 89/261 (34%), Gaps = 50/261 (19%)

Query: 2   DRGSGPRGSPPGCKAALWSTLGGSMQQSAPTGQWGIRADSSPRRRTVRTAQP----KERS 57
           D+   PR SPPG K       GG+  Q  P      +    P +   R   P    K + 
Sbjct: 85  DKSRSPR-SPPG-KPQGPPPQGGNQPQGPPPPPG--KPQGPPPQGGNRPQGPPPPGKPQG 140

Query: 58  PLRRGSCARGEKRPPG------PTGGQWRWGVSPRPSAASATGPPQDRAARGCELREGRA 111
           P  +G  +R  + PPG      P GG    G  P P      G PQ    +G +  +G  
Sbjct: 141 PPPQGDKSRSPRSPPGKPQGPPPQGGNQPQGPPPPP------GKPQGPPPQGGKKPQGPP 194

Query: 112 EAGHSSTSQKDTLSKGSAAAATAVAAGKHLAVPETRG----GVPARRRETANPPGPGRAR 167
             G      +    +G  + ++    GK    P   G    G P    +   PP  G  +
Sbjct: 195 PPG----KPQGPPPQGDKSRSSQSPPGKPQGPPPQGGNQPQGPPPPPGKPQGPPPQGGNK 250

Query: 168 PGGSPFGHRPLLQPRQIFAPLQAITN-----QVRPQQ-----------------RPEPPP 205
           P G P   +P   P Q  +  Q+  +     Q  PQQ                 +P+ PP
Sbjct: 251 PQGPPPPGKPQGPPAQGGSKSQSARSPPGKPQGPPQQEGNNPQGPPPPAGGNPQQPQAPP 310

Query: 206 AGSAARSGRRAQRPGRKRSPQ 226
           AG      R  Q     R PQ
Sbjct: 311 AGQPQGPPRPPQGGRPSRPPQ 331


>gi|4502961 alpha 1 type VII collagen precursor [Homo sapiens]
          Length = 2944

 Score = 50.4 bits (119), Expect = 2e-06
 Identities = 71/270 (26%), Positives = 87/270 (32%), Gaps = 40/270 (14%)

Query: 5    SGPRGSPPGCKAALWSTLGGSMQQSAPTGQWGI------RADSSPRRRTVRTAQPKERSP 58
            SG +G PPG K A      GS     P G  G+      R +  PR +      P ER  
Sbjct: 2104 SGEQG-PPGLKGAKGEP--GSNGDQGPKGDRGVPGIKGDRGEPGPRGQDGNPGLPGERGM 2160

Query: 59   L--RRGSCARGEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCELREGRAEAGHS 116
                     +G + PPGP GG    G    P  A   GP      +G        E G +
Sbjct: 2161 AGPEGKPGLQGPRGPPGPVGGHGDPGPPGAPGLAGPAGPQGPSGLKG--------EPGET 2212

Query: 117  STSQKDTLSKGSAAAATAVAAGKHLAVPETRGGVPARRRETANPPGPG----------RA 166
                +       A           L  P+   G+P +  ET  P  PG          R 
Sbjct: 2213 GPPGRGLTGPTGAVGLPGPPGPSGLVGPQGSPGLPGQVGETGKPGAPGRDGASGKDGDRG 2272

Query: 167  RPG-----GSPFGHRPLLQPRQIFAPLQAITNQVRPQQRPEPPPAGSAARSGRRAQRPGR 221
             PG     G P    P  +P    AP QA+     P  + E    G  A  G     PG 
Sbjct: 2273 SPGVPGSPGLPGPVGPKGEPGPTGAPGQAVVG--LPGAKGEKGAPGGLA--GDLVGEPGA 2328

Query: 222  K--RSPQSAPAESGELSIDREEGGPRQAPQ 249
            K  R       E GE     E G P +  Q
Sbjct: 2329 KGDRGLPGPRGEKGEAGRAGEPGDPGEDGQ 2358



 Score = 50.1 bits (118), Expect = 2e-06
 Identities = 72/265 (27%), Positives = 94/265 (35%), Gaps = 45/265 (16%)

Query: 6    GPRG--SPPGCKAALWSTLGGSMQQSAP--TGQWGIRADSSPRRRTVRTAQPKERS---P 58
            GP+G   PPG K        G  +  AP   GQ G   +  PR          +R    P
Sbjct: 1433 GPQGPVGPPGKKGEK-----GDSEDGAPGLPGQPGSPGEQGPRGPPGAIGPKGDRGFPGP 1487

Query: 59   LRRGSCARGEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCELREGRAEAGHSST 118
            L      +GE+ PPGP G +   GV+ RP A    GPP     +G +   GR        
Sbjct: 1488 LGEAG-EKGERGPPGPAGSRGLPGVAGRPGAKGPEGPPGPTGRQGEKGEPGR-------- 1538

Query: 119  SQKDTLSKGSAAAATAVAAGK-HLAVPETRGGVPARRRETANPPG---PGRARPGGSPFG 174
               D    G A A      G    A P    GV   R     PPG   PG   P G P  
Sbjct: 1539 -PGDPAVVGPAVAGPKGEKGDVGPAGPRGATGVQGER----GPPGLVLPGDPGPKGDPGD 1593

Query: 175  HRPLLQPRQIFAPLQA-ITNQVRPQQRPEPP-PAGSAARSGRRAQR-----------PGR 221
              P+    +   P  +    +     RP PP P G   R G   ++           PG+
Sbjct: 1594 RGPIGLTGRAGPPGDSGPPGEKGDPGRPGPPGPVGPRGRDGEVGEKGDEGPPGDPGLPGK 1653

Query: 222  --KRSPQSAPAESGELSIDREEGGP 244
              +R  + AP   G +    ++G P
Sbjct: 1654 AGERGLRGAPGVRGPVGEKGDQGDP 1678



 Score = 36.2 bits (82), Expect = 0.031
 Identities = 66/259 (25%), Positives = 83/259 (32%), Gaps = 55/259 (21%)

Query: 6    GPRGSPPGCKAALWSTLGGSMQQSAPTGQWGIRADSSPRRRTVRTAQPKERSP------- 58
            GPRG P G   A    +GG  +     G+ G    S P         P  R P       
Sbjct: 1341 GPRG-PKGEPGAPGQVIGG--EGPGLPGRKGDPGPSGPPGPRGPLGDPGPRGPPGLPGTA 1397

Query: 59   LRRGSCARGEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCELREGRAEAGHSST 118
            ++     RGE+ PPGP  G    G    P    + GP       G +  +G +E G    
Sbjct: 1398 MKGDKGDRGERGPPGPGEGGIAPGEPGLPGLPGSPGPQGPVGPPGKKGEKGDSEDG---- 1453

Query: 119  SQKDTLSKGSAAAATAVAAGKHLAVPETRGGVPARRRETANPPGPGRARPGGSPFGHRPL 178
                                          G+P +      P  PG   P G P    P 
Sbjct: 1454 ----------------------------APGLPGQ------PGSPGEQGPRGPPGAIGP- 1478

Query: 179  LQPRQIFAPLQAITNQVRPQQRPEPPPAGSAARSGRRAQRPGRKRSPQSAPAESGELSID 238
               R    PL     +    +R  P PAGS    G  A RPG K  P+  P  +G     
Sbjct: 1479 KGDRGFPGPLGEAGEK---GERGPPGPAGSRGLPG-VAGRPGAK-GPEGPPGPTGRQGEK 1533

Query: 239  REEGGPRQAPQGAGHALEG 257
             E G P   P   G A+ G
Sbjct: 1534 GEPGRPGD-PAVVGPAVAG 1551



 Score = 34.7 bits (78), Expect = 0.089
 Identities = 36/123 (29%), Positives = 45/123 (36%), Gaps = 21/123 (17%)

Query: 5    SGPRGS-----PPGCKAALWSTLGGSMQQSAPTGQWGIRADSSPRRRTVRTAQPKERS-- 57
            SGP+G      PPG    L  T  G+ ++  P G  G      P+        P ER   
Sbjct: 1690 SGPKGDRGEPGPPGPPGRLVDTGPGAREKGEP-GDRGQEGPRGPKGDPGLPGAPGERGIE 1748

Query: 58   -----------PLRRGSCARGEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCEL 106
                       P  RG    GEK   GP G   R G+  +P AA  +GP       G   
Sbjct: 1749 GFRGPPGPQGDPGVRGPA--GEKGDRGPPGLDGRSGLDGKPGAAGPSGPNGAAGKAGDPG 1806

Query: 107  REG 109
            R+G
Sbjct: 1807 RDG 1809



 Score = 31.6 bits (70), Expect = 0.76
 Identities = 61/229 (26%), Positives = 75/229 (32%), Gaps = 25/229 (10%)

Query: 2    DRGS-GPRGSPPGCKAALWSTLGGSMQQSAPTGQWGIRADSSPRRRTVRTAQPKERSPLR 60
            DRG  GP+G PPG         G S     P G+ GI              +P ER    
Sbjct: 2007 DRGDPGPQG-PPGLALGERGPPGPSGLAGEP-GKPGIPGLPGRAGGVGEAGRPGER---- 2060

Query: 61   RGSCARGEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCELREGRAEAGHSSTSQ 120
                  GE+   G  G Q R G    P      GPP      G ++       G S    
Sbjct: 2061 ------GERGEKGERGEQGRDGPPGLPGTPGPPGPP------GPKVSVDEPGPGLSGEQG 2108

Query: 121  KDTL--SKGSAAAATAVAAGKHLAVPETRG--GVPARRRETANPPGPGRARPGGSPFGHR 176
               L  +KG   +           VP  +G  G P  R +  NP  PG     G P G  
Sbjct: 2109 PPGLKGAKGEPGSNGDQGPKGDRGVPGIKGDRGEPGPRGQDGNPGLPGERGMAG-PEGKP 2167

Query: 177  PLLQPRQIFAPLQAITNQVRPQQRPEPPPAGSAARSGRRAQRPGRKRSP 225
             L  PR    P+    +   P       PAG    SG + + PG    P
Sbjct: 2168 GLQGPRGPPGPVGGHGDPGPPGAPGLAGPAGPQGPSGLKGE-PGETGPP 2215



 Score = 31.2 bits (69), Expect = 0.99
 Identities = 55/212 (25%), Positives = 73/212 (34%), Gaps = 45/212 (21%)

Query: 50   TAQPK-ERSPLRRGSCARGEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCELRE 108
            T QP+ E  P+    C +G+K  PG  G + + G    P     TG P  +   G    +
Sbjct: 1239 TTQPRPEPCPVY---CPKGQKGEPGEMGLRGQVGPPGDPGLPGRTGAPGPQGPPGSATAK 1295

Query: 109  GRAEAGHSSTSQKDTLSKGSAAAATAVAAGKHLAVPETRGGVPARRRETANP--PG---- 162
            G                +G   A             + R G P R      P  PG    
Sbjct: 1296 G---------------ERGFPGA-------------DGRPGSPGRAGNPGTPGAPGLKGS 1327

Query: 163  PGRARPGGSPFGHRPLLQPRQIFAPLQAITNQ--VRPQQRPEPPPAGSAARSGRRAQRPG 220
            PG   P G P    P     +  AP Q I  +    P ++ +P P+G     G     PG
Sbjct: 1328 PGLPGPRGDPGERGPRGPKGEPGAPGQVIGGEGPGLPGRKGDPGPSGPPGPRGPLGD-PG 1386

Query: 221  RKRSPQSAP--AESGELSIDREEGGPRQAPQG 250
              R P   P  A  G+   DR E GP    +G
Sbjct: 1387 -PRGPPGLPGTAMKGDKG-DRGERGPPGPGEG 1416



 Score = 31.2 bits (69), Expect = 0.99
 Identities = 67/262 (25%), Positives = 89/262 (33%), Gaps = 46/262 (17%)

Query: 28   QSAPTGQWGIRADSSPRRRTVRTAQP-KERSPLRRGSCARGEKRPPGPTGGQWRWGVSPR 86
            +S   G+ G    S P     +   P ++  P  RG   +G   P GP G   + G   +
Sbjct: 1780 RSGLDGKPGAAGPSGPNGAAGKAGDPGRDGLPGLRGE--QGLPGPSGPPGLPGKPGEDGK 1837

Query: 87   PSAASATGPPQDRAARGCELREG-RAEAGHSSTSQKDTLSKGSAAAATAVAAGKHLAVPE 145
            P      G P D    G   R+G + ++G S    +D   KG   A   +       +P 
Sbjct: 1838 PGLNGKNGEPGDPGEDG---RKGEKGDSGASGREGRDG-PKGERGAPGILGPQGPPGLPG 1893

Query: 146  TRG-------GVP------ARRRETANPPGPG-------RARPGGSPFGHRPLLQP---- 181
              G       GVP        R ET +    G       R  PG  P   R LL+     
Sbjct: 1894 PVGPPGQGFPGVPGGTGPKGDRGETGSKGEQGLPGERGLRGEPGSVPNVDR-LLETAGIK 1952

Query: 182  ----RQIFAPLQAITNQVRPQQRPEPPPAGSAARSGRRAQRPGRKRSPQSAPAESGELSI 237
                R+I       +    P       P G +   G     P  K  P   P E G L  
Sbjct: 1953 ASALREIVETWDESSGSFLPVPERRRGPKGDSGEQG-----PPGKEGPIGFPGERG-LKG 2006

Query: 238  DREEGGPRQAPQGAGHALEGRG 259
            DR + GP+  P   G AL  RG
Sbjct: 2007 DRGDPGPQGPP---GLALGERG 2025



 Score = 28.5 bits (62), Expect = 6.4
 Identities = 59/262 (22%), Positives = 84/262 (32%), Gaps = 47/262 (17%)

Query: 6    GPRGSPPGCKAALWSTLGGSMQQSAPTGQWGIRADSSPRRRTVRTAQPKERSPL----RR 61
            GP+G P            G       TG+ G   DS P        +P    P+    R 
Sbjct: 1586 GPKGDP------------GDRGPIGLTGRAGPPGDSGPPGEKGDPGRPGPPGPVGPRGRD 1633

Query: 62   GSCA-RGEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCELREGRAEAGHSSTSQ 120
            G    +G++ PPG  G   + G      A    GP  ++  +G    +GR  +  SS  +
Sbjct: 1634 GEVGEKGDEGPPGDPGLPGKAGERGLRGAPGVRGPVGEKGDQGDPGEDGRNGSPGSSGPK 1693

Query: 121  KDTLSKGSAAAATAVAAGKHLAVPETRGGVPARRRETANPPGPGRARPGGSPFGHRPLLQ 180
             D    G          G    + +T  G     RE   P   G+  P G      P   
Sbjct: 1694 GDRGEPG--------PPGPPGRLVDTGPGA----REKGEPGDRGQEGPRG------PKGD 1735

Query: 181  PRQIFAPLQAITNQVRPQQRPEPPPAGSAARSGRRAQRPGRKRSPQSAPAESGELSIDRE 240
            P    AP +      R    P+  P           + P  ++  +  P   G   +D +
Sbjct: 1736 PGLPGAPGERGIEGFRGPPGPQGDPG---------VRGPAGEKGDRGPPGLDGRSGLDGK 1786

Query: 241  EG--GPRQAPQGAGHALEGRGR 260
             G  GP   P GA       GR
Sbjct: 1787 PGAAGP-SGPNGAAGKAGDPGR 1807


>gi|188536004 zinc finger protein 469 [Homo sapiens]
          Length = 3925

 Score = 50.4 bits (119), Expect = 2e-06
 Identities = 64/252 (25%), Positives = 94/252 (37%), Gaps = 39/252 (15%)

Query: 6    GPRGS--------PPGCKAALWSTLGGSMQQSA------PTGQWGIRADSSPRRRT---V 48
            GPRG+        P GC+++       S    A      P    G  A     R T   +
Sbjct: 3632 GPRGTFHKGSATKPAGCQSSSKDRSAASTPSKALKFPVHPRKAVGSLAPGELARGTENGM 3691

Query: 49   RTAQPKER-SPLRRGSCARGEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCELR 107
            + A PK +  P  +GS   G  RP   TGG    G  P+P++           A+     
Sbjct: 3692 KPATPKAKPGPSSQGS---GSPRPGTKTGG----GSQPQPASGQLQSETATTPAKPSFPS 3744

Query: 108  EGRAEAGHSSTSQKDTLSKGSAAAATAVAAGKHLAVPETRGGVPARRRETANPPGPGRAR 167
               A     + +Q  + +KG   A      G H ++     G  + +R+    PGP R+ 
Sbjct: 3745 RSPAPERLPARAQAKSCTKGPREAGEQ---GPHGSLGPKEKGESSTKRKKGQVPGPARSE 3801

Query: 168  PGGSPFGH------RPLLQPRQIFAPLQAITNQVRP--QQRPEPPPAGSAARSGRRAQRP 219
              GS FG       +P   PR+   P + +  + +P  Q +P PPP+          QR 
Sbjct: 3802 SVGS-FGRAPSAPDKPPRTPRKQATPSRVLPTKPKPNSQNKPRPPPSEQRKAEPGHTQRK 3860

Query: 220  GR--KRSPQSAP 229
             R  K  PQ  P
Sbjct: 3861 DRLGKAFPQGRP 3872



 Score = 44.7 bits (104), Expect = 9e-05
 Identities = 63/250 (25%), Positives = 85/250 (34%), Gaps = 39/250 (15%)

Query: 4   GSGPRGSPPGCKAALWSTLGGSMQQSAPTGQWGIRA-----DSSPRRRTVR-------TA 51
           G  PRG+PP        T+ G +Q        G  +     D++P  RT +        A
Sbjct: 3   GERPRGAPP-------PTMTGDLQPRQVASSPGHPSQPPLEDNTPATRTTKGAREAGGQA 55

Query: 52  QPKERSPLRRGSCARGEKRPPG-----PTGGQWRWGVSPRPSAASATGPPQDRAARGCEL 106
           Q  E    +      GE +PP      P+    + G    P   S    P   A R    
Sbjct: 56  QAMELPEAQPRQARDGELKPPSLRGQAPSSTPGKRGSPQTPPGRSPLQAPSRLAGRAEGS 115

Query: 107 REGRAEAGHSSTSQKDTLSKGSAAAATAVAAGKHLAVPETRG-GVPARRRETANPPGPGR 165
              R   G +S+  K TL +         A    +  P+  G G P R       PG  R
Sbjct: 116 PPQRYILGIASSRTKPTLDETPENPQLEAAQLPEVDTPQGPGTGAPLR-------PGLPR 168

Query: 166 --ARPGGSPFG-HRPLLQPRQIFAPLQAITNQVRPQQRPEPPPAGSAARSGRRAQRPGRK 222
             A+P     G HR   +P   F      TN   P   P PP  G     G    +PG  
Sbjct: 169 TEAQPAAEELGFHRCFQEPPSSFTS----TNYTSPSATPRPPAPGPPQSRGTSPLQPGSY 224

Query: 223 RSPQSAPAES 232
              Q++ A+S
Sbjct: 225 PEYQASGADS 234



 Score = 42.4 bits (98), Expect = 4e-04
 Identities = 57/242 (23%), Positives = 88/242 (36%), Gaps = 31/242 (12%)

Query: 23   GGSMQQSAPTG----QWGIRADSSPRRRTVRTAQPKERSPLRRGSCARGEKRPPGPTGGQ 78
            G  +QQ+ P G    + G R   +  +R       K R+P  RG CA    +        
Sbjct: 3560 GPLLQQALPLGASLPRPGARGQDAEGKRAPLVFSGKRRAPGARGRCAPDHFQED------ 3613

Query: 79   WRWGVSPRPSAASATGPPQDRAARGCELREGRAEAGHSSTSQKDTLSKGSAAAATAVAAG 138
                +  +    S++    +   RG   +    +     +S KD  +  + + A      
Sbjct: 3614 ---HLLQKEKEVSSSHMVSEGGPRGTFHKGSATKPAGCQSSSKDRSAASTPSKALKFPVH 3670

Query: 139  KHLAVPETRGGVPARRRETANPPGPGRARPGGSPFGH---RPLL------QPRQIFAPLQ 189
               AV     G  AR  E    P   +A+PG S  G    RP        QP+     LQ
Sbjct: 3671 PRKAVGSLAPGELARGTENGMKPATPKAKPGPSSQGSGSPRPGTKTGGGSQPQPASGQLQ 3730

Query: 190  A--ITNQVRP-----QQRPEPPPAGSAARSGRRAQRPGRKRSPQSA--PAESGELSIDRE 240
            +   T   +P        PE  PA + A+S  +  R   ++ P  +  P E GE S  R+
Sbjct: 3731 SETATTPAKPSFPSRSPAPERLPARAQAKSCTKGPREAGEQGPHGSLGPKEKGESSTKRK 3790

Query: 241  EG 242
            +G
Sbjct: 3791 KG 3792



 Score = 40.0 bits (92), Expect = 0.002
 Identities = 70/281 (24%), Positives = 84/281 (29%), Gaps = 62/281 (22%)

Query: 36   GIRADSSPRRRTVRTAQPKERSPLRRGSCARGEKRPPGPTGGQW---------------R 80
            G RAD +PR         + RS  RR    R + R     GG W               R
Sbjct: 1003 GSRADPAPRVPRAAALPEETRSSRRRRLPPRKDPRKRKARGGAWGKELILKIVQQKNRLR 1062

Query: 81   WGVSPRPSAASATGPPQDRAARGCELR-EGRAEAGHSSTSQKDTLSKGSAAAATAVAAGK 139
                   S      PP+    RG   R E R E   +   ++D   K   AA        
Sbjct: 1063 EYDFASESEEDEQPPPRGPGFRGRRGRGEKRKEVELTQGPREDEPQKPRKAARQEAGGDG 1122

Query: 140  HLAVPETRGGVPARRRETANPPGPGR-------ARPGGSPFGHRPLLQPRQIFAPLQAIT 192
              A PE  GG       +    GP R       AR GG     RP + P+    PLQ  T
Sbjct: 1123 APANPEEPGGSRPGPGRSPQARGPSRSLETGAAAREGGPKCADRPSVAPKD---PLQVPT 1179

Query: 193  NQVR----------PQQRPEP-------------------PPAGSAARSGRR-----AQR 218
            N             PQ+  EP                   PPA  A   G        ++
Sbjct: 1180 NTETSEETRPSLDFPQEAKEPETAEESAPDSTEFTEALRSPPAACAGEMGASPGLLIPEQ 1239

Query: 219  PGRKRSPQSAPAESGELSIDREEGGPRQAPQGAGHALEGRG 259
            P   R     P  SG L+     G       G G  L G G
Sbjct: 1240 PPPSRHDTGTPKPSGSLANTAPHGS--SPTPGVGSLLGGPG 1278



 Score = 29.3 bits (64), Expect = 3.8
 Identities = 27/91 (29%), Positives = 35/91 (38%), Gaps = 22/91 (24%)

Query: 84  SPRPSAASATGPPQDRAARGCELREGRAEAGHSSTSQKDTLSKGSAAAATAVAAGKHLAV 143
           +PRP A    GPPQ R                +S  Q  +  +  A+ A +       + 
Sbjct: 202 TPRPPAP---GPPQSRG---------------TSPLQPGSYPEYQASGADSWPPAAENSF 243

Query: 144 PETRGGVPARRRETANPPGPGRARPGGSPFG 174
           P    GVP    E    P P  +RPGGSP G
Sbjct: 244 PGANFGVPPAEPE----PIPKGSRPGGSPRG 270


>gi|209571537 proline-rich protein BstNI subfamily 2 [Homo sapiens]
          Length = 416

 Score = 49.3 bits (116), Expect = 4e-06
 Identities = 74/278 (26%), Positives = 95/278 (34%), Gaps = 46/278 (16%)

Query: 4   GSGPRG--SPPGCKAALWSTLGGSMQQSAPTGQWGIRADSSPRRRTVRTAQP---KERSP 58
           G+ P+G  SPPG K       GG+  Q  P    G      P+        P   K + P
Sbjct: 44  GNKPQGPPSPPG-KPQGPPPQGGNQPQGPPPPP-GKPQGPPPQGGNKPQGPPPPGKPQGP 101

Query: 59  LRRGSCARGEKRPPG------PTGGQWRWGVSPRPSAASATGPPQDRAARGCELREGRAE 112
             +G  +R  + PPG      P GG    G  P P      G PQ    +G    +G   
Sbjct: 102 PPQGDKSRSPRSPPGKPQGPPPQGGNQPQGPPPPP------GKPQGPPPQGGNKPQGPPP 155

Query: 113 AG--HSSTSQKDTLSKGSAAAATAVAAGKHLAVPETRG----GVPARRRETANPPGPGRA 166
            G       Q D  S+ S +       GK    P   G    G P    +   PP  G  
Sbjct: 156 PGKPQGPPPQGDNKSRSSRSPP-----GKPQGPPPQGGNQPQGPPPPPGKPQGPPPQGGN 210

Query: 167 RPGGSPFGHRPLLQPRQIFAPLQAITNQV------------RPQQRPEPP--PAGSAARS 212
           +P G P   +P   P Q     Q+  +              +PQ  P PP  P G   + 
Sbjct: 211 KPQGPPPPGKPQGPPPQGDNKSQSARSPPGKPQGPPPQGGNQPQGPPPPPGKPQGPPPQG 270

Query: 213 GRRAQRPGRKRSPQSAPAESGELSIDREEGGPRQAPQG 250
           G + Q P     PQ  P + G  S  R    P   PQG
Sbjct: 271 GNKPQGPPPPGKPQGPPPQGGSKS--RSSRSPPGKPQG 306



 Score = 48.1 bits (113), Expect = 8e-06
 Identities = 69/285 (24%), Positives = 96/285 (33%), Gaps = 40/285 (14%)

Query: 2   DRGSGPRGSPPGCKAALWSTLGGSMQQS---------APTGQWGIRADSSPRRRTVRTAQ 52
           D+   PR SPPG K       GG+  Q           P  Q G +    P     +   
Sbjct: 106 DKSRSPR-SPPG-KPQGPPPQGGNQPQGPPPPPGKPQGPPPQGGNKPQGPPPPGKPQGPP 163

Query: 53  PKERSPLRRGSCARGEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCELREGRAE 112
           P+  +  R      G+ + P P GG    G  P P      G PQ    +G    +G   
Sbjct: 164 PQGDNKSRSSRSPPGKPQGPPPQGGNQPQGPPPPP------GKPQGPPPQGGNKPQGPPP 217

Query: 113 AGHSSTSQKDTLSKGSAAAATAVAAGKHLAVPETRG----GVPARRRETANPPGPGRARP 168
            G          +K  +A +     GK    P   G    G P    +   PP  G  +P
Sbjct: 218 PGKPQGPPPQGDNKSQSARSPP---GKPQGPPPQGGNQPQGPPPPPGKPQGPPPQGGNKP 274

Query: 169 GGSPFGHRPLLQPRQIFAPLQAITNQV------------RPQQRPEPP--PAGSAARSGR 214
            G P   +P   P Q  +  ++  +              +PQ  P PP  P G   + G 
Sbjct: 275 QGPPPPGKPQGPPPQGGSKSRSSRSPPGKPQGPPPQGGNQPQGPPPPPGKPQGPPPQGGN 334

Query: 215 RAQRPGRKRSPQSAPAESGELSIDREE--GGPRQAPQGAGHALEG 257
           + Q P     PQ  P + G  S       G P+  PQ  G+  +G
Sbjct: 335 KPQGPPPPGKPQGPPPQGGSKSRSARSPPGKPQGPPQQEGNNPQG 379



 Score = 35.4 bits (80), Expect = 0.052
 Identities = 59/249 (23%), Positives = 79/249 (31%), Gaps = 46/249 (18%)

Query: 4   GSGPRGSPPGCKAALWSTLGGSMQQSAPTGQWGIRADSSPRRRTVRTAQPKERSPLRRGS 63
           G+ P+G PP           G  Q   P G  G +    P     +   P+  +  +   
Sbjct: 188 GNQPQGPPPP---------PGKPQGPPPQG--GNKPQGPPPPGKPQGPPPQGDNKSQSAR 236

Query: 64  CARGEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCELREGRAEAGHSSTSQKDT 123
              G+ + P P GG    G  P P      G PQ    +G    +G    G     Q   
Sbjct: 237 SPPGKPQGPPPQGGNQPQGPPPPP------GKPQGPPPQGGNKPQGPPPPGKP---QGPP 287

Query: 124 LSKGSAAAATAVAAGKHLAVPETRG----GVPARRRETANPPGPGRARPGGSPFGHRPLL 179
              GS + ++    GK    P   G    G P    +   PP  G  +P G P   +P  
Sbjct: 288 PQGGSKSRSSRSPPGKPQGPPPQGGNQPQGPPPPPGKPQGPPPQGGNKPQGPPPPGKPQG 347

Query: 180 QPRQIFA---------------PLQAITNQVRPQ-------QRPEPPPAGSAARSGRRAQ 217
            P Q  +               P Q   N   P        Q+P+ PPAG      R  Q
Sbjct: 348 PPPQGGSKSRSARSPPGKPQGPPQQEGNNPQGPPPPAGGNPQQPQAPPAGQPQGPPRPPQ 407

Query: 218 RPGRKRSPQ 226
                R PQ
Sbjct: 408 GGRPSRPPQ 416



 Score = 34.3 bits (77), Expect = 0.12
 Identities = 43/170 (25%), Positives = 53/170 (31%), Gaps = 12/170 (7%)

Query: 83  VSPRPSAASATGPPQDRAARGCELREGRAEAGHSSTSQKDTLSKGSAAAATAVAAGKHLA 142
           VS   S +   G PQ    +G    +G          Q      G+         GK   
Sbjct: 23  VSQEESPSLIAGNPQGAPPQGGNKPQGPPSP--PGKPQGPPPQGGNQPQGPPPPPGKPQG 80

Query: 143 VPETRGGVPARRRETANPPGPGRARPGGSPFGHRPLLQPRQIFAPLQAITNQVRPQQRPE 202
            P   G  P    +   PPG  +  P        P   P +   P     NQ  PQ  P 
Sbjct: 81  PPPQGGNKP----QGPPPPGKPQGPPPQGDKSRSPRSPPGKPQGPPPQGGNQ--PQGPPP 134

Query: 203 PP--PAGSAARSGRRAQRPGRKRSPQSAPAESGELSIDREEGGPRQAPQG 250
           PP  P G   + G + Q P     PQ  P +    S  R    P   PQG
Sbjct: 135 PPGKPQGPPPQGGNKPQGPPPPGKPQGPPPQGDNKS--RSSRSPPGKPQG 182



 Score = 31.6 bits (70), Expect = 0.76
 Identities = 29/96 (30%), Positives = 34/96 (35%), Gaps = 7/96 (7%)

Query: 159 NPPGPGRARPGGSPFGHRPLLQPRQIFAPLQAITNQVRPQQRPEPP--PAGSAARSGRRA 216
           NP G   A P G      P   P +   P     NQ  PQ  P PP  P G   + G + 
Sbjct: 35  NPQG---APPQGGNKPQGPPSPPGKPQGPPPQGGNQ--PQGPPPPPGKPQGPPPQGGNKP 89

Query: 217 QRPGRKRSPQSAPAESGELSIDREEGGPRQAPQGAG 252
           Q P     PQ  P +  +    R   G  Q P   G
Sbjct: 90  QGPPPPGKPQGPPPQGDKSRSPRSPPGKPQGPPPQG 125


>gi|58761548 tau tubulin kinase 1 [Homo sapiens]
          Length = 1321

 Score = 47.0 bits (110), Expect = 2e-05
 Identities = 59/208 (28%), Positives = 81/208 (38%), Gaps = 38/208 (18%)

Query: 39   ADSSPRRRTVRTAQPKERSPLRRGSCARGEKRPPGPTGGQWRWGVSPRPSAASATGPPQD 98
            A  SPRR  +  ++P+ R P+       G + P G    + RW    RP         QD
Sbjct: 1038 ATISPRRHAMPGSRPRSRIPVLLSEEDTGSE-PSGSLSAKERWSKRARPQ--------QD 1088

Query: 99   RAARGCELREGR-----AEAGHSSTSQK-----DTLSKGSAAAATAVAAGKHLAVPETRG 148
             A    E R+GR     A    SS+S++     +TLS G+ +     A+    A+P   G
Sbjct: 1089 LARLVMEKRQGRLLLRLASGASSSSSEEQRRASETLS-GTGSEEDTPASEPAAALPRKSG 1147

Query: 149  GVPARRRETANP-----PGPGRAR-PGGSPFGHRPLLQPRQIFAPLQAITNQVRPQQRPE 202
               A R     P     P P  A+ P     G  P L          AIT++++ Q    
Sbjct: 1148 RAAATRSRIPRPIGLRMPMPVAAQQPASRSHGAAPALDT--------AITSRLQLQT--- 1196

Query: 203  PPPAGSAARSGRRAQRPGRKRSPQSAPA 230
             PP  + A   R  Q PGR   P  A A
Sbjct: 1197 -PPGSATAADLRPKQPPGRGLGPGRAQA 1223



 Score = 34.3 bits (77), Expect = 0.12
 Identities = 54/214 (25%), Positives = 74/214 (34%), Gaps = 45/214 (21%)

Query: 51   AQPKERSPLRRGSCARGEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCELREGR 110
            ++P    P + G  A    R P P G +      P P AA      Q  A+R      G 
Sbjct: 1136 SEPAAALPRKSGRAAATRSRIPRPIGLRM-----PMPVAA------QQPASRS----HGA 1180

Query: 111  AEAGHSSTSQKDTLSKGSAAAATAVAAGKHLAVPETRGGVPARRRETANPPGPGRAR-PG 169
            A A  ++ + +  L     +A  A    K    P  RG  P R +  A PP P   R P 
Sbjct: 1181 APALDTAITSRLQLQTPPGSATAADLRPKQ---PPGRGLGPGRAQAGARPPAPRSPRLPA 1237

Query: 170  GSPFGHRPLLQPRQIFAPLQAITNQVRP----QQRP-EPPPAGSAARSGRRAQRPGRKRS 224
             +         PR      Q+++ +  P    Q RP  PPP G           P  +  
Sbjct: 1238 STSAARNASASPRS-----QSLSRRESPSPSHQARPGVPPPRGV----------PPARAQ 1282

Query: 225  PQSAPAESGELSIDREEGGPRQAPQGAGHALEGR 258
            P   P+  G       + GPR   Q      +GR
Sbjct: 1283 PDGTPSPGG------SKKGPRGKLQAQRATTKGR 1310


>gi|116256464 hypothetical protein LOC343990 [Homo sapiens]
          Length = 962

 Score = 47.0 bits (110), Expect = 2e-05
 Identities = 70/261 (26%), Positives = 91/261 (34%), Gaps = 25/261 (9%)

Query: 2   DRGSGPRGS----PPGCKAALWSTLGGSMQQSAPTGQWGIRA-DSSPRRRTVRTAQPKER 56
           DRG  P       PPG   A  + L  S   +A   + G+   D S R  T   A+P+  
Sbjct: 288 DRGPEPGPPAPLPPPGGARARRARLQHSSALTASVEEGGVPGEDPSSRPATPELAEPESA 347

Query: 57  SPLRRGSCARGEKRP-PGPTGGQWRWGVSPRPSAASATGPPQDRAARGCELREGRAEAGH 115
             LR    +  E  P PGP GG+      P    A AT    D+A       E  A    
Sbjct: 348 PTLRVEPPSPPEGPPNPGPDGGKQDGEAPPAGPCAPAT----DKAEEVVCAPEDVASPFP 403

Query: 116 SSTSQKDTLSKGSAAAATAVAA----GKHLAVP-ETRGGVPARRRETANPPGPGRARPGG 170
           ++  + DT    +  AAT+ A     G   +VP E     P    E   PPGP       
Sbjct: 404 TAIPEGDTTPPETDPAATSEAPSARDGPERSVPKEAEPTPPVLPDEEKGPPGPAP----- 458

Query: 171 SPFGHRPLLQPRQIFAPLQAITNQVRPQQRPEPPPAGSAARSGRRAQRPGRKRSP----- 225
            P         R      + I  +      P PP   S  +    A   G   SP     
Sbjct: 459 EPEREAETEPERGAGTEPERIGTEPSTAPAPSPPAPKSCLKHRPAAASEGPAASPPLAAA 518

Query: 226 QSAPAESGELSIDREEGGPRQ 246
           +S P E G  S+D E   P +
Sbjct: 519 ESPPVEPGPGSLDAEAAAPER 539



 Score = 35.4 bits (80), Expect = 0.052
 Identities = 47/183 (25%), Positives = 62/183 (33%), Gaps = 32/183 (17%)

Query: 85  PRPSAASATGPPQDRAARGCELREGRAEAGHSSTSQKDTLSKGSAAA-ATAVAAGKHLAV 143
           P P   +   PP    AR       RA   HSS         G      ++  A   LA 
Sbjct: 291 PEPGPPAPLPPPGGARAR-------RARLQHSSALTASVEEGGVPGEDPSSRPATPELAE 343

Query: 144 PETRGGVPARRRETANPP-GPGRARP-GGSPFGHRP---------------LLQPRQIFA 186
           PE+    P  R E  +PP GP    P GG   G  P               +  P  + +
Sbjct: 344 PES---APTLRVEPPSPPEGPPNPGPDGGKQDGEAPPAGPCAPATDKAEEVVCAPEDVAS 400

Query: 187 PLQAITNQVRPQQRPEPPPAGSAARSGRRAQRPGRKRSPQSAPAESGELSIDREEGGPRQ 246
           P         P+    PP    AA S   + R G +RS       +  +  D E+G P  
Sbjct: 401 PFPTAI----PEGDTTPPETDPAATSEAPSARDGPERSVPKEAEPTPPVLPDEEKGPPGP 456

Query: 247 APQ 249
           AP+
Sbjct: 457 APE 459


>gi|117306167 proline-rich protein BstNI subfamily 3 precursor [Homo
           sapiens]
          Length = 309

 Score = 47.0 bits (110), Expect = 2e-05
 Identities = 63/261 (24%), Positives = 83/261 (31%), Gaps = 38/261 (14%)

Query: 7   PRGSPPGCKAALWSTLGGSMQQS---------APTGQWGIRADSSPRRRTVRTAQPKERS 57
           P G PP          GG+  Q           P  Q G ++   P R      QP +  
Sbjct: 57  PEGRPPQ---------GGNQSQGPPPRPGKPEGPPPQGGNQSQGPPPRPGKPEGQPPQGG 107

Query: 58  PLRRGSCAR-GEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCELREGRAEAGHS 116
              +G   R G+   P P GG    G  PRP       P     ++G   R G+ E    
Sbjct: 108 NQSQGPPPRPGKPEGPPPQGGNQSQGPPPRPGKPEGPPPQGGNQSQGPPPRPGKPEG--- 164

Query: 117 STSQKDTLSKGSAAAATAVAAGKHLAVPETRGGVPARRRETANPPGPGRARPGGSPFGHR 176
              Q    S+G                 +++G  P   +    PP  G    G  P   +
Sbjct: 165 PPPQGGNQSQGPPPHPGKPEGPPPQGGNQSQGPPPRPGKPEGPPPQGGNQSQGPPPRPGK 224

Query: 177 PLLQPRQIFAPLQAITNQVRPQQRPEPP--PAGSAARSGRRAQRPGRKRSPQSAPAESG- 233
           P   P Q            +PQ  P  P  P G   + G + QRP     PQ  P   G 
Sbjct: 225 PEGPPSQ---------GGNKPQGPPPHPGKPQGPPPQEGNKPQRPPPPGRPQGPPPPGGN 275

Query: 234 -ELSIDREEG---GPRQAPQG 250
            +  +    G   GP   PQG
Sbjct: 276 PQQPLPPPAGKPQGPPPPPQG 296



 Score = 45.8 bits (107), Expect = 4e-05
 Identities = 60/238 (25%), Positives = 76/238 (31%), Gaps = 58/238 (24%)

Query: 3   RGSGPRGSPPGCKAALWSTLGGSMQQS---------APTGQWGIRADSSPRRRTVRTAQP 53
           R   P G PP          GG+  Q           P  Q G ++   P R       P
Sbjct: 116 RPGKPEGPPPQ---------GGNQSQGPPPRPGKPEGPPPQGGNQSQGPPPRPGKPEGPP 166

Query: 54  KERSPLRRGSCAR-GEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCELREGRAE 112
            +     +G     G+   P P GG    G  PRP       P     ++G   R G+ E
Sbjct: 167 PQGGNQSQGPPPHPGKPEGPPPQGGNQSQGPPPRPGKPEGPPPQGGNQSQGPPPRPGKPE 226

Query: 113 AGHSSTSQKDTLSKGSAAAATAVAAGKHLAVPETRGGVPARRRETANPPGPGRAR----P 168
              S          G+         GK    P   G  P R      PP PGR +    P
Sbjct: 227 GPPS--------QGGNKPQGPPPHPGKPQGPPPQEGNKPQR------PPPPGRPQGPPPP 272

Query: 169 GGSPFGHRPLLQPRQIFAPLQAITNQVRPQQRPEPPPAGSAARSGRRAQRPGRKRSPQ 226
           GG+P   +PL  P              +PQ  P PP  G       R  RP + + PQ
Sbjct: 273 GGNP--QQPLPPPAG------------KPQGPPPPPQGG-------RPHRPPQGQPPQ 309



 Score = 42.0 bits (97), Expect = 6e-04
 Identities = 55/238 (23%), Positives = 78/238 (32%), Gaps = 26/238 (10%)

Query: 20  STLGGSMQQSAPTGQWGIRADSSPRRRTVRTAQPKERSPLRRGSCARGEKRP-----PGP 74
           S + G  +   P G       + P+R      +P+ R P           RP     P P
Sbjct: 30  SVISGKPEGRRPQG------GNQPQRTPPPPGKPEGRPPQGGNQSQGPPPRPGKPEGPPP 83

Query: 75  TGGQWRWGVSPRPSAASATGPPQDRAARGCELREGRAEAGHSSTSQKDTLSKGSAAAATA 134
            GG    G  PRP       P     ++G   R G+ E       Q    S+G       
Sbjct: 84  QGGNQSQGPPPRPGKPEGQPPQGGNQSQGPPPRPGKPE---GPPPQGGNQSQGPPPRPGK 140

Query: 135 VAAGKHLAVPETRGGVPARRRETANPPGPGRARPGGSPFGHRPLLQPRQIFAPLQAITNQ 194
                     +++G  P   +    PP  G    G  P   +P   P Q     Q+    
Sbjct: 141 PEGPPPQGGNQSQGPPPRPGKPEGPPPQGGNQSQGPPPHPGKPEGPPPQ--GGNQSQGPP 198

Query: 195 VRPQQRPEPPPAGSAARSGRRAQRPGRKRSPQSAPAESGELSIDREEGGPRQ--APQG 250
            RP +   PPP G     G     P R   P+  P++ G    ++ +G P     PQG
Sbjct: 199 PRPGKPEGPPPQGGNQSQG----PPPRPGKPEGPPSQGG----NKPQGPPPHPGKPQG 248


>gi|113416996 PREDICTED: ankyrin repeat domain 33B [Homo sapiens]
          Length = 1006

 Score = 46.6 bits (109), Expect = 2e-05
 Identities = 80/284 (28%), Positives = 106/284 (37%), Gaps = 59/284 (20%)

Query: 2   DRGSGPRGSPPGCKAALWST---LGGSMQQSAPTGQWGIRADSSPRRRTVRTAQPKERSP 58
           D GS P  S  GC     +T   + GS Q S           S  R R   T   ++  P
Sbjct: 263 DVGSAPSRSLRGCSLRFRNTPPPIPGSDQPS-----------SHGRPRFRETGDSEDCPP 311

Query: 59  LRRGSCARGEKRP----PGP-----TGGQWRWGVSPRPSAASATGPPQDRAARGCELREG 109
            R G+  +    P    PGP     TG +W   + PRP+   A  P    A++       
Sbjct: 312 FRAGTPPQQPTAPSSEWPGPQRSPGTGPRWTAPL-PRPAPLPAKRPGDSPASK------- 363

Query: 110 RAEAGHSSTSQKDTLSKGSAAAATAVAAGK----HLAVPETRGG--VPARRRETANP--- 160
           R   G  S   +D   KG     ++ AAG+     +  P+  GG  +  + RE   P   
Sbjct: 364 RPRPGGVSPEARDP-PKGENPLCSSEAAGETGPTQVRGPDALGGGRLQGKGREGKAPGEG 422

Query: 161 --PGPGRARPGGSPFGHRP--LLQPRQIFAPLQAITNQVRPQ-QRPEPPPAGSA-ARSGR 214
             P P   R G  P    P  L QP + F   ++    V  + +R     +G A ARSG 
Sbjct: 423 VSPRPAPPRLGKPPRSRPPQGLAQPPETFFFERSRLVPVSAELERAGHGFSGDAEARSGD 482

Query: 215 RAQRPGRKRSPQSAPAESGELSIDREEGGPRQAPQGAGHALEGR 258
              RP  + S  SAPA             P Q PQ A H   GR
Sbjct: 483 LGARPASRSSLPSAPAP------------PAQQPQAARHGAAGR 514


>gi|239508698 PREDICTED: similar to mucin [Homo sapiens]
          Length = 265

 Score = 46.6 bits (109), Expect = 2e-05
 Identities = 56/233 (24%), Positives = 82/233 (35%), Gaps = 48/233 (20%)

Query: 27  QQSAPTGQWGIRADSSPRRRTVRTAQPKERSPLRRGSCARGEKRPPGPTGGQWRWGVSPR 86
           QQ A   +    A  +P     RT + K + P  R +   G++R  G +    + G   R
Sbjct: 79  QQRAQDREEEAAAAPAPTSSGHRTEKRKPQQPQCRPAGGTGQRRGSGSSPSADQQGAQDR 138

Query: 87  PSAASATGPPQDRAARGCELREGRAEAGHSSTSQKDTLSKGSAAAATAVAAGKHLAVPET 146
              A+A   P  R              GH +  +K    +   AA T    G   +    
Sbjct: 139 EEEAAAAPAPTSR--------------GHRTEKRKPQQPQRRPAAGTGQRRGSRSSPSAD 184

Query: 147 RGGVPARRRETANPPGPGRARPGGSPFGHRPLLQPRQIFAPLQAITNQVRPQQRPEPPPA 206
           + G   R  E A  P P       +  GHR   + RQ            +PQ RP     
Sbjct: 185 QQGAQDREEEAAAAPVP-------TSRGHRTEKRKRQ------------QPQCRP----- 220

Query: 207 GSAARSGRRAQRPGRKRSPQSAPAESGELSIDR-EEGGPRQAPQGAGHALEGR 258
                    A   G++R  +S+P+   + + DR EE     AP  +GH  E R
Sbjct: 221 ---------AAGTGQRRGSRSSPSADQQRAQDREEEAAAAPAPTSSGHRTEKR 264



 Score = 42.7 bits (99), Expect = 3e-04
 Identities = 53/240 (22%), Positives = 85/240 (35%), Gaps = 46/240 (19%)

Query: 26  MQQSAPTGQWGIRADSSPRRRTVRTAQPKERSPLRRGSCARGEKRPPGPTGGQWRWGVSP 85
           +++S+   QW     + P R         ++ P  + +   G++R  G +    + G   
Sbjct: 15  LRESSEGDQWLENEKTKPLR--------PQQQPQCQPAGGTGQRRGSGSSPSADQQGAQD 66

Query: 86  RPSAASATGPPQDRAARGCELREGRAEA----GHSSTSQKDTLSKGSAAAATAVAAGKHL 141
           R   A+A      + A+  E     A A    GH +  +K    +   A  T    G   
Sbjct: 67  REEEAAAAPAADQQRAQDREEEAAAAPAPTSSGHRTEKRKPQQPQCRPAGGTGQRRGSGS 126

Query: 142 AVPETRGGVPARRRETANPPGPGRARPGGSPFGHRPLLQPRQIFAPLQAITNQVRPQQRP 201
           +    + G   R  E A  P P       +  GHR               T + +PQQ  
Sbjct: 127 SPSADQQGAQDREEEAAAAPAP-------TSRGHR---------------TEKRKPQQPQ 164

Query: 202 EPPPAGSAARSGRRAQRPGRKRSPQSAPAESGELSIDR-EEGGPRQAPQGAGHALEGRGR 260
             P AG+           G++R  +S+P+   + + DR EE      P   GH  E R R
Sbjct: 165 RRPAAGT-----------GQRRGSRSSPSADQQGAQDREEEAAAAPVPTSRGHRTEKRKR 213


>gi|37537692 proline-rich protein BstNI subfamily 4 precursor [Homo
           sapiens]
          Length = 247

 Score = 46.6 bits (109), Expect = 2e-05
 Identities = 48/185 (25%), Positives = 67/185 (36%), Gaps = 14/185 (7%)

Query: 67  GEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCELREGRAEAGHSSTSQKDTLSK 126
           G+ + P P GG    G  P P      G P+ R  +G    +G     H    ++     
Sbjct: 55  GKPQGPPPQGGNQSQGPPPPP------GKPEGRPPQGGNQSQGPPP--HPGKPERPPPQG 106

Query: 127 GSAAAATAVAAGKHLAVPETRGGVPARRRETANPPG-PGRARPGGSPFGHRPLLQPRQIF 185
           G+ +  T    GK    P  +GG  + R     PPG P R  P G      P   P +  
Sbjct: 107 GNQSQGTPPPPGKPER-PPPQGGNQSHRPPP--PPGKPERPPPQGGNQSQGPPPHPGKPE 163

Query: 186 APLQAITNQVRPQQRPEPPPAGSAARSGRRAQRPGRKRSPQSAPAESGELSIDREEGGPR 245
            P     N+ R  + P   P G   + G + Q P     PQ  P   G  +  + +  P 
Sbjct: 164 GPPPQEGNKSRSARSPPGKPQGPPQQEGNKPQGPPPPGKPQGPPPAGG--NPQQPQAPPA 221

Query: 246 QAPQG 250
             PQG
Sbjct: 222 GKPQG 226



 Score = 39.3 bits (90), Expect = 0.004
 Identities = 44/171 (25%), Positives = 61/171 (35%), Gaps = 30/171 (17%)

Query: 55  ERSPLRRGSCARGEKRPPG------PTGGQWRWGVSPRPSAASATGPPQDRAARGCELRE 108
           ER P + G+ ++G   PPG      P GG       P P       P     ++G     
Sbjct: 100 ERPPPQGGNQSQGTPPPPGKPERPPPQGGNQSHRPPPPPGKPERPPPQGGNQSQGPPPHP 159

Query: 109 GRAEAGHSSTSQKDTLSKGSAAAATAVAAGKHLAVPETRGGVPARRRETANPPGPGRARP 168
           G+ E             +G+ + +     GK    P+  G  P        PP PG+ + 
Sbjct: 160 GKPEGPPPQ--------EGNKSRSARSPPGKPQGPPQQEGNKPQ------GPPPPGKPQ- 204

Query: 169 GGSPFGHRPLLQPRQIFAPLQAITNQVRPQQRPEPPPAGSAARSGRRAQRP 219
           G  P G  P  QP+   AP        +PQ  P PP  G   R  +  Q P
Sbjct: 205 GPPPAGGNPQ-QPQ---APPAG-----KPQGPPPPPQGGRPPRPAQGQQPP 246



 Score = 37.7 bits (86), Expect = 0.011
 Identities = 50/208 (24%), Positives = 72/208 (34%), Gaps = 36/208 (17%)

Query: 41  SSPRRRTVRTAQPKERSPLRRGSCARGEKRPPG------PTGGQWRWGVSPRPSAASATG 94
           + P+R      +P+   P + G+ ++G   PPG      P GG    G  P P       
Sbjct: 45  NQPQRPPPPPGKPQGPPP-QGGNQSQGPPPPPGKPEGRPPQGGNQSQGPPPHPGKPERPP 103

Query: 95  PPQDRAARGCELREGRAE-----AGHSSTSQKDTLSK--------GSAAAATAVAAGKHL 141
           P     ++G     G+ E      G+ S        K        G+ +       GK  
Sbjct: 104 PQGGNQSQGTPPPPGKPERPPPQGGNQSHRPPPPPGKPERPPPQGGNQSQGPPPHPGKPE 163

Query: 142 AVPETRGGVPARRRETANPPGPGRARP---GGSPFGHRPLLQPRQIFAPLQAITNQVRPQ 198
             P   G    + R   +PPG  +  P   G  P G  P  +P+    P  A  N  +PQ
Sbjct: 164 GPPPQEGN---KSRSARSPPGKPQGPPQQEGNKPQGPPPPGKPQ---GPPPAGGNPQQPQ 217

Query: 199 -------QRPEPPPAGSAARSGRRAQRP 219
                  Q P PPP G       + Q+P
Sbjct: 218 APPAGKPQGPPPPPQGGRPPRPAQGQQP 245


>gi|39930517 sterile alpha motif domain containing 1 [Homo sapiens]
          Length = 538

 Score = 46.2 bits (108), Expect = 3e-05
 Identities = 53/182 (29%), Positives = 62/182 (34%), Gaps = 43/182 (23%)

Query: 43  PRRRTVRTAQPKERSPLRRGSCARGEKRPPGPTGGQWRWGVS-------PRPSAASATGP 95
           PRR     A P  R+P    + A     PP P        V+       PR +AA+AT P
Sbjct: 102 PRRGATPPAPP--RAPRGAPAAAAAAAPPPTPAPPPPPAPVAAAAPARAPRAAAAAATAP 159

Query: 96  PQDRAARGCELREGRAEAGHSSTSQKDTLSKGSAAAATAVAAGKHLAVPETRGGVPARRR 155
           P                            S G A           LA P      PA   
Sbjct: 160 P----------------------------SPGPAQPGPRAQRAAPLAAPPP---APAAPP 188

Query: 156 ETANPPGPGRARPGGSPFGHRPLLQPRQIFAPLQAITNQVRPQQRPEPPPAGSAARSGRR 215
             A P GP RA P        PL  P Q  AP Q    Q  P  +P+PPP G A R+G  
Sbjct: 189 AVAPPAGPRRAPPPAVAAREPPLPPPPQPPAPPQ---QQQPPPPQPQPPPEGGAVRAGGA 245

Query: 216 AQ 217
           A+
Sbjct: 246 AR 247


>gi|91208420 bassoon protein [Homo sapiens]
          Length = 3926

 Score = 45.8 bits (107), Expect = 4e-05
 Identities = 39/148 (26%), Positives = 54/148 (36%), Gaps = 10/148 (6%)

Query: 91   SATGPPQDRAARGCELREGRAEAGHSSTSQKDTLSKGSAAAATAVAAGKHLAVPETRGGV 150
            +A GP Q ++    ++  G A     +  Q+  L       A   A  +  + P TRG  
Sbjct: 3752 AAPGPQQSQSPSSRQIPSGAASRQPQTQQQQQGLGLQPPQQALTQARLQQQSQPTTRGSA 3811

Query: 151  PARRRETANP-PGPGRA---RPGGSPFGHRPLLQPRQIFAPLQAITNQVRPQQRPEPPPA 206
            PA  +    P PGP  A   +P G P   +         AP Q    Q +P   P P   
Sbjct: 3812 PAASQPAGKPQPGPSTATGPQPAGPPRAEQTNGSKGTAKAPQQGRAPQAQPAPGPGPAGV 3871

Query: 207  GSAARSGRRAQRPGRKRSPQSAPAESGE 234
             + AR G      G   +P   P   GE
Sbjct: 3872 KAGARPG------GTPGAPAGQPGADGE 3893



 Score = 39.7 bits (91), Expect = 0.003
 Identities = 56/210 (26%), Positives = 84/210 (40%), Gaps = 30/210 (14%)

Query: 49   RTAQPKERSPLRRGSCARGEKRP---PGPTGGQWRWGVSPRPSAASATGPPQDRAARGCE 105
            R A+P  R         R E RP   P       + G    PS+A  + P +  +A    
Sbjct: 3665 RAAKPHARD------LGRHEARPHSQPSSAPAMPKKGQPGYPSSAEYSQPSRASSAYH-- 3716

Query: 106  LREGRAEAGHSSTSQKDTLSKGSAAAATAVAAGKHLAVPETRGGVPARRRETANPPGPGR 165
                     H+S S+K +    S  AA    A    A P+ +G   A   + +  P   R
Sbjct: 3717 ---------HASDSKKGSRQAHSGPAALQSKAEPQ-AQPQLQGRQAAPGPQQSQSPS-SR 3765

Query: 166  ARPGGSPFGHRPLLQPRQ----IFAPLQAITNQVRPQQRPEPPPAGSAARSGRRAQRPGR 221
              P G+    +P  Q +Q    +  P QA+T Q R QQ+ +P   GSA  + + A +P  
Sbjct: 3766 QIPSGAA-SRQPQTQQQQQGLGLQPPQQALT-QARLQQQSQPTTRGSAPAASQPAGKPQP 3823

Query: 222  KRSPQSAPAESGELSIDREEG--GPRQAPQ 249
              S  + P  +G    ++  G  G  +APQ
Sbjct: 3824 GPSTATGPQPAGPPRAEQTNGSKGTAKAPQ 3853



 Score = 31.2 bits (69), Expect = 0.99
 Identities = 27/103 (26%), Positives = 40/103 (38%), Gaps = 3/103 (2%)

Query: 161 PGPGRARPGGSPFGHRPLLQPRQIFAPLQAITNQVRPQQRPEPPPA-GSAARSGRRAQRP 219
           PGPG  +P  +P G   L  P    A   A+     P   P P P  GS +R     +  
Sbjct: 29  PGPGAGKPPSAPAGGGQL--PAAGAARSTAVPPVPGPGPGPGPGPGPGSTSRRLDPKEPL 86

Query: 220 GRKRSPQSAPAESGELSIDREEGGPRQAPQGAGHALEGRGRVI 262
           G +R+    P ++   +   E     +A   AG   +G  R +
Sbjct: 87  GNQRAASPTPKQASATTPGHESPRETRAQGPAGQEADGPRRTL 129


>gi|110832843 TBP-associated factor 4 [Homo sapiens]
          Length = 1085

 Score = 45.4 bits (106), Expect = 5e-05
 Identities = 63/268 (23%), Positives = 86/268 (32%), Gaps = 37/268 (13%)

Query: 23  GGSMQQSAPTGQWGIRADSSPRRRTVRTAQPKERSPLRRGSCA----------------R 66
           GG  Q+  P         + P     +   P E S    GSCA                 
Sbjct: 99  GGGPQRPGPPSPRRPLVPAGPAPPAAKLRPPPEGSA---GSCAPVPAAAAVAAGPEPAPA 155

Query: 67  GEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCELREGRAEAGHSSTSQKDTLSK 126
           G  +P GP     R G  P P      GP   + A       G A+  + S +  ++   
Sbjct: 156 GPAKPAGPAALAARAGPGPGPGPGPGPGPGPGKPA-----GPGAAQTLNGSAALLNS-HH 209

Query: 127 GSAAAATAVAAGKHLAVPETRGGVPARRRET------ANPPGPGRARPGGSPFGHRPLLQ 180
            +A A + V  G    +P  +   P    +T      A PP P    P  +P    P   
Sbjct: 210 AAAPAVSLVNNGPAALLPLPKPAAPGTVIQTPPFVGAAAPPAPAAPSPPAAPAPAAPAAA 269

Query: 181 PRQIFAPLQAITNQVRPQQRPEPPPAGSAARSGRRAQRPGRK--RSPQSAPAESGELSID 238
           P     P  A     RP   P  PP  + A     A + G     +P  APA  G   + 
Sbjct: 270 PP---PPPPAPATLARPPGHPAGPPTAAPAVPPPAAAQNGGSAGAAPAPAPAAGGPAGVS 326

Query: 239 REEG-GPRQAPQGAGHALEGRGRVINGS 265
            + G G   A    G   E   RV+  +
Sbjct: 327 GQPGPGAAAAAPAPGVKAESPKRVVQAA 354



 Score = 41.2 bits (95), Expect = 0.001
 Identities = 44/154 (28%), Positives = 53/154 (34%), Gaps = 23/154 (14%)

Query: 131 AATAVAAGKHLAVPETRGGV---PARRRETA-----NPPGPGRARPGGSPFGHRPLLQPR 182
           AA A A G H+      G     PA   E A      PP  GRARPGG          PR
Sbjct: 52  AAAAGALGNHVVSGSPAGAAGAGPAAPAEGAPGAAPEPPPAGRARPGGGGPQRPGPPSPR 111

Query: 183 QIFAPLQAITNQVRPQQRPEPPPAGSAARSGRRAQRPGRKRSPQSAPA----ESGELSID 238
           +   P         P  +  PPP GSA               P+ APA     +G  ++ 
Sbjct: 112 RPLVP----AGPAPPAAKLRPPPEGSAGSCAPVPAAAAVAAGPEPAPAGPAKPAGPAALA 167

Query: 239 REEG-------GPRQAPQGAGHALEGRGRVINGS 265
              G       GP   P     A  G  + +NGS
Sbjct: 168 ARAGPGPGPGPGPGPGPGPGKPAGPGAAQTLNGS 201


>gi|116256356 alpha 4 type IV collagen precursor [Homo sapiens]
          Length = 1690

 Score = 45.4 bits (106), Expect = 5e-05
 Identities = 65/253 (25%), Positives = 84/253 (33%), Gaps = 42/253 (16%)

Query: 6    GPRGSP--PGCKAALWSTLGGSMQQSAP--TGQWGIRADSSPRRRTVRTAQPKERS-PLR 60
            GP+G P  PGC        G S +Q  P   G  G      P   +     P +   P  
Sbjct: 1085 GPKGEPGSPGCPGHF----GASGEQGLPGIQGPRGSPGRPGPPGSSGPPGCPGDHGMPGL 1140

Query: 61   RGSCARGEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCELREGRAEAGHSSTSQ 120
            RG    GE   PGP G Q   G+   P     +G P      G + ++G           
Sbjct: 1141 RGQ--PGEMGDPGPRGLQGDPGIPGPPGIKGPSGSPGLNGLHGLKGQKG----------- 1187

Query: 121  KDTLSKGSAAAATAVAAGKHLAVPETRGGVPARRRETANPPGPGRARPGGSPFGHR-PLL 179
                        T  A+G H   P    G+P  + E  +P  PG + PG  P G + P  
Sbjct: 1188 ------------TKGASGLHDVGPPGPVGIPGLKGERGDPGSPGISPPG--PRGKKGPPG 1233

Query: 180  QPRQIFAPLQAITNQVRPQQRPEPPPAGSAARSGRRAQRPGRKRSPQSAPAESGELSIDR 239
             P     P  A      P+  P+P P G     G     P   R     P   G + + R
Sbjct: 1234 PPGSSGPPGPAGATGRAPKDIPDPGPPGDQGPPG-----PDGPRGAPGPPGLPGSVDLLR 1288

Query: 240  EEGGPRQAPQGAG 252
             E G    P   G
Sbjct: 1289 GEPGDCGLPGPPG 1301



 Score = 30.0 bits (66), Expect = 2.2
 Identities = 31/111 (27%), Positives = 36/111 (32%), Gaps = 7/111 (6%)

Query: 67  GEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCELREGRAEAGHSSTSQKDTLSK 126
           G +  PG +G     G    P  A   GPP  R   G     G  E G S          
Sbjct: 692 GPQGAPGLSGSDGHKGRPGTPGTAEIPGPPGFRGDMGDPGFGG--EKGSSPVGPPGPPGS 749

Query: 127 GSAAAATAV---AAGKHLAVPETRG--GVPARRRETANPPGPGRARPGGSP 172
                   +    A  HL  P  RG  GVP  +    +P  PG   P G P
Sbjct: 750 PGVNGQKGIPGDPAFGHLGPPGKRGLSGVPGIKGPRGDPGCPGAEGPAGIP 800



 Score = 29.6 bits (65), Expect = 2.9
 Identities = 55/211 (26%), Positives = 66/211 (31%), Gaps = 38/211 (18%)

Query: 67   GEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCELREGRA--EAGHSSTSQKDTL 124
            GE+  PG  G     G    P     +G P DR  RG +   G    E   +  SQK T 
Sbjct: 918  GERGKPGAEGCP---GAKGEPGEKGMSGLPGDRGLRGAKGAIGPPGDEGEMAIISQKGTP 974

Query: 125  SKGSAAAATAVAAGKHLAVPETRG-----GVPARRRETA--NPPGPGRARPG--GSPFGH 175
             +                 P  RG     G+  RR E     PPG  R  PG  G P   
Sbjct: 975  GEPGPPGDD--------GFPGERGDKGTPGMQGRRGEPGRYGPPGFHRGEPGEKGQPGPP 1026

Query: 176  RPLLQPRQI----FAPLQAITNQVRPQQRPEPP----------PAGSAARSGRRAQRPGR 221
             P   P       F     +         P PP          P G+          PG 
Sbjct: 1027 GPPGPPGSTGLRGFIGFPGLPGDQGEPGSPGPPGFSGIDGARGPKGNKGDPASHFGPPGP 1086

Query: 222  KRSPQSAPAESGELSIDREEGGPR-QAPQGA 251
            K  P S P   G      E+G P  Q P+G+
Sbjct: 1087 KGEPGS-PGCPGHFGASGEQGLPGIQGPRGS 1116



 Score = 28.5 bits (62), Expect = 6.4
 Identities = 32/115 (27%), Positives = 43/115 (37%), Gaps = 8/115 (6%)

Query: 67  GEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCELREGRAEAGHSSTSQKDTLSK 126
           G K  PGP G +   G+   P    A+GPP ++ A+G ++   R + GH      D    
Sbjct: 524 GTKGDPGPPGAEGPPGL---PGKHGASGPPGNKGAKG-DMVVSRVK-GHKGERGPDG-PP 577

Query: 127 GSAAAATAVAAGKHLAVPETRGGVPARRRETANPPGPGRARPGGSPFGHRPLLQP 181
           G      +     H       G  P    E A P G G   P G P    P+  P
Sbjct: 578 GFPGQPGSHGRDGHAGEKGDPG--PPGDHEDATPGGKGFPGPLGPPGKAGPVGPP 630


>gi|33667117 MICAL-like 2 isoform 1 [Homo sapiens]
          Length = 904

 Score = 45.4 bits (106), Expect = 5e-05
 Identities = 67/249 (26%), Positives = 91/249 (36%), Gaps = 53/249 (21%)

Query: 4   GSGPRGSPPGCKAALWSTLGGSMQQSAPTGQWGIRADSSPRRRTVRTAQPKERSPLRRG- 62
           G   R  PP    A  ST   S   + P       A+SS   R    ++PK  +P+ +G 
Sbjct: 509 GLPSRMEPP----APLSTSSTSQASALPPAGRRNLAESSGVGRVGAGSRPKPEAPMAKGK 564

Query: 63  --------SCARGEKRPPGPTGGQWRWGVSP--RPSAASATGPPQDRAARGCELREGRAE 112
                   S +  E +  GP G  WR  + P  R S A  T  P++  A    L E RA 
Sbjct: 565 STTLTQDMSTSLQEGQEDGPAG--WRANLKPVDRRSPAERTLKPKEPRA----LAEPRAG 618

Query: 113 AGHSSTSQKDTLSKGSAAAATAVAAGKHLAVPETRGGVPARRRETANPPGPGRARPGGSP 172
                 S       GS A +  +               P R   T  P  PG + P  SP
Sbjct: 619 EAPRKVS-------GSFAGSVHITL------------TPVRPDRTPRPASPGPSLPARSP 659

Query: 173 FGHRPLLQPRQIFAPLQAITNQVRPQQRPEPPPAGSAARSGRRAQR-------PGRKRSP 225
               P  +   + A L    N +RP    EPP   +  +S +  ++       PGR  SP
Sbjct: 660 --SPPRRRRLAVPASLDVCDNWLRP----EPPGQEARVQSWKEEEKKPHLQGKPGRPLSP 713

Query: 226 QSAPAESGE 234
            + PA  GE
Sbjct: 714 ANVPALPGE 722


>gi|239755873 PREDICTED: hypothetical protein [Homo sapiens]
          Length = 577

 Score = 44.7 bits (104), Expect = 9e-05
 Identities = 62/210 (29%), Positives = 79/210 (37%), Gaps = 21/210 (10%)

Query: 61  RGSCARGEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRA--ARGCELREGRAEAGHSST 118
           RG  A   KR      GQW+   SP P+ A A   PQ R     GC     R E  H + 
Sbjct: 350 RGFDAAVRKRKSPALSGQWQ-SPSPPPALAGAPPTPQQRQNFGPGCAGLSPRPEGPHGAG 408

Query: 119 S-QKDTLSKGSAAAATAVAAGKHLAVPE--TRGGVPARRRETANPPGPGRARPGGSPFGH 175
                 L        T +   +  ++P   +RG   ARR       G GR  PGG   G 
Sbjct: 409 GCPMGELGMSLRVWGTLLPLERARSLPSWNSRGFCGARRGG-----GSGRFLPGGGWRGA 463

Query: 176 RPLLQPRQIFAPLQAITNQVRPQQRPEPPPAGSAARSGRRAQRPGRKRSPQSAPAESGEL 235
           R  L+     AP Q+  +         P P GSA R+   A+    +RS    PA  G  
Sbjct: 464 RTKLRVCCTPAPGQSRGSLA-------PRPTGSAGRALAAARLFAPQRSAAQLPARCGCF 516

Query: 236 SIDREEGGPRQAPQGA---GHALEGRGRVI 262
              R    PR+   GA     A   RG+V+
Sbjct: 517 GSARGMRAPRRLLVGAEETAGARSKRGKVL 546


>gi|224548936 hypothetical protein LOC100170229 [Homo sapiens]
          Length = 715

 Score = 44.3 bits (103), Expect = 1e-04
 Identities = 55/214 (25%), Positives = 90/214 (42%), Gaps = 24/214 (11%)

Query: 49  RTAQPKERSPLRRGSCARGEKRPPG--PTGGQWRW-GVSPRPSAASATGPPQDRAARGCE 105
           R    + R+P RRGS  R  KR P    T G+ R  G  P  ++   T   Q + +RG  
Sbjct: 115 RGTHSRGRTPGRRGS--RSSKRSPSRASTPGRIRTHGARPGMASRVRTPTSQQKGSRGKS 172

Query: 106 LREGRAEAGHSSTSQKDTLSKGS--AAAATAVAAGKHLAVP------ETRGGVPARRRET 157
               R      S SQ   LSK S      + +     LAV       +T  G+P+  +E 
Sbjct: 173 YGRPRTSNRERSDSQPRNLSKKSYRPPGGSGIGRSSELAVTPSTAKCQTPTGIPS--KEK 230

Query: 158 ANPPGPGRARPGGSPFGHRPLLQPRQIFAPLQAIT-----NQVRPQQRPEPPPAGSAARS 212
           ++ P P  +R   S +G   +    + ++P +  +     NQ   + RP+   + S +RS
Sbjct: 231 SDNPSPSSSRKVKS-YGQMIIPSREKSYSPTEMSSRVKSYNQASTRSRPQ---SHSQSRS 286

Query: 213 GRRAQRPGRKRSPQSAPAESGELSIDREEGGPRQ 246
            RR++   +KR+     + S + +  R     R+
Sbjct: 287 PRRSRSGSQKRTHSRVRSHSWKRNHSRARSRTRK 320


>gi|33286446 opioid growth factor receptor [Homo sapiens]
          Length = 677

 Score = 44.3 bits (103), Expect = 1e-04
 Identities = 56/200 (28%), Positives = 68/200 (34%), Gaps = 36/200 (18%)

Query: 88  SAASATGPPQDRAARGCELREGRAEAGHSSTS-QKDTLSKGSAAAATAVAAGKHLAVPET 146
           SAA A+G  Q  A  G     G  +AGHS    ++DT  +      T    G     P  
Sbjct: 468 SAAVASGGAQTLALAGSPAPSGHPKAGHSENGVEEDTEGRTGPKEGT---PGSPSETPGP 524

Query: 147 RGGVPARRRETANP---PGPGRARPGG----------------SPFGHRPLLQPRQIFAP 187
               PA      +P   PGP  A P G                 P G  P   P +   P
Sbjct: 525 SPAGPAGDEPAESPSETPGPRPAGPAGDEPAESPSETPGPRPAGPAGDEPAESPSETPGP 584

Query: 188 LQAITNQVRPQQRPE----PPPAGSAARSGRRAQRPGRKRSPQSA------PAESGELSI 237
             A   +  P + P     P PAG A      A+ P     P+ A      PAES   + 
Sbjct: 585 SPAGPTRDEPAESPSETPGPRPAGPA--GDEPAESPSETPGPRPAGPAGDEPAESPSETP 642

Query: 238 DREEGGP-RQAPQGAGHALE 256
                GP R  P  AG A E
Sbjct: 643 GPSPAGPTRDEPAKAGEAAE 662


>gi|239758013 PREDICTED: hypothetical protein [Homo sapiens]
          Length = 819

 Score = 44.3 bits (103), Expect = 1e-04
 Identities = 70/275 (25%), Positives = 98/275 (35%), Gaps = 32/275 (11%)

Query: 3   RGSGPRGSPPGCKAALWSTLGGSMQQS--APTGQWGIRADSSPRRRTVRTAQPKERSPLR 60
           RG+   G  P  K       GG   +S      +WG R    PRRR  +  +       R
Sbjct: 289 RGAATAGEKPAAKGR-----GGRGPKSLGGQNPRWG-RGGKKPRRRGQKATKAAAAGTKR 342

Query: 61  RGSCARGEKRPPGPTG-----GQWRWGVSPRPSAASATGPPQDRAARGCELREGRAEAGH 115
            G    G ++     G      Q R G  PR   ASA G      +RG   ++G  EA  
Sbjct: 343 GG----GGQQKAAVVGVKSRKKQRRRGKKPRRQKASAAGAK----SRGGGDKKGLREAKR 394

Query: 116 SSTSQKDTLSKGSAAAATAVAAGKHLAVPETRGG---VPARRRETANPPGPGRARPGGSP 172
               +K    +     A    A K       RGG    P RR   A     G++R GG  
Sbjct: 395 RGDGRKKPRWQNPRRRAQKAPAAK------PRGGGSKKPRRRGPKAAKSRGGKSRSGGGK 448

Query: 173 FGHRPLLQPRQIFAPLQAITNQVRPQQRPEPPPAGSAARSGRRAQRPGRK--RSPQSAPA 230
              R   + RQ  A   A +   + Q R +   AG+ +R G      G+   R  Q A A
Sbjct: 449 KPRRRGQKARQKAAEAGAESRGDKKQGRQKASAAGTKSRGGGLKSHSGKNLWRRGQKAEA 508

Query: 231 ESGELSIDREEGGPRQAPQGAGHALEGRGRVINGS 265
              + +    +   R+ P+    A+ G    + GS
Sbjct: 509 AGEKATAAGAKSRGRKQPRRQKPAVAGAKSRVGGS 543



 Score = 39.7 bits (91), Expect = 0.003
 Identities = 74/304 (24%), Positives = 107/304 (35%), Gaps = 63/304 (20%)

Query: 9   GSPPGCKAALWSTLGGSMQQSAPTGQWGIRADSSPRRRTV----------RTAQPKERSP 58
           G+     AA   + GG   Q A       R+   PRRR            R  + K R  
Sbjct: 156 GAQKAAAAAGVKSQGGEKPQKAAAAGAKSRSGKKPRRRGAKSRGDEDKKGRQREAKSRFG 215

Query: 59  LRRGSCARGEKRP-PGPTGGQWRWGVSPRPSAASATGPPQDRAARGCELREGRAEAGHSS 117
             RG   +  +RP P   G + R G   +P      G    +AA G + R G++  G   
Sbjct: 216 KTRGGRRKKHRRPKPAAGGAKSRRGGGKKPQKQRRRG---QKAATGGKSRVGKSRGGGDK 272

Query: 118 TSQKDTLSKGSA-----AAATA----VAAGKHLAVPET---------RGGVPARRR--ET 157
             ++   +   A      AATA     A G+    P++         RGG   RRR  + 
Sbjct: 273 KLRRQGQNAALAGAKRRGAATAGEKPAAKGRGGRGPKSLGGQNPRWGRGGKKPRRRGQKA 332

Query: 158 ANPPGPGRARPGGSPFGHRPLLQPRQIFAPLQAITNQVRPQQRP---EPPPAGSAARSG- 213
                 G  R GG         Q +     +++   Q R  ++P   +   AG+ +R G 
Sbjct: 333 TKAAAAGTKRGGGG--------QQKAAVVGVKSRKKQRRRGKKPRRQKASAAGAKSRGGG 384

Query: 214 -----RRAQRPG----------RKRSPQSAPAES--GELSIDREEGGPRQAPQGAGHALE 256
                R A+R G           +R  Q APA    G  S      GP+ A    G +  
Sbjct: 385 DKKGLREAKRRGDGRKKPRWQNPRRRAQKAPAAKPRGGGSKKPRRRGPKAAKSRGGKSRS 444

Query: 257 GRGR 260
           G G+
Sbjct: 445 GGGK 448



 Score = 30.8 bits (68), Expect = 1.3
 Identities = 28/112 (25%), Positives = 43/112 (38%), Gaps = 12/112 (10%)

Query: 52  QPKERSPLRRGSCARGEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAA--------RG 103
           QP+ + P   GS    E       G +      PR   A + G     AA        RG
Sbjct: 660 QPRWQKPAAAGS----ESSAAAAVGQEAEARKKPRRRRAKSRGDGGQNAAVGINPWQRRG 715

Query: 104 CELREGRAEAGHSSTSQKDTLSKGSAAAATAVAAGKHLAVPETRGGVPARRR 155
            + R G++  G  + S+    +K   +      +GK  A  ++ GG  A+RR
Sbjct: 716 KKPRLGKSRGGGGAKSRGSGGAKSRGSGGAKSRSGKKAAASKSHGGGGAKRR 767


>gi|237757308 synaptojanin 1 isoform b [Homo sapiens]
          Length = 1350

 Score = 43.9 bits (102), Expect = 1e-04
 Identities = 55/231 (23%), Positives = 82/231 (35%), Gaps = 35/231 (15%)

Query: 43   PRRRTVRTAQPKERSPLRR---GSCARGEKRPPGPTGGQWRWGVSPRPSAASATG---PP 96
            P++   +  +PK   P R     +     +RPP P+G +     SP P+     G   PP
Sbjct: 1134 PQKDPAQPLEPKRPPPPRPVAPPTRPAPPQRPPPPSGAR-----SPAPTRKEFGGIGAPP 1188

Query: 97   QDRAARGCELREGRAEAGHSSTSQKDTLSKGSAAAATAVAAGKHLAVPETRGGVPARRRE 156
                AR    RE  A      T++KD + +   +    +A          R  +P R   
Sbjct: 1189 SPGVAR----REMEAPKS-PGTTRKDNIGRSQPSPQAGLAGPGPAGYSTARPTIPPRAGV 1243

Query: 157  TANPPGPGRARPG---------------GSPFGHRPLLQPRQIFAPLQAITNQVRPQQRP 201
             + P    RA  G               GS F   PL +P+  F P  ++    +  Q P
Sbjct: 1244 ISAPQSHARASAGRLTPESQSKTSETSKGSTFLPEPL-KPQAAFPPQSSLPPPAQRLQEP 1302

Query: 202  EPPPAGSAARSGRRAQRPGRKRSPQSAPAESGELSIDREEGGPRQAPQGAG 252
              P A    +SG    +P  +  PQ  P      S+  E     Q  Q +G
Sbjct: 1303 LVPVAAPMPQSG---PQPNLETPPQPPPRSRSSHSLPSEASSQPQQEQPSG 1350


>gi|110349772 alpha 1 type I collagen preproprotein [Homo sapiens]
          Length = 1464

 Score = 43.9 bits (102), Expect = 1e-04
 Identities = 68/281 (24%), Positives = 95/281 (33%), Gaps = 39/281 (13%)

Query: 5   SGPRGSP-----PGCKAALWST----LGGSMQQSAPTGQWGIRADSSPRRRTVRTAQPKE 55
           +GP G+P     PG K A  +       G      P+G  G      P+  +     P  
Sbjct: 382 AGPAGNPGADGQPGAKGANGAPGIAGAPGFPGARGPSGPQGPGGPPGPKGNSGEPGAPGS 441

Query: 56  RSPLRRGSCARGE------KRPPGPTGGQWRWGVSPRPSAASATGPPQDR---AARGCEL 106
           +      + A+GE      + PPGP G + + G    P      GPP +R    +RG   
Sbjct: 442 KGD----TGAKGEPGPVGVQGPPGPAGEEGKRGARGEPGPTGLPGPPGERGGPGSRGFPG 497

Query: 107 REGRAEAGHSSTSQKDTLSKGSAAAATAVAAGKHLAVPETRG-----GVPARRRETANPP 161
            +G A     +  +      G   +           +P  +G     G P    +T  PP
Sbjct: 498 ADGVAGPKGPAGERGSPGPAGPKGSPGEAGRPGEAGLPGAKGLTGSPGSPGPDGKT-GPP 556

Query: 162 GP----GRARPGGSPFGHRPLLQPRQIFAPLQAITNQVRPQQRPEPPPAGSAARSGR--- 214
           GP    GR  P G P G R          P  A     +  +R  P P G+   +G+   
Sbjct: 557 GPAGQDGRPGPPGPP-GARGQAGVMGFPGPKGAAGEPGKAGERGVPGPPGAVGPAGKDGE 615

Query: 215 -RAQRPGRKRSPQSAPAESGELSIDREEG--GPRQAPQGAG 252
             AQ P     P     E G       +G  GP   P  AG
Sbjct: 616 AGAQGPPGPAGPAGERGEQGPAGSPGFQGLPGPAGPPGEAG 656



 Score = 42.7 bits (99), Expect = 3e-04
 Identities = 72/277 (25%), Positives = 88/277 (31%), Gaps = 54/277 (19%)

Query: 2    DRGS----GPRG--SPPGCKAALWSTLGGSMQQSAPTGQWGIRADSSPRRRTVRTAQP-- 53
            DRG     GP G   PPG          G        G  G + D+ P         P  
Sbjct: 801  DRGEPGPPGPAGFAGPPGAD--------GQPGAKGEPGDAGAKGDAGPPGPAGPAGPPGP 852

Query: 54   --KERSPLRRGSCARGEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCELREG-R 110
                 +P  +G  ARG   PPG TG     G    P  +   GPP      G E  +G R
Sbjct: 853  IGNVGAPGAKG--ARGSAGPPGATGFPGAAGRVGPPGPSGNAGPPGPPGPAGKEGGKGPR 910

Query: 111  AEAGHSS-------------TSQKDTLSKGSAAAATAVAAGKHLAVPETRGGVPARRRET 157
             E G +                +K +      A A      + +A      G+P +R E 
Sbjct: 911  GETGPAGRPGEVGPPGPPGPAGEKGSPGADGPAGAPGTPGPQGIAGQRGVVGLPGQRGER 970

Query: 158  ANPPGPGRARPGGSPFGHRPLLQPRQIFAPLQAITNQVRPQQRPEPPPAGSAARSGRRAQ 217
              P  PG   P G P              P  A   +  P     P  AG    SGR   
Sbjct: 971  GFPGLPG---PSGEPGKQ----------GPSGASGERGPPGPMGPPGLAGPPGESGREG- 1016

Query: 218  RPGRKRSP--QSAPAESGELSIDREEGGPRQAPQGAG 252
             PG + SP    +P   G    DR E GP   P   G
Sbjct: 1017 APGAEGSPGRDGSPGAKG----DRGETGPAGPPGAPG 1049



 Score = 41.6 bits (96), Expect = 7e-04
 Identities = 75/260 (28%), Positives = 93/260 (35%), Gaps = 45/260 (17%)

Query: 24  GSMQQSAPTGQWGIRADSSPRRRTVRTAQPKERSPLRRGSCARGEKRPPGPTGGQWRWGV 83
           G M  S P G  G      P+       +P E       S   G + PPGP G     G 
Sbjct: 179 GPMGPSGPRGLPGPPGAPGPQGFQGPPGEPGEPG----ASGPMGPRGPPGPPGKNGDDGE 234

Query: 84  SPRPSAASATGPPQDRAARGCELREG-RAEAGHSSTSQKDTLSKGSAAAA----TAVAAG 138
           + +P      GPP  + ARG     G     GH   S  D  +KG A  A       + G
Sbjct: 235 AGKPGRPGERGPPGPQGARGLPGTAGLPGMKGHRGFSGLDG-AKGDAGPAGPKGEPGSPG 293

Query: 139 KHLA--------VPETRG--GVP----ARRRE----TANPPGP-GRARPGGSP-----FG 174
           ++ A        +P  RG  G P    AR  +     A PPGP G A P G P      G
Sbjct: 294 ENGAPGQMGPRGLPGERGRPGAPGPAGARGNDGATGAAGPPGPTGPAGPPGFPGAVGAKG 353

Query: 175 HRPLLQPRQIFAPLQAITNQVRPQQRPEPPPAGSAARSGRRAQ--RPGRKRSPQSAPAES 232
                 PR    P Q +  +  P     P PAG+A  +G      +PG K     AP  +
Sbjct: 354 EAGPQGPRGSEGP-QGVRGEPGP-----PGPAGAAGPAGNPGADGQPGAK-GANGAPGIA 406

Query: 233 GELSIDREEGGPRQAPQGAG 252
           G        G     PQG G
Sbjct: 407 GAPGFPGARG--PSGPQGPG 424



 Score = 33.1 bits (74), Expect = 0.26
 Identities = 64/267 (23%), Positives = 84/267 (31%), Gaps = 41/267 (15%)

Query: 6   GPRGSPPGCKAALWSTLGGSMQQSAPTGQWGIRADSSPRRRTVR----TAQPKERSPLRR 61
           GP G P          + G +    P+G  G R    P  R V+     A P+  +    
Sbjct: 647 GPAGPPGEAGKPGEQGVPGDLGAPGPSGARGERG--FPGERGVQGPPGPAGPRGANGAPG 704

Query: 62  GSCARGEKRPPGPTGGQWRWGVSPRP---SAASATGPPQDRAARGCELREGR--AEAGHS 116
              A+G+   PG  G Q   G+   P    AA   GP  DR   G +  +G    +    
Sbjct: 705 NDGAKGDAGAPGAPGSQGAPGLQGMPGERGAAGLPGPKGDRGDAGPKGADGSPGKDGVRG 764

Query: 117 STSQKDTLSKGSAAAATAVAAGKHLAVPETRGGVPARRRETANPPGP-GRARPGGSPFGH 175
            T          A      +     A P    G P  R E   PPGP G A P G+    
Sbjct: 765 LTGPIGPPGPAGAPGDKGESGPSGPAGPTGARGAPGDRGE-PGPPGPAGFAGPPGA---- 819

Query: 176 RPLLQPRQIFAPLQAITNQVRPQQRPEPPPAGSAARSGRRAQRPGRKRSPQSAPAESGEL 235
                               +P  + EP  AG+   +G     P     P   P   G +
Sbjct: 820 ------------------DGQPGAKGEPGDAGAKGDAG-----PPGPAGPAGPPGPIGNV 856

Query: 236 SIDREEGGPRQA-PQGAGHALEGRGRV 261
                +G    A P GA       GRV
Sbjct: 857 GAPGAKGARGSAGPPGATGFPGAAGRV 883



 Score = 30.0 bits (66), Expect = 2.2
 Identities = 49/189 (25%), Positives = 66/189 (34%), Gaps = 18/189 (9%)

Query: 66  RGEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCELREG-RAEAGHSSTSQKDTL 124
           +G   P GP G   + G    P    A GP   R  RG     G +   G +     +  
Sbjct: 643 QGLPGPAGPPGEAGKPGEQGVPGDLGAPGPSGARGERGFPGERGVQGPPGPAGPRGANGA 702

Query: 125 SKGSAAAATAVAAGKHLAVPETRG-----GVPARRRETANPPGP----GRARPGGSPFGH 175
                A   A A G     P ++G     G+P   R  A  PGP    G A P G+  G 
Sbjct: 703 PGNDGAKGDAGAPG----APGSQGAPGLQGMPG-ERGAAGLPGPKGDRGDAGPKGAD-GS 756

Query: 176 RPLLQPRQIFAPLQAITNQVRPQQRPEPPPAGSAARSGRRAQRPGRKRSPQSAPAESGEL 235
                 R +  P+        P  + E  P+G A  +G R   PG +  P   P  +G  
Sbjct: 757 PGKDGVRGLTGPIGPPGPAGAPGDKGESGPSGPAGPTGARG-APGDRGEP-GPPGPAGFA 814

Query: 236 SIDREEGGP 244
                +G P
Sbjct: 815 GPPGADGQP 823



 Score = 29.6 bits (65), Expect = 2.9
 Identities = 21/73 (28%), Positives = 26/73 (35%), Gaps = 1/73 (1%)

Query: 31   PTGQWGIRADSSPRRRTVRTAQPKERSPLRRGSCARGEKRPPGPTGGQWRWGVSPRPSAA 90
            P G  G      PR     T +  +R  ++      G + PPGP G     G S     A
Sbjct: 1080 PVGARGPAGPQGPRGDKGETGEQGDRG-IKGHRGFSGLQGPPGPPGSPGEQGPSGASGPA 1138

Query: 91   SATGPPQDRAARG 103
               GPP    A G
Sbjct: 1139 GPRGPPGSAGAPG 1151



 Score = 29.3 bits (64), Expect = 3.8
 Identities = 24/82 (29%), Positives = 31/82 (37%), Gaps = 2/82 (2%)

Query: 24   GSMQQSAPTGQWGIRADSSPRRRTVRTAQP-KERSPLRRGSC-ARGEKRPPGPTGGQWRW 81
            G+  +  P G  G    + P   + R   P  E SP R GS  A+G++   GP G     
Sbjct: 989  GASGERGPPGPMGPPGLAGPPGESGREGAPGAEGSPGRDGSPGAKGDRGETGPAGPPGAP 1048

Query: 82   GVSPRPSAASATGPPQDRAARG 103
            G    P      G   DR   G
Sbjct: 1049 GAPGAPGPVGPAGKSGDRGETG 1070



 Score = 29.3 bits (64), Expect = 3.8
 Identities = 64/252 (25%), Positives = 78/252 (30%), Gaps = 35/252 (13%)

Query: 5    SGPRGSP-PGCKAALWSTLGGSMQQSAPTGQWGIRADSSPRRRTVRTAQPKERSPLRRGS 63
            SG RG P P     L    G S ++ AP  +     D SP  +  R        P     
Sbjct: 991  SGERGPPGPMGPPGLAGPPGESGREGAPGAEGSPGRDGSPGAKGDRGETGPAGPP----- 1045

Query: 64   CARGEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCELREG-RAEAGHSSTSQKD 122
             A G    PGP G   + G       A   GP     ARG    +G R + G +   Q D
Sbjct: 1046 GAPGAPGAPGPVGPAGKSGDRGETGPAGPAGPVGPVGARGPAGPQGPRGDKGETG-EQGD 1104

Query: 123  TLSKGSAAAATAVAAGKHLAVPETRGGVPARRRETANPPG-PGRARPGGSPFGHRPLLQP 181
               KG            H      +G           PPG PG   P G+     P   P
Sbjct: 1105 RGIKG------------HRGFSGLQG--------PPGPPGSPGEQGPSGASGPAGPRGPP 1144

Query: 182  RQIFAPLQAITNQVRPQQRPEPPPAGSAARSGRRAQRPGRKRSPQSAPAESGELSIDREE 241
                AP +   N + P     P P G    +G     P     P   P   G  S   + 
Sbjct: 1145 GSAGAPGKDGLNGL-PGPIGPPGPRGRTGDAG-----PVGPPGPPGPPGPPGPPSAGFDF 1198

Query: 242  GGPRQAPQGAGH 253
                Q PQ   H
Sbjct: 1199 SFLPQPPQEKAH 1210



 Score = 28.9 bits (63), Expect = 4.9
 Identities = 49/207 (23%), Positives = 62/207 (29%), Gaps = 32/207 (15%)

Query: 67   GEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCELREGRAEAGHSSTSQKDTLSK 126
            GE    GP+G     G           GPP +    G    EG      S  ++ D    
Sbjct: 980  GEPGKQGPSGASGERGPPGPMGPPGLAGPPGESGREGAPGAEGSPGRDGSPGAKGDRGET 1039

Query: 127  GSAAAATAVAAGKHLAVPETRGGVPARRRET-----ANPPGPGRARPGGSPFGHR----- 176
            G A    A  A      P    G    R ET     A P GP  AR    P G R     
Sbjct: 1040 GPAGPPGAPGA-PGAPGPVGPAGKSGDRGETGPAGPAGPVGPVGARGPAGPQGPRGDKGE 1098

Query: 177  ------PLLQPRQIFAPLQAITNQVRPQQRPEPPPAGSAARSGRRAQRPGRKRSPQSAPA 230
                    ++  + F+ LQ           P  PP     +    A  P   R P  +  
Sbjct: 1099 TGEQGDRGIKGHRGFSGLQG----------PPGPPGSPGEQGPSGASGPAGPRGPPGSAG 1148

Query: 231  ESGELSIDREEG-----GPRQAPQGAG 252
              G+  ++   G     GPR     AG
Sbjct: 1149 APGKDGLNGLPGPIGPPGPRGRTGDAG 1175


>gi|4502951 collagen type III alpha 1 preproprotein [Homo sapiens]
          Length = 1466

 Score = 43.9 bits (102), Expect = 1e-04
 Identities = 61/231 (26%), Positives = 79/231 (34%), Gaps = 22/231 (9%)

Query: 43  PRRRTVRTAQPKERSPLRRGSCARGEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAAR 102
           P+  T  T  P  + P       +G+  PPG  G     G+  +P +  + GPP      
Sbjct: 89  PQPPTAPTRPPNGQGP----QGPKGDPGPPGIPGRNGDPGIPGQPGSPGSPGPP--GICE 142

Query: 103 GCELREGRAEAGHSSTSQKDTLSKGSAAAATAVAAGKHLAVPETRGGVPARRRETA--NP 160
            C          + S   K  ++ G  A     A       P    G P          P
Sbjct: 143 SCPTGPQNYSPQYDSYDVKSGVAVGGLAGYPGPAGPPGPPGPPGTSGHPGSPGSPGYQGP 202

Query: 161 PG-PGRARPGGSPFGHRPLLQPRQIFAPLQAITNQVRPQQR--PEPP----PAGSAARSG 213
           PG PG+A P G P G    + P              RP +R  P PP    PAG     G
Sbjct: 203 PGEPGQAGPSGPP-GPPGAIGPSGPAGKDGESGRPGRPGERGLPGPPGIKGPAGIPGFPG 261

Query: 214 RRAQR--PGR--KRSPQSAPAESGELSIDREEGGPRQAPQGAGHALEGRGR 260
            +  R   GR  ++    AP   GE  +  E G P   P G   A   RGR
Sbjct: 262 MKGHRGFDGRNGEKGETGAPGLKGENGLPGENGAP--GPMGPRGAPGERGR 310



 Score = 40.8 bits (94), Expect = 0.001
 Identities = 71/254 (27%), Positives = 84/254 (33%), Gaps = 57/254 (22%)

Query: 9   GSPPGCKAALWSTLGGSMQQSAPTGQWGIRADSSPRRRTVRTAQPKERSP--LRRGSCAR 66
           G PPG          G   +  P G  G  A  +P  +    A P ER P  L      R
Sbjct: 647 GGPPG--------ENGKPGEPGPKGDAG--APGAPGGKGDAGA-PGERGPPGLAGAPGLR 695

Query: 67  GEKRPPGPTGGQWRWGVSPRPSAASA---TGPPQDRAARGCELREGRAEAGHSSTSQKDT 123
           G   PPGP GG+   G    P AA      G P +R   G    +G  + G       D 
Sbjct: 696 GGAGPPGPEGGKGAAGPPGPPGAAGTPGLQGMPGERGGLGSPGPKG--DKGEPGGPGADG 753

Query: 124 LSKGSAAAATAVAAGKHLAVPETRGGVPARRRETANPPGPGRARPGGSPFGHRPLLQPRQ 183
           +             G     P    G P  + E   P  PG A P GSP G R    P  
Sbjct: 754 VPGKDGPRGPTGPIG-----PPGPAGQPGDKGEGGAPGLPGIAGPRGSP-GERGETGP-- 805

Query: 184 IFAPLQAITNQVRPQQRPEPPPAGSAARSGRRAQRPGRKRSPQSAPAESGELSIDREEGG 243
                              P PAG     G+  + PG K   + AP E G       EGG
Sbjct: 806 -------------------PGPAGFPGAPGQNGE-PGGK-GERGAPGEKG-------EGG 837

Query: 244 P---RQAPQGAGHA 254
           P      P G+G A
Sbjct: 838 PPGVAGPPGGSGPA 851



 Score = 37.4 bits (85), Expect = 0.014
 Identities = 70/282 (24%), Positives = 92/282 (32%), Gaps = 36/282 (12%)

Query: 5   SGPRGSP-----PGCKAALWSTLGGSMQQSAPTGQWGIRADSSPRRRTVRTAQPKERSPL 59
           SGPRG P     PG K        G+  ++   G  G      P  +   T       P 
Sbjct: 569 SGPRGQPGVMGFPGPKGN-----DGAPGKNGERGGPGGPGPQGPPGKNGETGPQGPPGPT 623

Query: 60  RRGSCARGEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCE-LREGRAEAGHSST 118
             G   +G+  PPGP G Q   G    P      G P  +   G      G+ +AG    
Sbjct: 624 GPGG-DKGDTGPPGPQGLQGLPGTGGPPGENGKPGEPGPKGDAGAPGAPGGKGDAGAPGE 682

Query: 119 SQKDTLSKG----SAAAATAVAAGKHLAVPE-------TRG--GVPARRRETANP-PGPG 164
                L+        A       GK  A P        T G  G+P  R    +P P   
Sbjct: 683 RGPPGLAGAPGLRGGAGPPGPEGGKGAAGPPGPPGAAGTPGLQGMPGERGGLGSPGPKGD 742

Query: 165 RARPGGSPFGHRPLLQ-PRQIFAPLQAITNQVRPQQRPE------PPPAGSAARSGRRAQ 217
           +  PGG      P    PR    P+       +P  + E      P  AG     G R +
Sbjct: 743 KGEPGGPGADGVPGKDGPRGPTGPIGPPGPAGQPGDKGEGGAPGLPGIAGPRGSPGERGE 802

Query: 218 R--PGRKRSPQSAPAESGELSIDREEGGPRQAPQGAGHALEG 257
              PG    P  AP ++GE     E G P +  +G    + G
Sbjct: 803 TGPPGPAGFP-GAPGQNGEPGGKGERGAPGEKGEGGPPGVAG 843



 Score = 35.0 bits (79), Expect = 0.068
 Identities = 70/269 (26%), Positives = 82/269 (30%), Gaps = 56/269 (20%)

Query: 6   GPRGSPPGCKAALWSTLGGSMQQSAPTGQWGIRADSSPRRRTVRTAQPKERSPLRRGSCA 65
           G +G P G  A       G    + P G  G       +        P    P  RGS  
Sbjct: 741 GDKGEPGGPGADGVPGKDGPRGPTGPIGPPGPAGQPGDKGEGGAPGLPGIAGP--RGSPG 798

Query: 66  -RGEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCELREGRAEAGHSSTSQKDTL 124
            RGE  PPGP G          P A    G P  +  RG    +G               
Sbjct: 799 ERGETGPPGPAG---------FPGAPGQNGEPGGKGERGAPGEKG--------------- 834

Query: 125 SKGSAAAATAVAAGKHLAVPETRGGVPARRRETANPPGPGRARPGGSPFGHRPLLQPRQI 184
            +G          G   A P    GV   + E  +P GPG A   G P G R L  P   
Sbjct: 835 -EGGPPGVAGPPGGSGPAGPPGPQGV---KGERGSPGGPGAA---GFP-GARGLPGPPGS 886

Query: 185 FAPLQAITNQVRPQQRPEPPPAGSAARSGR--------RAQRPGRKRSP--QSAPAESGE 234
                       P +   P PAG+    G          A +PG K SP  Q  P   G 
Sbjct: 887 NGNPGPPGPSGSPGKDGPPGPAGNTGAPGSPGVSGPKGDAGQPGEKGSPGAQGPPGAPGP 946

Query: 235 LSIDREEG-----------GPRQAPQGAG 252
           L I    G           GPR +P   G
Sbjct: 947 LGIAGITGARGLAGPPGMPGPRGSPGPQG 975



 Score = 33.5 bits (75), Expect = 0.20
 Identities = 52/191 (27%), Positives = 64/191 (33%), Gaps = 24/191 (12%)

Query: 67  GEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCELREGRAEAGHSSTSQKDTLSK 126
           GE   PGP G +   G   RP    A G   +  ARG + + G      ++       +K
Sbjct: 291 GENGAPGPMGPRGAPGERGRPGLPGAAGARGNDGARGSDGQPGPPGPPGTAGFPGSPGAK 350

Query: 127 GSAAAATAVAAGKHLAVPETRGGV-PARRRETANPPGPGRARPG--GSPFGHRPLLQPRQ 183
           G    A +  +      P  RG   P        PPGP    PG  GSP G   +     
Sbjct: 351 GEVGPAGSPGSN---GAPGQRGEPGPQGHAGAQGPPGP----PGINGSPGGKGEM----- 398

Query: 184 IFAPLQAITNQVRPQQRPEPPPAGSAARSGRR--AQRPGRKRSPQSAPAESGELSIDREE 241
              P            R  P PAG+    G R  A  PG K   +  P   GE    R E
Sbjct: 399 --GPAGIPGAPGLMGARGPPGPAGANGAPGLRGGAGEPG-KNGAKGEPGPRGE----RGE 451

Query: 242 GGPRQAPQGAG 252
            G    P   G
Sbjct: 452 AGIPGVPGAKG 462



 Score = 33.5 bits (75), Expect = 0.20
 Identities = 59/254 (23%), Positives = 76/254 (29%), Gaps = 43/254 (16%)

Query: 5    SGPRGSP-PGCKAALWSTLGGSMQQSAPTGQWGIRADSSPRRRTVRTAQPKERSPLRRGS 63
            SG RG P P     L  T G   +   P        D SP  +  R       +P     
Sbjct: 989  SGERGPPGPQGLPGLAGTAGEPGRDGNPGSDGLPGRDGSPGGKGDRGENGSPGAP----- 1043

Query: 64   CARGEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCELREG-RAEAGHSSTSQKD 122
             A G   PPGP G   + G       A   G P    +RG    +G R + G       +
Sbjct: 1044 GAPGHPGPPGPVGPAGKSGDRGESGPAGPAGAPGPAGSRGAPGPQGPRGDKG-------E 1096

Query: 123  TLSKGSAAAATAVAAGKHLAVPETRGGVPARRRETANPPGPGRARPGG--SPFGHRPLLQ 180
            T  +G+A                   G+   R    NP  PG   P G     G      
Sbjct: 1097 TGERGAA-------------------GIKGHRGFPGNPGAPGSPGPAGQQGAIGSPGPAG 1137

Query: 181  PRQIFAPLQAITNQVRPQQRPEPPPAGSAARSGRRAQRPGRKRSPQSAPAESGELSIDRE 240
            PR    P+       +      P P G     G R +     R  + +P   G+      
Sbjct: 1138 PR---GPVGPSGPPGKDGTSGHPGPIGPPGPRGNRGE-----RGSEGSPGHPGQPGPPGP 1189

Query: 241  EGGPRQAPQGAGHA 254
             G P     G G A
Sbjct: 1190 PGAPGPCCGGVGAA 1203



 Score = 31.2 bits (69), Expect = 0.99
 Identities = 49/195 (25%), Positives = 61/195 (31%), Gaps = 14/195 (7%)

Query: 67  GEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCELREG-------RAEAGHSSTS 119
           G   PPGP+G + + GV   P      G P     RG     G         E G     
Sbjct: 561 GRPGPPGPSGPRGQPGVMGFPGPKGNDGAPGKNGERGGPGGPGPQGPPGKNGETG--PQG 618

Query: 120 QKDTLSKGSAAAATAVAAGKHLAVPETRGGVPARRRETANPPGPGRARPGGSPFGHRPLL 179
                  G     T     + L      GG P    +   P   G A   G+P G     
Sbjct: 619 PPGPTGPGGDKGDTGPPGPQGLQGLPGTGGPPGENGKPGEPGPKGDAGAPGAPGGKGDAG 678

Query: 180 QPRQIFAPLQAITNQVRPQQRPEPPPAGSAARSGRRAQRPGRKRSP--QSAPAESGELSI 237
            P +   P  A    +R    P P P G    +G     PG   +P  Q  P E G L  
Sbjct: 679 APGERGPPGLAGAPGLRGGAGP-PGPEGGKGAAGPPGP-PGAAGTPGLQGMPGERGGLGS 736

Query: 238 DREEGGPRQAPQGAG 252
              + G +  P G G
Sbjct: 737 PGPK-GDKGEPGGPG 750



 Score = 29.3 bits (64), Expect = 3.8
 Identities = 50/204 (24%), Positives = 61/204 (29%), Gaps = 25/204 (12%)

Query: 65  ARGEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCELREGR-AEAGHS------- 116
           ARG    PGP G     G    P A    GP     + G   + G     GH+       
Sbjct: 325 ARGSDGQPGPPGPPGTAGFPGSPGAKGEVGPAGSPGSNGAPGQRGEPGPQGHAGAQGPPG 384

Query: 117 ------STSQKDTLSKGSAAAATAVAAGKHLAVPETRGGVPARRRETANPPGPGRARPGG 170
                 S   K  +       A  +   +    P    G P   R  A  PG   A+   
Sbjct: 385 PPGINGSPGGKGEMGPAGIPGAPGLMGARGPPGPAGANGAPG-LRGGAGEPGKNGAKGEP 443

Query: 171 SPFGHRPLLQPRQIFAPLQAITNQVRPQQRPEPPPAGSAARSGRRAQRPGRK--RSPQSA 228
            P G R       I     A     +     EP   G    +G R   PG +    P   
Sbjct: 444 GPRGER---GEAGIPGVPGAKGEDGKDGSPGEPGANGLPGAAGERG-APGFRGPAGPNGI 499

Query: 229 PAESGELSIDREEGGPRQA-PQGA 251
           P E G      E G P  A P+GA
Sbjct: 500 PGEKGPAG---ERGAPGPAGPRGA 520



 Score = 29.3 bits (64), Expect = 3.8
 Identities = 29/113 (25%), Positives = 40/113 (35%), Gaps = 9/113 (7%)

Query: 2    DRG----SGPRGSPPGCKAALWSTLGGSMQQSAPTGQWGIRADSSPRRRTVRTAQPKERS 57
            DRG    +GP G+P    +       G       TG+ G       R        P    
Sbjct: 1063 DRGESGPAGPAGAPGPAGSRGAPGPQGPRGDKGETGERGAAGIKGHRGFPGNPGAPGSPG 1122

Query: 58   PL-RRGSCAR----GEKRPPGPTGGQWRWGVSPRPSAASATGPPQDRAARGCE 105
            P  ++G+       G + P GP+G   + G S  P      GP  +R  RG E
Sbjct: 1123 PAGQQGAIGSPGPAGPRGPVGPSGPPGKDGTSGHPGPIGPPGPRGNRGERGSE 1175


>gi|239754474 PREDICTED: similar to mucin [Homo sapiens]
          Length = 417

 Score = 43.9 bits (102), Expect = 1e-04
 Identities = 50/215 (23%), Positives = 76/215 (35%), Gaps = 47/215 (21%)

Query: 27  QQSAPTGQWGIRADSSPRRRTVRTAQPKERSPLRRGSCARGEKRPPGPTGGQWRWGVSPR 86
           QQ A   +    A  +P  R  RT + K + P RR +   G++R  G +    + G   R
Sbjct: 238 QQRAQDREEEAAAAPAPTSRGHRTEKRKPQQPQRRPAGGTGQRRGSGYSPSADQQGAQDR 297

Query: 87  PSAASATGPPQDRAARGCELREGRAEAGHSSTSQKDTLSKGSAAAATAVAAGKHLAVPET 146
              A+A   P                +GH +  +K    +   A  T    G   +    
Sbjct: 298 EEEAAAAPAP--------------TSSGHRTEKRKRLQLQCQPAGGTGQRRGSGCSSSAN 343

Query: 147 RGGVPARRRETANPPGPGRARPGGSPFGHRPLLQPRQIFAPLQAITNQVRPQQRPEPPPA 206
           + G   R  E A  P P  +       GHR               T + +PQQ    P A
Sbjct: 344 QQGAQDREEEAAAAPVPTSS-------GHR---------------TEKRKPQQPQRRPAA 381

Query: 207 GSAARSGRRAQRPGRKRSPQSAPAESGELSIDREE 241
           G+           G++R   S+P+   + + DREE
Sbjct: 382 GT-----------GQRRGSGSSPSADQQRAQDREE 405



 Score = 42.0 bits (97), Expect = 6e-04
 Identities = 56/229 (24%), Positives = 82/229 (35%), Gaps = 47/229 (20%)

Query: 39  ADSSPRRRTVRTAQPKERSPLRRGSCARGEKR--------PPGPTGGQWRWGVSPRPSAA 90
           A ++ ++R     +    +P+   S  R EKR        P G TG   R G    PSA 
Sbjct: 74  APAADQQRAQDREEEAAAAPVPTSSGHRTEKRKRLQLQCQPAGGTGQ--RRGSRSSPSAD 131

Query: 91  SATGPPQDRAARGCELREGRAEAGHSSTSQKDTLSKGSAAAATAVAAGKHLAVPETRGGV 150
                 ++  A    +   R   GH +  +K    +   A  T    G   +    + G 
Sbjct: 132 QQRAQDREEEAAAAPVPTSR---GHRTEKRKRQQPQRRPAGGTGQRRGSGSSPSADQQGA 188

Query: 151 PARRRETANPPGPGRARPGGSPFGHRPLLQPRQIFAPLQAITNQVRPQQRPEPPPAGSAA 210
             R  E A  P P       +  GHR               T + +PQQ P+  PAG   
Sbjct: 189 QDREEEAAAAPAP-------TSRGHR---------------TEKRKPQQ-PQCRPAGGT- 224

Query: 211 RSGRRAQRPGRKRSPQSAPAESGELSIDR-EEGGPRQAPQGAGHALEGR 258
                    G++R  +S+P+   + + DR EE     AP   GH  E R
Sbjct: 225 ---------GQRRGSRSSPSADQQRAQDREEEAAAAPAPTSRGHRTEKR 264



 Score = 37.0 bits (84), Expect = 0.018
 Identities = 24/79 (30%), Positives = 34/79 (43%), Gaps = 9/79 (11%)

Query: 192 TNQVRPQQRPEPPPAGSAARSGRRAQRPG--------RKRSPQSAPAESGELSIDR-EEG 242
           T  +RPQQ+P+  PAG   +       P         R+    +APA   + + DR EE 
Sbjct: 30  TKPLRPQQQPQCQPAGGTGQRRGSGSSPSADQQGAQDREEEAAAAPAADQQRAQDREEEA 89

Query: 243 GPRQAPQGAGHALEGRGRV 261
                P  +GH  E R R+
Sbjct: 90  AAAPVPTSSGHRTEKRKRL 108


>gi|38150007 small nuclear ribonucleoprotein polypeptide B/B'
           isoform B' [Homo sapiens]
          Length = 240

 Score = 43.9 bits (102), Expect = 1e-04
 Identities = 50/159 (31%), Positives = 57/159 (35%), Gaps = 30/159 (18%)

Query: 82  GVSPRPSAASATGPPQDRAA-RGCELREGRAEA---------GHSSTSQKDTLSKG---- 127
           G++  P A +A GP   RAA RG        +A         G    SQ+    +G    
Sbjct: 91  GIARVPLAGAAGGPGIGRAAGRGIPAGVPMPQAPAGLAGPVRGVGGPSQQVMTPQGRGTV 150

Query: 128 --SAAAATAVAAGKHLAVPETRGGVPARRRETANPPG-----PGRARPGGSPFGHRPLLQ 180
             +AAAATA  AG     P  RGG P      A PPG     PG   P G P G  P   
Sbjct: 151 AAAAAAATASIAGAPTQYPPGRGGPPPPMGRGAPPPGMMGPPPGMRPPMGPPMGIPPGRG 210

Query: 181 PRQIFAPLQAITNQVRPQQRPEPPPAGSAARSGRRAQRP 219
                 P         P  RP PP        G R  RP
Sbjct: 211 TPMGMPP---------PGMRPPPPGMRGPPPPGMRPPRP 240


>gi|207113162 Treacher Collins-Franceschetti syndrome 1 isoform e
           [Homo sapiens]
          Length = 1451

 Score = 43.9 bits (102), Expect = 1e-04
 Identities = 63/262 (24%), Positives = 94/262 (35%), Gaps = 33/262 (12%)

Query: 7   PRGSPPGCKAALWSTLGGSMQQSAPTGQWGIRADSSPRRR------TVRTAQPKERSPLR 60
           P   P   KA+  ST     +++AP    G   D +P+ +        R  +P+E S   
Sbjct: 216 PSVKPAQVKASSVSTKESPARKAAPAP--GKVGDVTPQVKGGALPPAKRAKKPEEESESS 273

Query: 61  RGSCARGEKRPPGPTGG--------QWRWGVSPR---PSAASATGPPQDRAARGCELREG 109
                  E+ P G            Q R   +P    P   +   PP    A   + + G
Sbjct: 274 EEGSESEEEAPAGTRSQVKASEKILQVRAASAPAKGTPGKGATPAPPGKAGAVASQTKAG 333

Query: 110 RAEAGHSSTSQKDTLSKGSAAAATAV----AAGKHLAVPETRGGVPARRRETANPPGPGR 165
           + E    S+S++ + S+    AA A+    A+GK   V           R+ A P  PG+
Sbjct: 334 KPEEDSESSSEESSDSEEETPAAKALLQAKASGKTSQVGAASAPAKESPRKGAAPAPPGK 393

Query: 166 ARPGGSPFGHRPLLQPRQIF---------APLQAITNQVRPQQRPEPPPAGSAARSGRRA 216
             P  +        +  Q           AP QA  +   PQ R    PA  + R G  A
Sbjct: 394 TGPAVAKAQAGKREEDSQSSSEESDSEEEAPAQAKPSGKAPQVRAASAPAKESPRKG-AA 452

Query: 217 QRPGRKRSPQSAPAESGELSID 238
             P RK  P +A  + G+   D
Sbjct: 453 PAPPRKTGPAAAQVQVGKQEED 474



 Score = 34.3 bits (77), Expect = 0.12
 Identities = 45/185 (24%), Positives = 73/185 (39%), Gaps = 17/185 (9%)

Query: 7   PRGSPPGCKAALWSTLG-GSMQQSAPTGQWGIRADSSPRRRTVRTAQPKERSPLRRGSCA 65
           P G  P  K A  ST+G G + + A     G    ++P  +  +  +  E S       +
Sbjct: 499 PLGKSPQVKPA--STMGMGPLGKGAGPVPPGKVGPATPSAQVGKWEEDSESSSEESSDSS 556

Query: 66  RGEKRPPGPTGGQWRWG--VSPRPSAASATGPPQDRAARGCELR-EGRAEAGHSSTSQKD 122
            GE         +   G  +  +P+++ A GPPQ       +++ E   +   SS    D
Sbjct: 557 DGEVPTAVAPAQEKSLGNILQAKPTSSPAKGPPQKAGPVAVQVKAEKPMDNSESSEESSD 616

Query: 123 TL-SKGSAAAATAVAAGKHLAVPETRGGVPARRRETAN---------PPGPGRARPGGSP 172
           +  S+ + AA TA  A   L +P+T+   P +   TA+            P +A    SP
Sbjct: 617 SADSEEAPAAMTAAQAKPALKIPQTK-ACPKKTNTTASAKVAPVRVGTQAPRKAGTATSP 675

Query: 173 FGHRP 177
            G  P
Sbjct: 676 AGSSP 680


>gi|207113160 Treacher Collins-Franceschetti syndrome 1 isoform d
           [Homo sapiens]
          Length = 1488

 Score = 43.9 bits (102), Expect = 1e-04
 Identities = 63/262 (24%), Positives = 94/262 (35%), Gaps = 33/262 (12%)

Query: 7   PRGSPPGCKAALWSTLGGSMQQSAPTGQWGIRADSSPRRR------TVRTAQPKERSPLR 60
           P   P   KA+  ST     +++AP    G   D +P+ +        R  +P+E S   
Sbjct: 216 PSVKPAQVKASSVSTKESPARKAAPAP--GKVGDVTPQVKGGALPPAKRAKKPEEESESS 273

Query: 61  RGSCARGEKRPPGPTGG--------QWRWGVSPR---PSAASATGPPQDRAARGCELREG 109
                  E+ P G            Q R   +P    P   +   PP    A   + + G
Sbjct: 274 EEGSESEEEAPAGTRSQVKASEKILQVRAASAPAKGTPGKGATPAPPGKAGAVASQTKAG 333

Query: 110 RAEAGHSSTSQKDTLSKGSAAAATAV----AAGKHLAVPETRGGVPARRRETANPPGPGR 165
           + E    S+S++ + S+    AA A+    A+GK   V           R+ A P  PG+
Sbjct: 334 KPEEDSESSSEESSDSEEETPAAKALLQAKASGKTSQVGAASAPAKESPRKGAAPAPPGK 393

Query: 166 ARPGGSPFGHRPLLQPRQIF---------APLQAITNQVRPQQRPEPPPAGSAARSGRRA 216
             P  +        +  Q           AP QA  +   PQ R    PA  + R G  A
Sbjct: 394 TGPAVAKAQAGKREEDSQSSSEESDSEEEAPAQAKPSGKAPQVRAASAPAKESPRKG-AA 452

Query: 217 QRPGRKRSPQSAPAESGELSID 238
             P RK  P +A  + G+   D
Sbjct: 453 PAPPRKTGPAAAQVQVGKQEED 474



 Score = 34.3 bits (77), Expect = 0.12
 Identities = 45/185 (24%), Positives = 73/185 (39%), Gaps = 17/185 (9%)

Query: 7   PRGSPPGCKAALWSTLG-GSMQQSAPTGQWGIRADSSPRRRTVRTAQPKERSPLRRGSCA 65
           P G  P  K A  ST+G G + + A     G    ++P  +  +  +  E S       +
Sbjct: 499 PLGKSPQVKPA--STMGMGPLGKGAGPVPPGKVGPATPSAQVGKWEEDSESSSEESSDSS 556

Query: 66  RGEKRPPGPTGGQWRWG--VSPRPSAASATGPPQDRAARGCELR-EGRAEAGHSSTSQKD 122
            GE         +   G  +  +P+++ A GPPQ       +++ E   +   SS    D
Sbjct: 557 DGEVPTAVAPAQEKSLGNILQAKPTSSPAKGPPQKAGPVAVQVKAEKPMDNSESSEESSD 616

Query: 123 TL-SKGSAAAATAVAAGKHLAVPETRGGVPARRRETAN---------PPGPGRARPGGSP 172
           +  S+ + AA TA  A   L +P+T+   P +   TA+            P +A    SP
Sbjct: 617 SADSEEAPAAMTAAQAKPALKIPQTK-ACPKKTNTTASAKVAPVRVGTQAPRKAGTATSP 675

Query: 173 FGHRP 177
            G  P
Sbjct: 676 AGSSP 680


  Database: hs.faa
    Posted date:  Aug 4, 2009  4:42 PM
  Number of letters in database: 18,247,518
  Number of sequences in database:  37,866
  
Lambda     K      H
   0.311    0.130    0.399 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 15,307,151
Number of Sequences: 37866
Number of extensions: 1016118
Number of successful extensions: 8462
Number of sequences better than 10.0: 30
Number of HSP's better than 10.0 without gapping: 121
Number of HSP's successfully gapped in prelim test: 1022
Number of HSP's that attempted gapping in prelim test: 5924
Number of HSP's gapped (non-prelim): 2868
length of query: 267
length of database: 18,247,518
effective HSP length: 100
effective length of query: 167
effective length of database: 14,460,918
effective search space: 2414973306
effective search space used: 2414973306
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 61 (28.1 bits)

Search results were obtained with NCBI BLAST and RefSeq entries.


Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help

Guide to the Human Genome
Copyright © 2010 by Stewart Scherer. All rights reserved.

CSHL Press