BLASTX 2.0.10 [Aug-26-1999] 

Query= XF-02A11-GL38
         (1431 letters)

Database: nr
           455,460 sequences; 140,124,617 total letters

Graphical Overview:

                                                                Score     E
Sequences producing significant alignments:                     (bits)  Value

gi|1175678|sp|P37765|YCIL_ECOLI HYPOTHETICAL 32.7 KD PROTEIN IN...   239    4e-62
gi|602963 (U18111) ORF4 [Escherichia coli]                           238    1e-61
gi|1175679|sp|P45104|YCIL_HAEIN HYPOTHETICAL PROTEIN HI1199 >gi...   234    2e-60
gi|1175677|sp|P42395|YCIL_BUCAP HYPOTHETICAL 30.2 KD PROTEIN IN...   159    7e-38
gi|466190|sp|P35159|RLUB_BACSU RIBOSOMAL LARGE SUBUNIT PSEUDOUR...   129    6e-29
gi|6458615|gb|AAF10472.1|AE001942_4 (AE001942) ribosomal large ...   123    3e-27
gi|4980760|gb|AAD35352.1|AE001708_20 (AE001708) 16S pseudouridy...   120    3e-26
gi|3915347|sp|O51155|Y129_BORBU HYPOTHETICAL PROTEIN BB0129 >gi...   118    1e-25
gi|6647946|sp|Q9ZD06|Y544_RICPR HYPOTHETICAL PROTEIN RP544 >gi|...   116    4e-25
gi|5738484|emb|CAB52832.1| (AL109848) putative pseudouridine sy...   115    1e-24
gi|3915377|sp|Q55578|Y361_SYNY3 HYPOTHETICAL 28.2 KD PROTEIN SL...   113    3e-24
gi|3915445|sp|O67444|YE64_AQUAE HYPOTHETICAL PROTEIN AQ_1464 >g...   110    2e-23
gi|3915547|sp|O33210|YH11_MYCTU HYPOTHETICAL 27.6 KD PROTEIN RV...   108    2e-22
gi|3915542|sp|O05668|YH11_MYCLE HYPOTHETICAL 28.1 KD PROTEIN CB...   106    4e-22
gi|3915394|sp|O66829|Y554_AQUAE HYPOTHETICAL PROTEIN AQ_554 >gi...   105    1e-21
gi|4155973 (AE001558) putative [Helicobacter pylori J99]             103    3e-21
gi|2501526|sp|P55986|YE59_HELPY HYPOTHETICAL PROTEIN HP1459 >gi...   102    5e-21
gi|418534|sp|P32684|YJBC_ECOLI HYPOTHETICAL 32.5 KD PROTEIN IN ...    96    9e-19
gi|3329180|gb|AAC68318.1| (AE001343) predicted pseudouridine sy...    83    7e-15
gi|465610|sp|P33918|RSUA_ECOLI RIBOSOMAL SMALL SUBUNIT PSEUDOUR...    83    7e-15
gi|2145990|pir||S72955 u0247g protein - Mycobacterium leprae >g...    82    9e-15
gi|4377181|gb|AAD19002| (AE001667) predicted pseudouridine synt...    82    1e-14
gi|3322747 (AE001223) conserved hypothetical protein [Treponema...    78    2e-13
gi|1175857|sp|P45124|RSUA_HAEIN RIBOSOMAL SMALL SUBUNIT PSEUDOU...    74    3e-12
gi|3845303 (AE001423) pseudouridine synthetase (RsuA fam.); 1st...    67    3e-10
gi|1175289|sp|P44827|YMFC_HAEIN HYPOTHETICAL PROTEIN HI0694 >gi...    67    4e-10
gi|1787380 (AE000213) orf, hypothetical protein [Escherichia coli]    58    2e-07
gi|3916025|sp|P75966|YMFC_ECOLI HYPOTHETICAL 24.9 KD PROTEIN IN...    58    2e-07
gi|3402689|gb|AAC28992.1| (AC004697) unknown protein [Arabidops...    54    2e-06
gi|3915559|sp|O32068|YTZF_BACSU HYPOTHETICAL 17.7 KD PROTEIN IN...    54    3e-06
gi|3402691|gb|AAC28994.1| (AC004697) unknown protein [Arabidops...    52    2e-05
gi|6689331|emb|CAB65436.1| (AJ250721) hypothetical protein [Syn...    49    8e-05
gi|3915395|sp|P72581|Y612_SYNY3 HYPOTHETICAL 21.0 KD PROTEIN SL...    48    1e-04
gi|1651652|dbj|BAA16580| (D90899) hypothetical protein [Synecho...    42    0.010
gi|1175470|sp|Q09801|YAAA_SCHPO HYPOTHETICAL 37.3 KD PROTEIN C2...    42    0.014
gi|294136|gb|AAA29551.1| (M83173) circumsporozoite protein [Pla...    41    0.018
gi|117587|sp|P08307|CSP_PLAFW CIRCUMSPOROZOITE PROTEIN PRECURSO...    41    0.018
gi|48921|emb|CAA42462| (X59794) traC-4 [Escherichia coli] >gi|1...    41    0.024
gi|48920|emb|CAA42461| (X59794) traC-3 [Escherichia coli] >gi|1...    41    0.024
gi|294138|gb|AAA29552.1| (M83174) circumsporozoite protein [Pla...    41    0.024
gi|136173|sp|P27190|TRC5_ECOLI DNA PRIMASE TRAC (REPLICATION PR...    41    0.024
gi|117591|sp|P26694|CSP_PLARE CIRCUMSPOROZOITE PROTEIN PRECURSO...    41    0.031
gi|117584|sp|P05691|CSP_PLAFL CIRCUMSPOROZOITE PROTEIN (CS) >gi...    41    0.031
gi|294119|gb|AAA29542.1| (M83164) circumsporozoite protein [Pla...    40    0.040
gi|294125|gb|AAA29545.1| (M83167) circumsporozoite protein [Pla...    40    0.040
gi|552191 (M57499) circumsporozoite protein [Plasmodium falcipa...    40    0.040
gi|294129|gb|AAA29547.1| (M83169) circumsporozoite protein [Pla...    40    0.040
gi|117586|sp|P13814|CSP_PLAFT CIRCUMSPOROZOITE PROTEIN PRECURSO...    40    0.040
gi|84198|pir||S05428 circumsporozoite protein - Plasmodium falc...    40    0.040
gi|294121|gb|AAA29543.1| (M83165) circumsporozoite protein [Pla...    40    0.040
gi|294123|gb|AAA29544.1| (M83166) circumsporozoite protein [Pla...    40    0.040
gi|294134|gb|AAA29550.1| (M83172) circumsporozoite protein [Pla...    40    0.040
gi|117583|sp|P02893|CSP_PLAFA CIRCUMSPOROZOITE PROTEIN PRECURSO...    40    0.040
gi|294151|gb|AAA29569.1| (M83156) circumsporozoite protein [Pla...    40    0.040
gi|4493889|emb|CAB38998.1| (AL034558) predicted using hexExon; ...    40    0.040
gi|6226848|sp|P19597|CSP_PLAFO CIRCUMSPOROZOITE PROTEIN PRECURS...    40    0.040
gi|552190 (M57498) circumsporozoite protein [Plasmodium falcipa...    40    0.040
gi|224051|prf||1008275A protein,circumsporozoite [Plasmodium sp.]     39    0.069
gi|294158|gb|AAA29574.1| (M83161) circumsporozoite protein [Pla...    39    0.12
gi|2462758 (AC002292) putative RNA-binding protein [Arabidopsis...    39    0.12
gi|283464|pir||A60540 sporozoite surface protein 2 - Plasmodium...    39    0.12
gi|548996|sp|Q01443|SSP2_PLAYO SPOROZOITE SURFACE PROTEIN 2 PRE...    39    0.12
gi|1582641|prf||2119210A mucin [Homo sapiens]                         38    0.16
gi|2135764|pir||I53641 mucin - human (fragment) >gi|945219 (L46...    38    0.16
gi|6580291|emb|CAB63354.1| (AL032627) cDNA EST EMBL:D65921 come...    38    0.20
gi|677949 (U20969) Plasmodium falciparum circumsporozoite prote...    38    0.20
gi|93140|pir||A40505 early protein EP0 - suid herpesvirus 1 (st...    38    0.27
gi|124138|sp|P29129|ICP0_PRVIF TRANS-ACTING TRANSCRIPTIONAL PRO...    38    0.27
gi|3875920|emb|CAB04111.1| (Z81503) predicted using Genefinder;...    38    0.27
gi|6681425|dbj|BAA88672.1| (AB030233) CiMsi [Ciona intestinalis]      38    0.27
gi|2498734|sp|Q60865|P137_MOUSE GPI-ANCHORED PROTEIN P137 >gi|1...    37    0.35
gi|6322611|ref|NP_012685.1|YJR151C| Yjr151cp >gi|1352944|sp|P47...    37    0.46
gi|4758666|ref|NP_004681.1|| LATS (large tumor suppressor, Dros...    36    0.60
gi|3172134 (U90209) RNA polymerase II largest subunit [Bonnemai...    36    0.60
gi|2689219|emb|CAA67983.1| (X99669) dynamin like protein [Dicty...    36    0.60
gi|3064231|gb|AAC14254.1| (AF036460) mucin-like protein [Trypan...    36    0.60
gi|112945|sp|P14196|AAC2_DICDI AAC-RICH MRNA CLONE AAC11 PROTEI...    36    0.60
gi|969095 (U31961) no-on transient A-like protein [Drosophila m...    36    0.60
gi|1082604|pir||S53363 mucin 5AC (clone JER58) - human (fragmen...    36    0.60
gi|4324420|gb|AAD16881| (AF104350) prespore-specific protein [D...    36    0.60
gi|2497729|sp|Q49537|VLPE_MYCHR VARIANT SURFACE ANTIGEN E PRECU...    36    0.60
gi|2114108|dbj|BAA20059| (AB003911) OX40 precursor [Oryctolagus...    36    0.79
gi|82698|pir||JQ0985 hydroxyproline-rich glycoprotein precursor...    36    0.79
gi|4220540|emb|CAA23013| (AL035356) hypothetical protein [Arabi...    36    0.79
gi|5114426|gb|AAD40313.1|AF157503_1 (AF157503) chitinase 1 [Pen...    36    0.79
gi|228937|prf||1814452B Hyp-rich glycoprotein [Zea mays]              36    0.79
gi|106291|pir||S16681 homeotic protein - human                        36    1.0
gi|1170758|sp|P38486|LEG3_CANFA GALECTIN-3 (GALACTOSE-SPECIFIC ...    36    1.0
gi|1724014|sp|P54604|YHCT_BACSU HYPOTHETICAL 33.7 KD PROTEIN IN...    36    1.0
gi|3242649|dbj|BAA29028.1| (AB015440) alpha 1 type I collagen [...    35    1.4
gi|4808164|emb|CAB42795.1| (Y18878) largest subunit of the RNA ...    35    1.4
gi|4808166|emb|CAB42797.1| (Y18879) largest subunit of the RNA ...    35    1.4
gi|1589837 (U68729) cuticle preprocollagen [Meloidogyne incognita]    35    1.4
gi|563375|emb|CAA84031| (Z34277) mucin [Homo sapiens]                 35    1.4
gi|1519696 (U67956) coded for by C. elegans cDNA yk126f9.5; cod...    35    1.4
gi|1184072 (U40766) COL-1 [Meloidogyne incognita]                     35    1.4
gi|2245693|gb|AAB62577.1| (AF010328) short form of CHIP [Drosop...    35    1.8
gi|4324436|gb|AAD16883| (AF104414) large tumor suppressor 1 [Mu...    35    1.8
gi|3877422|emb|CAB05195.1| (Z82268) predicted using Genefinder;...    35    1.8
gi|1280114|gb|AAC25888.1| (U55373) Similar to cuticular collage...    35    1.8
gi|2245687|gb|AAB62574.1| (AF010325) CHIP [Drosophila melanogas...    35    1.8
gi|119712|sp|P14918|EXTN_MAIZE EXTENSIN PRECURSOR (PROLINE-RICH...    34    2.3
gi|227614|prf||1707318A Thr rich extensin [Zea mays]                  34    2.3
gi|3810839|emb|CAA21800| (AL032684) conserved hypothetical zinc...    34    2.3
gi|228938|prf||1814452C Hyp-rich glycoprotein [Zea diploperennis]     34    2.3
gi|1082938|pir||A49688 lactose-binding lectin L-29 - dog              34    2.3
gi|3851592 (AF093575) surface antigen ariel1 [Entamoeba histoly...    34    2.3
gi|71823|pir||FGHUA fibrinogen alpha chain precursor, short spl...    34    2.3
gi|437331 (L23429) beta-galactosides-binding lectin [Canis fami...    34    2.3
gi|120943|sp|P13816|GARP_PLAFF GLUTAMIC ACID-RICH PROTEIN PRECU...    34    2.3
gi|283032|pir||S22456 hydroxyproline-rich glycoprotein - perenn...    34    2.3
gi|3834294 (U80846) No definition line found [Caenorhabditis el...    34    2.3
gi|168457 (M36913) cell wall protein (put.); putative [Zea mays]      34    2.3
gi|4503689|ref|NP_000499.1|| fibrinogen, A alpha polypeptide >g...    34    2.3
gi|3834293 (U80846) No definition line found [Caenorhabditis el...    34    2.3
gi|3924913|emb|CAB03474.1| (Z81138) predicted using Genefinder;...    34    3.0
gi|3924914|emb|CAB03475.1| (Z81138) predicted using Genefinder;...    34    3.0
gi|6580246|emb|CAB63321.1| (Z81138) cDNA EST EMBL:D65528 comes ...    34    3.0
gi|1280073 (U55366) Similar to cuticle collagen [Caenorhabditis...    34    3.0
gi|5901942|ref|NP_008971.1|| E1B-55kDa-associated protein 5 >gi...    34    3.0
gi|2501678|sp|Q45480|YLYB_BACSU HYPOTHETICAL 33.7 KD PROTEIN IN...    34    3.0
gi|2633919|emb|CAB13420| (Z99112) alternate gene name: ylmL; si...    34    3.0
gi|2135766|pir||S53362 mucin 5AC (clone JER47) - human (fragment)     34    3.0
gi|1139597 (U43400) H1 gene product [Human herpesvirus 7] >gi|1...    34    3.0
gi|628112|pir||S32988 hypothetical protein - human herpesvirus 4      34    4.0
gi|628185|pir||S42447 nuclear protein EBNA2 - Epstein-Barr viru...    34    4.0
gi|5833486|gb|AAD53528.1| (AF162149) variable surface lipoprote...    34    4.0
gi|283045|pir||S28264 hydroxyproline-rich glycoprotein - maize ...    34    4.0
gi|3880306|emb|CAA90995.1| (Z54238) similar to cuticle collagen...    34    4.0
gi|2833307|sp|Q21184|YXWK_CAEEL PUTATIVE CUTICLE COLLAGEN F55C1...    34    4.0
gi|3877661|emb|CAA98486.1| (Z74036) predicted using Genefinder;...    34    4.0
gi|543543|pir||JN0690 glutenin, high-molecular-weight Bx7 chain...    34    4.0
gi|102059|pir||D41710 promastigote surface antigen-2 (clone 4.6...    34    4.0
gi|119111|sp|P12978|EBN2_EBV EBNA-2 NUCLEAR PROTEIN >gi|1083964...    34    4.0
gi|100864|pir||S08315 cell wall protein - maize (fragment) >gi|...    33    5.2
gi|2135765|pir||A43932 mucin 2 precursor, intestinal - human (f...    33    5.2
gi|5852937|gb|AAD54275.1|AF169793_1 (AF169793) HET-C protein [P...    33    5.2
gi|4503493|ref|NP_001955.1|| early growth response 1 >gi|119242...    33    5.2
gi|2833328|sp|Q27200|FBRL_TETTH FIBRILLARIN >gi|2654298|emb|CAA...    33    5.2
gi|2707270 (AF036171) homeobox-containing protein [Dictyosteliu...    33    5.2
gi|3879844|emb|CAB04725.1| (Z81592) predicted using Genefinder;...    33    5.2
gi|418972|pir||S31035 retrovirus-related gag polyprotein - mous...    33    5.2
gi|6594249|emb|CAB63561.1| (AL031746) hypothetical garp protein...    33    5.2
gi|6601502|gb|AAF19004.1|AF151366_1 (AF151366) arginine/serine-...    33    5.2
gi|476822|pir||A42893 penicillin-binding protein 1A - Streptoco...    33    5.2
gi|282331|pir||S28037 penicillin-binding protein 1a - Streptoco...    33    5.2
gi|585647|sp|Q04707|PBPA_STRPN PENICILLIN-BINDING PROTEIN 1A (P...    33    5.2
gi|4140029|dbj|BAA36973.1| (AB015438) alpha 1 type I collagen [...    33    5.2
gi|5410459|gb|AAD43067.1|AF139884_1 (AF139884) penicillin-bindi...    33    5.2
gi|6563337|gb|AAF17255.1|AF210745_1 (AF210745) penicillin-bindi...    33    5.2
gi|6563341|gb|AAF17257.1|AF210747_1 (AF210747) penicillin-bindi...    33    5.2
gi|102058|pir||C41710 promastigote surface antigen-2 (clone 2.5...    33    5.2
gi|186396 (M94131) mucin [Homo sapiens]                               33    5.2
gi|3319463 (AF077544) unknown [Caenorhabditis elegans]                33    5.2
gi|5815245|gb|AAD52614.1|AF175223_1 (AF175223) SANT domain prot...    33    5.2
gi|4505285|ref|NP_002448.1|| mucin 2, intestinal/tracheal >gi|2...    33    5.2
gi|82601|pir||A30843 glutenin high molecular weight chain Bx7 p...    33    6.8
gi|2388676 (AF015539) precollagen P [Mytilus edulis]                  33    6.8
gi|1085433|pir||S55316 mucin (clone PGM-2B) - pig >gi|915207 (U...    33    6.8
gi|2314597|gb|AAD08465.1| (AE000642) conserved hypothetical pro...    33    6.8
gi|330361 (M10593) major outer envelope glycoprotein gp220 [Eps...    33    6.8
gi|1841851 (U86876) chitinase-like protein [Bombyx mori]              33    6.8
gi|2854193 (AF045645) Similar to cuticular collagen; coded for ...    33    6.8
gi|3875708|emb|CAA84658.1| (Z35598) Asparagine, Serine and Glyc...    33    6.8
gi|2119159|pir||I50694 alpha-1 collagen type III - chicken (fra...    33    6.8
gi|543541|pir||JC2099 glutenin, high molecular weight chain Bx1...    33    6.8
gi|1118137 (U41746) coded for by C. elegans cDNA yk68a8.5 [Caen...    32    9.0
gi|3876265|emb|CAB02976.1| (Z81067) predicted using Genefinder;...    32    9.0
gi|3873739|emb|CAA86059.1| (Z37983) weak similarity with putati...    32    9.0
gi|1870123|emb|CAB06788| (Z86109) unknown [Saccharomyces pastor...    32    9.0
gi|46691|emb|CAA43604| (X61307) protein A [Staphylococcus aureu...    32    9.0
gi|1019435 (U32447) mucin-like protein [Trypanosoma cruzi]            32    9.0
gi|6324130|ref|NP_014200.1|GCR2| Transcription factor; Gcr2p >g...    32    9.0
gi|1136289 (U42597) histidine kinase A [Dictyostelium discoideum]     32    9.0
gi|2193933|emb|CAB09584| (Z96800) hypothetical protein Rv0312 [...    32    9.0
gi|2493778|sp|Q09456|YQ35_CAEEL PUTATIVE CUTICLE COLLAGEN C09G5...    32    9.0
gi|182424 (J00127) alpha-fibrinogen precursor [Homo sapiens]          32    9.0
gi|1707117 (U80453) C23H3.9 [Caenorhabditis elegans]                  32    9.0
gi|5901828|gb|AAD55422.1|AF181636_1 (AF181636) BcDNA.GH07269 [D...    32    9.0
gi|4033468|sp|P92965|RS40_ARATH ARGININE/SERINE-RICH SPLICING F...    32    9.0
gi|2981221 (AF053091) eyelid [Drosophila melanogaster]                32    9.0
gi|5824601|emb|CAA82571.2| (Z29443) similar to Annexin; cDNA ES...    32    9.0
gi|6273747|gb|AAF06358.1|AF102865_1 (AF102865) calymmin [Danio ...    32    9.0
gi|1523876|emb|CAA99996| (Z75666) BRCA2 [Chlorocebus aethiops]        32    9.0
gi|1448962 (L79944) OFR1 of non-LTR retrotransposon; putative [...    32    9.0
gi|188864 (M74027) mucin [Homo sapiens]                               32    9.0
gi|1256180|dbj|BAA12287| (D84250) chitinase [Penaeus japonicus]       32    9.0
gi|4996369|dbj|BAA78427.1| (AB021267) polyprotein [Arabidopsis ...    32    9.0
gi|223918|prf||1004351A fibrinogen alphaA [Homo sapiens]              32    9.0
gi|2952545 (AF051898) coronin binding protein [Dictyostelium di...    32    9.0

>gi|1175678|sp|P37765|YCIL_ECOLI HYPOTHETICAL 32.7 KD PROTEIN IN
           TRPL-BTUR INTERGENIC REGION (ORF4)
           >gi|1742064|dbj|BAA14806| (D90764) ORF_ID:o253#9;
           similar to [SwissProt Accession Number P37765]
           [Escherichia coli] >gi|1742080|dbj|BAA14821| (D90765)
           ORF_ID:o253#9; similar to [SwissProt Accession Number
           P37765] [Escherichia coli] >gi|1787524 (AE000225) orf,
           hypothetical protein [Escherichia coli]
           Length = 291
           
 Score =  239 bits (604), Expect = 4e-62
 Identities = 136/270 (50%), Positives = 168/270 (61%), Gaps = 3/270 (1%)
 Frame = +1

Query: 73  LEERLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLG--MSVKSGDKIELDGRSF-VA 243
           + E+L KVLA+AG GSRR +E  I  G + V+G IA+LG  + V  G KI +DG    V 
Sbjct: 1   MSEKLQKVLARAGHGSRREIESIIEAGRVSVDGKIAKLGDRVEVTPGLKIRIDGHLISVR 60

Query: 244 SALTEPARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXX 423
            +  +  RVL Y KPEGE+ TR DPEGRPTVF+ LP L+GARWIA+GRLD+N        
Sbjct: 61  ESAEQICRVLAYYKPEGELCTRNDPEGRPTVFDRLPKLRGARWIAVGRLDVNTCGLLLFT 120

Query: 424 XDGELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERI 603
            DGELAN +MHPS E+EREY VRV        V D  L  L+RGV LEDG A F TI+  
Sbjct: 121 TDGELANRLMHPSREVEREYAVRVFG-----QVDDAKLRDLSRGVQLEDGPAAFKTIKFS 175

Query: 604 GNTDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKT 783
           G    + W+ V + EGRNREVRRLWE+ G QVSRL R RYG + LP+ L RG  TEL   
Sbjct: 176 GGEGINQWYNVTLTEGRNREVRRLWEAVGVQVSRLIRVRYGDIPLPKGLPRGGWTELDLA 235

Query: 784 QVEALRTQLKLEKDMPLALTLQPIIGQRRSAKA 882
           Q   LR  ++L  +    + ++     RR  KA
Sbjct: 236 QTNYLRELVELPPETSSKVAVEK---DRRRMKA 265


>gi|602963 (U18111) ORF4 [Escherichia coli]
           Length = 243
           
 Score =  238 bits (600), Expect = 1e-61
 Identities = 131/243 (53%), Positives = 157/243 (63%), Gaps = 3/243 (1%)
 Frame = +1

Query: 73  LEERLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLG--MSVKSGDKIELDGRSF-VA 243
           + E+L KVLA+AG GSRR +E  I  G + V+G IA+LG  + V  G KI +DG    V 
Sbjct: 1   MSEKLQKVLARAGHGSRREIESIIEAGRVSVDGKIAKLGDRVEVTPGLKIRIDGHLISVR 60

Query: 244 SALTEPARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXX 423
            +  +  RVL Y KPEGE+ TR DPEGRPTVF+ LP L+GARWIA+GRLD+N        
Sbjct: 61  ESAEQICRVLAYYKPEGELCTRNDPEGRPTVFDRLPKLRGARWIAVGRLDVNTCGLLLFT 120

Query: 424 XDGELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERI 603
            DGELAN +MHPS E+EREY VRV        V D  L  L+RGV LEDG A F TI+  
Sbjct: 121 TDGELANRLMHPSREVEREYAVRVFG-----QVDDAKLRDLSRGVQLEDGPAAFKTIKFS 175

Query: 604 GNTDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKT 783
           G    + W+ V + EGRNREVRRLWE+ G QVSRL R RYG + LP+ L RG  TEL   
Sbjct: 176 GGEGINQWYNVTLTEGRNREVRRLWEAVGVQVSRLIRVRYGDIPLPKGLPRGGWTELDLA 235

Query: 784 QVEALR 801
           Q   LR
Sbjct: 236 QTNYLR 241


>gi|1175679|sp|P45104|YCIL_HAEIN HYPOTHETICAL PROTEIN HI1199
           >gi|1074680|pir||A64169 hypothetical protein HI1199 -
           Haemophilus influenzae (strain Rd KW20) >gi|1574128
           (U32799) conserved hypothetical protein [Haemophilus
           influenzae Rd]
           Length = 357
           
 Score =  234 bits (590), Expect = 2e-60
 Identities = 137/290 (47%), Positives = 174/290 (59%), Gaps = 4/290 (1%)
 Frame = +1

Query: 55  ATEAPKLE-ERLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLG--MSVKSGDKIELD 225
           A+  PK E E+L KVLA+AG GSRR +E  I+ G + V G IA LG  + V SG K+ +D
Sbjct: 67  ASNQPKAEGEKLQKVLARAGQGSRREIETMIAAGRVSVEGKIATLGDRIDVHSGVKVRID 126

Query: 226 GRSF-VASALTEPARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINX 402
           G+   ++    E  RVL+Y KPEGE+ TR DPEGR TVF+ LP L G+RWIA+GRLDIN 
Sbjct: 127 GQIINLSHTQKEICRVLMYYKPEGELCTRSDPEGRATVFDRLPRLTGSRWIAVGRLDINT 186

Query: 403 XXXXXXXXDGELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAK 582
                   DGELAN +MHPS E+EREY VRV        V D +L +L +GV LEDG A 
Sbjct: 187 SGLLLFTTDGELANRLMHPSREVEREYSVRVFG-----QVDDAMLARLRKGVQLEDGLAN 241

Query: 583 FDTIERIGNTDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQ 762
           F  I+  G    + W+ V + EGRNREVRRLWESQG QVSRL R RYG++ L + L RG 
Sbjct: 242 FKEIKFTGGVGINQWYDVTLMEGRNREVRRLWESQGIQVSRLIRIRYGNIKLMKGLPRGG 301

Query: 763 STELPKTQVEALRTQLKLEKDMPLALTLQPIIGQRRSAKATLHVNRNDNNKHAY 924
             E+    V  LR  + L  +    L ++    + +S +    V R       Y
Sbjct: 302 WEEMDLENVNYLRELVGLPAETETKLDVKQASRRPKSGQIRKAVKRYSEMNKRY 355


>gi|1175677|sp|P42395|YCIL_BUCAP HYPOTHETICAL 30.2 KD PROTEIN IN
           TRPA 3'REGION >gi|480102|pir||S36431 hypothetical
           protein - Buchnera aphidicola >gi|396661|emb|CAA79503|
           (Z19055) unknown open reading frame [Buchnera
           aphidicola]
           Length = 258
           
 Score =  159 bits (397), Expect = 7e-38
 Identities = 92/249 (36%), Positives = 144/249 (56%), Gaps = 5/249 (2%)
 Frame = +1

Query: 82  RLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLG--MSVKSGDKIELDGRSFVASALT 255
           ++ K+L+  G GSRR +E  I  G I +NG+ A +G  ++ K+  +I +D +  +     
Sbjct: 4   KIQKILSDLGYGSRRFIECMIKCGKISINGEKAIIGQYLNKKNPGEILIDKKKIIVKRNK 63

Query: 256 EPARVLIYN-KPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDG 432
              +VLIYN KP GEV TR+D + R TVF+ LP L   RW+++GRLDIN         DG
Sbjct: 64  NLPKVLIYNNKPIGEVCTRDDFQKRLTVFDKLPKLNLNRWVSVGRLDINTKGLLLFTNDG 123

Query: 433 ELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERI--G 606
            LAN +MHP S+IEREY +R+        +    +  L +GV +  G   F  I  +   
Sbjct: 124 TLANKLMHPRSQIEREYNIRIFG-----EMNKNKINILRKGVKIIHGYVSFKEIVPLYDK 178

Query: 607 NTDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKTQ 786
               + WF+ ++ EG+NRE+R +++S  CQV++L R RYG+++LP+ L  GQ   L    
Sbjct: 179 KEGKNKWFKGILCEGKNREIRLMFKSIQCQVNQLIRVRYGNIILPKNLKEGQWMMLNSIF 238

Query: 787 VEALRTQLKLEKDM 828
           ++ L   +  +K++
Sbjct: 239 LKKLYNLINFDKEI 252


>gi|466190|sp|P35159|RLUB_BACSU RIBOSOMAL LARGE SUBUNIT
           PSEUDOURIDINE SYNTHASE B (PSEUDOURIDYLATE SYNTHASE)
           (URACIL HYDROLYASE) >gi|629120|pir||S45555 hypothetical
           protein X13 - Bacillus subtilis >gi|410137 (L09228)
           ORFX13 [Bacillus subtilis] >gi|2634751|emb|CAB14248|
           (Z99116) similar to hypothetical proteins [Bacillus
           subtilis]
           Length = 229
           
 Score =  129 bits (321), Expect = 6e-29
 Identities = 90/236 (38%), Positives = 133/236 (56%), Gaps = 9/236 (3%)
 Frame = +1

Query: 79  ERLHKVLAQAGLGSRRALEQRISNGLIKVNGDIA-QLGMSVKSGDKIELDGRSFVASALT 255
           ERL KV+A AG+ SR   E+ I  G +KVNG +  +LG+ V   D+IE++G         
Sbjct: 2   ERLQKVIAHAGVASRSKAEELIKEGKVKVNGKVVTELGVKVTGSDQIEVNGLKVERE--- 58

Query: 256 EPARVLIYNKPEGEVTTREDPEGRPTV---FETLPVLKGARWIAIGRLDINXXXXXXXXX 426
           EP   L+Y KP G ++  +D +GR  V   F+ +P     R   IGRLD +         
Sbjct: 59  EPVYFLLY-KPRGVISAAQDDKGRKVVTDFFKNIP----QRIYPIGRLDYDTSGLLLLTN 113

Query: 427 DGELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDG-----TAKFDT 591
           DGE AN +MHP  EI++ YV +V+    +     ELL +L RG+ LE+G      AK  +
Sbjct: 114 DGEFANKLMHPKYEIDKTYVAKVKGIPPK-----ELLRKLERGIRLEEGKTAPAKAKLLS 168

Query: 592 IERIGNTDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTE 771
           +++   T      ++ + EGRNR+VRR++E+ G +V +LKR  Y  + L R L  G + E
Sbjct: 169 LDKKKQTSI---IQLTIHEGRNRQVRRMFEAIGHEVIKLKREEYAFLNL-RGLHTGDARE 224

Query: 772 LPKTQ 786
           L  T+
Sbjct: 225 LRLTK 229


>gi|6458615|gb|AAF10472.1|AE001942_4 (AE001942) ribosomal large
           subunit pseudouridine synthase B [Deinococcus
           radiodurans]
           Length = 257
           
 Score =  123 bits (306), Expect = 3e-27
 Identities = 89/240 (37%), Positives = 119/240 (49%)
 Frame = +1

Query: 79  ERLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLGMSVKSGDKIELDGRSFVASALTE 258
           ERLHK LA+AG+ SRRA E+ I  G + VNG  A LG  V   D + +DGR  V     E
Sbjct: 4   ERLHKRLARAGIASRRAAEELIRAGRVTVNGQTAGLGQGVNDTDDVRVDGR-LVELTRPE 62

Query: 259 PARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDGEL 438
                +Y KP G VTT  D  GR  V + +P + G     +GRLD +         DG+L
Sbjct: 63  TVTYALY-KPVGFVTTAHDEYGRRNVLDAMPDVPGLH--PVGRLDKDSEGLLLLTNDGDL 119

Query: 439 ANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGNTDS 618
              + HP    E+ Y       EG E      L+ L RG+ ++DG A     + +    +
Sbjct: 120 TLTLTHPRYGHEKAYRAWT---EGREPPTQAELDVLVRGIAMDDGPA-----QALSAAPA 171

Query: 619 HDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKTQVEAL 798
            D   VV+ EGRNR+VRR+ E+ G  V RL R R G + L  +L  G+  EL    +E L
Sbjct: 172 EDGAYVVLGEGRNRQVRRMLEALGHPVGRLVRYRVGGLWL-GDLNPGEYRELGPRDLEQL 230


>gi|4980760|gb|AAD35352.1|AE001708_20 (AE001708) 16S pseudouridylate
           synthase [Thermotoga maritima]
           Length = 239
           
 Score =  120 bits (298), Expect = 3e-26
 Identities = 80/250 (32%), Positives = 136/250 (54%), Gaps = 2/250 (0%)
 Frame = +1

Query: 82  RLHKVLAQAGLGSRRALEQRISNGLIKVNGDIA-QLGMSVKSGDKIELDGRSFVASALTE 258
           RL + L+ +G+G+R+ +++ I  G + VNG +    G  V   D + LDG         +
Sbjct: 2   RLDRYLSNSGVGTRKEVKKLIKQGRVTVNGRVVLDPGHPVLENDAVALDGE---VVRFHK 58

Query: 259 PARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDGEL 438
              +L Y KP G VT+ +DP    T+ E LP LKG     +GRLD +         DG+ 
Sbjct: 59  KVYILFY-KPSGYVTSTKDPHSE-TIMEFLPPLKGI--FPVGRLDKDAEGLLIITNDGDF 114

Query: 439 ANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGT-AKFDTIERIGNTD 615
           A+ ++ P   +E+EY+V+V   EGE  V ++ +E+L  GV L DG  AK   +E++ N  
Sbjct: 115 AHRVISPKWSVEKEYIVKV---EGE--VTEDKIEKLKNGVTLRDGFFAKAKRVEKLSN-- 167

Query: 616 SHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKTQVEA 795
             D  ++V+ EG+  +++R+  + G +   LKRTR G ++LP ++  G+   L + +V+ 
Sbjct: 168 --DTLKIVITEGKYHQIKRMTAAVGLKTVHLKRTRIGGLVLPDDMKPGEYRFLSEEEVKK 225

Query: 796 LRTQLKLEKDMP 831
           +  +   ++D P
Sbjct: 226 VFEREDQKEDTP 237


>gi|3915347|sp|O51155|Y129_BORBU HYPOTHETICAL PROTEIN BB0129
           >gi|2688006 (AE001124) conserved hypothetical protein
           [Borrelia burgdorferi]
           Length = 249
           
 Score =  118 bits (292), Expect = 1e-25
 Identities = 74/248 (29%), Positives = 129/248 (51%), Gaps = 1/248 (0%)
 Frame = +1

Query: 82  RLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLGMSVKSGDKIELDGRSFVASALTEP 261
           R+H  LA+ G+GSRR  E+ I   L++VN  IA+LG  V  GD+I    + FV       
Sbjct: 8   RVHVFLAEKGVGSRRFCEELIRKKLVRVNNTIAKLGDKVTLGDRIIYKKQIFVFKDFQIN 67

Query: 262 ARV-LIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDGEL 438
            R+ L  NKP   + +  D +GR      +  L   R  +IGRLD           DG+ 
Sbjct: 68  NRIYLALNKPRNYLCSNFDVDGRKLAISLVQPLFKERVFSIGRLDFKSSGLLLFTNDGKF 127

Query: 439 ANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGNTDS 618
           AN ++HP  ++EREY++     E ++ + + LL     G+ ++    K  + E +    +
Sbjct: 128 ANDIIHPRQKVEREYII-----ESKKDIDENLLISFKSGIKVKKEFFKLKSYEILNKNSA 182

Query: 619 HDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKTQVEAL 798
               R+++ EG+NRE+R+++ S+   + ++ R R G++ L   L  GQ   +P +++ +L
Sbjct: 183 ----RLILDEGKNREIRKVFLSKNIFLKKIHRIRIGNINLD-SLKEGQVKIVPLSKINSL 237

Query: 799 RTQLKLEKD 825
           +++L+   D
Sbjct: 238 KSRLEKLND 246


>gi|6647946|sp|Q9ZD06|Y544_RICPR HYPOTHETICAL PROTEIN RP544
           >gi|3861093|emb|CAA14993| (AJ235272) unknown [Rickettsia
           prowazekii]
           Length = 235
           
 Score =  116 bits (288), Expect = 4e-25
 Identities = 78/217 (35%), Positives = 123/217 (55%), Gaps = 1/217 (0%)
 Frame = +1

Query: 82  RLHKVLAQAGLGSRRALEQRISNGLIKVNG-DIAQLGMSVKSGDKIELDGRSFVASALTE 258
           RL K+++ AG+ SRR  E+ I  G +K++G  I     +V   ++IE+ GR       T+
Sbjct: 3   RLAKIISNAGVCSRRNAEKLIVGGKVKIDGITILSPATNVDMSNQIEVSGRLINN---TQ 59

Query: 259 PARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDGEL 438
             R+ IY KP G +TT +DP  R TVFE L  L   R I+IGRLD+N          G+L
Sbjct: 60  KPRLWIYYKPVGLITTHKDPLSRKTVFEQLIGLP--RVISIGRLDLNSEGLLLLTNSGDL 117

Query: 439 ANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGNTDS 618
           A+    P+S+++R Y VR     G  ++   LL+   + + ++       +I+ +    S
Sbjct: 118 AHQFEMPASKLKRVYNVRAY---GNPNI---LLKNNYKNLKIDGIFYNPHSIKLLRQNKS 171

Query: 619 HDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSV 732
           + WF VV+ EG+NRE+RR++E  G QV++L R +YG++
Sbjct: 172 NSWFEVVLFEGKNREIRRIFEYFGLQVNKLIRIQYGAL 209


>gi|5738484|emb|CAB52832.1| (AL109848) putative pseudouridine
           synthase [Streptomyces coelicolor A3(2)]
           Length = 371
           
 Score =  115 bits (284), Expect = 1e-24
 Identities = 80/246 (32%), Positives = 123/246 (49%), Gaps = 2/246 (0%)
 Frame = +1

Query: 79  ERLHKVLAQAGLGSRRALEQRISNGLIKVNGDIA-QLGMSVK-SGDKIELDGRSFVASAL 252
           ERL KVLA+AG GSRRA E+ I    +++NG+I  + G  V    D++++DG     +  
Sbjct: 135 ERLQKVLARAGYGSRRACEELIEQARVEINGEIVLEQGRRVDPEKDEVKVDG----LTVA 190

Query: 253 TEPARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDG 432
           T+  +    NKP G V+T EDPEGR  + + +   +  R   +GRLD            G
Sbjct: 191 TQSYQFFSLNKPAGVVSTMEDPEGRQCLGDYV-TNRETRLFHVGRLDTETEGVILLTNHG 249

Query: 433 ELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGNT 612
           ELA+ + HP   +++ Y+  +  P     +  +L ++L  G+ LEDG A+ D    +  T
Sbjct: 250 ELAHRLTHPRYGVKKTYLAHIVGP-----IPRDLGKRLKDGIQLEDGYARADHFRVVEQT 304

Query: 613 DSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKTQVE 792
             +    V + EGR   VRR+    G  V  L RT +G + L  +   G    L  T+V 
Sbjct: 305 GKNYLVEVTLHEGRKHIVRRMLAEAGFPVDNLVRTAFGPITL-GDQKSGWLRRLSNTEVG 363

Query: 793 ALRTQLKL 816
            L  ++ L
Sbjct: 364 MLMQEVDL 371


>gi|3915377|sp|Q55578|Y361_SYNY3 HYPOTHETICAL 28.2 KD PROTEIN
           SLR0361 >gi|1001457|dbj|BAA10082| (D63999) hypothetical
           protein [Synechocystis sp.]
           Length = 249
           
 Score =  113 bits (281), Expect = 3e-24
 Identities = 83/248 (33%), Positives = 122/248 (48%), Gaps = 6/248 (2%)
 Frame = +1

Query: 73  LEERLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLGMSVK-SGDKIELDGRSFVASA 249
           + ER+ K+L+Q G+ SRR  E+ I  G + VNG +A LG       D + +DG+   A  
Sbjct: 1   MAERIQKLLSQWGIASRRHAEEMILAGRVSVNGKVANLGDKADPQQDFLSVDGKQIKADN 60

Query: 250 LTEPARVLIYNKPEGEVTTREDPEGRPTVFETLP--VLKGARWIAIGRLDINXXXXXXXX 423
                 +L+ NKP   ++T +DP GR TV + LP  + +G     +GRLD N        
Sbjct: 61  RPRDIYLLV-NKPRDVLSTCDDPRGRKTVLDLLPQDLQRGKGLHPVGRLDRNSTGALLLT 119

Query: 424 XDGELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERI 603
            DGEL   + HP   + + Y V +     E +  DE LE+   G+ML+       T+E I
Sbjct: 120 NDGELTLRLTHPRYHLPKTYDVWL-----EGNPSDEDLEKWRSGMMLDGKKTLPATLEVI 174

Query: 604 GNTDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLL---PRELLRGQSTEL 774
                     V + EGRNR++RRL E  G  V +L R   G + L    + L  GQ   L
Sbjct: 175 SENKDQIHLLVTLTEGRNRQIRRLAEELGLTVLKLHRRTIGPLQLHTRGKVLGSGQFRFL 234

Query: 775 PKTQVEALRTQLKL 816
              ++  L+ Q+ L
Sbjct: 235 SPAEIRLLKKQVNL 248


>gi|3915445|sp|O67444|YE64_AQUAE HYPOTHETICAL PROTEIN AQ_1464
           >gi|2983856 (AE000741) hypothetical protein [Aquifex
           aeolicus]
           Length = 249
           
 Score =  110 bits (273), Expect = 2e-23
 Identities = 81/236 (34%), Positives = 126/236 (53%), Gaps = 10/236 (4%)
 Frame = +1

Query: 73  LEERLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQ-LGMSVKSG-DKIELDGRSFVAS 246
           +E R++K L++AG+ SRR  E+ I  G +KVNG++ + LG+ V    D +E+DG+     
Sbjct: 2   MEVRINKFLSEAGVASRRKAEKLILEGRVKVNGEVVRSLGVKVNPEVDIVEVDGKP---- 57

Query: 247 ALTEPARVLIYNKPEGEVTTR-EDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXX 423
              +  R +I NKP   +T     P+GR T+ E +  +   R   +GRLD N        
Sbjct: 58  VKPQRKRYIILNKPCCYLTQLGRSPDGRKTIEELIKDIP-ERVFPVGRLDYNTEGLLILT 116

Query: 424 XDGELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERI 603
            DGELAN ++HP  ++ + Y+  V     E  V  + L+++ +G+ LEDG AK D I  +
Sbjct: 117 NDGELANRILHPRYKLPKVYLALV-----EGKVDQKTLKRMKQGIELEDGFAKPDNIRIV 171

Query: 604 GNTDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLP-------RELLRGQ 762
                +    +   EGR   V+R   + G +V RLKR   G + L        REL +G+
Sbjct: 172 RYEGKNTLLEITFHEGRKHLVKRFLGAFGHKVKRLKRIAIGPIKLGKLSPGKWRELNQGE 231

Query: 763 STELPK 780
             +L K
Sbjct: 232 LAQLFK 237


>gi|3915547|sp|O33210|YH11_MYCTU HYPOTHETICAL 27.6 KD PROTEIN RV1711
           >gi|2326754|emb|CAB10968| (Z98268) hypothetical protein
           Rv1711 [Mycobacterium tuberculosis]
           Length = 254
           
 Score =  108 bits (266), Expect = 2e-22
 Identities = 79/222 (35%), Positives = 111/222 (49%), Gaps = 4/222 (1%)
 Frame = +1

Query: 82  RLHKVLAQAGLGSRRALEQRISNGLIKVNGDIA-QLGMSVKSGDKI-ELDGRSFVASALT 255
           RL KVL+QAG+ SRRA E+ I +G ++V+G +  +LG  V     +  +DG   V   L 
Sbjct: 15  RLQKVLSQAGIASRRAAEKMIVDGRVEVDGHVVTELGTRVDPQVAVVRVDGARVV---LD 71

Query: 256 EPARVLIYNKPEGEVTTREDPEGRPTVFETLP--VLKGARWIAIGRLDINXXXXXXXXXD 429
           +    L  NKP G  +T  D  GRP + + +   V    +   +GRLD +         D
Sbjct: 72  DSLVYLALNKPRGMHSTMSDDRGRPCIGDLIERKVRGTKKLFHVGRLDADTEGLMLLTND 131

Query: 430 GELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGN 609
           GELA+ +MHPS E+ + Y+  V        V   L   L  G+ L+DG A  D    +  
Sbjct: 132 GELAHRLMHPSHEVPKTYLATVTGS-----VPRGLGRTLRAGIELDDGPAFVDDFAVVDA 186

Query: 610 TDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRE 747
                  RV + EGRNR VRRL  + G  V  L RT  G+V L ++
Sbjct: 187 IPGKTLVRVTLHEGRNRIVRRLLAAAGFPVEALVRTDIGAVSLGKQ 232


>gi|3915542|sp|O05668|YH11_MYCLE HYPOTHETICAL 28.1 KD PROTEIN
           CB1351.03C >gi|2065214|emb|CAB08276| (Z95117)
           MLC1351.03c, unknown, len: 256 aa, similar to eg.
           YCIL_ECOLI P37765 hypothetical 32.7 kd protein in trpl-
           (291 aa), fasta clones, opt: 481 z-score: 570.9 E():
           8.5e-25, (42.4% identity in 229 aa overlap); contains
           PS011...
           Length = 256
           
 Score =  106 bits (263), Expect = 4e-22
 Identities = 81/233 (34%), Positives = 119/233 (50%), Gaps = 11/233 (4%)
 Frame = +1

Query: 82  RLHKVLAQAGLGSRRALEQRISNGLIKVNGDIA-QLGMSVKSG-DKIELDGRSFVASALT 255
           RL K+L++AG+ SRRA E+ I  G ++V+G +  +LG  V      + +DG   V   + 
Sbjct: 17  RLQKILSRAGIASRRAAEKLIIEGRVEVDGQLVRELGTRVDPDVSVVRVDG---VKVVVD 73

Query: 256 EPARVLIYNKPEGEVTTREDPEGRPTVFETLP--VLKGARWIAIGRLDINXXXXXXXXXD 429
           +    L  NKP G  +T  D  GRP V + +   V    +   +GRLD +         D
Sbjct: 74  DSLVYLALNKPRGMYSTMSDDRGRPCVGDLIERRVRGNKKLFHVGRLDADTEGLILLTND 133

Query: 430 GELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGN 609
           GELA+ +MHPS E+ + Y+  V+       V   L ++L+ G+ L+DG A  D    +  
Sbjct: 134 GELAHRLMHPSHEVSKTYLATVKGA-----VPRGLGKKLSVGLELDDGPAHVDDFAVVDA 188

Query: 610 TDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLP-------RELLRGQST 768
                  R+ + EGR R VRRL  + G  V  L RT  G+V L        R LLR +  
Sbjct: 189 IPGKTLVRLTLHEGRKRIVRRLLTAAGFPVEMLVRTDIGAVSLGDQRPGCLRALLRDEIR 248

Query: 769 ELPK 780
           +L K
Sbjct: 249 QLYK 252


>gi|3915394|sp|O66829|Y554_AQUAE HYPOTHETICAL PROTEIN AQ_554
           >gi|2983194 (AE000695) hypothetical protein [Aquifex
           aeolicus]
           Length = 238
           
 Score =  105 bits (259), Expect = 1e-21
 Identities = 74/244 (30%), Positives = 135/244 (55%), Gaps = 4/244 (1%)
 Frame = +1

Query: 82  RLHKVLAQAGLGSRRALEQRISNGLIKVNGDIA-QLGMSVKSGDKIELDGRSFVASALTE 258
           RL K L+++   SR+  ++ I  G +KV+G +  Q    VK G+++E++G+S       +
Sbjct: 2   RLDKYLSKSLHISRKEAKELIREGRVKVSGKVVKQAEYRVKEGEEVEVEGKS------VK 55

Query: 259 PAR--VLIYNKPEGEVTTREDPEGRPTVFETLPV-LKGARWIAIGRLDINXXXXXXXXXD 429
           P +   L+  KP+G ++T E+ +  P+  E +       +  + GRLD++         D
Sbjct: 56  PKKNVYLMLYKPKGYLSTTEEDKKYPSFLELIREHFPSRKLFSAGRLDVDAEGLLLITDD 115

Query: 430 GELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGN 609
           GELA+ + HP  ++E+EY+VR+     +  + DE L++L   V LE+   +    E++  
Sbjct: 116 GELAHRLTHPKWKVEKEYIVRL-----DRDIGDEELKKLYE-VKLEEKPVQLVKAEKL-- 167

Query: 610 TDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKTQV 789
             S D  + ++ EGR+  V+RL+++ G  V  LKRTR G++ L   +  G+  EL + +V
Sbjct: 168 --SGDTVKAILTEGRHHVVKRLFKAVGHNVVYLKRTRVGNLRLDENMEPGEWRELTEEEV 225

Query: 790 EALRTQLK 813
           + L+  +K
Sbjct: 226 KELKRLVK 233


>gi|4155973 (AE001558) putative [Helicobacter pylori J99]
           Length = 262
           
 Score =  103 bits (255), Expect = 3e-21
 Identities = 74/219 (33%), Positives = 119/219 (53%), Gaps = 9/219 (4%)
 Frame = +1

Query: 82  RLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLGMSVKSGDKIELDGRSFVASALTEP 261
           R+++ LA     SRR  E+ +  G +K+N + A+L   VK  DK+ LD R  +     + 
Sbjct: 7   RINQFLAHYTKHSRREAEKLVLEGRVKINHEHAKLASVVKENDKVFLDKR-LIKPLKNKK 65

Query: 262 ARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDGELA 441
             VL+Y+KP+GE+ ++ DP  R  ++E+L   K A +  +GRLD              + 
Sbjct: 66  FSVLVYHKPKGELVSKADPLKRHVIYESLEK-KYAHFAPVGRLDFASEGVLLLSDSKAVV 124

Query: 442 NAMMHPSSEIEREYVVRVR---SPEGEEHVQDEL-LEQLTRGVMLEDGT-----AKFDTI 594
           +A+MH  + +E+EY+V+++   + E E  +Q+ L LE  T+G   +        A F   
Sbjct: 125 SALMH--ANLEKEYLVKIQGFVTREMENAMQEGLKLENATKGAHQKTPIKSMEFAPFIGY 182

Query: 595 ERIGNTDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLL 738
           E I N   +   RV++ EG+NRE+RR +      V  L+R RYG V L
Sbjct: 183 EIIKNHAKYSKLRVIINEGKNRELRRFFAFFNAGVLDLRRVRYGFVNL 230


>gi|2501526|sp|P55986|YE59_HELPY HYPOTHETICAL PROTEIN HP1459
           >gi|2314637|gb|AAD08501.1| (AE000646) conserved
           hypothetical protein [Helicobacter pylori 26695]
           Length = 262
           
 Score =  102 bits (253), Expect = 5e-21
 Identities = 72/219 (32%), Positives = 120/219 (53%), Gaps = 9/219 (4%)
 Frame = +1

Query: 82  RLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLGMSVKSGDKIELDGRSFVASALTEP 261
           R+++ LA     SRR  E+ +  G +K+N + A+L   VK  D++ LD R  +     + 
Sbjct: 7   RINQFLAHYTKHSRREAEKLVLEGRVKINHEHAKLASVVKENDRVFLDKR-LIKPLKNKK 65

Query: 262 ARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDGELA 441
             VL+Y+KP+GE+ ++ DP  R  ++E+L   K A +  +GRLD              + 
Sbjct: 66  FSVLVYHKPKGELVSKADPLKRRVIYESLEK-KYAHFAPVGRLDFASEGVLLLSDSKAVV 124

Query: 442 NAMMHPSSEIEREYVVRVR---SPEGEEHVQDEL-LEQLTRGVMLEDGT-----AKFDTI 594
           +A+MH  +++E+EY+++++   + E E  +Q+ L LE  T+G   +        A F   
Sbjct: 125 SALMH--ADLEKEYLIKIQGFVTREMENAMQEGLKLENATKGAHQKTPIKSMEFAPFIGY 182

Query: 595 ERIGNTDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLL 738
           E I N   +   RV++ EG+NRE+RR +      V  L+R RYG V L
Sbjct: 183 EIIKNHAKYSKLRVIINEGKNRELRRFFAFFNAGVLDLRRVRYGFVNL 230


>gi|418534|sp|P32684|YJBC_ECOLI HYPOTHETICAL 32.5 KD PROTEIN IN
           PEPE-LYSC INTERGENIC REGION >gi|396357 (U00006) No
           definition line found [Escherichia coli] >gi|1790453
           (AE000475) orf, hypothetical protein [Escherichia coli]
           Length = 290
           
 Score = 95.6 bits (234), Expect = 9e-19
 Identities = 66/224 (29%), Positives = 117/224 (51%)
 Frame = +1

Query: 67  PKLEERLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLGMSVKSGDKIELDGRSFVAS 246
           P    RL+K ++++G+ SRR  ++ I  G + +NG  A +G  VK GD ++++G+  +  
Sbjct: 3   PDSSVRLNKYISESGICSRREADRYIEQGNVFLNGKRATIGDQVKPGDVVKVNGQ-LIEP 61

Query: 247 ALTEPARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXX 426
              E   ++  NKP G V+T ED E R  + +   V    R   IGRLD +         
Sbjct: 62  REAEDLVLIALNKPVGIVSTTEDGE-RDNIVDF--VNHSKRVFPIGRLDKDSQGLIFLTN 118

Query: 427 DGELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIG 606
            G+L N ++   ++ E+EY+V V  P     + +E +  ++ GV +     K   +++  
Sbjct: 119 HGDLVNKILRAGNDHEKEYLVTVDKP-----ITEEFIRGMSAGVPILGTVTKKCKVKK-- 171

Query: 607 NTDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLL 738
             ++   FR+ + +G NR++RR+ E  G +V +L+RTR  +V L
Sbjct: 172 --EAPFVFRITLVQGLNRQIRRMCEHFGYEVKKLERTRIMNVSL 213


>gi|3329180|gb|AAC68318.1| (AE001343) predicted pseudouridine
           synthase [Chlamydia trachomatis]
           Length = 241
           
 Score = 82.7 bits (201), Expect = 7e-15
 Identities = 71/238 (29%), Positives = 117/238 (48%), Gaps = 4/238 (1%)
 Frame = +1

Query: 82  RLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLGMSV---KSGDKIELDGRSFVASAL 252
           RL+K LA AG+ SRR  ++ I  G + VNG +A  G  V   +  D +E+ G+   A   
Sbjct: 5   RLNKFLASAGVASRRKCDEIIFAGSVTVNGRVAA-GPFVTVDEEFDSVEVGGQRIGA--- 60

Query: 253 TEPARVLIYNKPEGEVTTREDP-EGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXD 429
            E     + +KP G + + E    G   V + L      R   +GRLD           D
Sbjct: 61  -EKKVYFMVHKPLGYLCSSERKFPGSKLVIDLLSHCP-YRLFTVGRLDKETSGLILVTND 118

Query: 430 GELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGN 609
           GE AN ++HPS  I +EY+++V        V    LE L  G +++    +  +++++  
Sbjct: 119 GEFANRVIHPSFGITKEYLLKV-----SRDVTARDLETLMAGTVIDGKVVRPVSVKKV-- 171

Query: 610 TDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKTQV 789
                  +++V EG+  E+R   E+ G Q+  LKR R GS++L   L  G+  EL  +++
Sbjct: 172 --RRGTIKIIVNEGKKHEIRLFAEAAGLQLLELKRIRIGSLVL-GGLPYGKYRELTDSEL 228

Query: 790 EA 795
           ++
Sbjct: 229 DS 230


>gi|465610|sp|P33918|RSUA_ECOLI RIBOSOMAL SMALL SUBUNIT
           PSEUDOURIDINE SYNTHASE A (16S PSEUDOURIDYLATE 516
           SYNTHASE) (16S PSEUDOURIDINE 516 SYNTHASE) (URACIL
           HYDROLYASE) >gi|1042177|bbs|169371 16S RNA pseudouridine
           516 synthase, 16S RNA psi 516 synthase=rsuA gene product
           [Escherichia coli, Peptide, 231 aa] >gi|405907 (U00008)
           yejD [Escherichia coli] >gi|1788510 (AE000308) 16S
           pseudouridylate 516 synthase [Escherichia coli]
           Length = 231
           
 Score = 82.7 bits (201), Expect = 7e-15
 Identities = 71/239 (29%), Positives = 111/239 (45%), Gaps = 4/239 (1%)
 Frame = +1

Query: 82  RLHKVLAQAGLGSRRALEQR-ISNGLIKVNGDIAQ-LGMSVKSGDKIELDGRSFVASALT 255
           RL K +AQ  LG  RA+  R I    + V+G+I +     +     +  DG      A  
Sbjct: 2   RLDKFIAQQ-LGVSRAIAGREIRGNRVTVDGEIVRNAAFKLLPEHDVAYDGNPL---AQQ 57

Query: 256 EPARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDGE 435
              R  + NKP+G V + +DP+  PTV   L      +  A GRLDI+         DG+
Sbjct: 58  HGPRYFMLNKPQGYVCSTDDPD-HPTVLYFLDEPVAWKLHAAGRLDIDTTGLVLMTDDGQ 116

Query: 436 LANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVML--EDGTAKFDTIERIGN 609
            ++ +  P    E+ Y+V + SP     V D+  EQ  +GV L  E    K   +E I  
Sbjct: 117 WSHRITSPRHHCEKTYLVTLESP-----VADDTAEQFAKGVQLHNEKDLTKPAVLEVITP 171

Query: 610 TDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKTQV 789
           T      R+ + EGR  +V+R++ + G  V  L R R G + L  +L  G+   L + ++
Sbjct: 172 TQ----VRLTISEGRYHQVKRMFAAVGNHVVELHRERIGGITLDADLAPGEYRPLTEEEI 227

Query: 790 EAL 798
            ++
Sbjct: 228 ASV 230


>gi|2145990|pir||S72955 u0247g protein - Mycobacterium leprae
           >gi|467162 (U00021) u0247g [Mycobacterium leprae]
           Length = 186
           
 Score = 82.3 bits (200), Expect = 9e-15
 Identities = 60/170 (35%), Positives = 84/170 (49%), Gaps = 9/170 (5%)
 Frame = +1

Query: 271 LIYNKPEGEVTTREDPEGRPTVFETLP--VLKGARWIAIGRLDINXXXXXXXXXDGELAN 444
           L  NKP G  +T  D  GRP V + +   V    +   +GRLD +         DGELA+
Sbjct: 9   LALNKPRGMYSTMSDDRGRPCVGDLIERRVRGNKKLFHVGRLDADTEGLILLTNDGELAH 68

Query: 445 AMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGNTDSHD 624
            +MHPS E+ + Y+  V+       V   L ++L+ G+ L+DG A  D    +       
Sbjct: 69  RLMHPSHEVSKTYLATVKGA-----VPRGLGKKLSVGLELDDGPAHVDDFAVVDAIPGKT 123

Query: 625 WFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLP-------RELLRGQSTELPK 780
             R+ + EGR R VRRL  + G  V  L RT  G+V L        R LLR +  +L K
Sbjct: 124 LVRLTLHEGRKRIVRRLLTAAGFPVEMLVRTDIGAVSLGDQRPGCLRALLRDEIRQLYK 182


>gi|4377181|gb|AAD19002| (AE001667) predicted pseudouridine synthase
           [Chlamydia pneumoniae]
           Length = 235
           
 Score = 81.9 bits (199), Expect = 1e-14
 Identities = 71/245 (28%), Positives = 118/245 (47%), Gaps = 1/245 (0%)
 Frame = +1

Query: 82  RLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLG-MSVKSGDKIELDGRSFVASALTE 258
           RL+K LA AG+ SRR  ++ I +G + VNG +A+   + V   DK+++ G S     LT+
Sbjct: 5   RLNKFLASAGVASRRKCDEIIFSGSVTVNGRVAEGPFVLVDPEDKVQVGGTSV---HLTK 61

Query: 259 PARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDGEL 438
               +++       ++ +   G   V +    L   R   +GRLD           DGE 
Sbjct: 62  KVYFMVHKAIGYLCSSEKKFPGTKLVIDLFAHLP-YRVFTVGRLDKETSGLILVTNDGEF 120

Query: 439 ANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGNTDS 618
           AN ++HPSS I +EY+++V        V  + L +L  G  ++    +  ++ +I     
Sbjct: 121 ANKIIHPSSGITKEYLLKV-----SRDVSAKDLGKLMEGTFIDGKHVRPVSVTKI----R 171

Query: 619 HDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKTQVEAL 798
               ++VV EG+  E+R   ++ G  +  LKR R GS++L   L  G+  EL   +   L
Sbjct: 172 RGTVKIVVSEGKKHEIRLFADAAGFPILELKRIRIGSLVL-GGLRYGEYRELTDAE---L 227

Query: 799 RTQLKL 816
            T +KL
Sbjct: 228 GTYMKL 233


>gi|3322747 (AE001223) conserved hypothetical protein [Treponema
           pallidum]
           Length = 261
           
 Score = 78.0 bits (189), Expect = 2e-13
 Identities = 70/224 (31%), Positives = 108/224 (47%), Gaps = 17/224 (7%)
 Frame = +1

Query: 67  PKLEERLHKVLAQAGLGSRRALEQRISNGLIKVNGD-IAQLGMSVKSGDKIELDGRSFVA 243
           P    RL   LA++G  SRRA E  I++G + V+G  +   G +V + + + +DG     
Sbjct: 6   PFFRLRLQVYLARSGCASRRACEALIASGRVTVDGQTVTTQGRTVCAQNVVCVDG---TV 62

Query: 244 SALTEPARVLIYNKPEGEVTTR--EDPEG-----------RPTVFETLPVLKGA---RWI 375
             L    R ++  KP G + +   + P G           +      + +++ A   R  
Sbjct: 63  VQLERVQRYVLLYKPVGYICSLAPQFPAGYAHTQVRAGPSKQEYARAIDLVQPAYQERLY 122

Query: 376 AIGRLDINXXXXXXXXXDGELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRG 555
            IGRLD+          DG  A A+ HP S IE+EY+V  R P     V   LL    RG
Sbjct: 123 HIGRLDVRSEGALLFTNDGSFAQALGHPRSGIEKEYIVETREP-----VPAALLSSFVRG 177

Query: 556 VMLEDGTAKFDTIERIGNTDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVL 735
           V +E    +      +    +    ++V+ EG+ RE+R ++E+ G  V RL R R G V 
Sbjct: 178 VWVEGCRYRCVRARHL----AAQCVQLVLVEGKKREIRVVFEAWGQDVVRLVRVRIGRVR 233

Query: 736 L 738
           L
Sbjct: 234 L 234


>gi|1175857|sp|P45124|RSUA_HAEIN RIBOSOMAL SMALL SUBUNIT
           PSEUDOURIDINE SYNTHASE A (16S PSEUDOURIDYLATE 516
           SYNTHASE) (16S PSEUDOURIDINE 516 SYNTHASE) (URACIL
           HYDROLYASE) >gi|1074693|pir||F64169 hypothetical protein
           HI1243 - Haemophilus influenzae (strain Rd KW20)
           >gi|1574175 (U32804) 16s pseudouridylate 516 synthase
           (rsuA) [Haemophilus influenzae Rd]
           Length = 232
           
 Score = 73.7 bits (178), Expect = 3e-12
 Identities = 60/239 (25%), Positives = 110/239 (45%), Gaps = 2/239 (0%)
 Frame = +1

Query: 82  RLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLGMSVKSGDKIELDGRSFVASALT-- 255
           RL K +A+    +R    + I    +K+NG+I + G SV+   + E+    F    LT  
Sbjct: 2   RLDKFIAENVGLTRSQATKAIRQSAVKINGEIVKSG-SVQISQEDEI---YFEDELLTWI 57

Query: 256 EPARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDGE 435
           E  +  + NKP+G V + +D +  PT+++        +  + GRLD++         DG+
Sbjct: 58  EEGQYFMLNKPQGCVCSNDDGD-YPTIYQFFDYPLAGKLHSAGRLDVDTTGLVLLTDDGQ 116

Query: 436 LANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGNTD 615
            ++ +  P    E+ Y+V +  P  E +        L RG       AK + ++      
Sbjct: 117 WSHRITSPKHHCEKTYLVTLADPVEENYSAACAEGILLRGEKEPTKPAKLEILDDYN--- 173

Query: 616 SHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKTQVEA 795
                 + + EGR  +V+R++ + G +V  L R + G V+L   L  G+   L ++++E 
Sbjct: 174 ----VNLTISEGRYHQVKRMFAALGNKVVGLHRWKIGDVVLDESLEEGEYRPLTQSEIEK 229

Query: 796 L 798
           L
Sbjct: 230 L 230


>gi|3845303 (AE001423) pseudouridine synthetase (RsuA fam.); 1st
           euk. member (OO) [Plasmodium falciparum]
           Length = 338
           
 Score = 67.1 bits (161), Expect = 3e-10
 Identities = 65/240 (27%), Positives = 120/240 (49%), Gaps = 53/240 (22%)
 Frame = +1

Query: 82  RLHKVLAQAGLGSRRALEQRISNGLIKVNGDI-AQLGMSVKSGD--------KIELDGR- 231
           RL+K+++     SRR  ++ I +G +K+N  I    G  V  G         KI+L    
Sbjct: 47  RLNKLISMKRNISRRKSDEFIKDGKVKINNKIITNPGTHVHIGKDSLRIYDKKIKLTNII 106

Query: 232 SFVASALTEPARVLIYNKPEGEVTTREDPEGRPTVFETLP--VLKGARWIAIGRLDINXX 405
           + +     +  + ++ +KP+G + T  D + R +++   P  +L+  R + +GRLD N  
Sbjct: 107 NMIKQNENKLHKWIVLHKPKGLLCTSNDEKNRKSIYTLFPEEMLQKYRLVTVGRLDRNTS 166

Query: 406 XXXXXXXDGELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDG---- 573
                  D    N + HP  +  R Y V +  P     V+   L++L RG+ LE+     
Sbjct: 167 GVLLLTNDYAWVNKLTHPKYQRIRTYRVHIEGP-----VKMNALKELARGIYLEEDEKTQ 221

Query: 574 -------------------------------------TAKFDTIERIGNTDSHDWFRVVV 642
                                                  + + I+   +T       + +
Sbjct: 222 PKKIYNYKESREKSNIDDKKKKKMSKMKKKTNPAFIEILREEKIKIKEDTKKITVLNISI 281

Query: 643 KEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKTQVEALR 801
           KEGRNR++R++++     V ++KRT + ++ L       Q  EL + +V  L+
Sbjct: 282 KEGRNRQIRKMFQQINQPVIKIKRTSFENITLKNIYFPKQYRELNQKEVNDLK 334


>gi|1175289|sp|P44827|YMFC_HAEIN HYPOTHETICAL PROTEIN HI0694
           >gi|1074484|pir||I64156 hypothetical protein HI0694 -
           Haemophilus influenzae (strain Rd KW20) >gi|1573697
           (U32752) conserved hypothetical protein [Haemophilus
           influenzae Rd]
           Length = 240
           
 Score = 66.7 bits (160), Expect = 4e-10
 Identities = 57/187 (30%), Positives = 89/187 (47%), Gaps = 14/187 (7%)
 Frame = +1

Query: 256 EPARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDGE 435
           +  +V+++NKP   +T   D +GR T+ + + +       A GRLD +         +GE
Sbjct: 49  DETKVVLFNKPFDVLTQFTDEQGRATLKDFISI---PNVYAAGRLDRDSEGLLILTNNGE 105

Query: 436 LANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGNTD 615
           L + +  P  + E+ Y V+V   EG     D  L QL +GV L+DG  K   +  I   +
Sbjct: 106 LQHRLADPKFKTEKTYWVQV---EGIPEETD--LAQLRKGVELKDGVTKSAKVRLISEPN 160

Query: 616 SHD--------------WFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELL 753
             +              W  + + EGRNR+VRR+    G    RL R   G +L    L 
Sbjct: 161 LWERNPPIRERKNIPTSWLEIKISEGRNRQVRRMTAHIGFPTLRLVRVSMG-LLSINGLE 219

Query: 754 RGQSTELPKTQVEALRTQLKL 816
            G    L   +++AL   +KL
Sbjct: 220 NGSFRLLSLDEIKALFQTVKL 240


>gi|1787380 (AE000213) orf, hypothetical protein [Escherichia coli]
           Length = 207
           
 Score = 57.8 bits (137), Expect = 2e-07
 Identities = 51/161 (31%), Positives = 74/161 (45%), Gaps = 15/161 (9%)
 Frame = +1

Query: 256 EPARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDGE 435
           +P RV+++NKP   +    D  GR T+ E +PV +G    A GRLD +         +G 
Sbjct: 27  QPTRVILFNKPYDVLPQFTDEAGRKTLKEFIPV-QGV--YAAGRLDRDSEGLLVLTNNGA 83

Query: 436 LANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGT---AKFDTIE--- 597
           L   +  P     + Y V+V     E     + LE L  GV L DG    A  + ++   
Sbjct: 84  LQARLTQPGKRTGKIYYVQV-----EGIPTQDALEALRNGVTLNDGPTLPAGAELVDEPA 138

Query: 598 ---------RIGNTDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLL 738
                    R   +    W ++ + EGRNR+VRR+    G    RL R   G   L
Sbjct: 139 WLWPRNPPIRERKSIPTSWLKITLYEGRNRQVRRMTAHVGFPTLRLIRYAMGDYSL 194


>gi|3916025|sp|P75966|YMFC_ECOLI HYPOTHETICAL 24.9 KD PROTEIN IN
           TRMU-ICDA INTERGENIC REGION >gi|4062699|dbj|BAA35957|
           (D90748) Hypothetical protein HI0694 [Escherichia coli]
           >gi|4062717|dbj|BAA35966| (D90749) Hypothetical protein
           HI0694 [Escherichia coli]
           Length = 217
           
 Score = 57.8 bits (137), Expect = 2e-07
 Identities = 51/161 (31%), Positives = 74/161 (45%), Gaps = 15/161 (9%)
 Frame = +1

Query: 256 EPARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDGE 435
           +P RV+++NKP   +    D  GR T+ E +PV +G    A GRLD +         +G 
Sbjct: 37  QPTRVILFNKPYDVLPQFTDEAGRKTLKEFIPV-QGV--YAAGRLDRDSEGLLVLTNNGA 93

Query: 436 LANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGT---AKFDTIE--- 597
           L   +  P     + Y V+V     E     + LE L  GV L DG    A  + ++   
Sbjct: 94  LQARLTQPGKRTGKIYYVQV-----EGIPTQDALEALRNGVTLNDGPTLPAGAELVDEPA 148

Query: 598 ---------RIGNTDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLL 738
                    R   +    W ++ + EGRNR+VRR+    G    RL R   G   L
Sbjct: 149 WLWPRNPPIRERKSIPTSWLKITLYEGRNRQVRRMTAHVGFPTLRLIRYAMGDYSL 204


>gi|3402689|gb|AAC28992.1| (AC004697) unknown protein [Arabidopsis
           thaliana]
           Length = 303
           
 Score = 54.3 bits (128), Expect = 2e-06
 Identities = 41/134 (30%), Positives = 63/134 (46%), Gaps = 12/134 (8%)
 Frame = +1

Query: 79  ERLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLGMS--VKSGDKIELDGRSFVASAL 252
           +RL KVLA AG+ SRR  E+ I +G + VNG +     +    S D I ++G        
Sbjct: 160 QRLSKVLAAAGVASRRTSEELIFDGKVTVNGILCNTPQTRVDPSRDIIYVNGNRIPKK-- 217

Query: 253 TEPARVLIYNKPEGEVTTREDPEGRPTVF----------ETLPVLKGARWIAIGRLDINX 402
             P      NKP+G + +  + E +  +           +  P     R   +GRLD+  
Sbjct: 218 LPPKVYFALNKPKGYICSSGEKEIKSAISLFDEYLSSWDKRNPGTPKPRLFTVGRLDVAT 277

Query: 403 XXXXXXXXDGELANAMMHPSSEIERE 480
                   DG+ A  + HPSS + +E
Sbjct: 278 TGLIVVTNDGDFAQKLSHPSSSLPKE 303


>gi|3915559|sp|O32068|YTZF_BACSU HYPOTHETICAL 17.7 KD PROTEIN IN
           AMYX-OPUD INTERGENIC REGION >gi|2635487|emb|CAB14981|
           (Z99119) similar to hypothetical proteins [Bacillus
           subtilis]
           Length = 157
           
 Score = 53.9 bits (127), Expect = 3e-06
 Identities = 38/139 (27%), Positives = 64/139 (45%), Gaps = 1/139 (0%)
 Frame = +1

Query: 382 GRLDINXXXXXXXXXDGELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVM 561
           GRLD +         DG+LA+ ++ P   + + Y V ++S    E + D     L  GV 
Sbjct: 18  GRLDKDTEGFLLLTNDGQLAHRLLSPKKHVPKTYEVHLKSQISREDISD-----LETGVY 72

Query: 562 LEDGTAKFDTIERIGNTDS-HDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLL 738
           +E G         I   DS +    + + EG+  +V+++ ++ G +V  LKR   G V L
Sbjct: 73  IEGGYKTKPAKAEIKTNDSGNTVIYLTITEGKYHQVKQMAKAVGNEVVYLKRLSMGRVSL 132

Query: 739 PRELLRGQSTELPKTQVEAL 798
              L  G+  EL + ++  L
Sbjct: 133 DPALAPGEYRELTEEELHLL 152


>gi|3402691|gb|AAC28994.1| (AC004697) unknown protein [Arabidopsis
           thaliana]
           Length = 86
           
 Score = 51.5 bits (121), Expect = 2e-05
 Identities = 26/56 (46%), Positives = 37/56 (65%)
 Frame = +1

Query: 631 RVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKTQVEAL 798
           R+VV EGRN EVR L ++ G +V  LKR R G   LP +L  G+  EL +++++AL
Sbjct: 27  RIVVHEGRNHEVRELVKNAGLEVHSLKRVRIGGFRLPSDLGLGKHVELKQSELKAL 82


>gi|6689331|emb|CAB65436.1| (AJ250721) hypothetical protein
           [Synechococcus leopoliensis]
           Length = 199
           
 Score = 49.2 bits (115), Expect = 8e-05
 Identities = 40/150 (26%), Positives = 69/150 (45%), Gaps = 14/150 (9%)
 Frame = +1

Query: 265 RVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDGELAN 444
           R L+++KP   V  +  P  RP        +       +GRLD +         +G L +
Sbjct: 4   RYLLFHKPYDAVC-QFSPSDRPDQQTLKDYIDVPEVYPVGRLDRDSEGLLLLTNNGALQH 62

Query: 445 AMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGNTDSHD 624
            + HP    +R Y V+V     E    +  L+ L +GV ++D   +   ++R+ +    +
Sbjct: 63  RLCHPRFGHDRTYWVQV-----EREPTEAALQALRQGVQIQDYRTRPAKVQRLDDPQIPE 117

Query: 625 --------------WFRVVVKEGRNREVRRLWESQGCQVSRLKR 714
                         W  + ++EGRNR+VRR+  + G    RL R
Sbjct: 118 RDPPIRFRKTVPTAWLALTLQEGRNRQVRRMTAAVGHPTLRLIR 161


>gi|3915395|sp|P72581|Y612_SYNY3 HYPOTHETICAL 21.0 KD PROTEIN
           SLR0612
           Length = 261
           
 Score = 48.4 bits (113), Expect = 1e-04
 Identities = 46/158 (29%), Positives = 71/158 (44%), Gaps = 18/158 (11%)
 Frame = +1

Query: 247 ALTEPARVLIYNKPEGEVT--TREDPEGRPTV--FETLPVLKGARWIAIGRLDINXXXXX 414
           AL +  + +++ KP G +   T      RPT+  +  LP L       +GRLD +     
Sbjct: 34  ALNKTPQTIVFYKPYGVLCQFTDNSAHPRPTLKDYINLPDL-----YPVGRLDQDSEGLL 88

Query: 415 XXXXDGELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTI 594
               +G+L + + H     +R Y  +V     E    DE LE L RG+   D   +    
Sbjct: 89  LLTSNGKLQHRLAHREFAHQRTYFAQV-----EGSPTDEDLEPLRRGITFADYPTRPAIA 143

Query: 595 ERIGNTD--------------SHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTR 720
           + I   D                 W  + + EGRNR+VRR+  + G    RL R +
Sbjct: 144 KIITEPDFPPRNPPIRYRASIPTSWLSITLTEGRNRQVRRMTAAVGFPTLRLVRVQ 199


>gi|1651652|dbj|BAA16580| (D90899) hypothetical protein
           [Synechocystis sp.]
           Length = 185
           
 Score = 42.2 bits (97), Expect = 0.010
 Identities = 34/114 (29%), Positives = 51/114 (43%), Gaps = 14/114 (12%)
 Frame = +1

Query: 379 IGRLDINXXXXXXXXXDGELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGV 558
           +GRLD +         +G+L + + H     +R Y  +V     E    DE LE L RG+
Sbjct: 1   MGRLDQDSEGLLLLTSNGKLQHRLAHREFAHQRTYFAQV-----EGSPTDEDLEPLRRGI 55

Query: 559 MLEDGTAKFDTIERIGNTD--------------SHDWFRVVVKEGRNREVRRLWESQGCQ 696
              D   +    + I   D                 W  + + EGRNR+VRR+  + G  
Sbjct: 56  TFADYPTRPAIAKIITEPDFPPRNPPIRYRASIPTSWLSITLTEGRNRQVRRMTAAVGFP 115

Query: 697 VSRLKRTR 720
             RL R +
Sbjct: 116 TLRLVRVQ 123


>gi|1175470|sp|Q09801|YAAA_SCHPO HYPOTHETICAL 37.3 KD PROTEIN C22G7.10
            IN CHROMOSOME I >gi|2130331|pir||S62454 hypothetical
            protein SPAC22G7.10 - fission yeast (Schizosaccharomyces
            pombe) >gi|1009460|emb|CAA91134.1| (Z54328) hypothetical
            protein [Schizosaccharomyces pombe]
            Length = 344
            
 Score = 41.8 bits (96), Expect = 0.014
 Identities = 26/75 (34%), Positives = 34/75 (44%), Gaps = 4/75 (5%)
 Frame = +1

Query: 1195 AGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSD----HATPTFNPYGNPGQK 1362
            +G G H    +PN    N P    R+S  + P N+PN  S     HA PT NP  + G  
Sbjct: 237  SGGGVHSGAATPNAYVNNNPSSSRRES--ESPANSPNITSSAGMTHAQPTHNPTSSYG-- 292

Query: 1363 TGAGRPNNSGGKYNRNRGP 1419
                  N +   YN +R P
Sbjct: 293  ------NGASTNYNASRPP 305


 Score = 32.8 bits (73), Expect = 6.8
 Identities = 25/84 (29%), Positives = 37/84 (43%), Gaps = 4/84 (4%)
 Frame = +1

Query: 1162 STGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAP---NFP-SDHATP 1329
            S  P N  N  +  G    + + NP +    G  T  +  + P+N P   N+P S    P
Sbjct: 263  SESPANSPNITSSAGMTHAQPTHNPTSSYGNGASTNYNASRPPSNHPHSSNYPSSSRRKP 322

Query: 1330 TFNPYGNPGQKTGAGRPNNSGGKYNRNR 1413
            + + Y N   +        SGG+Y RNR
Sbjct: 323  SPDRYSNYSSR-------GSGGRYRRNR 343


>gi|294136|gb|AAA29551.1| (M83173) circumsporozoite protein
            [Plasmodium falciparum]
            Length = 442
            
 Score = 41.4 bits (95), Expect = 0.018
 Identities = 38/172 (22%), Positives = 56/172 (32%)
 Frame = +1

Query: 895  NRNDNNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKP 1074
            N N NN +  +NN     E ++  + D   +D  + +   H K +    G          
Sbjct: 80   NDNGNNNNGNNNNGDNGREGKDEDKRDGNNEDNEKLRKPKHKKLKQPGDGNPDPNANPNV 139

Query: 1075 FKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTP 1254
                 P  D + +  +             +     + N  A   A+PN  +PN N    P
Sbjct: 140  DPNANPNVDPNANPNANPNANP-----NANPNANPNANPNANPNANPNA-NPNANPNANP 193

Query: 1255 GQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 1410
                  +P   PN  PN    +A P  NP  NP     A    N     N N
Sbjct: 194  NANPNANPNANPNANPNV-DPNANPNANPNANPNANPNANPNANPNANPNAN 244


 Score = 40.2 bits (92), Expect = 0.040
 Identities = 26/77 (33%), Positives = 33/77 (42%), Gaps = 3/77 (3%)
 Frame = +1

Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
            N  A   A+PN  +PN N    P      +P   PN  PN  + +A P  NP  NP    
Sbjct: 248  NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 305

Query: 1366 GA---GRPNNSGGKYNRNRG 1416
             A     PN +  K N+  G
Sbjct: 306  NANPNANPNANPNKNNQGNG 325


>gi|117587|sp|P08307|CSP_PLAFW CIRCUMSPOROZOITE PROTEIN PRECURSOR (CS)
            >gi|627052|pir||A54529 circumsporozoite protein -
            Plasmodium falciparum (strain Wellcome) >gi|160215
            (M15505) circumsporozoite protein [Plasmodium falciparum]
            Length = 442
            
 Score = 41.4 bits (95), Expect = 0.018
 Identities = 38/172 (22%), Positives = 56/172 (32%)
 Frame = +1

Query: 895  NRNDNNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKP 1074
            N N NN +  +NN     E ++  + D   +D  + +   H K +    G          
Sbjct: 80   NDNGNNNNGNNNNGDNGREGKDEDKRDGNNEDNEKLRKPKHKKLKQPGDGNPDPNANPNV 139

Query: 1075 FKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTP 1254
                 P  D + +  +             +     + N  A   A+PN  +PN N    P
Sbjct: 140  DPNANPNVDPNANPNANPNANP-----NANPNANPNANPNANPNANPNA-NPNANPNANP 193

Query: 1255 GQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 1410
                  +P   PN  PN    +A P  NP  NP     A    N     N N
Sbjct: 194  NANPNANPNANPNANPNV-DPNANPNANPNANPNANPNANPNANPNANPNAN 244


 Score = 40.2 bits (92), Expect = 0.040
 Identities = 26/77 (33%), Positives = 33/77 (42%), Gaps = 3/77 (3%)
 Frame = +1

Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
            N  A   A+PN  +PN N    P      +P   PN  PN  + +A P  NP  NP    
Sbjct: 248  NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 305

Query: 1366 GA---GRPNNSGGKYNRNRG 1416
             A     PN +  K N+  G
Sbjct: 306  NANPNANPNANPNKNNQGNG 325


>gi|48921|emb|CAA42462| (X59794) traC-4 [Escherichia coli] >gi|1572574
            (U67194) TraC4 [Enterobacter aerogenes]
            Length = 747
            
 Score = 41.0 bits (94), Expect = 0.024
 Identities = 43/146 (29%), Positives = 65/146 (44%), Gaps = 16/146 (10%)
 Frame = +1

Query: 862  QRRSAKATLHVNRNDNNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVS 1041
            +R +  A +   R   + +A   + S A E+R+      + +D    Q +    +R    
Sbjct: 137  ERAAKLARVQEERVRRDPNATEEDISAAKEARKTAEASAMLNDSD-AQRRAAELERQERD 195

Query: 1042 GEAAAKQAHKPFKQY-----KPKND-RSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGN 1203
             + A  QA KP +QY     K K++ +SL        QSWYVP G    P      GA  
Sbjct: 196  RQQAQPQAEKPERQYINVPYKEKDEAKSLGARWDRQQQSWYVPPGTDAAPFAKWAQGAAT 255

Query: 1204 GA-HPNKKSPNPNTR---NTPGQQTRKS------PYKYPNNA 1299
             A  P    P P  +     P QQ +++      PY+  N A
Sbjct: 256  AAVEPRSAQPAPEAQGEAQKPAQQAQQARQYLAVPYEQRNAA 297


>gi|48920|emb|CAA42461| (X59794) traC-3 [Escherichia coli] >gi|1572575
            (U67194) TraC3 [Enterobacter aerogenes]
            Length = 1230
            
 Score = 41.0 bits (94), Expect = 0.024
 Identities = 43/146 (29%), Positives = 65/146 (44%), Gaps = 16/146 (10%)
 Frame = +1

Query: 862  QRRSAKATLHVNRNDNNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVS 1041
            +R +  A +   R   + +A   + S A E+R+      + +D    Q +    +R    
Sbjct: 620  ERAAKLARVQEERVRRDPNATEEDISAAKEARKTAEASAMLNDSD-AQRRAAELERQERD 678

Query: 1042 GEAAAKQAHKPFKQY-----KPKND-RSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGN 1203
             + A  QA KP +QY     K K++ +SL        QSWYVP G    P      GA  
Sbjct: 679  RQQAQPQAEKPERQYINVPYKEKDEAKSLGARWDRQQQSWYVPPGTDAAPFAKWAQGAAT 738

Query: 1204 GA-HPNKKSPNPNTR---NTPGQQTRKS------PYKYPNNA 1299
             A  P    P P  +     P QQ +++      PY+  N A
Sbjct: 739  AAVEPRSAQPAPEAQGEAQKPAQQAQQARQYLAVPYEQRNAA 780


>gi|294138|gb|AAA29552.1| (M83174) circumsporozoite protein
            [Plasmodium falciparum]
            Length = 420
            
 Score = 41.0 bits (94), Expect = 0.024
 Identities = 43/161 (26%), Positives = 56/161 (34%), Gaps = 3/161 (1%)
 Frame = +1

Query: 928  NNHSTADESRELRRFDTLRDDRGRG--QGKHHFK-DRLTVSGEAAAKQAHKPFKQYKPKN 1098
            N +S    SR L   D   ++ G    +GK   K D      E   K  HK  KQ    N
Sbjct: 61   NWYSLKKNSRSLGENDDGNNNNGDNGREGKDEDKRDGNNEDNEKLRKPKHKKLKQPGDGN 120

Query: 1099 DRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSP 1278
                +  +     +  V    +     + N  A   A+PN  +PN N    P      +P
Sbjct: 121  PDPNANPNVDPNANPNVDPNANPNVDPNANPNANPNANPNA-NPNANPNANPNANPNANP 179

Query: 1279 YKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 1410
               PN  PN    +A P  NP  NP     A    N     N N
Sbjct: 180  NANPNANPNV-DPNANPNANPNANPNANPNANPNANPNANPNAN 222


 Score = 40.2 bits (92), Expect = 0.040
 Identities = 26/77 (33%), Positives = 33/77 (42%), Gaps = 3/77 (3%)
 Frame = +1

Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
            N  A   A+PN  +PN N    P      +P   PN  PN  + +A P  NP  NP    
Sbjct: 226  NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 283

Query: 1366 GA---GRPNNSGGKYNRNRG 1416
             A     PN +  K N+  G
Sbjct: 284  NANPNANPNANPNKNNQGNG 303


>gi|136173|sp|P27190|TRC5_ECOLI DNA PRIMASE TRAC (REPLICATION PRIMASE)
            >gi|481041|pir||S37669 traC-2 protein - Escherichia coli
            >gi|48919|emb|CAA42460| (X59794) traC-2 [Escherichia
            coli] >gi|1572573 (U67194) TraC2 [Enterobacter aerogenes]
            Length = 1448
            
 Score = 41.0 bits (94), Expect = 0.024
 Identities = 43/146 (29%), Positives = 65/146 (44%), Gaps = 16/146 (10%)
 Frame = +1

Query: 862  QRRSAKATLHVNRNDNNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVS 1041
            +R +  A +   R   + +A   + S A E+R+      + +D    Q +    +R    
Sbjct: 838  ERAAKLARVQEERVRRDPNATEEDISAAKEARKTAEASAMLNDSD-AQRRAAELERQERD 896

Query: 1042 GEAAAKQAHKPFKQY-----KPKND-RSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGN 1203
             + A  QA KP +QY     K K++ +SL        QSWYVP G    P      GA  
Sbjct: 897  RQQAQPQAEKPERQYINVPYKEKDEAKSLGARWDRQQQSWYVPPGTDAAPFAKWAQGAAT 956

Query: 1204 GA-HPNKKSPNPNTR---NTPGQQTRKS------PYKYPNNA 1299
             A  P    P P  +     P QQ +++      PY+  N A
Sbjct: 957  AAVEPRSAQPAPEAQGEAQKPAQQAQQARQYLAVPYEQRNAA 998


>gi|117591|sp|P26694|CSP_PLARE CIRCUMSPOROZOITE PROTEIN PRECURSOR (CS)
            >gi|102373|pir||A39756 circumsporozoite protein -
            Plasmodium reichenowi >gi|160229 (M60972)
            circumsporozoite protein [Plasmodium reichenowi]
            Length = 388
            
 Score = 40.6 bits (93), Expect = 0.031
 Identities = 25/75 (33%), Positives = 30/75 (39%)
 Frame = +1

Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
            N  A   A+PN  +PN N    P      +P   PN  PN  + +A P  NP  NP    
Sbjct: 194  NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 251

Query: 1366 GAGRPNNSGGKYNRN 1410
             A    N     NRN
Sbjct: 252  NANPNANPNANPNRN 266


 Score = 40.6 bits (93), Expect = 0.031
 Identities = 41/167 (24%), Positives = 56/167 (32%), Gaps = 3/167 (1%)
 Frame = +1

Query: 910  NKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFKQYK 1089
            N ++   N  +  E+ +    D    D G  + + H   R     E   K  H   KQ  
Sbjct: 61   NWYSLKKNSRSLGENDDADNGDADNGDEGIDENRRH---RNKEGKEKLKKPKHNKLKQ-- 115

Query: 1090 PKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPN---KKSPNPNTRNTPGQ 1260
            P ND      +P       V    +     + N      A+PN     +PN N    P  
Sbjct: 116  PGNDNVDPNANPN------VDPNANPNVDPNANPNVDPNANPNVDPNANPNVNPNANPNV 169

Query: 1261 QTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 1410
                +P   PN  PN  + +A P  NP  NP     A    N     N N
Sbjct: 170  DPNANPNVNPNANPNV-NPNANPNVNPNANPNANPNANPNANPNANPNAN 218


>gi|117584|sp|P05691|CSP_PLAFL CIRCUMSPOROZOITE PROTEIN (CS)
            >gi|552195 (M17802) circumsporozoite protein [Plasmodium
            falciparum]
            Length = 315
            
 Score = 40.6 bits (93), Expect = 0.031
 Identities = 43/161 (26%), Positives = 56/161 (34%), Gaps = 6/161 (3%)
 Frame = +1

Query: 928  NNHSTADESRELRRFDTLRDDRGRG--QGKHHFK-DRLTVSGEAAAKQAHKPFKQYKPKN 1098
            N +S    SR L   D   ++ G    +GK   K D      E   K  HK  KQ    N
Sbjct: 45   NWYSLKKNSRSLGENDDGNNNNGDNGREGKDEDKRDGNNEDNEKLRKPKHKKLKQPADGN 104

Query: 1099 DRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKK---SPNPNTRNTPGQQTR 1269
                +  +     +  V    +     + N  A   A+PN     +PN N    P     
Sbjct: 105  PDPNANPNVDPNANPNVDPNANPNVDPNANPNANPNANPNANPNANPNANPNANPNANPN 164

Query: 1270 KSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 1410
             +P   PN  PN    +A P  NP  NP     A    N     N N
Sbjct: 165  ANPNANPNANPNV-DPNANPNANPNANPNANPNANPNANPNANPNAN 210


 Score = 40.2 bits (92), Expect = 0.040
 Identities = 26/77 (33%), Positives = 33/77 (42%), Gaps = 3/77 (3%)
 Frame = +1

Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
            N  A   A+PN  +PN N    P      +P   PN  PN  + +A P  NP  NP    
Sbjct: 198  NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 255

Query: 1366 GA---GRPNNSGGKYNRNRG 1416
             A     PN +  K N+  G
Sbjct: 256  NANPNANPNANPNKNNQGNG 275


>gi|294119|gb|AAA29542.1| (M83164) circumsporozoite protein
            [Plasmodium falciparum] >gi|294142|gb|AAA29563.1|
            (M83150) circumsporozoite protein [Plasmodium falciparum]
            >gi|294161|gb|AAA29576.1| (M83163) circumsporozoite
            protein [Plasmodium falciparum]
            Length = 436
            
 Score = 40.2 bits (92), Expect = 0.040
 Identities = 26/77 (33%), Positives = 33/77 (42%), Gaps = 3/77 (3%)
 Frame = +1

Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
            N  A   A+PN  +PN N    P      +P   PN  PN  + +A P  NP  NP    
Sbjct: 242  NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 299

Query: 1366 GA---GRPNNSGGKYNRNRG 1416
             A     PN +  K N+  G
Sbjct: 300  NANPNANPNANPNKNNQGNG 319


 Score = 38.3 bits (87), Expect = 0.16
 Identities = 43/161 (26%), Positives = 56/161 (34%), Gaps = 14/161 (8%)
 Frame = +1

Query: 928  NNHSTADESRELRRFDTLRDDRGRG--QGKHHFK-DRLTVSGEAAAKQAHKPFKQYKPKN 1098
            N +S    SR L   D   ++ G    +GK   K D      E   K  HK  KQ    N
Sbjct: 61   NWYSLKKNSRSLGENDDGNNNNGDNGREGKDEDKRDGNNEDNEKLRKPKHKKLKQPGDGN 120

Query: 1099 DRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKK-----------SPNPNTR 1245
                +  +     +  V    +     + N  A   A+PN             +PN N  
Sbjct: 121  PDPNANPNVDPNANPNVDPNANPNANPNANPNANPNANPNANPNANPNANPNANPNANPN 180

Query: 1246 NTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 1410
              P      +P   PN  PN    +A P  NP  NP     A    N     N N
Sbjct: 181  ANPNANPNANPNANPNANPNV-DPNANPNANPNANPNANPNANPNANPNANPNAN 234


>gi|294125|gb|AAA29545.1| (M83167) circumsporozoite protein
            [Plasmodium falciparum]
            Length = 436
            
 Score = 40.2 bits (92), Expect = 0.040
 Identities = 26/77 (33%), Positives = 33/77 (42%), Gaps = 3/77 (3%)
 Frame = +1

Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
            N  A   A+PN  +PN N    P      +P   PN  PN  + +A P  NP  NP    
Sbjct: 242  NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 299

Query: 1366 GA---GRPNNSGGKYNRNRG 1416
             A     PN +  K N+  G
Sbjct: 300  NANPNANPNANPNKNNQGNG 319


 Score = 33.6 bits (75), Expect = 4.0
 Identities = 21/80 (26%), Positives = 30/80 (37%)
 Frame = +1

Query: 1171 PRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGN 1350
            P++ +    G+G      +PN +    P      +P   PN  PN  + +A P  NP  N
Sbjct: 108  PKHKKLKQPGDGNPDPNANPNVDPNANPNVDPNANPNVDPNANPN-ANPNANPNANPNAN 166

Query: 1351 PGQKTGAGRPNNSGGKYNRN 1410
            P     A    N     N N
Sbjct: 167  PNANPNANPNANPNANPNAN 186


>gi|552191 (M57499) circumsporozoite protein [Plasmodium falciparum]
            Length = 424
            
 Score = 40.2 bits (92), Expect = 0.040
 Identities = 26/77 (33%), Positives = 33/77 (42%), Gaps = 3/77 (3%)
 Frame = +1

Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
            N  A   A+PN  +PN N    P      +P   PN  PN  + +A P  NP  NP    
Sbjct: 230  NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 287

Query: 1366 GA---GRPNNSGGKYNRNRG 1416
             A     PN +  K N+  G
Sbjct: 288  NANPNANPNANPNKNNQGNG 307


 Score = 33.2 bits (74), Expect = 5.2
 Identities = 21/80 (26%), Positives = 29/80 (36%)
 Frame = +1

Query: 1171 PRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGN 1350
            P++ +    G+G      +PN +    P      +P   PN  PN    +A P  NP  N
Sbjct: 108  PKHKKLKQPGDGNPDPNANPNVDPNANPNVDPNANPNVDPNANPNV-DPNANPNANPNAN 166

Query: 1351 PGQKTGAGRPNNSGGKYNRN 1410
            P     A    N     N N
Sbjct: 167  PNANPNANPNANPNANPNAN 186


>gi|294129|gb|AAA29547.1| (M83169) circumsporozoite protein
            [Plasmodium falciparum] >gi|294140|gb|AAA29562.1|
            (M83149) circumsporozoite protein [Plasmodium falciparum]
            Length = 424
            
 Score = 40.2 bits (92), Expect = 0.040
 Identities = 26/77 (33%), Positives = 33/77 (42%), Gaps = 3/77 (3%)
 Frame = +1

Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
            N  A   A+PN  +PN N    P      +P   PN  PN  + +A P  NP  NP    
Sbjct: 230  NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 287

Query: 1366 GA---GRPNNSGGKYNRNRG 1416
             A     PN +  K N+  G
Sbjct: 288  NANPNANPNANPNKNNQGNG 307


 Score = 33.6 bits (75), Expect = 4.0
 Identities = 21/80 (26%), Positives = 30/80 (37%)
 Frame = +1

Query: 1171 PRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGN 1350
            P++ +    G+G      +PN +    P      +P   PN  PN  + +A P  NP  N
Sbjct: 108  PKHKKLKQPGDGNPDPNANPNVDPNANPNVDPNANPNVDPNANPN-ANPNANPNANPNAN 166

Query: 1351 PGQKTGAGRPNNSGGKYNRN 1410
            P     A    N     N N
Sbjct: 167  PNANPNANPNANPNANPNAN 186


>gi|117586|sp|P13814|CSP_PLAFT CIRCUMSPOROZOITE PROTEIN PRECURSOR (CS)
            >gi|627051|pir||A54533 circumsporozoite protein -
            Plasmodium falciparum (strain T4, Thailand) >gi|160217
            (M19752) circumsporozoite protein [Plasmodium falciparum]
            Length = 424
            
 Score = 40.2 bits (92), Expect = 0.040
 Identities = 26/77 (33%), Positives = 33/77 (42%), Gaps = 3/77 (3%)
 Frame = +1

Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
            N  A   A+PN  +PN N    P      +P   PN  PN  + +A P  NP  NP    
Sbjct: 230  NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 287

Query: 1366 GA---GRPNNSGGKYNRNRG 1416
             A     PN +  K N+  G
Sbjct: 288  NANPNANPNANPNKNNQGNG 307


 Score = 33.6 bits (75), Expect = 4.0
 Identities = 21/80 (26%), Positives = 30/80 (37%)
 Frame = +1

Query: 1171 PRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGN 1350
            P++ +    G+G      +PN +    P      +P   PN  PN  + +A P  NP  N
Sbjct: 108  PKHKKLKQPGDGNPDPNANPNVDPNANPNVDPNANPNVDPNANPN-ANPNANPNANPNAN 166

Query: 1351 PGQKTGAGRPNNSGGKYNRN 1410
            P     A    N     N N
Sbjct: 167  PNANPNANPNANPNANPNAN 186


>gi|84198|pir||S05428 circumsporozoite protein - Plasmodium falciparum
            (isolate NF54) >gi|160169 (M22982) circumsporozoite
            protein [Plasmodium falciparum]
            Length = 405
            
 Score = 40.2 bits (92), Expect = 0.040
 Identities = 26/77 (33%), Positives = 33/77 (42%), Gaps = 3/77 (3%)
 Frame = +1

Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
            N  A   A+PN  +PN N    P      +P   PN  PN  + +A P  NP  NP    
Sbjct: 211  NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 268

Query: 1366 GA---GRPNNSGGKYNRNRG 1416
             A     PN +  K N+  G
Sbjct: 269  NANPNANPNANPNKNNQGNG 288


 Score = 37.9 bits (86), Expect = 0.20
 Identities = 24/75 (32%), Positives = 28/75 (37%)
 Frame = +1

Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
            N  A   A+PN  +PN N    P      +P   PN  PN    +A P  NP  NP    
Sbjct: 167  NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPNV-DPNANPNANPNANPNANP 224

Query: 1366 GAGRPNNSGGKYNRN 1410
             A    N     N N
Sbjct: 225  NANPNANPNANPNAN 239


>gi|294121|gb|AAA29543.1| (M83165) circumsporozoite protein
            [Plasmodium falciparum]
            Length = 432
            
 Score = 40.2 bits (92), Expect = 0.040
 Identities = 26/77 (33%), Positives = 33/77 (42%), Gaps = 3/77 (3%)
 Frame = +1

Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
            N  A   A+PN  +PN N    P      +P   PN  PN  + +A P  NP  NP    
Sbjct: 238  NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 295

Query: 1366 GA---GRPNNSGGKYNRNRG 1416
             A     PN +  K N+  G
Sbjct: 296  NANPNANPNANPNKNNQGNG 315


 Score = 33.6 bits (75), Expect = 4.0
 Identities = 21/80 (26%), Positives = 30/80 (37%)
 Frame = +1

Query: 1171 PRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGN 1350
            P++ +    G+G      +PN +    P      +P   PN  PN  + +A P  NP  N
Sbjct: 108  PKHKKLKQPGDGNPDPNANPNVDPNANPNVDPNANPNVDPNANPN-ANPNANPNANPNAN 166

Query: 1351 PGQKTGAGRPNNSGGKYNRN 1410
            P     A    N     N N
Sbjct: 167  PNANPNANPNANPNANPNAN 186


>gi|294123|gb|AAA29544.1| (M83166) circumsporozoite protein
            [Plasmodium falciparum] >gi|294127|gb|AAA29546.1|
            (M83168) circumsporozoite protein [Plasmodium falciparum]
            >gi|294131|gb|AAA29548.1| (M83170) circumsporozoite
            protein [Plasmodium falciparum] >gi|294145|gb|AAA29565.1|
            (M83152) circumsporozoite protein [Plasmodium falciparum]
            >gi|294149|gb|AAA29568.1| (M83155) circumsporozoite
            protein [Plasmodium falciparum] >gi|294154|gb|AAA29571.1|
            (M83158) circumsporozoite protein [Plasmodium falciparum]
            Length = 432
            
 Score = 40.2 bits (92), Expect = 0.040
 Identities = 26/77 (33%), Positives = 33/77 (42%), Gaps = 3/77 (3%)
 Frame = +1

Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
            N  A   A+PN  +PN N    P      +P   PN  PN  + +A P  NP  NP    
Sbjct: 238  NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 295

Query: 1366 GA---GRPNNSGGKYNRNRG 1416
             A     PN +  K N+  G
Sbjct: 296  NANPNANPNANPNKNNQGNG 315


 Score = 33.2 bits (74), Expect = 5.2
 Identities = 21/80 (26%), Positives = 29/80 (36%)
 Frame = +1

Query: 1171 PRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGN 1350
            P++ +    G+G      +PN +    P      +P   PN  PN    +A P  NP  N
Sbjct: 108  PKHKKLKQPGDGNPDPNANPNVDPNANPNVDPNANPNVDPNANPNV-DPNANPNANPNAN 166

Query: 1351 PGQKTGAGRPNNSGGKYNRN 1410
            P     A    N     N N
Sbjct: 167  PNANPNANPNANPNANPNAN 186


>gi|294134|gb|AAA29550.1| (M83172) circumsporozoite protein
            [Plasmodium falciparum]
            Length = 416
            
 Score = 40.2 bits (92), Expect = 0.040
 Identities = 26/77 (33%), Positives = 33/77 (42%), Gaps = 3/77 (3%)
 Frame = +1

Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
            N  A   A+PN  +PN N    P      +P   PN  PN  + +A P  NP  NP    
Sbjct: 222  NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 279

Query: 1366 GA---GRPNNSGGKYNRNRG 1416
             A     PN +  K N+  G
Sbjct: 280  NANPNANPNANPNKNNQGNG 299


 Score = 37.9 bits (86), Expect = 0.20
 Identities = 24/75 (32%), Positives = 28/75 (37%)
 Frame = +1

Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
            N  A   A+PN  +PN N    P      +P   PN  PN    +A P  NP  NP    
Sbjct: 174  NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPNV-DPNANPNANPNANPNANP 231

Query: 1366 GAGRPNNSGGKYNRN 1410
             A    N     N N
Sbjct: 232  NANPNANPNANPNAN 246


 Score = 33.6 bits (75), Expect = 4.0
 Identities = 21/80 (26%), Positives = 30/80 (37%)
 Frame = +1

Query: 1171 PRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGN 1350
            P++ +    G+G      +PN +    P      +P   PN  PN  + +A P  NP  N
Sbjct: 108  PKHKKLKQPGDGNPDPNANPNVDPNANPNVDPNANPNVDPNANPN-ANPNANPNANPNAN 166

Query: 1351 PGQKTGAGRPNNSGGKYNRN 1410
            P     A    N     N N
Sbjct: 167  PNANPNANPNANPNANPNAN 186


>gi|117583|sp|P02893|CSP_PLAFA CIRCUMSPOROZOITE PROTEIN PRECURSOR (CS)
            >gi|72381|pir||OZZQAF circumsporozoite protein -
            Plasmodium falciparum (isolate IMTM22) >gi|160161
            (K02194) circumsporozoite protein [Plasmodium falciparum]
            Length = 412
            
 Score = 40.2 bits (92), Expect = 0.040
 Identities = 26/77 (33%), Positives = 33/77 (42%), Gaps = 3/77 (3%)
 Frame = +1

Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
            N  A   A+PN  +PN N    P      +P   PN  PN  + +A P  NP  NP    
Sbjct: 218  NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 275

Query: 1366 GA---GRPNNSGGKYNRNRG 1416
             A     PN +  K N+  G
Sbjct: 276  NANPNANPNANPNKNNQGNG 295


 Score = 37.9 bits (86), Expect = 0.20
 Identities = 24/75 (32%), Positives = 28/75 (37%)
 Frame = +1

Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
            N  A   A+PN  +PN N    P      +P   PN  PN    +A P  NP  NP    
Sbjct: 170  NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPNV-DPNANPNANPNANPNANP 227

Query: 1366 GAGRPNNSGGKYNRN 1410
             A    N     N N
Sbjct: 228  NANPNANPNANPNAN 242


 Score = 33.6 bits (75), Expect = 4.0
 Identities = 21/80 (26%), Positives = 30/80 (37%)
 Frame = +1

Query: 1171 PRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGN 1350
            P++ +    G+G      +PN +    P      +P   PN  PN  + +A P  NP  N
Sbjct: 108  PKHKKLKQPGDGNPDPNANPNVDPNANPNVDPNANPNVDPNANPN-ANPNANPNANPNAN 166

Query: 1351 PGQKTGAGRPNNSGGKYNRN 1410
            P     A    N     N N
Sbjct: 167  PNANPNANPNANPNANPNAN 186


>gi|294151|gb|AAA29569.1| (M83156) circumsporozoite protein
            [Plasmodium falciparum]
            Length = 452
            
 Score = 40.2 bits (92), Expect = 0.040
 Identities = 26/77 (33%), Positives = 33/77 (42%), Gaps = 3/77 (3%)
 Frame = +1

Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
            N  A   A+PN  +PN N    P      +P   PN  PN  + +A P  NP  NP    
Sbjct: 258  NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 315

Query: 1366 GA---GRPNNSGGKYNRNRG 1416
             A     PN +  K N+  G
Sbjct: 316  NANPNANPNANPNKNNQGNG 335


 Score = 34.4 bits (77), Expect = 2.3
 Identities = 21/80 (26%), Positives = 30/80 (37%)
 Frame = +1

Query: 1171 PRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGN 1350
            P++ +    G+G      +PN +    P      +P   PN  PN  + +A P  NP  N
Sbjct: 108  PKHKKLKQPGDGNPDPNANPNVDPNANPNVDPNANPNANPNANPN-ANPNANPNANPNAN 166

Query: 1351 PGQKTGAGRPNNSGGKYNRN 1410
            P     A    N     N N
Sbjct: 167  PNANPNANPNANPNANPNAN 186


>gi|4493889|emb|CAB38998.1| (AL034558) predicted using hexExon;
            MAL3P2.11 (PFC0210c), Circumsporozoite (CS) protein, len:
            397 aa; Similarity to many Plasmodium CS proteins.
            [Plasmodium falciparum]
            Length = 396
            
 Score = 40.2 bits (92), Expect = 0.040
 Identities = 26/77 (33%), Positives = 33/77 (42%), Gaps = 3/77 (3%)
 Frame = +1

Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
            N  A   A+PN  +PN N    P      +P   PN  PN  + +A P  NP  NP    
Sbjct: 202  NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 259

Query: 1366 GA---GRPNNSGGKYNRNRG 1416
             A     PN +  K N+  G
Sbjct: 260  NANPNANPNANPNKNNQGNG 279


 Score = 37.9 bits (86), Expect = 0.20
 Identities = 24/75 (32%), Positives = 28/75 (37%)
 Frame = +1

Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
            N  A   A+PN  +PN N    P      +P   PN  PN    +A P  NP  NP    
Sbjct: 158  NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPNV-DPNANPNANPNANPNANP 215

Query: 1366 GAGRPNNSGGKYNRN 1410
             A    N     N N
Sbjct: 216  NANPNANPNANPNAN 230


>gi|6226848|sp|P19597|CSP_PLAFO CIRCUMSPOROZOITE PROTEIN PRECURSOR
            (CS) >gi|160153 (M83886) circumsporozoite protein
            [Plasmodium falciparum] >gi|2276342|emb|CAA33421|
            (X15363) circumsporozoite protein (AA 1 - 405)
            [Plasmodium falciparum]
            Length = 397
            
 Score = 40.2 bits (92), Expect = 0.040
 Identities = 26/77 (33%), Positives = 33/77 (42%), Gaps = 3/77 (3%)
 Frame = +1

Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
            N  A   A+PN  +PN N    P      +P   PN  PN  + +A P  NP  NP    
Sbjct: 203  NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 260

Query: 1366 GA---GRPNNSGGKYNRNRG 1416
             A     PN +  K N+  G
Sbjct: 261  NANPNANPNANPNKNNQGNG 280


 Score = 37.9 bits (86), Expect = 0.20
 Identities = 24/75 (32%), Positives = 28/75 (37%)
 Frame = +1

Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
            N  A   A+PN  +PN N    P      +P   PN  PN    +A P  NP  NP    
Sbjct: 159  NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPNV-DPNANPNANPNANPNANP 216

Query: 1366 GAGRPNNSGGKYNRN 1410
             A    N     N N
Sbjct: 217  NANPNANPNANPNAN 231


>gi|552190 (M57498) circumsporozoite protein [Plasmodium falciparum]
            Length = 393
            
 Score = 40.2 bits (92), Expect = 0.040
 Identities = 26/77 (33%), Positives = 33/77 (42%), Gaps = 3/77 (3%)
 Frame = +1

Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
            N  A   A+PN  +PN N    P      +P   PN  PN  + +A P  NP  NP    
Sbjct: 199  NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 256

Query: 1366 GA---GRPNNSGGKYNRNRG 1416
             A     PN +  K N+  G
Sbjct: 257  NANPNANPNANPNKNNQGNG 276


 Score = 37.9 bits (86), Expect = 0.20
 Identities = 24/75 (32%), Positives = 28/75 (37%)
 Frame = +1

Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
            N  A   A+PN  +PN N    P      +P   PN  PN    +A P  NP  NP    
Sbjct: 151  NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPNV-DPNANPNANPNANPNANP 208

Query: 1366 GAGRPNNSGGKYNRN 1410
             A    N     N N
Sbjct: 209  NANPNANPNANPNAN 223


>gi|224051|prf||1008275A protein,circumsporozoite [Plasmodium sp.]
            Length = 100
            
 Score = 39.5 bits (90), Expect = 0.069
 Identities = 24/75 (32%), Positives = 30/75 (40%)
 Frame = +1

Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
            N  A   A+PN  +PN N    P      +P   PN  PN  + +A P  NP  NP    
Sbjct: 24   NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 81

Query: 1366 GAGRPNNSGGKYNRN 1410
             A    N     N+N
Sbjct: 82   NANPNANPNANPNKN 96


>gi|294158|gb|AAA29574.1| (M83161) circumsporozoite protein
            [Plasmodium falciparum]
            Length = 420
            
 Score = 38.7 bits (88), Expect = 0.12
 Identities = 26/77 (33%), Positives = 31/77 (39%), Gaps = 3/77 (3%)
 Frame = +1

Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
            N  A   A+PN  +PN N    P      +P   PN  PN     A P  NP  NP    
Sbjct: 230  NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-----ANPNANPNANPNANP 283

Query: 1366 GA---GRPNNSGGKYNRNRG 1416
             A     PN +  K N+  G
Sbjct: 284  NANPNANPNANPNKNNQGNG 303


 Score = 37.9 bits (86), Expect = 0.20
 Identities = 24/75 (32%), Positives = 28/75 (37%)
 Frame = +1

Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
            N  A   A+PN  +PN N    P      +P   PN  PN    +A P  NP  NP    
Sbjct: 186  NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPNV-DPNANPNANPNANPNANP 243

Query: 1366 GAGRPNNSGGKYNRN 1410
             A    N     N N
Sbjct: 244  NANPNANPNANPNAN 258


 Score = 33.6 bits (75), Expect = 4.0
 Identities = 21/80 (26%), Positives = 30/80 (37%)
 Frame = +1

Query: 1171 PRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGN 1350
            P++ +    G+G      +PN +    P      +P   PN  PN  + +A P  NP  N
Sbjct: 108  PKHKKLKQPGDGNPDPNANPNVDPNANPNVDPNANPNVDPNANPN-ANPNANPNANPNAN 166

Query: 1351 PGQKTGAGRPNNSGGKYNRN 1410
            P     A    N     N N
Sbjct: 167  PNANPNANPNANPNANPNAN 186


>gi|2462758 (AC002292) putative RNA-binding protein [Arabidopsis
            thaliana]
            Length = 292
            
 Score = 38.7 bits (88), Expect = 0.12
 Identities = 23/71 (32%), Positives = 37/71 (51%), Gaps = 1/71 (1%)
 Frame = +1

Query: 904  DNNKHAYHNNHSTADESRELRRFDTLRDDR-GRGQGKHHFKDRLTVSGEAAAKQAHKPFK 1080
            D ++  Y +     +  RE R FD   D R  R  G++ ++DR   SG+    + H PF+
Sbjct: 154  DGHRDRYGDRDLERERERE-REFDRYMDGRRDRDGGRYSYRDRFD-SGDKYEPRDHYPFE 211

Query: 1081 QYKPKNDRSLSE 1116
            +Y P  DR +S+
Sbjct: 212  RYAPPGDRFVSD 223


>gi|283464|pir||A60540 sporozoite surface protein 2 - Plasmodium
            yoelii (fragment)
            Length = 477
            
 Score = 38.7 bits (88), Expect = 0.12
 Identities = 24/84 (28%), Positives = 29/84 (33%)
 Frame = +1

Query: 1171 PRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGN 1350
            P N  N    N  +      NPN  N P      +    PNN PN P++   P      N
Sbjct: 93   PNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNN-PNNPNNPNNP-----NN 146

Query: 1351 PGQKTGAGRPNNSGGKYNRNRGPR 1422
            P        PNN     N N  P+
Sbjct: 147  PNNPNNPNNPNNPNDPSNPNNHPK 170


 Score = 37.9 bits (86), Expect = 0.20
 Identities = 27/86 (31%), Positives = 33/86 (37%), Gaps = 3/86 (3%)
 Frame = +1

Query: 1171 PRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNA--PNFPSDHATPTFNPY 1344
            P N  N    N  +      NPN  N P      +    PNN   PN P++   P  NP 
Sbjct: 96   PNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPN-NP- 153

Query: 1345 GNPGQKTGAGRPNNSGGKYN-RNRGPRYP 1428
             NP        PNN   + N + R P  P
Sbjct: 154  NNPNNPNDPSNPNNHPKRRNPKRRNPNKP 182


 Score = 37.1 bits (84), Expect = 0.35
 Identities = 25/88 (28%), Positives = 31/88 (34%)
 Frame = +1

Query: 1147 VPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHAT 1326
            +P  +   P N       N  +      NPN  N P      +    PNN PN P++   
Sbjct: 67   IPNKIPEKPSNPEEPVNPNDPNDPNNPNNPNNPNNPNNPNNPNNPNNPNN-PNNPNNPNN 125

Query: 1327 PTFNPYGNPGQKTGAGRPNNSGGKYNRN 1410
            P  NP  NP        PNN     N N
Sbjct: 126  PN-NP-NNPNNPNNPNNPNNPNNPNNPN 151


 Score = 37.1 bits (84), Expect = 0.35
 Identities = 21/68 (30%), Positives = 29/68 (41%)
 Frame = +1

Query: 1207 AHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNN 1386
            ++PNK  PNPN  + P +     P       PN PS+   P  N   NP + +    P+N
Sbjct: 198  SNPNK--PNPNEPSNPNKPNPNEPSNPNKPNPNEPSNPNKPNPNEPLNPNEPSNPNEPSN 255

Query: 1387 SGGKYNRN 1410
                 N N
Sbjct: 256  PNAPSNPN 263


 Score = 36.4 bits (82), Expect = 0.60
 Identities = 26/82 (31%), Positives = 30/82 (35%), Gaps = 8/82 (9%)
 Frame = +1

Query: 1165 TGPRNH--------RNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDH 1320
            + P NH        RN        PN   PNPN  + P +     P       PN PS+ 
Sbjct: 163  SNPNNHPKRRNPKRRNPNKPKPNKPNPNKPNPNEPSNPNKPNPNEPSNPNKPNPNEPSNP 222

Query: 1321 ATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 1410
              P  N   NP  K     P N     N N
Sbjct: 223  NKPNPNEPSNP-NKPNPNEPLNPNEPSNPN 251


 Score = 36.4 bits (82), Expect = 0.60
 Identities = 23/80 (28%), Positives = 38/80 (46%), Gaps = 9/80 (11%)
 Frame = +1

Query: 1171 PRNHRNAGAGNGAHPNKKSPN----PNTRNTPGQQTRKSPYKYPN-----NAPNFPSDHA 1323
            P N         ++PNK +PN    PN  + P + +  +    PN     N P+ P++ +
Sbjct: 219  PSNPNKPNPNEPSNPNKPNPNEPLNPNEPSNPNEPSNPNAPSNPNEPSNPNEPSNPNEPS 278

Query: 1324 TPTFNPYGNPGQKTGAGRPNNSGGKYNRN 1410
             P  N   NP + +   +P+N     N N
Sbjct: 279  NP--NEPSNPNEPSNPKKPSNPNEPSNPN 305


 Score = 35.6 bits (80), Expect = 1.0
 Identities = 24/70 (34%), Positives = 29/70 (41%)
 Frame = +1

Query: 1201 NGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRP 1380
            N  +PN  + NPN  N P      +    PNN PN P++   P  NP  NP        P
Sbjct: 92   NPNNPNNPN-NPNNPNNPNNPNNPNNPNNPNN-PNNPNNPNNPN-NP-NNPNNPNNPNNP 147

Query: 1381 NNSGGKYNRN 1410
            NN     N N
Sbjct: 148  NNPNNPNNPN 157


 Score = 35.6 bits (80), Expect = 1.0
 Identities = 27/88 (30%), Positives = 32/88 (35%)
 Frame = +1

Query: 1147 VPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHAT 1326
            +PE  S  P    N    N  +      NPN  N P      +    PNN PN P++   
Sbjct: 71   IPEKPSN-PEEPVNPNDPNDPNNPNNPNNPNNPNNPNNPNNPNNPNNPNN-PNNPNNPNN 128

Query: 1327 PTFNPYGNPGQKTGAGRPNNSGGKYNRN 1410
            P  NP  NP        PNN     N N
Sbjct: 129  PN-NP-NNPNNPNNPNNPNNPNNPNNPN 154


 Score = 33.2 bits (74), Expect = 5.2
 Identities = 24/80 (30%), Positives = 34/80 (42%), Gaps = 1/80 (1%)
 Frame = +1

Query: 1171 PRNHRNAGAGNGAHPNKKSPN-PNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYG 1347
            P N         ++PNK +PN P+  N P      +P K PN  PN P +   P+     
Sbjct: 197  PSNPNKPNPNEPSNPNKPNPNEPSNPNKPNPNEPSNPNK-PN--PNEPLNPNEPS----- 248

Query: 1348 NPGQKTGAGRPNNSGGKYNRN 1410
            NP + +    P+N     N N
Sbjct: 249  NPNEPSNPNAPSNPNEPSNPN 269


>gi|548996|sp|Q01443|SSP2_PLAYO SPOROZOITE SURFACE PROTEIN 2 PRECURSOR
            >gi|323142|pir||A45559 sporozoite surface protein 2 -
            Plasmodium yoelii >gi|160693 (M84732) sporozoite surface
            protein [Plasmodium yoelii]
            Length = 826
            
 Score = 38.7 bits (88), Expect = 0.12
 Identities = 24/84 (28%), Positives = 29/84 (33%)
 Frame = +1

Query: 1171 PRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGN 1350
            P N  N    N  +      NPN  N P      +    PNN PN P++   P      N
Sbjct: 315  PNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNN-PNNPNNPNNP-----NN 368

Query: 1351 PGQKTGAGRPNNSGGKYNRNRGPR 1422
            P        PNN     N N  P+
Sbjct: 369  PNNPNNPNNPNNPNDPSNPNNHPK 392


 Score = 37.9 bits (86), Expect = 0.20
 Identities = 27/86 (31%), Positives = 33/86 (37%), Gaps = 3/86 (3%)
 Frame = +1

Query: 1171 PRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNA--PNFPSDHATPTFNPY 1344
            P N  N    N  +      NPN  N P      +    PNN   PN P++   P  NP 
Sbjct: 318  PNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPN-NP- 375

Query: 1345 GNPGQKTGAGRPNNSGGKYN-RNRGPRYP 1428
             NP        PNN   + N + R P  P
Sbjct: 376  NNPNNPNDPSNPNNHPKRRNPKRRNPNKP 404


 Score = 37.1 bits (84), Expect = 0.35
 Identities = 25/88 (28%), Positives = 31/88 (34%)
 Frame = +1

Query: 1147 VPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHAT 1326
            +P  +   P N       N  +      NPN  N P      +    PNN PN P++   
Sbjct: 289  IPNKIPEKPSNPEEPVNPNDPNDPNNPNNPNNPNNPNNPNNPNNPNNPNN-PNNPNNPNN 347

Query: 1327 PTFNPYGNPGQKTGAGRPNNSGGKYNRN 1410
            P  NP  NP        PNN     N N
Sbjct: 348  PN-NP-NNPNNPNNPNNPNNPNNPNNPN 373


 Score = 37.1 bits (84), Expect = 0.35
 Identities = 21/68 (30%), Positives = 29/68 (41%)
 Frame = +1

Query: 1207 AHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNN 1386
            ++PNK  PNPN  + P +     P       PN PS+   P  N   NP + +    P+N
Sbjct: 420  SNPNK--PNPNEPSNPNKPNPNEPSNPNKPNPNEPSNPNKPNPNEPLNPNEPSNPNEPSN 477

Query: 1387 SGGKYNRN 1410
                 N N
Sbjct: 478  PNAPSNPN 485


 Score = 36.4 bits (82), Expect = 0.60
 Identities = 26/82 (31%), Positives = 30/82 (35%), Gaps = 8/82 (9%)
 Frame = +1

Query: 1165 TGPRNH--------RNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDH 1320
            + P NH        RN        PN   PNPN  + P +     P       PN PS+ 
Sbjct: 385  SNPNNHPKRRNPKRRNPNKPKPNKPNPNKPNPNEPSNPNKPNPNEPSNPNKPNPNEPSNP 444

Query: 1321 ATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 1410
              P  N   NP  K     P N     N N
Sbjct: 445  NKPNPNEPSNP-NKPNPNEPLNPNEPSNPN 473


 Score = 36.4 bits (82), Expect = 0.60
 Identities = 23/80 (28%), Positives = 38/80 (46%), Gaps = 9/80 (11%)
 Frame = +1

Query: 1171 PRNHRNAGAGNGAHPNKKSPN----PNTRNTPGQQTRKSPYKYPN-----NAPNFPSDHA 1323
            P N         ++PNK +PN    PN  + P + +  +    PN     N P+ P++ +
Sbjct: 441  PSNPNKPNPNEPSNPNKPNPNEPLNPNEPSNPNEPSNPNAPSNPNEPSNPNEPSNPNEPS 500

Query: 1324 TPTFNPYGNPGQKTGAGRPNNSGGKYNRN 1410
             P  N   NP + +   +P+N     N N
Sbjct: 501  NP--NEPSNPNEPSNPKKPSNPNEPSNPN 527


 Score = 35.6 bits (80), Expect = 1.0
 Identities = 24/70 (34%), Positives = 29/70 (41%)
 Frame = +1

Query: 1201 NGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRP 1380
            N  +PN  + NPN  N P      +    PNN PN P++   P  NP  NP        P
Sbjct: 314  NPNNPNNPN-NPNNPNNPNNPNNPNNPNNPNN-PNNPNNPNNPN-NP-NNPNNPNNPNNP 369

Query: 1381 NNSGGKYNRN 1410
            NN     N N
Sbjct: 370  NNPNNPNNPN 379


 Score = 35.6 bits (80), Expect = 1.0
 Identities = 27/88 (30%), Positives = 32/88 (35%)
 Frame = +1

Query: 1147 VPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHAT 1326
            +PE  S  P    N    N  +      NPN  N P      +    PNN PN P++   
Sbjct: 293  IPEKPSN-PEEPVNPNDPNDPNNPNNPNNPNNPNNPNNPNNPNNPNNPNN-PNNPNNPNN 350

Query: 1327 PTFNPYGNPGQKTGAGRPNNSGGKYNRN 1410
            P  NP  NP        PNN     N N
Sbjct: 351  PN-NP-NNPNNPNNPNNPNNPNNPNNPN 376


 Score = 33.2 bits (74), Expect = 5.2
 Identities = 24/80 (30%), Positives = 34/80 (42%), Gaps = 1/80 (1%)
 Frame = +1

Query: 1171 PRNHRNAGAGNGAHPNKKSPN-PNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYG 1347
            P N         ++PNK +PN P+  N P      +P K PN  PN P +   P+     
Sbjct: 419  PSNPNKPNPNEPSNPNKPNPNEPSNPNKPNPNEPSNPNK-PN--PNEPLNPNEPS----- 470

Query: 1348 NPGQKTGAGRPNNSGGKYNRN 1410
            NP + +    P+N     N N
Sbjct: 471  NPNEPSNPNAPSNPNEPSNPN 491


>gi|1582641|prf||2119210A mucin [Homo sapiens]
           Length = 164
           
 Score = 38.3 bits (87), Expect = 0.16
 Identities = 26/76 (34%), Positives = 38/76 (49%), Gaps = 4/76 (5%)
 Frame = +3

Query: 663 STSAMGIARLPSQPPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRHA---- 830
           ST++      PS P  + TL    PTT TT+ P   TT  P+ STT   +T    A    
Sbjct: 62  STTSAPTTSTPSAPTTSTTL---APTTSTTSAPTTSTTSTPTSSTTSTPQTSTTSASTTS 118

Query: 831 VSTHPATDYRPTPFSQSHTA 890
           +++ P T   P P + + +A
Sbjct: 119 ITSGPGTTPSPVPTTSTTSA 138


>gi|2135764|pir||I53641 mucin - human (fragment) >gi|945219 (L46721)
           mucin [Homo sapiens]
           Length = 164
           
 Score = 38.3 bits (87), Expect = 0.16
 Identities = 27/76 (35%), Positives = 37/76 (48%), Gaps = 4/76 (5%)
 Frame = +3

Query: 663 STSAMGIARLPSQPPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRHAVSTH 842
           ST++      PS P  + TL    PTT TT+ P   TT  P+ STT   +T    A +T 
Sbjct: 62  STTSAPTTSTPSAPTTSTTL---APTTSTTSAPTTSTTSTPTSSTTSTPQTSTTSASTTS 118

Query: 843 ----PATDYRPTPFSQSHTA 890
               P T   P P + + +A
Sbjct: 119 ITCGPGTTPSPVPTTSTTSA 138


>gi|6580291|emb|CAB63354.1| (AL032627) cDNA EST EMBL:D65921 comes from
            this gene; cDNA EST yk385f3.3 comes from this gene; cDNA
            EST yk385f3.5 comes from this gene; cDNA EST EMBL:D66141
            comes from this gene; cDNA EST EMBL:D69818 comes from
            this gene; cDN...
            Length = 373
            
 Score = 37.9 bits (86), Expect = 0.20
 Identities = 32/87 (36%), Positives = 41/87 (46%), Gaps = 20/87 (22%)
 Frame = +1

Query: 1156 GVSTGPRNHRNAGAGNGAHPNKKSPNPNTRN-TPGQQTRKSPYKYPNNAPNFPSDHATPT 1332
            G S     + N   GNG++ N    N N  N   G    + PY  P +   +P  +  P 
Sbjct: 249  GNSGNGNGNSNGNNGNGSNGNGNGNNGNGNNGNNGNGHPQYPYPVPPHG-YYPPGYPYPP 307

Query: 1333 FNPYGNPGQ---------------KTGAGRPN----NSGGKYNRNRG 1416
              PY  PG                + G G+PN    NSGG   RNRG
Sbjct: 308  GYPYPPPGAFYYPPGGIPQNGMNGQNGNGQPNIIVINSGGNKKRNRG 354


 Score = 32.8 bits (73), Expect = 6.8
 Identities = 30/89 (33%), Positives = 40/89 (44%), Gaps = 5/89 (5%)
 Frame = +1

Query: 1162 STGPRNH--RNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTF 1335
            S+G  N+   N+G+GNG   N  S N N+ N  G           NNA N  + +     
Sbjct: 215  SSGNSNYGSNNSGSGNG---NSNSGNGNSGNGNG-----------NNAGNSGNGNG---- 256

Query: 1336 NPYGNPGQKTGAGRPNNSGGKYNRNRG---PRYP 1428
            N  GN G  +      N+G   N N G   P+YP
Sbjct: 257  NSNGNNGNGSNGNGNGNNGNGNNGNNGNGHPQYP 290


>gi|677949 (U20969) Plasmodium falciparum circumsporozoite protein
            (CS) gene, complete cds. [Plasmodium falciparum]
            Length = 408
            
 Score = 37.9 bits (86), Expect = 0.20
 Identities = 24/75 (32%), Positives = 28/75 (37%)
 Frame = +1

Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
            N  A   A+PN  +PN N    P      +P   PN  PN    +A P  NP  NP    
Sbjct: 170  NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPNV-DPNANPNANPNANPNANP 227

Query: 1366 GAGRPNNSGGKYNRN 1410
             A    N     N N
Sbjct: 228  NANPNANPNANPNAN 242


 Score = 33.6 bits (75), Expect = 4.0
 Identities = 22/78 (28%), Positives = 28/78 (35%)
 Frame = +1

Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
            N  A   A+PN  +PN N    P      +P   PN  PN  +       N   NP +  
Sbjct: 246  NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPNKNNQGNGQGHNMPNNPNRNV 304

Query: 1366 GAGRPNNSGGKYNRNRGP 1419
                  N+  K N N  P
Sbjct: 305  DENANANNAVKNNNNEEP 322


>gi|93140|pir||A40505 early protein EP0 - suid herpesvirus 1 (strain
            Indiana-Funkhuser or Becker)
            Length = 410
            
 Score = 37.5 bits (85), Expect = 0.27
 Identities = 44/176 (25%), Positives = 70/176 (39%), Gaps = 12/176 (6%)
 Frame = +1

Query: 862  QRRSAKATLHVNRNDNNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVS 1041
            QRR+  +   VN  D ++   H++   +    E         D G      H +D LT +
Sbjct: 228  QRRAPMSHQGVNYIDTSESEAHSDSEVSSPDEE---------DSGASSSGVHTED-LTEA 277

Query: 1042 GEAAAKQAHKPFK-----------QYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRN 1188
             E+A  Q   P +           + + +  R L  G          PE  S+G  +   
Sbjct: 278  SESADDQRPAPRRSPRRARRAAVLRREQRRTRCLRRGRTGGQAQGETPEAPSSGEGSSAQ 337

Query: 1189 AGA-GNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
             GA G GA P   +   + R++P      S  + P+     PS  A  T  P G P   +
Sbjct: 338  HGASGAGAGPGSANTAASARSSPSSSPSSS-MRRPS-----PSASAPETAAPRGGPPASS 391

Query: 1366 GAGRPNNS 1389
             +G P ++
Sbjct: 392  SSGSPRSA 399


>gi|124138|sp|P29129|ICP0_PRVIF TRANS-ACTING TRANSCRIPTIONAL PROTEIN
            ICP0 (EARLY PROTEIN 0) (EP0) >gi|334048 (M57504) EPO
            [Pseudorabies virus]
            Length = 410
            
 Score = 37.5 bits (85), Expect = 0.27
 Identities = 44/176 (25%), Positives = 70/176 (39%), Gaps = 12/176 (6%)
 Frame = +1

Query: 862  QRRSAKATLHVNRNDNNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVS 1041
            QRR+  +   VN  D ++   H++   +    E         D G      H +D LT +
Sbjct: 228  QRRAPMSHQGVNYIDTSESEAHSDSEVSSPDEE---------DSGASSSGVHTED-LTEA 277

Query: 1042 GEAAAKQAHKPFK-----------QYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRN 1188
             E+A  Q   P +           + + +  R L  G          PE  S+G  +   
Sbjct: 278  SESADDQRPAPRRSPRRARRAAVLRREQRRTRCLRRGRTGGQAQGETPEAPSSGEGSSAQ 337

Query: 1189 AGA-GNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
             GA G GA P   +   + R++P      S  + P+     PS  A  T  P G P   +
Sbjct: 338  HGASGAGAGPGSANTAASARSSPSSSPSSS-MRRPS-----PSASAPETAAPRGGPPASS 391

Query: 1366 GAGRPNNS 1389
             +G P ++
Sbjct: 392  SSGSPRSA 399


>gi|3875920|emb|CAB04111.1| (Z81503) predicted using Genefinder;
            similar to collagen; cDNA EST EMBL:D65450 comes from this
            gene; cDNA EST EMBL:D68888 comes from this gene
            [Caenorhabditis elegans]
            Length = 305
            
 Score = 37.5 bits (85), Expect = 0.27
 Identities = 36/92 (39%), Positives = 39/92 (42%), Gaps = 15/92 (16%)
 Frame = +1

Query: 1150 PEGVSTGPRNHRNAGA----GNGAH--------PNKKSPN--PNTRNTPGQQTRKSPYKY 1287
            P+G    P N   AGA    GN AH        P    P   P     PG      P   
Sbjct: 190  PKGPRGAPGNSGRAGAPGQPGNDAHGYGGGVGAPGPAGPRGAPGPAGHPGSSGGGRPGPA 249

Query: 1288 -PNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRNRGPRY 1425
             P  AP  P         P G+PGQ    GRP  SGG  NR   P+Y
Sbjct: 250  GPKGAPGQPGRPG-----PDGHPGQP---GRPGQSGGSGNRGVCPKY 288


>gi|6681425|dbj|BAA88672.1| (AB030233) CiMsi [Ciona intestinalis]
            Length = 393
            
 Score = 37.5 bits (85), Expect = 0.27
 Identities = 30/97 (30%), Positives = 36/97 (36%), Gaps = 1/97 (1%)
 Frame = +1

Query: 1117 GSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPG-QQTRKSPYKYPN 1293
            G+    Q  Y P  +    R   N   GNGA PN  S  P    + G   T    Y   N
Sbjct: 299  GNMGNMQGGYQPGMMGMQGRGVNN---GNGAQPNAASTYPQNPTSYGPMPTSGGGYNQGN 355

Query: 1294 NAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNR 1407
               N  S  A       G+ GQK+G G  N+    Y R
Sbjct: 356  TGSNNSSGQANTGNTGGGSYGQKSGGGSNNSGYHPYRR 393


>gi|2498734|sp|Q60865|P137_MOUSE GPI-ANCHORED PROTEIN P137 >gi|1098569
            (U27838) glycosyl-phosphatidyl-inositol-anchored protein
            homolog [Mus musculus]
            Length = 656
            
 Score = 37.1 bits (84), Expect = 0.35
 Identities = 30/112 (26%), Positives = 38/112 (33%), Gaps = 2/112 (1%)
 Frame = +1

Query: 1081 QYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNA--GAGNGAHPNKKSPNPNTRNTP 1254
            Q  P+ +      S   + S  V  G S G R   N   G  NG         P+  NTP
Sbjct: 535  QQPPQQNTGFPRSSQPYYNSRGVSRGGSRGARGLMNGYRGPANGFRGGYDGYRPSFSNTP 594

Query: 1255 GQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRNRG 1416
                 +S +  P +   +  D     F      GQ    G P   GG    NRG
Sbjct: 595  NSGYSQSQFTAPRDYSGYQRDGYQQNFK--RGSGQSGPRGAPRGRGGPPRPNRG 646


>gi|6322611|ref|NP_012685.1|YJR151C| Yjr151cp
           >gi|1352944|sp|P47179|YJ9P_YEAST HYPOTHETICAL 118.4 KD
           PROTEIN IN BAT2-DAL5 INTERGENIC REGION PRECURSOR
           >gi|1078284|pir||S57180 probable membrane protein
           YJR151c - yeast (Saccharomyces cerevisiae)
           >gi|1015903|emb|CAA89684| (Z49651) ORF YJR151c
           [Saccharomyces cerevisiae]
           Length = 1161
           
 Score = 36.7 bits (83), Expect = 0.46
 Identities = 28/83 (33%), Positives = 35/83 (41%), Gaps = 1/83 (1%)
 Frame = +3

Query: 651 PQPRSTSAMGIARLPSQPPQTYTLWFRPPTTRTTAWPIDRTTQD-PSRSTTHPTETRKRH 827
           P   +TS        S  P T T    P T+ T+  P   TT   P+ STT  T T    
Sbjct: 162 PTTSTTSTTPTTSTTSTTPTTSTTSTTPTTSTTSTTPTTSTTSTTPTTSTTSTTPTTS-- 219

Query: 828 AVSTHPATDYRPTPFSQSHTARES 899
             ST P T   PT  + S T++ S
Sbjct: 220 TTSTTPTTSTTPTTSTTSTTSQTS 243


 Score = 32.8 bits (73), Expect = 6.8
 Identities = 25/79 (31%), Positives = 33/79 (41%), Gaps = 5/79 (6%)
 Frame = +3

Query: 663 STSAMGIARLPSQPPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTE-----TRKRH 827
           +TS        S   QT T    P T+ T+  P   TT   S ++T PT      T    
Sbjct: 226 TTSTTPTTSTTSTTSQTSTKSTTPTTSSTSTTPTTSTTPTTSTTSTAPTTSTTSTTSTTS 285

Query: 828 AVSTHPATDYRPTPFSQSHTARES 899
            +ST P T    + FS S  +  S
Sbjct: 286 TISTAPTTSTTSSTFSTSSASASS 309


>gi|4758666|ref|NP_004681.1|| LATS (large tumor suppressor,
            Drosophila) homolog 1 >gi|4324434|gb|AAD16882| (AF104413)
            large tumor suppressor 1 [Homo sapiens]
            >gi|5738136|gb|AAD50272.1|AF164041_1 (AF164041) WARTS
            protein kinase [Homo sapiens]
            Length = 1130
            
 Score = 36.4 bits (82), Expect = 0.60
 Identities = 20/88 (22%), Positives = 40/88 (44%)
 Frame = +1

Query: 1120 SPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNA 1299
            +P+++ +  +P+ +    RN  N    N + P  ++  P + + P Q +  S ++ P   
Sbjct: 396  APSSYTNGSIPQSMMVPNRNSHNMELYNISVPGLQTNWPQSSSAPAQSSPSSGHEIPTWQ 455

Query: 1300 PNFPSDHATPTFNPYGNPGQKTGAGRPN 1383
            PN P   +    NP GN    +   +P+
Sbjct: 456  PNIPV-RSNSFNNPLGNRASHSANSQPS 482


>gi|3172134 (U90209) RNA polymerase II largest subunit [Bonnemaisonia
            hamifera]
            Length = 1732
            
 Score = 36.4 bits (82), Expect = 0.60
 Identities = 33/127 (25%), Positives = 45/127 (34%), Gaps = 4/127 (3%)
 Frame = +1

Query: 1048 AAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKS 1227
            AA   A  P  QY P   R   + +P T  S                 GA     P+  S
Sbjct: 1591 AAQSPAQSPGVQYSPDKSRVQVQRAPPTAPS-------------AAGGGASRSYSPSSPS 1637

Query: 1228 PNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPY----GNPGQKTGAGRPNNSGG 1395
             N     +PG     +   Y  ++P   S  +   F+P     G     T A   N +  
Sbjct: 1638 YNGRGAASPGANYVAASPGYSPSSPGAYSPSSPAAFSPSSPAAGGYSPSTPAYTANAAAN 1697

Query: 1396 KYNRNRGPRYP 1428
            +Y+  R PRYP
Sbjct: 1698 QYSYARSPRYP 1708


>gi|2689219|emb|CAA67983.1| (X99669) dynamin like protein
            [Dictyostelium discoideum]
            Length = 853
            
 Score = 36.4 bits (82), Expect = 0.60
 Identities = 28/115 (24%), Positives = 44/115 (37%), Gaps = 13/115 (11%)
 Frame = +1

Query: 1060 QAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPN-- 1233
            Q+  PF Q + +       G PA  Q    P  ++ GP+N           PN+  P+  
Sbjct: 569  QSTNPFLQQQQQGQNKYPGGPPAQQQPNQQPNQLNKGPQN---------MPPNQSKPSSI 619

Query: 1234 ----PNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRP------- 1380
                PN  N       +  ++  +   +F      P+   YG       +  P       
Sbjct: 620  PQNGPNNNNNNNNNNNRQDHQQGSFFSSFFRASPDPSLGQYGGANNSNNSNNPTSPINSS 679

Query: 1381 NNSGGKYN 1404
            +NSG  YN
Sbjct: 680  SNSGNNYN 687


>gi|3064231|gb|AAC14254.1| (AF036460) mucin-like protein
           [Trypanosoma cruzi]
           Length = 119
           
 Score = 36.4 bits (82), Expect = 0.60
 Identities = 30/77 (38%), Positives = 36/77 (45%), Gaps = 6/77 (7%)
 Frame = +3

Query: 633 CRCQRRPQPRSTSAMGIARLPSQPPQTYTLWF-RPPTTRTTAWPIDRTTQDPSRSTT--- 800
           C    +P P  +S        ++PP T T    RPPTT TT      TTQ P+ STT   
Sbjct: 13  CVADAQPVPEGSSNTTTTTTTTKPPTTTTTTTTRPPTTTTTT-----TTQAPTTSTTTAP 67

Query: 801 --HPTETRKRHAVSTHPATDYRP 863
               T T +  AVST  A    P
Sbjct: 68  EAPSTTTTEAPAVSTTRAPSRLP 90


>gi|112945|sp|P14196|AAC2_DICDI AAC-RICH MRNA CLONE AAC11 PROTEIN
            >gi|84120|pir||S05355 hypothetical protein (clone AAC11)
            - slime mold (Dictyostelium discoideum) (fragment)
            >gi|7174|emb|CAA34529| (X16522) coding region (AA 448)
            [Dictyostelium discoideum]
            Length = 448
            
 Score = 36.4 bits (82), Expect = 0.60
 Identities = 51/210 (24%), Positives = 79/210 (37%), Gaps = 4/210 (1%)
 Frame = +1

Query: 781  TQVEALRTQLKLEKDMPLALTLQPIIGQRRSAKATLHVNRNDNNKHAYHNNHST--ADES 954
            T +  L   ++ +  +P     QPI     +     ++N N+NN +  +NN+++     S
Sbjct: 95   TNLNGLSLAIQNQSSLP-----QPINNNNNNNNNNSNINNNNNNSNNNNNNNNSNLGINS 149

Query: 955  RELRRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFKQYKPKNDRSLSEGSPATF 1134
               +      D R RG      + R     E       K  +   PK D    EG+P   
Sbjct: 150  SPTQSSANSADKRSRG------RPRKNPPSEPKDTSGPKRKRGRPPKMD---EEGNP--- 197

Query: 1135 QSWYVPEGVSTGPRNH--RNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNF 1308
            Q   VP+  S   R    +        + N    + NT  TP ++ R  P K    +P+ 
Sbjct: 198  QPKPVPQPGSNKKRGRPKKPKDENESDYNNTSFSDSNTDGTPKKRGR--PPKAKGESPS- 254

Query: 1309 PSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 1410
                A+PT N  GN     G    NN+    N N
Sbjct: 255  ----ASPTHNTLGN-----GILNSNNNNNNNNNN 279


>gi|969095 (U31961) no-on transient A-like protein [Drosophila
            melanogaster]
            Length = 642
            
 Score = 36.4 bits (82), Expect = 0.60
 Identities = 26/82 (31%), Positives = 37/82 (44%), Gaps = 4/82 (4%)
 Frame = +1

Query: 1177 NHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSP-YKYPNNAPNFPSDHAT-PTFNPY-G 1347
            N  +AG G    PN  +     +   G+Q  + P ++     PN P+ +A     N Y G
Sbjct: 139  NELSAGGGGQNQPNHSNKGQGNQGDQGEQGNQGPNFRGRGGGPNQPNQNANQEQSNGYPG 198

Query: 1348 NPG-QKTGAGRPNNSGGKYNRNRGPR 1422
            N G  K G G+    GGK+ R    R
Sbjct: 199  NQGDNKGGQGQRGAGGGKHQRGNRSR 224


>gi|1082604|pir||S53363 mucin 5AC (clone JER58) - human (fragment)
           >gi|563377|emb|CAA84032| (Z34278) mucin [Homo sapiens]
           Length = 279
           
 Score = 36.4 bits (82), Expect = 0.60
 Identities = 27/75 (36%), Positives = 35/75 (46%)
 Frame = +3

Query: 657 PRSTSAMGIARLPSQPPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRHAVS 836
           P +++  G    PS  P T T     PTTRTT+ P   TT   + STT   ET  R   +
Sbjct: 4   PTTSTTSGPGTTPSPVPTTSTT--SAPTTRTTSAPKSSTTSAATTSTTSGPETTPRPVPT 61

Query: 837 THPATDYRPTPFSQS 881
           T  +T   PT  + S
Sbjct: 62  T--STTSSPTTSTTS 74


 Score = 34.4 bits (77), Expect = 2.3
 Identities = 20/52 (38%), Positives = 27/52 (51%)
 Frame = +3

Query: 735 PTTRTTAWPIDRTTQDPSRSTTHPTETRKRHAVSTHPATDYRPTPFSQSHTA 890
           PTT TT+ P  RTT  P  STT  T T    + ++ P T   P P + + +A
Sbjct: 100 PTTSTTSAPTTRTTSAPISSTTSATTT----STTSGPGTTPSPVPTTSTTSA 147


 Score = 33.2 bits (74), Expect = 5.2
 Identities = 24/78 (30%), Positives = 36/78 (45%), Gaps = 1/78 (1%)
 Frame = +3

Query: 657 PRSTSAMGIARLPSQPPQTYTLWFRP-PTTRTTAWPIDRTTQDPSRSTTHPTETRKRHAV 833
           P+S++        +  P+T     RP PTT TT+ P   TT  P+ STT  + T      
Sbjct: 36  PKSSTTSAATTSTTSGPETTP---RPVPTTSTTSSPTTSTTSAPTTSTTSASTTSTTSGA 92

Query: 834 STHPATDYRPTPFSQSHTA 890
            T P+    P P + + +A
Sbjct: 93  GTTPS----PVPTTSTTSA 107


>gi|4324420|gb|AAD16881| (AF104350) prespore-specific protein
            [Dictyostelium discoideum]
            Length = 1231
            
 Score = 36.4 bits (82), Expect = 0.60
 Identities = 32/149 (21%), Positives = 54/149 (35%), Gaps = 1/149 (0%)
 Frame = +1

Query: 964  RRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFKQYKPKND-RSLSEGSPATFQS 1140
            R F +LRDD G    KH+++   ++  + A   + K  K     N+  +L+  +P     
Sbjct: 781  RLFGSLRDDIG----KHNYQQNASLFFDFATFLSKKSNKNLGDINNLNNLNNNNP----- 831

Query: 1141 WYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDH 1320
                   +  P N+          PN  + NPN  N        +     NN  N  +++
Sbjct: 832  -------NNNPNNN----------PNNNNNNPNNNNNNNNNNNNNNNNNNNNNNNNNNNN 874

Query: 1321 ATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 1410
                +N + N          NN+    N N
Sbjct: 875  NNTNYNNFNNTNNNNNNSNKNNNNNNNNNN 904


>gi|2497729|sp|Q49537|VLPE_MYCHR VARIANT SURFACE ANTIGEN E PRECURSOR
            (VLPE PROLIPOPROTEIN) >gi|1039437 (U35016) VlpE
            prolipoprotein [Mycoplasma hyorhinis]
            >gi|1583723|prf||2121355B Vlp surface protein [Mycoplasma
            hyorhinis]
            Length = 243
            
 Score = 36.4 bits (82), Expect = 0.60
 Identities = 22/108 (20%), Positives = 40/108 (36%)
 Frame = +1

Query: 1096 NDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKS 1275
            N    + G+ ++  S   P+G  + P N   +        N  + +P   N     T   
Sbjct: 75   NQSGSASGNGSSNSSVSTPDGQHSNPSNPTTSDPKESNPSNPTTSDPKESNPSNPTTSDG 134

Query: 1276 PYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRNRGP 1419
             +  P+N        + P+ NP  + GQ +    P  S G+++    P
Sbjct: 135  QHSNPSNPTTSDPKESNPS-NPTTSDGQHSNPSNPTTSDGQHSNPSNP 181


>gi|2114108|dbj|BAA20059| (AB003911) OX40 precursor [Oryctolagus
           cuniculus]
           Length = 267
           
 Score = 36.0 bits (81), Expect = 0.79
 Identities = 24/52 (46%), Positives = 32/52 (61%), Gaps = 5/52 (9%)
 Frame = +3

Query: 642 QRRPQPRSTSAMGIAR----LPSQPPQTYTLWFRPPTTRT-TAWPIDRTTQDPSRST 797
           +R  QP S+ +  +      L +QP +T +  +RPPT RT TAWP  RT Q PS  T
Sbjct: 146 KRTLQPASSISDAVCEDRSSLATQPWETPSAPYRPPTARTSTAWP--RTAQGPSTPT 200


>gi|82698|pir||JQ0985 hydroxyproline-rich glycoprotein precursor -
           maize >gi|257041|bbs|115226 (S45164) hydroxyproline-rich
           glycoprotein, HRGP [maize, Peptide, 328 aa] [Zea mays]
           >gi|4007865|emb|CAA10387| (AJ131535) Hydroxyproline-rich
           Glycoprotein (HRGP) [Zea mays]
           Length = 328
           
 Score = 36.0 bits (81), Expect = 0.79
 Identities = 26/80 (32%), Positives = 35/80 (43%)
 Frame = +3

Query: 648 RPQPRSTSAMGIARLPSQPPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRH 827
           +P P + +       P   P TYT   +PPT + T      + + P+   T PT T    
Sbjct: 243 KPTPPTYTPSPKPPTPKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPTPKPTPPTYTPTPK 302

Query: 828 AVSTHPATDYRPTPFSQSHT 887
             +T P T Y PTP   SHT
Sbjct: 303 PPATKPPT-YTPTP-PVSHT 320


 Score = 32.8 bits (73), Expect = 6.8
 Identities = 20/59 (33%), Positives = 27/59 (44%)
 Frame = +3

Query: 693 PSQPPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRHAVSTHPATDYRPTP 869
           P   P TYT   +PPT + T      + + P+   T PT T      +T P T  +PTP
Sbjct: 138 PKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPATKPPTP-KPTP 195


>gi|4220540|emb|CAA23013| (AL035356) hypothetical protein [Arabidopsis
            thaliana]
            Length = 319
            
 Score = 36.0 bits (81), Expect = 0.79
 Identities = 44/170 (25%), Positives = 68/170 (39%), Gaps = 20/170 (11%)
 Frame = +1

Query: 907  NNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFKQY 1086
            +N  A  +NH    +S E +RFD   D          FK   T   +  +  +H+     
Sbjct: 40   SNPLAETSNHQ--QDSFETQRFDYYTDPMAAYSS---FKKNKTPKQQYISSPSHQGSSPV 94

Query: 1087 KPKNDRSLSEGSPAT-----------FQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPN 1233
             P+   S+  GS  +             + Y P G++    +HR   AG     N   P 
Sbjct: 95   PPQFPPSVPPGSLCSEYQAQTNHGGFHAAHYEPRGMAHLSPSHRGPPAGWN---NNFRPP 151

Query: 1234 PNTRNTPGQQTRKSPYKYPNNAPNFPSD---------HATPTFNPYGNPGQKTGAGRPNN 1386
            P   + P Q   + P+ +    PN  ++         +  P F+ YG      G     N
Sbjct: 152  PVNHSGPPQWVPR-PFPFSQEMPNMGNNRFGGRGSYNNTPPQFSNYGRQNANWGGNTYPN 210

Query: 1387 SGGKYNRNRG 1416
            SG   +R RG
Sbjct: 211  SGRGRSRGRG 220


>gi|5114426|gb|AAD40313.1|AF157503_1 (AF157503) chitinase 1 [Penaeus
           monodon]
           Length = 620
           
 Score = 36.0 bits (81), Expect = 0.79
 Identities = 18/36 (50%), Positives = 23/36 (63%), Gaps = 2/36 (5%)
 Frame = +3

Query: 693 PSQPPQTYTLWFRPPTTRTTAW--PIDRTTQDPSRSTT 800
           P+ PP T + W+ PPTT TT     I  TT+DP+  TT
Sbjct: 423 PTLPPTTTSPWWTPPTTTTTTRDPSITTTTRDPNLPTT 460


>gi|228937|prf||1814452B Hyp-rich glycoprotein [Zea mays]
           Length = 327
           
 Score = 36.0 bits (81), Expect = 0.79
 Identities = 26/80 (32%), Positives = 35/80 (43%)
 Frame = +3

Query: 648 RPQPRSTSAMGIARLPSQPPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRH 827
           +P P + +       P   P TYT   +PPT + T      + + P+   T PT T    
Sbjct: 242 KPTPPTYTPSPKPPTPKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPTPKPTPPTYTPTPK 301

Query: 828 AVSTHPATDYRPTPFSQSHT 887
             +T P T Y PTP   SHT
Sbjct: 302 PPATKPPT-YTPTP-PVSHT 319


 Score = 32.8 bits (73), Expect = 6.8
 Identities = 20/59 (33%), Positives = 27/59 (44%)
 Frame = +3

Query: 693 PSQPPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRHAVSTHPATDYRPTP 869
           P   P TYT   +PPT + T      + + P+   T PT T      +T P T  +PTP
Sbjct: 137 PKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPATKPPTP-KPTP 194


>gi|106291|pir||S16681 homeotic protein - human
            Length = 316
            
 Score = 35.6 bits (80), Expect = 1.0
 Identities = 36/143 (25%), Positives = 60/143 (41%), Gaps = 3/143 (2%)
 Frame = +1

Query: 988  DRGRGQGKHHFKDRLTVSGEAAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVST 1167
            D+G G+     +D      E    + H P +Q +P    S +    +  ++   P G +T
Sbjct: 169  DKGSGRRLRTLRDSDPEEDEDEDDEDHFPLQQRRPW---STASSDCSVGRTGIAPRGPAT 225

Query: 1168 GPRNHRNAGAGNGAHPNKKSPNP-NTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPY 1344
             PR  R+  A + + P + +P P  +   PG  T           P  P + A P   P+
Sbjct: 226  SPRPSRSPAAQDRSRPARSAPGPAASPGGPGAWTH----------PARPREQARPP--PH 273

Query: 1345 GNPGQKTGAG--RPNNSGGKYNRNRG 1416
            G P  + GAG  R  +  G++   +G
Sbjct: 274  G-PLAQAGAGGIRRGSGPGRFPFKQG 298


>gi|1170758|sp|P38486|LEG3_CANFA GALECTIN-3 (GALACTOSE-SPECIFIC LECTIN
            3) (MAC-2 ANTIGEN) (IGE-BINDING PROTEIN) (35 KD LECTIN)
            (CARBOHYDRATE BINDING PROTEIN 35) (CBP 35)
            (LAMININ-BINDING PROTEIN) (LECTIN L-29)
            Length = 296
            
 Score = 35.6 bits (80), Expect = 1.0
 Identities = 36/111 (32%), Positives = 43/111 (38%), Gaps = 10/111 (9%)
 Frame = +1

Query: 1096 NDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGN-------GAHPNKKSPN--PNTRN 1248
            ND     G+P   Q W        GP  ++ AGAG        GA+P +  P   P    
Sbjct: 8    NDALSGSGNPNP-QGW-------PGPWGNQPAGAGGYPGASYPGAYPGQAPPGGYPGQAP 59

Query: 1249 TPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNS-GGKYNRNRGPRY 1425
              G   +  P  YP  AP        P   P G PGQ    G P  +  G Y     P Y
Sbjct: 60   PGGYPGQAPPGGYPGQAPPGGYPGQAP---PGGYPGQAPPGGYPGQAPPGTYPGPTAPAY 116

Query: 1426 P 1428
            P
Sbjct: 117  P 117


>gi|1724014|sp|P54604|YHCT_BACSU HYPOTHETICAL 33.7 KD PROTEIN IN
           CSPB-GLPP INTERGENIC REGION >gi|1239996|emb|CAA65704.1|
           (X96983) hypothetical protein [Bacillus subtilis]
           >gi|2633244|emb|CAB12749| (Z99108) similar to
           hypothetical proteins [Bacillus subtilis]
           Length = 302
           
 Score = 35.6 bits (80), Expect = 1.0
 Identities = 28/81 (34%), Positives = 44/81 (53%), Gaps = 10/81 (12%)
 Frame = +1

Query: 85  LHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLGMSVKSGDKIELDGRSFVASA----- 249
           L  VL  A   S+  ++  +S+  IKVN +     M VK GD++ +D +   AS+     
Sbjct: 21  LFSVLKTALKASKPVIQDWMSHQQIKVNHESVLNNMIVKKGDRVFIDLQESEASSVIPEY 80

Query: 250 -----LTEPARVLIYNKPEGEVTTREDPEGR 327
                L E   +LI NKP G + T  + +G+
Sbjct: 81  GELDILFEDNHMLIINKPAG-IATHPNEDGQ 110


>gi|3242649|dbj|BAA29028.1| (AB015440) alpha 1 type I collagen [Rana
            catesbeiana]
            Length = 1445
            
 Score = 35.2 bits (79), Expect = 1.4
 Identities = 31/83 (37%), Positives = 35/83 (41%), Gaps = 7/83 (8%)
 Frame = +1

Query: 1150 PEGVSTGPRNHRNA----GAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPN--NAPNFP 1311
            P+G S GP   + A    GA   A P   + NP T   PG    K     P    AP FP
Sbjct: 342  PQG-SRGPDGPQGARGEPGAPGQAGPAGSAGNPGTDGQPGA---KGATGAPGIAGAPGFP 397

Query: 1312 SDHATP-TFNPYGNPGQKTGAGRPNNSGGK 1398
                 P    P G+PG K   G P   G K
Sbjct: 398  GARGAPGPQGPGGSPGPKGNNGEPGAQGNK 427


>gi|4808164|emb|CAB42795.1| (Y18878) largest subunit of the RNA
            polymerase II complex [Drosophila guanche]
            Length = 1889
            
 Score = 35.2 bits (79), Expect = 1.4
 Identities = 41/127 (32%), Positives = 55/127 (43%), Gaps = 24/127 (18%)
 Frame = +1

Query: 1048 AAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGP-----RNHRNAGAGNGAH 1212
            +AA  A      + P +  S S  SPA   S Y P   S  P       +  A +  GA 
Sbjct: 1537 SAASDASGMSPSWSPAHPGS-SPSSPAPSMSPYYPASPSVSPSYSPTSPNYTASSPGGAS 1595

Query: 1213 PNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNF--------PS----DHATPTFNPYGN-P 1353
            PN    +PN   T       SP +Y +  PNF        PS       +P ++P  N P
Sbjct: 1596 PNYSPSSPNYSPTSPLYAAPSP-RYASTTPNFNPQSTGYSPSASGYSPTSPVYSPTSNFP 1654

Query: 1354 GQKTGAG------RPNNSGGKYNRNRGPRYP 1428
               + AG       P N+    + N  P  P
Sbjct: 1655 SSPSFAGSGSNMYSPGNAYSPSSTNYSPNSP 1685


>gi|4808166|emb|CAB42797.1| (Y18879) largest subunit of the RNA
            polymerase II complex [Drosophila pseudoobscura]
            Length = 1811
            
 Score = 35.2 bits (79), Expect = 1.4
 Identities = 41/127 (32%), Positives = 55/127 (43%), Gaps = 24/127 (18%)
 Frame = +1

Query: 1048 AAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGP-----RNHRNAGAGNGAH 1212
            +AA  A      + P +  S S  SPA   S Y P   S  P       +  A +  GA 
Sbjct: 1459 SAASDASGMSPSWSPAHPGS-SPSSPAPSMSPYYPASPSVSPSYSPTSPNYTASSPGGAS 1517

Query: 1213 PNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNF--------PS----DHATPTFNPYGN-P 1353
            PN    +PN   T       SP +Y +  PNF        PS       +P ++P  N P
Sbjct: 1518 PNYSPSSPNYSPTSPLYAAASP-RYASTTPNFNPQSTGYSPSASGYSPTSPVYSPTSNFP 1576

Query: 1354 GQKTGAG------RPNNSGGKYNRNRGPRYP 1428
               + AG       P N+    + N  P  P
Sbjct: 1577 SSPSFAGSGSNMYSPGNAYSPSSTNYSPNSP 1607


>gi|1589837 (U68729) cuticle preprocollagen [Meloidogyne incognita]
            Length = 308
            
 Score = 35.2 bits (79), Expect = 1.4
 Identities = 24/72 (33%), Positives = 35/72 (48%), Gaps = 4/72 (5%)
 Frame = +1

Query: 1213 PNKKSPNPNTRNTPGQ--QTRKSPYKYPNNAPNFPSDHATPTF-NPYGNPGQKTGAGRPN 1383
            P  +S NP    +PGQ  +++++P +     P  P     P    P G PGQ  G G+P 
Sbjct: 121  PPGRSGNPGAPGSPGQPGKSQQAPCQQVTTPPCKPCPGGQPGQPGPPGPPGQPGGPGQPG 180

Query: 1384 NSGGKYNRNR-GPRYP 1428
              GG+ +    GP  P
Sbjct: 181  QPGGQASPGEPGPAGP 196


>gi|563375|emb|CAA84031| (Z34277) mucin [Homo sapiens]
           Length = 477
           
 Score = 35.2 bits (79), Expect = 1.4
 Identities = 28/97 (28%), Positives = 41/97 (41%)
 Frame = +3

Query: 603 RQHRLTRLVSCRCQRRPQPRSTSAMGIARLPSQPPQTYTLWFRPPTTRTTAWPIDRTTQD 782
           ++ R T LV+      PQ  +TSA                    PTT TT+ P   TT  
Sbjct: 154 QKSRTTTLVTTSTTSTPQTSTTSA--------------------PTTSTTSAPTTSTTSA 193

Query: 783 PSRSTTHPTETRKRHAVSTHPATDYRPTPFSQSHTAR 893
           P+ STT   +T    ++S+ P +     P S + +AR
Sbjct: 194 PTTSTTSTPQT----SISSAPTSSTTSAPTSSTISAR 226


 Score = 34.0 bits (76), Expect = 3.0
 Identities = 25/65 (38%), Positives = 32/65 (48%), Gaps = 2/65 (3%)
 Frame = +3

Query: 705 PQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRHAV--STHPATDYRPTPFSQ 878
           P T T  F  PTT TT+     TT  P+ STT   +T K  A   ST   +   P+P + 
Sbjct: 234 PTTSTTSF--PTTSTTSATTTSTTSAPTSSTTSTPQTSKTSAATSSTTSGSGTTPSPVTT 291

Query: 879 SHTARES 899
           + TA  S
Sbjct: 292 TSTASVS 298


>gi|1519696 (U67956) coded for by C. elegans cDNA yk126f9.5; coded
           for by C. elegans cDNA yk159h6.3; coded for by C.
           elegans cDNA yk126f9.3; coded for by C. elegans cDNA
           yk159h6.5 [Caenorhabditis elegans]
           Length = 1229
           
 Score = 35.2 bits (79), Expect = 1.4
 Identities = 23/78 (29%), Positives = 32/78 (40%)
 Frame = +3

Query: 654 QPRSTSAMGIARLPSQPPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRHAV 833
           QP ST+      LP    QT T     PTT   +    + T      +T  T T K+ + 
Sbjct: 554 QPTSTAESTTTALPFTTEQTVTT--EEPTTAEKSTATQKPTTTQESVSTEKTSTTKKAST 611

Query: 834 STHPATDYRPTPFSQSHT 887
           +  P T   PT  ++S T
Sbjct: 612 TEEPTTTDEPTTTTESST 629


>gi|1184072 (U40766) COL-1 [Meloidogyne incognita]
            Length = 309
            
 Score = 35.2 bits (79), Expect = 1.4
 Identities = 24/72 (33%), Positives = 35/72 (48%), Gaps = 4/72 (5%)
 Frame = +1

Query: 1213 PNKKSPNPNTRNTPGQ--QTRKSPYKYPNNAPNFPSDHATPTF-NPYGNPGQKTGAGRPN 1383
            P  +S NP    +PGQ  +++++P +     P  P     P    P G PGQ  G G+P 
Sbjct: 122  PPGRSGNPGAPGSPGQPGKSQQAPCQQVTTPPCKPCPGGQPGQPGPPGPPGQPGGPGQPG 181

Query: 1384 NSGGKYNRNR-GPRYP 1428
              GG+ +    GP  P
Sbjct: 182  QPGGQASPGEPGPAGP 197


>gi|2245693|gb|AAB62577.1| (AF010328) short form of CHIP [Drosophila
            melanogaster]
            Length = 365
            
 Score = 34.8 bits (78), Expect = 1.8
 Identities = 24/94 (25%), Positives = 38/94 (39%), Gaps = 6/94 (6%)
 Frame = +1

Query: 1144 YVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHA 1323
            Y P G S+ P  +++    N   P   +P   +   P   +   P    N   N+P    
Sbjct: 65   YPPAGQSS-PAGNQSIVFQNSNQPGSNTPQYTSSPAPSGSSTPGPVGAQNIPGNYPQSAT 123

Query: 1324 TPTFN-----PYGNPGQKTGA-GRPNNSGGKYNRNRGPRY 1425
               FN     P+G+P    G   RP +SG  +N  +   +
Sbjct: 124  AGNFNGPVGGPFGSPSSGLGQFSRPASSGTPFNSGQAGHF 163


>gi|4324436|gb|AAD16883| (AF104414) large tumor suppressor 1 [Mus
            musculus]
            Length = 962
            
 Score = 34.8 bits (78), Expect = 1.8
 Identities = 30/128 (23%), Positives = 52/128 (40%), Gaps = 6/128 (4%)
 Frame = +1

Query: 1000 GQGKHHFKDRLTVSGEAAAKQAHKPFKQYKPKNDRSLS------EGSPATFQSWYVPEGV 1161
            G G+  F     V   +  +Q   P+    P N +S S        +P +F +  VP+ +
Sbjct: 183  GGGQSDFIVHQNVPTGSVTRQPPPPYP-LTPANGQSPSALQTGASAAPPSFANGNVPQSM 241

Query: 1162 STGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNP 1341
                RN  N    N   P  ++  P + + P Q +    ++ P   PN P   +    NP
Sbjct: 242  MVPNRNSHNMELYNINVPGLQTAWPQSSSAPAQSSPSGGHEIPTWQPNIPV-RSNSFNNP 300

Query: 1342 YGNPGQKTGAGRPN 1383
             G+    +   +P+
Sbjct: 301  LGSRASHSANSQPS 314


>gi|3877422|emb|CAB05195.1| (Z82268) predicted using Genefinder;
            similar to CUTICLE COLLAGEN 34; cDNA EST EMBL:D65629
            comes from this gene; cDNA EST EMBL:D68754 comes from
            this gene; cDNA EST EMBL:D68791 comes from this gene;
            cDNA EST EMBL:D68988 comes ...
            Length = 304
            
 Score = 34.8 bits (78), Expect = 1.8
 Identities = 26/85 (30%), Positives = 33/85 (38%)
 Frame = +1

Query: 1150 PEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATP 1329
            P+G S  P N   AGA     P   + + +    PGQ   + P   P  +P  P     P
Sbjct: 190  PKGASGAPGNPGQAGAPG--QPGADAQSESIPGAPGQAGPQGP-PGPAGSPGAPGGPGQP 246

Query: 1330 TFNPYGNPGQKTGAGRPNNSGGKYN 1404
                 G PGQK  +G P   G   N
Sbjct: 247  -----GAPGQKGPSGAPGQPGADGN 266


>gi|1280114|gb|AAC25888.1| (U55373) Similar to cuticular collagen;
            coded for by C. elegans cDNA yk92h9.3; coded for by C.
            elegans cDNA yk100f8.5; coded for by C. elegans cDNA
            yk123h6.5; coded for by C. elegans cDNA yk125b5.5; coded
            for by C. elegans cDN...
            Length = 289
            
 Score = 34.8 bits (78), Expect = 1.8
 Identities = 28/81 (34%), Positives = 33/81 (40%), Gaps = 3/81 (3%)
 Frame = +1

Query: 1150 PEGVSTGPRNHRNAGA-GNGAHPNKKSPN--PNTRNTPGQQTRKSPYKYPNNAPNFPSDH 1320
            P G   GP    + G  GN   P    P   P T   PGQ  R  P       P  P   
Sbjct: 176  PPGGPGGPGEGGDGGRPGNPGRPGPAGPRGEPGTEYKPGQPGRPGP-------PG-PRGE 227

Query: 1321 ATPTFNPYGNPGQKTGAGRPNNSG 1392
            A P   P G+PG    +G+P N+G
Sbjct: 228  AGPAGQP-GSPGNDGESGKPGNAG 250


>gi|2245687|gb|AAB62574.1| (AF010325) CHIP [Drosophila melanogaster]
            Length = 577
            
 Score = 34.8 bits (78), Expect = 1.8
 Identities = 24/94 (25%), Positives = 38/94 (39%), Gaps = 6/94 (6%)
 Frame = +1

Query: 1144 YVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHA 1323
            Y P G S+ P  +++    N   P   +P   +   P   +   P    N   N+P    
Sbjct: 65   YPPAGQSS-PAGNQSIVFQNSNQPGSNTPQYTSSPAPSGSSTPGPVGAQNIPGNYPQSAT 123

Query: 1324 TPTFN-----PYGNPGQKTGA-GRPNNSGGKYNRNRGPRY 1425
               FN     P+G+P    G   RP +SG  +N  +   +
Sbjct: 124  AGNFNGPVGGPFGSPSSGLGQFSRPASSGTPFNSGQAGHF 163


>gi|119712|sp|P14918|EXTN_MAIZE EXTENSIN PRECURSOR (PROLINE-RICH
           GLYCOPROTEIN) >gi|100863|pir||S08314 cell wall
           glycoprotein - maize >gi|22508|emb|CAA31854| (X13499)
           cell wall protein (AA 1-267) [Zea mays] >gi|168455
           (M36912) cell wall protein (put.); putative [Zea mays]
           >gi|226756|prf||1604465A cell wall protein [Zea mays]
           Length = 267
           
 Score = 34.4 bits (77), Expect = 2.3
 Identities = 28/80 (35%), Positives = 36/80 (45%), Gaps = 5/80 (6%)
 Frame = +3

Query: 648 RPQPRSTSAMGIARLPSQPPQTYTLWFRPPTTRT-----TAWPIDRTTQDPSRSTTHPTE 812
           +P P + +       P   P TYT   +PPT +      T  P    T+ P+   T PT 
Sbjct: 177 KPTPPTYTPSPKPPTPKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPATKPPTPKPTPPTY 236

Query: 813 TRKRHAVSTHPATDYRPTPFSQSHT 887
           T      +T P T Y PTP   SHT
Sbjct: 237 TPTPKPPATKPPT-YTPTP-PVSHT 259


 Score = 33.2 bits (74), Expect = 5.2
 Identities = 21/67 (31%), Positives = 31/67 (45%)
 Frame = +3

Query: 693 PSQPPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRHAVSTHPATDYRPTPF 872
           P   P TYT   +PPT + T      + + P+   T PT T      +T P T  +PTP 
Sbjct: 176 PKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPATKPPTP-KPTPP 234

Query: 873 SQSHTAR 893
           + + T +
Sbjct: 235 TYTPTPK 241


>gi|227614|prf||1707318A Thr rich extensin [Zea mays]
           Length = 251
           
 Score = 34.4 bits (77), Expect = 2.3
 Identities = 28/80 (35%), Positives = 36/80 (45%), Gaps = 5/80 (6%)
 Frame = +3

Query: 648 RPQPRSTSAMGIARLPSQPPQTYTLWFRPPTTRT-----TAWPIDRTTQDPSRSTTHPTE 812
           +P P + +       P   P TYT   +PPT +      T  P    T+ P+   T PT 
Sbjct: 161 KPTPPTYTPSPKPPTPKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPATKPPTPKPTPPTY 220

Query: 813 TRKRHAVSTHPATDYRPTPFSQSHT 887
           T      +T P T Y PTP   SHT
Sbjct: 221 TPTPKPPATKPPT-YTPTP-PVSHT 243


 Score = 33.2 bits (74), Expect = 5.2
 Identities = 21/67 (31%), Positives = 31/67 (45%)
 Frame = +3

Query: 693 PSQPPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRHAVSTHPATDYRPTPF 872
           P   P TYT   +PPT + T      + + P+   T PT T      +T P T  +PTP 
Sbjct: 160 PKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPATKPPTP-KPTPP 218

Query: 873 SQSHTAR 893
           + + T +
Sbjct: 219 TYTPTPK 225


>gi|3810839|emb|CAA21800| (AL032684) conserved hypothetical
            zinc-finger protein [Schizosaccharomyces pombe]
            Length = 482
            
 Score = 34.4 bits (77), Expect = 2.3
 Identities = 39/154 (25%), Positives = 62/154 (39%), Gaps = 1/154 (0%)
 Frame = +1

Query: 841  TLQPIIGQRRSAKATLHVNRNDNNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHF 1020
            TL P   ++R  +A +      N+K++  +   T+D++         R+D     G + F
Sbjct: 330  TLNPDYQKQREIEAVVKSVLGSNSKNS--DKVGTSDDNNTPMSEKRKREDDD-ANGPNKF 386

Query: 1021 KDRLT-VSGEAAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGA 1197
              R + V  +A A+ A K              +G PA F  + +P G+   P    NA A
Sbjct: 387  AARSSAVFSKATAEPAFKSAMAIPDMPSMPHVQGFPAPFPPFMMP-GLPQMPPMMMNAIA 445

Query: 1198 GNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAP 1302
            G   H N+  P  N+R  P   +   P     N P
Sbjct: 446  GQVYHNNRNPPRTNSR--PSNASVPPPSSLHKNPP 478


>gi|228938|prf||1814452C Hyp-rich glycoprotein [Zea diploperennis]
           Length = 349
           
 Score = 34.4 bits (77), Expect = 2.3
 Identities = 28/80 (35%), Positives = 36/80 (45%), Gaps = 5/80 (6%)
 Frame = +3

Query: 648 RPQPRSTSAMGIARLPSQPPQTYTLWFRPPTTRT-----TAWPIDRTTQDPSRSTTHPTE 812
           +P P + +       P   P TYT   +PPT +      T  P    T+ P+   T PT 
Sbjct: 259 KPTPPTYTPSPKPPTPKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPATKPPTPKPTPPTY 318

Query: 813 TRKRHAVSTHPATDYRPTPFSQSHT 887
           T      +T P T Y PTP   SHT
Sbjct: 319 TPTPKPPATKPPT-YTPTP-PVSHT 341


>gi|1082938|pir||A49688 lactose-binding lectin L-29 - dog
            Length = 294
            
 Score = 34.4 bits (77), Expect = 2.3
 Identities = 30/87 (34%), Positives = 36/87 (40%), Gaps = 10/87 (11%)
 Frame = +1

Query: 1168 GPRNHRNAGAGN-------GAHPNKKSPN--PNTRNTPGQQTRKSPYKYPNNAPNFPSDH 1320
            GP  ++ AGAG        GA+P +  P   P      G   +  P  YP  AP      
Sbjct: 22   GPWGNQPAGAGGYPGASYPGAYPGQAPPGGYPGQAPPGGYPGQAPPGGYPGQAPPGGYPG 81

Query: 1321 ATPTFNPYGNPGQKTGAGRPNNS-GGKYNRNRGPRYP 1428
              P   P G PGQ    G P  +  G Y     P YP
Sbjct: 82   QAP---PGGYPGQAPPGGYPGQAPPGTYPGPTAPAYP 115


>gi|3851592 (AF093575) surface antigen ariel1 [Entamoeba histolytica]
            Length = 215
            
 Score = 34.4 bits (77), Expect = 2.3
 Identities = 25/102 (24%), Positives = 40/102 (38%)
 Frame = +1

Query: 1093 KNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRK 1272
            + D+  S  S     S   P+  S    N  +    N +  NK  PN ++ N P + +  
Sbjct: 47   EEDKKSSSNSELDENSNNQPDESSNNKPNESSDNKPNESSDNK--PNESSNNKPSESSNN 104

Query: 1273 SPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGK 1398
             P +  NN PN  SD+  P  +    P + +      +S  K
Sbjct: 105  KPDESSNNKPNESSDN-KPNESSNNKPNESSNNKPSESSNNK 145


 Score = 33.2 bits (74), Expect = 5.2
 Identities = 22/80 (27%), Positives = 35/80 (43%)
 Frame = +1

Query: 1150 PEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATP 1329
            P+  S    N  +    N +  NK  PN ++ N P + +   P +  NN PN  SD+  P
Sbjct: 106  PDESSNNKPNESSDNKPNESSNNK--PNESSNNKPSESSNNKPDESSNNKPNESSDN-KP 162

Query: 1330 TFNPYGNPGQKTGAGRPNNS 1389
              +    P + +   +PN S
Sbjct: 163  NESSNNKPNESSD-NKPNES 181


>gi|71823|pir||FGHUA fibrinogen alpha chain precursor, short splice
            form - human >gi|182426 (J00128) A-alpha fibrinogen [Homo
            sapiens] >gi|458554 (M64982) common fibrinogen alpha
            chain [Homo sapiens] >gi|4033511 (M58569) fibrinogen
            alpha subunit [Homo sapiens]
            Length = 644
            
 Score = 34.4 bits (77), Expect = 2.3
 Identities = 32/109 (29%), Positives = 48/109 (43%), Gaps = 5/109 (4%)
 Frame = +1

Query: 1093 KNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNT--PGQQT 1266
            +N  S   G  AT++      G STG  N  ++G G+  + N  SP P +  T  PG   
Sbjct: 308  RNPGSSGTGGTATWKPGSSGPG-STGSWNSGSSGTGSTGNQNPGSPRPGSTGTWNPGSSE 366

Query: 1267 RKSPYKYPNNAPNFPSDHATPTFNPYGNPGQ---KTGAGRPNNSGGKYNRNRGP 1419
            R S            + H T   +  G+ GQ   ++G+ RP++ G    R   P
Sbjct: 367  RGS------------AGHWTSESSVSGSTGQWHSESGSFRPDSPGSGNARPNNP 408


>gi|437331 (L23429) beta-galactosides-binding lectin [Canis
            familiaris]
            Length = 285
            
 Score = 34.4 bits (77), Expect = 2.3
 Identities = 30/87 (34%), Positives = 36/87 (40%), Gaps = 10/87 (11%)
 Frame = +1

Query: 1168 GPRNHRNAGAGN-------GAHPNKKSPN--PNTRNTPGQQTRKSPYKYPNNAPNFPSDH 1320
            GP  ++ AGAG        GA+P +  P   P      G   +  P  YP  AP      
Sbjct: 13   GPWGNQPAGAGGYPGASYPGAYPGQAPPGGYPGQAPPGGYPGQAPPGGYPGQAPPGGYPG 72

Query: 1321 ATPTFNPYGNPGQKTGAGRPNNS-GGKYNRNRGPRYP 1428
              P   P G PGQ    G P  +  G Y     P YP
Sbjct: 73   QAP---PGGYPGQAPPGGYPGQAPPGTYPGPTAPAYP 106


>gi|120943|sp|P13816|GARP_PLAFF GLUTAMIC ACID-RICH PROTEIN PRECURSOR
            >gi|627054|pir||A54514 glutamic acid-rich protein
            precursor - Plasmodium falciparum
            >gi|160299|gb|AAA29605.1| (J03998) glutamic acid-rich
            protein [Plasmodium falciparum]
            Length = 678
            
 Score = 34.4 bits (77), Expect = 2.3
 Identities = 41/164 (25%), Positives = 69/164 (42%), Gaps = 11/164 (6%)
 Frame = +1

Query: 796  LRTQLKLEKDMPLALTLQPIIGQRRSAKATLHVNRNDNNKHAYHNNH--STADESRELRR 969
            L  + +LEK+       + ++ + +  K  +    NDN K+A++NN   S+ D +  +  
Sbjct: 49   LLNETELEKNKDDNSKSETLLKEEKDEKDDVPTTSNDNLKNAHNNNEISSSTDPTNIINV 108

Query: 970  FD----TLRDDRGRGQGKHHFKDRLTV-----SGEAAAKQAHKPFKQYKPKNDRSLSEGS 1122
             D       D +   + K H KD+          E   K+  K  K+ K K D+   E S
Sbjct: 109  NDKDNENSVDKKKDKKEKKHKKDKKEKKEKKDKKEKKDKKEKKHKKEKKHKKDKKKKENS 168

Query: 1123 PATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKY 1287
                 S Y      TG    +NA      + +++  +    N  G     SPY+Y
Sbjct: 169  EV--MSLY-----KTGQHKPKNATEHGEENLDEEMVSEINNNAQGGLLLSSPYQY 216


>gi|283032|pir||S22456 hydroxyproline-rich glycoprotein - perennial
           teosinte >gi|22092|emb|CAA45514| (X64173)
           hydroxyproline-rich  glycoprotein [Zea diploperennis]
           Length = 350
           
 Score = 34.4 bits (77), Expect = 2.3
 Identities = 28/80 (35%), Positives = 36/80 (45%), Gaps = 5/80 (6%)
 Frame = +3

Query: 648 RPQPRSTSAMGIARLPSQPPQTYTLWFRPPTTRT-----TAWPIDRTTQDPSRSTTHPTE 812
           +P P + +       P   P TYT   +PPT +      T  P    T+ P+   T PT 
Sbjct: 260 KPTPPTYTPSPKPPTPKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPATKPPTPKPTPPTY 319

Query: 813 TRKRHAVSTHPATDYRPTPFSQSHT 887
           T      +T P T Y PTP   SHT
Sbjct: 320 TPTPKPPATKPPT-YTPTP-PVSHT 342


>gi|3834294 (U80846) No definition line found [Caenorhabditis elegans]
            Length = 2232
            
 Score = 34.4 bits (77), Expect = 2.3
 Identities = 27/97 (27%), Positives = 40/97 (40%), Gaps = 1/97 (1%)
 Frame = +1

Query: 1105 SLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNT-RNTPGQQTRKSPY 1281
            S   GS  T QS       + G  +     + +   P+ +SP PNT   TP Q + +SP 
Sbjct: 564  STVSGSTGTSQSTLASSTATPGSSS--TVPSSSSPQPSSQSPAPNTGSTTPSQTSSQSPS 621

Query: 1282 KYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGG 1395
               N + + P+  +  T  P G+      A  P  S G
Sbjct: 622  PSMNPSSSTPTGSSQSTITPEGST-----ASSPTGSTG 654


>gi|168457 (M36913) cell wall protein (put.); putative [Zea mays]
           Length = 109
           
 Score = 34.4 bits (77), Expect = 2.3
 Identities = 28/80 (35%), Positives = 36/80 (45%), Gaps = 5/80 (6%)
 Frame = +3

Query: 648 RPQPRSTSAMGIARLPSQPPQTYTLWFRPPTTRT-----TAWPIDRTTQDPSRSTTHPTE 812
           +P P + +       P   P TYT   +PPT +      T  P    T+ P+   T PT 
Sbjct: 19  KPTPPTYTPSPKPPTPKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPATKPPTPKPTPPTY 78

Query: 813 TRKRHAVSTHPATDYRPTPFSQSHT 887
           T      +T P T Y PTP   SHT
Sbjct: 79  TPTPKPPATKPPT-YTPTP-PVSHT 101


 Score = 33.2 bits (74), Expect = 5.2
 Identities = 21/67 (31%), Positives = 31/67 (45%)
 Frame = +3

Query: 693 PSQPPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRHAVSTHPATDYRPTPF 872
           P   P TYT   +PPT + T      + + P+   T PT T      +T P T  +PTP 
Sbjct: 18  PKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPATKPPTP-KPTPP 76

Query: 873 SQSHTAR 893
           + + T +
Sbjct: 77  TYTPTPK 83


>gi|4503689|ref|NP_000499.1|| fibrinogen, A alpha polypeptide
            >gi|1706799|sp|P02671|FIBA_HUMAN FIBRINOGEN ALPHA/ALPHA-E
            CHAIN PRECURSOR >gi|2135107|pir||D44234 fibrinogen alpha
            chain precursor, extended splice form - human >gi|182407
            (M58569) fibrinogen alpha subunit precursor [Homo
            sapiens] >gi|458555 (M64982) fibrinogen alpha-E chain
            [Homo sapiens]
            Length = 866
            
 Score = 34.4 bits (77), Expect = 2.3
 Identities = 32/109 (29%), Positives = 48/109 (43%), Gaps = 5/109 (4%)
 Frame = +1

Query: 1093 KNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNT--PGQQT 1266
            +N  S   G  AT++      G STG  N  ++G G+  + N  SP P +  T  PG   
Sbjct: 308  RNPGSSGTGGTATWKPGSSGPG-STGSWNSGSSGTGSTGNQNPGSPRPGSTGTWNPGSSE 366

Query: 1267 RKSPYKYPNNAPNFPSDHATPTFNPYGNPGQ---KTGAGRPNNSGGKYNRNRGP 1419
            R S            + H T   +  G+ GQ   ++G+ RP++ G    R   P
Sbjct: 367  RGS------------AGHWTSESSVSGSTGQWHSESGSFRPDSPGSGNARPNNP 408


>gi|3834293 (U80846) No definition line found [Caenorhabditis elegans]
            Length = 1032
            
 Score = 34.4 bits (77), Expect = 2.3
 Identities = 27/97 (27%), Positives = 40/97 (40%), Gaps = 1/97 (1%)
 Frame = +1

Query: 1105 SLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNT-RNTPGQQTRKSPY 1281
            S   GS  T QS       + G  +     + +   P+ +SP PNT   TP Q + +SP 
Sbjct: 564  STVSGSTGTSQSTLASSTATPGSSS--TVPSSSSPQPSSQSPAPNTGSTTPSQTSSQSPS 621

Query: 1282 KYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGG 1395
               N + + P+  +  T  P G+      A  P  S G
Sbjct: 622  PSMNPSSSTPTGSSQSTITPEGST-----ASSPTGSTG 654


>gi|3924913|emb|CAB03474.1| (Z81138) predicted using Genefinder; cDNA
            EST EMBL:D65543 comes from this gene [Caenorhabditis
            elegans]
            Length = 304
            
 Score = 34.0 bits (76), Expect = 3.0
 Identities = 30/92 (32%), Positives = 40/92 (42%), Gaps = 12/92 (13%)
 Frame = +1

Query: 1150 PEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPN-----NAPNFPS 1314
            P+G    P N   AGA     P +   +  + ++PG   +  P   P       AP  P 
Sbjct: 190  PKGAPGAPGNPGQAGA-----PGQPGSDAQSESSPGAPGQAGPQGPPGPAGSPGAPGGPG 244

Query: 1315 DHATP-TFNPYGNPGQ-----KTGA-GRPNNSGGKYNRNRGPRY 1425
                P    P G PGQ       GA G+P  SGG   +   P+Y
Sbjct: 245  QAGAPGPKGPSGAPGQPGADGNPGAPGQPGQSGGAGEKGICPKY 288


>gi|3924914|emb|CAB03475.1| (Z81138) predicted using Genefinder; cDNA
            EST EMBL:D69494 comes from this gene; cDNA EST
            EMBL:D69317 comes from this gene [Caenorhabditis elegans]
            Length = 304
            
 Score = 34.0 bits (76), Expect = 3.0
 Identities = 30/92 (32%), Positives = 40/92 (42%), Gaps = 12/92 (13%)
 Frame = +1

Query: 1150 PEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPN-----NAPNFPS 1314
            P+G    P N   AGA     P +   +  + ++PG   +  P   P       AP  P 
Sbjct: 190  PKGAPGAPGNPGQAGA-----PGQPGSDAQSESSPGAPGQAGPQGPPGPAGSPGAPGGPG 244

Query: 1315 DHATP-TFNPYGNPGQ-----KTGA-GRPNNSGGKYNRNRGPRY 1425
                P    P G PGQ       GA G+P  SGG   +   P+Y
Sbjct: 245  QAGAPGPKGPSGAPGQPGADGNPGAPGQPGQSGGAGEKGICPKY 288


>gi|6580246|emb|CAB63321.1| (Z81138) cDNA EST EMBL:D65528 comes from
            this gene [Caenorhabditis elegans]
            Length = 304
            
 Score = 34.0 bits (76), Expect = 3.0
 Identities = 30/92 (32%), Positives = 40/92 (42%), Gaps = 12/92 (13%)
 Frame = +1

Query: 1150 PEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPN-----NAPNFPS 1314
            P+G    P N   AGA     P +   +  + ++PG   +  P   P       AP  P 
Sbjct: 190  PKGAPGAPGNPGQAGA-----PGQPGSDAQSESSPGAPGQAGPQGPPGPAGSPGAPGGPG 244

Query: 1315 DHATP-TFNPYGNPGQ-----KTGA-GRPNNSGGKYNRNRGPRY 1425
                P    P G PGQ       GA G+P  SGG   +   P+Y
Sbjct: 245  QAGAPGPKGPSGAPGQPGADGNPGAPGQPGQSGGAGEKGICPKY 288


>gi|1280073 (U55366) Similar to cuticle collagen [Caenorhabditis
            elegans]
            Length = 310
            
 Score = 34.0 bits (76), Expect = 3.0
 Identities = 29/92 (31%), Positives = 39/92 (41%), Gaps = 10/92 (10%)
 Frame = +1

Query: 1150 PEGVSTGP----RNHRNAGAG----NGAHPNKKSPN--PNTRNTPGQQTRKSPYKYPNNA 1299
            P+G   GP    R+ +   AG    + + P +  PN  P  R  PGQ         P   
Sbjct: 195  PKGAPGGPGQPGRDGQPGQAGQPGSSSSEPGQPGPNGQPGPRGPPGQAGSPGGNGQPGG- 253

Query: 1300 PNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRNRGPRY 1425
            P  P    +      GN GQ    G+P  SGG   +   P+Y
Sbjct: 254  PGQPGQRGSD--GQPGNDGQPGAPGQPGQSGGSGEKGICPKY 293


>gi|5901942|ref|NP_008971.1|| E1B-55kDa-associated protein 5
            >gi|3319956|emb|CAA07548| (AJ007509) E1B-55kDa-associated
            protein [Homo sapiens]
            Length = 856
            
 Score = 34.0 bits (76), Expect = 3.0
 Identities = 24/72 (33%), Positives = 34/72 (46%), Gaps = 3/72 (4%)
 Frame = +1

Query: 1213 PNKKSPNPN---TRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPN 1383
            P +  P P+    RN PG  T    Y   +N P   ++ +TPT + Y +P Q + +  P 
Sbjct: 708  PQQPPPPPSYSPARNPPGAST----YNKNSNIPGSSANTSTPTVSSY-SPPQPSYSQPPY 762

Query: 1384 NSGGKYNRNRGPRYP 1428
            N GG      GP  P
Sbjct: 763  NQGGYSQGYTGPPPP 777


>gi|2501678|sp|Q45480|YLYB_BACSU HYPOTHETICAL 33.7 KD PROTEIN IN
           LSP-PYRR INTERGENIC REGION (ORF-X) >gi|1373157 (U48870)
           orf-X; hypothetical protein;  Method: conceptual
           translation supplied by author
           Length = 303
           
 Score = 34.0 bits (76), Expect = 3.0
 Identities = 28/83 (33%), Positives = 44/83 (52%), Gaps = 11/83 (13%)
 Frame = +1

Query: 52  TATEAPKLEERLHKVLAQAGLG-SRRALEQRISNGLIKVNGDIAQLGMSVKSGDKI---- 216
           TA+E  K  ER+ K LA      SR  ++Q + +G + VNG   +    ++ GD++    
Sbjct: 7   TASEEQK-SERIDKFLASTENDWSRTQVQQWVKDGQVVVNGSAVKANYKIQPGDQVTVTV 65

Query: 217 -ELDGRSFVASALT-----EPARVLIYNKPEGEV 300
            E +    +A  +      E   VL+ NKP G V
Sbjct: 66  PEPEALDVLAEPMDLDIYYEDQDVLVVNKPRGMV 99


>gi|2633919|emb|CAB13420| (Z99112) alternate gene name: ylmL;
           similar to hypothetical proteins [Bacillus subtilis]
           Length = 273
           
 Score = 34.0 bits (76), Expect = 3.0
 Identities = 28/83 (33%), Positives = 44/83 (52%), Gaps = 11/83 (13%)
 Frame = +1

Query: 52  TATEAPKLEERLHKVLAQAGLG-SRRALEQRISNGLIKVNGDIAQLGMSVKSGDKI---- 216
           TA+E  K  ER+ K LA      SR  ++Q + +G + VNG   +    ++ GD++    
Sbjct: 7   TASEEQK-SERIDKFLASTENDWSRTQVQQWVKDGQVVVNGSAVKANYKIQPGDQVTVTV 65

Query: 217 -ELDGRSFVASALT-----EPARVLIYNKPEGEV 300
            E +    +A  +      E   VL+ NKP G V
Sbjct: 66  PEPEALDVLAEPMDLDIYYEDQDVLVVNKPRGMV 99


>gi|2135766|pir||S53362 mucin 5AC (clone JER47) - human (fragment)
           Length = 477
           
 Score = 34.0 bits (76), Expect = 3.0
 Identities = 25/65 (38%), Positives = 32/65 (48%), Gaps = 2/65 (3%)
 Frame = +3

Query: 705 PQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRHAV--STHPATDYRPTPFSQ 878
           P T T  F  PTT TT+     TT  P+ STT   +T K  A   ST   +   P+P + 
Sbjct: 234 PTTSTTSF--PTTSTTSATTTSTTSAPTTSTTSTPQTSKTSAATSSTTSGSGTTPSPVTT 291

Query: 879 SHTARES 899
           + TA  S
Sbjct: 292 TSTASVS 298


 Score = 34.0 bits (76), Expect = 3.0
 Identities = 27/93 (29%), Positives = 38/93 (40%)
 Frame = +3

Query: 603 RQHRLTRLVSCRCQRRPQPRSTSAMGIARLPSQPPQTYTLWFRPPTTRTTAWPIDRTTQD 782
           ++ R T LV+      PQ  +TSA                    PTT TT+ P   TT  
Sbjct: 154 QKSRTTTLVTTSTTSTPQTSTTSA--------------------PTTSTTSAPTTSTTSA 193

Query: 783 PSRSTTHPTETRKRHAVSTHPATDYRPTPFSQS 881
           P+ STT   +T    ++S+ P T     P S +
Sbjct: 194 PTTSTTSTPQT----SISSAPTTSTTSAPTSST 222


>gi|1139597 (U43400) H1 gene product [Human herpesvirus 7] >gi|1139696
            (U43400) H1' gene product [Human herpesvirus 7]
            Length = 169
            
 Score = 34.0 bits (76), Expect = 3.0
 Identities = 18/42 (42%), Positives = 21/42 (49%)
 Frame = +1

Query: 1204 GAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATP 1329
            GA+PN   PNPN  + P      +P   PN  PN PS H  P
Sbjct: 9    GANPN---PNPNPSSKPNPSPNPNPSSKPNPNPN-PSSHCHP 46


>gi|628112|pir||S32988 hypothetical protein - human herpesvirus 4
            Length = 512
            
 Score = 33.6 bits (75), Expect = 4.0
 Identities = 35/122 (28%), Positives = 56/122 (45%), Gaps = 6/122 (4%)
 Frame = +1

Query: 976  TLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPE 1155
            T    RG+ +G+   + R    G++  KQ  KP   ++P+ + S    SP+      +PE
Sbjct: 361  TQGQSRGQSRGRGRGRGRGRGKGKSRDKQ-RKPGGPWRPEPNTS----SPS------MPE 409

Query: 1156 GVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYP--NNAPN----FPSD 1317
             +S     H+  GAG+   P   +  P  RN+       SP   P  +N+P     FP D
Sbjct: 410  -LSPVLGLHQGQGAGDSPTPGPSNAAPVCRNSHTATPNVSPIHEPESHNSPEAPILFPDD 468

Query: 1318 HATPTFNP 1341
               P+ +P
Sbjct: 469  WYPPSIDP 476


>gi|628185|pir||S42447 nuclear protein EBNA2 - Epstein-Barr virus
            >gi|330444 (K03333) nuclear protein EBNA2 [Epstein-Barr
            virus]
            Length = 490
            
 Score = 33.6 bits (75), Expect = 4.0
 Identities = 35/122 (28%), Positives = 56/122 (45%), Gaps = 6/122 (4%)
 Frame = +1

Query: 976  TLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPE 1155
            T    RG+ +G+   + R    G++  KQ  KP   ++P+ + S    SP+      +PE
Sbjct: 339  TQGQSRGQSRGRGRGRGRGRGKGKSRDKQ-RKPGGPWRPEPNTS----SPS------MPE 387

Query: 1156 GVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYP--NNAPN----FPSD 1317
             +S     H+  GAG+   P   +  P  RN+       SP   P  +N+P     FP D
Sbjct: 388  -LSPVLGLHQGQGAGDSPTPGPSNAAPVCRNSHTATPNVSPIHEPESHNSPEAPILFPDD 446

Query: 1318 HATPTFNP 1341
               P+ +P
Sbjct: 447  WYPPSIDP 454


>gi|5833486|gb|AAD53528.1| (AF162149) variable surface lipoprotein
            [Mycoplasma bovis]
            Length = 202
            
 Score = 33.6 bits (75), Expect = 4.0
 Identities = 27/87 (31%), Positives = 39/87 (44%), Gaps = 2/87 (2%)
 Frame = +1

Query: 1153 EGVSTGPRNHR--NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHAT 1326
            +G  T P      N G G  A+P++ +P    + TP    + +P       P  P D  T
Sbjct: 105  QGTPTNPDQGTPANPGQGTPANPDQGTPTNPDQGTPANPGQGTPANPDQGTPANP-DQGT 163

Query: 1327 PTFNPYGNPGQKTGAGRPNNSGGKYNRNR 1413
            PT     NPGQ T A +P+ S  + N  +
Sbjct: 164  PT-----NPGQGTPA-KPHFSPEEENAEK 186


>gi|283045|pir||S28264 hydroxyproline-rich glycoprotein - maize
           >gi|22333|emb|CAA44844| (X63134) hydroxyproline-rich
           glycoprotein [Zea mays] >gi|228936|prf||1814452A
           Hyp-rich glycoprotein [Zea mays]
           Length = 303
           
 Score = 33.6 bits (75), Expect = 4.0
 Identities = 23/65 (35%), Positives = 30/65 (45%)
 Frame = +3

Query: 693 PSQPPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRHAVSTHPATDYRPTPF 872
           P   P TYT   +PPT + +      + + P+   T PT T      +T P T Y PTP 
Sbjct: 233 PKPTPPTYTPSPKPPTPKPSPPTYTPSPKPPTPKPTPPTYTPTPKPPATKPPT-YTPTP- 290

Query: 873 SQSHT 887
             SHT
Sbjct: 291 PVSHT 295


 Score = 33.6 bits (75), Expect = 4.0
 Identities = 22/74 (29%), Positives = 32/74 (42%)
 Frame = +3

Query: 648 RPQPRSTSAMGIARLPSQPPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRH 827
           +P P + +       P   P TYT   +PPT + T      + + P+   T PT T    
Sbjct: 165 KPTPPTYTPSPKPPTPKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPTPKPTPPTYTPSPK 224

Query: 828 AVSTHPATDYRPTP 869
             +T P T  +PTP
Sbjct: 225 PPATKPPTP-KPTP 237


>gi|3880306|emb|CAA90995.1| (Z54238) similar to cuticle collagen; cDNA
            EST EMBL:T01150 comes from this gene; cDNA EST
            EMBL:D33882 comes from this gene; cDNA EST EMBL:D65956
            comes from this gene; cDNA EST EMBL:D66123 comes from
            this gene; cDNA EST EMBL:D... >gi|3880308|emb|CAA90997.1|
            (Z54238) similar to cuticle collagen [Caenorhabditis
            elegans]
            Length = 299
            
 Score = 33.6 bits (75), Expect = 4.0
 Identities = 23/76 (30%), Positives = 30/76 (39%)
 Frame = +1

Query: 1171 PRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGN 1350
            P N    GA     P+  + NP     PGQ  + +P +        P+  A P   P G 
Sbjct: 174  PGNDGQPGAPGNKGPSGPNGNPGAPGAPGQPGQDAPSEPITPGAPGPAGPAGPQ-GPPGA 232

Query: 1351 PGQKTGAGRPNNSGGK 1398
            PGQ    G+P   G K
Sbjct: 233  PGQPGHDGQPGAPGPK 248


>gi|2833307|sp|Q21184|YXWK_CAEEL PUTATIVE CUTICLE COLLAGEN F55C10.3
            >gi|3877662|emb|CAA98487.1| (Z74036) similar to collagen
            [Caenorhabditis elegans]
            Length = 266
            
 Score = 33.6 bits (75), Expect = 4.0
 Identities = 31/93 (33%), Positives = 37/93 (39%), Gaps = 7/93 (7%)
 Frame = +1

Query: 1150 PEGVSTGPRNHRNAGA-GNGAHPNKKSP-----NPNTRNTPGQQTRKSPYKYPNNAPNFP 1311
            P G + G  N  + G  G    P  K P     NP     PGQ  + +P +     P  P
Sbjct: 128  PSGDAGGNGNPGSPGQDGQPGAPGNKGPSGPNGNPGAPGAPGQPGQDAPSE--PITPGAP 185

Query: 1312 SDHATP-TFNPYGNPGQKTGAGRPNNSGGKYNRNRGPRYP 1428
                TP    P G PGQ    G+P   G K   N  P  P
Sbjct: 186  GPQGTPGPQGPPGQPGQPGHDGQPGAPGPK-GPNGNPGQP 224


>gi|3877661|emb|CAA98486.1| (Z74036) predicted using Genefinder;
            similar to collagen [Caenorhabditis elegans]
            Length = 299
            
 Score = 33.6 bits (75), Expect = 4.0
 Identities = 31/93 (33%), Positives = 37/93 (39%), Gaps = 7/93 (7%)
 Frame = +1

Query: 1150 PEGVSTGPRNHRNAGA-GNGAHPNKKSP-----NPNTRNTPGQQTRKSPYKYPNNAPNFP 1311
            P G + G  N  + G  G    P  K P     NP     PGQ  + +P +     P  P
Sbjct: 161  PSGDAGGNGNPGSPGQDGQPGAPGNKGPSGPNGNPGAPGAPGQPGQDAPSE--PITPGAP 218

Query: 1312 SDHATP-TFNPYGNPGQKTGAGRPNNSGGKYNRNRGPRYP 1428
                TP    P G PGQ    G+P   G K   N  P  P
Sbjct: 219  GPQGTPGPQGPPGQPGQPGHDGQPGAPGPK-GPNGNPGQP 257


>gi|543543|pir||JN0690 glutenin, high-molecular-weight Bx7 chain
            precursor - wheat
            Length = 791
            
 Score = 33.6 bits (75), Expect = 4.0
 Identities = 28/136 (20%), Positives = 45/136 (32%)
 Frame = +1

Query: 1000 GQGKHHFKDRLTVSGEAAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRN 1179
            GQG+ H +           +Q + P    +P   + L +G P      Y P     G + 
Sbjct: 139  GQGQQHQQP-------GQRQQGYYPTSPQQPGQGQQLGQGQPG-----YYPTSQQPGQKQ 186

Query: 1180 HRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQ 1359
                G  +G       P    ++  GQQ  +    Y   +P        P        GQ
Sbjct: 187  QAGQGQQSGQGQQGYYPTSPQQSGQGQQPGQGQPGYYPTSPQQSGQWQQPGQGQQPGQGQ 246

Query: 1360 KTGAGRPNNSGGKYNR 1407
            ++G G+     G+  R
Sbjct: 247  QSGQGQQGQQPGQGQR 262


>gi|102059|pir||D41710 promastigote surface antigen-2 (clone 4.6) -
           Leishmania major (fragment) >gi|9583|emb|CAA40414|
           (X57135) surface antigen P2 [Leishmania major]
           Length = 327
           
 Score = 33.6 bits (75), Expect = 4.0
 Identities = 21/66 (31%), Positives = 32/66 (47%)
 Frame = +3

Query: 696 SQPPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRHAVSTHPATDYRPTPFS 875
           ++PP T T   +PPTT TT   +  TT   + +T  PT T      +T   T  +P P +
Sbjct: 151 TKPPTTTTTTTKPPTTTTTTTKLPTTT---TTTTKPPTTTTTTTTTTTTTTTTTKP-PIT 206

Query: 876 QSHTAR 893
            + T +
Sbjct: 207 TATTTK 212


 Score = 33.2 bits (74), Expect = 5.2
 Identities = 18/40 (45%), Positives = 23/40 (57%), Gaps = 2/40 (5%)
 Frame = +3

Query: 696 SQPPQTYTLWFRPPT--TRTTAWPIDRTTQDPSRSTTHPTET 815
           ++PP T T   +PPT  T TT  P   TT+ P+  TT  T T
Sbjct: 211 TKPPTTTTTTTKPPTTITSTTKLPTTTTTEAPAEPTTTATPT 252


>gi|119111|sp|P12978|EBN2_EBV EBNA-2 NUCLEAR PROTEIN
            >gi|1083964|pir||S42442 EBNA2 protein - human herpesvirus
            4 >gi|1632787|emb|CAA24877.1| (V01555) BYRF1, encodes
            EBNA-2 (Dambaugh et al, 1984; Dillner et al, 1984) [Human
            herpesvirus 4]
            Length = 487
            
 Score = 33.6 bits (75), Expect = 4.0
 Identities = 35/122 (28%), Positives = 56/122 (45%), Gaps = 6/122 (4%)
 Frame = +1

Query: 976  TLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPE 1155
            T    RG+ +G+   + R    G++  KQ  KP   ++P+ + S    SP+      +PE
Sbjct: 336  TQGQSRGQSRGRGRGRGRGRGKGKSRDKQ-RKPGGPWRPEPNTS----SPS------MPE 384

Query: 1156 GVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYP--NNAPN----FPSD 1317
             +S     H+  GAG+   P   +  P  RN+       SP   P  +N+P     FP D
Sbjct: 385  -LSPVLGLHQGQGAGDSPTPGPSNAAPVCRNSHTATPNVSPIHEPESHNSPEAPILFPDD 443

Query: 1318 HATPTFNP 1341
               P+ +P
Sbjct: 444  WYPPSIDP 451


>gi|100864|pir||S08315 cell wall protein - maize (fragment)
           >gi|22269|emb|CAA31860| (X13506) cell wall protein (108
           AA) [Zea mays] >gi|168459 (M36914) cell wall protein
           (put.); putative [Zea mays]
           Length = 108
           
 Score = 33.2 bits (74), Expect = 5.2
 Identities = 21/67 (31%), Positives = 31/67 (45%)
 Frame = +3

Query: 693 PSQPPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRHAVSTHPATDYRPTPF 872
           P   P TYT   +PPT + T      + + P+   T PT T      +T P T  +PTP 
Sbjct: 18  PKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPATKPPTP-KPTPP 76

Query: 873 SQSHTAR 893
           + + T +
Sbjct: 77  TYTPTPK 83


>gi|2135765|pir||A43932 mucin 2 precursor, intestinal - human
            (fragments)
            Length = 3020
            
 Score = 33.2 bits (74), Expect = 5.2
 Identities = 28/79 (35%), Positives = 35/79 (43%), Gaps = 9/79 (11%)
 Frame = +3

Query: 651  PQPRSTSAMGIARLPSQPPQTYTLWFRPPTT------RTTAWPIDRTTQDPSRSTT---H 803
            P P +T+ +     PS P  T T    PPTT       TT  P+  TT  P  STT    
Sbjct: 1408 PPPTTTTTLPPTTTPSPPTTTTTT--PPPTTTPSPPITTTTTPLPTTTPSPPISTTTTPP 1465

Query: 804  PTETRKRHAVSTHPATDYRPTPFSQSHT 887
            PT T      +  P T     P + + T
Sbjct: 1466 PTTTPSPPTTTPSPPTTTPSPPTTTTTT 1493


>gi|5852937|gb|AAD54275.1|AF169793_1 (AF169793) HET-C protein
            [Podospora anserina]
            Length = 735
            
 Score = 33.2 bits (74), Expect = 5.2
 Identities = 34/146 (23%), Positives = 54/146 (36%), Gaps = 7/146 (4%)
 Frame = +1

Query: 925  HNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFKQYKPKNDR 1104
            H+NH  A  S    R  +     G G    H +   ++  +   +QA     +Y P+   
Sbjct: 576  HHNHILASNSSSSSRSVSAPSHGGNGCDHGHGRPAGSLWEQVKKQQADA---RYSPRPGS 632

Query: 1105 SLSEGSPATFQSWYVPEGVST----GPRNHRNAGAGNGAHPNKKSP---NPNTRNTPGQQ 1263
            S          S Y   G  +     P+   + G G+GA+P ++ P           G Q
Sbjct: 633  SGGGYGQRPGSSGYGSGGGGSYGRPSPQPGYSGGGGSGAYPPQQQPQYGGGGYGGPGGYQ 692

Query: 1264 TRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQK 1362
                P+ +      +P  H  P     G PGQ+
Sbjct: 693  QPPPPHHHGQYGGGYPGQHPPPPPQGGGYPGQQ 725


>gi|4503493|ref|NP_001955.1|| early growth response 1
            >gi|119242|sp|P18146|EGR1_HUMAN EARLY GROWTH RESPONSE
            PROTEIN 1 (EGR-1) (KROX24) (ZIF268) (TRANSCRIPTION FACTOR
            ETR103) (ZINC FINGER PROTEIN 225) (AT225)
            >gi|87347|pir||A41211 early growth response protein 1 -
            human >gi|31130|emb|CAA36777| (X52541) early growth
            response protein 1 (AA 1-543) [Homo sapiens] >gi|182263
            (M62829) ETR103 [Homo sapiens]
            >gi|5420379|emb|CAB46678.1| (AJ243425) early growth
            response protein 1 [Homo sapiens]
            Length = 543
            
 Score = 33.2 bits (74), Expect = 5.2
 Identities = 24/82 (29%), Positives = 37/82 (44%), Gaps = 2/82 (2%)
 Frame = +1

Query: 1066 HKPFKQYKPKNDRS--LSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPN 1239
            H P     PK +    LS G+P    +   PEG  +   +  + G G G   +  S + +
Sbjct: 25   HSPTMDNYPKLEEMMLLSNGAPQFLGAAGAPEGSGSNSSSSSSGGGGGGGGGSNSSSSSS 84

Query: 1240 TRNTPGQQTRKSPYKYPNNAPNFP 1311
            T N P   T + PY++   A +FP
Sbjct: 85   TFN-PQADTGEQPYEH-LTAESFP 106


>gi|2833328|sp|Q27200|FBRL_TETTH FIBRILLARIN >gi|2654298|emb|CAA54924|
            (X77962) fibrillarin [Tetrahymena thermophila]
            Length = 294
            
 Score = 33.2 bits (74), Expect = 5.2
 Identities = 15/28 (53%), Positives = 17/28 (60%)
 Frame = +1

Query: 1345 GNPGQKTGAGRPNNSGGKYNRNRGPRYP 1428
            G PG K G GRP   GGK+   +GPR P
Sbjct: 37   GGPGGKFGGGRPGGPGGKFGA-KGPRGP 63


>gi|2707270 (AF036171) homeobox-containing protein [Dictyostelium
            discoideum]
            Length = 534
            
 Score = 33.2 bits (74), Expect = 5.2
 Identities = 31/174 (17%), Positives = 61/174 (34%), Gaps = 5/174 (2%)
 Frame = +1

Query: 889  HVNRNDNNKHAYHNNHSTADESRELRRFDTL-----RDDRGRGQGKHHFKDRLTVSGEAA 1053
            H N N+NN + Y+N +S ++ +R              ++      + H  D         
Sbjct: 285  HNNNNNNNSNNYNNGNSNSNNNRNNNNNYNYNNYINNNNYNNNNNRQHCDDE-------- 336

Query: 1054 AKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPN 1233
             ++  + F      N+ + +     +    Y  +  +    N+ N       + N  + N
Sbjct: 337  -EEDEQYFNNNNNNNNNNNNNRISDSSDDQYFSDDTNNNNDNYNNGNNNYNNNNNNNNFN 395

Query: 1234 PNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 1410
             N  N        + Y   NN  ++ + +    FN   N   +      NN+  +YN N
Sbjct: 396  NNYMNNYNNNYNNNNY---NNNNSYNNSNGNNNFNNNNNNNNQN--NNNNNNNNQYNNN 449


>gi|3879844|emb|CAB04725.1| (Z81592) predicted using Genefinder; cDNA
            EST EMBL:M89005 comes from this gene [Caenorhabditis
            elegans]
            Length = 695
            
 Score = 33.2 bits (74), Expect = 5.2
 Identities = 25/91 (27%), Positives = 34/91 (36%), Gaps = 1/91 (1%)
 Frame = +1

Query: 1150 PEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNT-PGQQTRKSPYKYPNNAPNFPSDHAT 1326
            P   S+G   +RN G GN  + NK S N N  N   G       Y   N+  +F +    
Sbjct: 539  PPPRSSGANGNRNGGGGNRRNNNKNSSNSNNNNNFNGNGNGDGSYNNNNDNCDFENRCGG 598

Query: 1327 PTFNPYGNPGQKTGAGRPNNSGGKYNRNRGPR 1422
                  GN  Q+  + +        N N G R
Sbjct: 599  QGGFENGNENQRFSSRKQPPPKPSANNNNGDR 630


 Score = 32.8 bits (73), Expect = 6.8
 Identities = 26/79 (32%), Positives = 33/79 (40%), Gaps = 4/79 (5%)
 Frame = +1

Query: 1192 GAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGA 1371
            G GNG  P    P P  +    ++T        NN  N  + +  P   P  N G   G 
Sbjct: 458  GGGNGPVPPIPEPKPLCKGLKFKKTANGGGGNNNNNNNNNNRNNGP---PPRNNGNNNGN 514

Query: 1372 GR----PNNSGGKYNRNRGPRYP 1428
            GR    P++SG   NR  GP  P
Sbjct: 515  GRPMKPPSSSGSGSNRRSGPPPP 537


>gi|418972|pir||S31035 retrovirus-related gag polyprotein - mouse
            intracisternal A-particle MIAD8 (fragment)
            Length = 717
            
 Score = 33.2 bits (74), Expect = 5.2
 Identities = 19/52 (36%), Positives = 27/52 (51%)
 Frame = +1

Query: 1123 PATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSP 1278
            PA  QS Y+P+  S+GPR+      GN      + P  + R+ PG+ TR  P
Sbjct: 402  PADSQSAYMPKNGSSGPRSQGPQRYGN---QFVEDPGSSQRDDPGRPTRVEP 450


>gi|6594249|emb|CAB63561.1| (AL031746) hypothetical garp protein
            [Plasmodium falciparum]
            Length = 673
            
 Score = 33.2 bits (74), Expect = 5.2
 Identities = 41/164 (25%), Positives = 68/164 (41%), Gaps = 11/164 (6%)
 Frame = +1

Query: 796  LRTQLKLEKDMPLALTLQPIIGQRRSAKATLHVNRNDNNKHAYHNNH--STADESRELRR 969
            L  + +LEK+       + ++ + +  K  +    NDN K+A++NN   S+ D +  +  
Sbjct: 49   LLNETELEKNKDDNSKSETLLKEEKDEKDDVPTTSNDNLKNAHNNNEISSSTDPTNIINV 108

Query: 970  FD----TLRDDRGRGQGKHHFKDRLTV-----SGEAAAKQAHKPFKQYKPKNDRSLSEGS 1122
             D       D +   + K H KD+          E   K+  K  K+ K K D+   E S
Sbjct: 109  NDKDNENSVDKKKDKKEKKHKKDKKEKKEKKDKKEKKDKKEKKHKKEKKHKKDKKKEENS 168

Query: 1123 PATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKY 1287
                 S Y      TG    +NA      +  ++  +    N  G     SPY+Y
Sbjct: 169  EV--MSLY-----KTGQHKPKNATEHGEENLYEEMVSEINNNAQGGLLLSSPYQY 216


>gi|6601502|gb|AAF19004.1|AF151366_1 (AF151366) arginine/serine-rich
            protein [Arabidopsis thaliana]
            Length = 414
            
 Score = 33.2 bits (74), Expect = 5.2
 Identities = 31/114 (27%), Positives = 46/114 (40%), Gaps = 7/114 (6%)
 Frame = +1

Query: 1078 KQYKPKNDRSLSE----GSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTR 1245
            K+  PK+D + ++    G P   +           PR  R+     G  P ++SP+   R
Sbjct: 193  KRDAPKSDNAAADAEKDGGPRRPRETSPQRKTGLSPRR-RSPLPRRGLSPRRRSPDSPHR 251

Query: 1246 NTPGQQTRKSPYKYPNNAPNFPSDHATPTFNP---YGNPGQKTGAGRPNNSGGKYNRNRG 1416
              PG   R+     P   P  PS   +P+  P   Y +P +    G P    G   R R 
Sbjct: 252  RRPGSPIRRRGDTPPRRRPASPSRGRSPSSPPPRRYRSPPR----GSPRRIRGSPVRRRS 307

Query: 1417 P 1419
            P
Sbjct: 308  P 308


>gi|476822|pir||A42893 penicillin-binding protein 1A - Streptococcus
            pneumoniae >gi|153768 (M90527) penicillin-binding protein
            [Streptococcus pneumoniae]
            Length = 719
            
 Score = 33.2 bits (74), Expect = 5.2
 Identities = 30/107 (28%), Positives = 43/107 (40%), Gaps = 2/107 (1%)
 Frame = +1

Query: 1108 LSEGSPATFQSWYVPEGVSTGPRNHRNAGA--GNGAHPNKKSPNPNTRNTPGQQTRKSPY 1281
            LSEGS    + W +PEG+      +RN      NGA     SP P    +    +  S  
Sbjct: 625  LSEGSNP--EDWNIPEGL------YRNGEFVFKNGARSTWSSPAPQQPPSTESSSSSSDS 676

Query: 1282 KYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRNRGPRYP 1428
                ++   PS + + T NP  N  Q       N +  + N+N  P  P
Sbjct: 677  STSQSSSTTPSTNNSTTTNPNNNTQQS------NTTPDQQNQNPQPAQP 719


>gi|282331|pir||S28037 penicillin-binding protein 1a - Streptococcus
            pneumoniae (strain 63915) (fragment)
            >gi|47418|emb|CAA48072| (X67872) penicillin-binding
            protein 1a [Streptococcus pneumoniae]
            Length = 719
            
 Score = 33.2 bits (74), Expect = 5.2
 Identities = 30/107 (28%), Positives = 43/107 (40%), Gaps = 2/107 (1%)
 Frame = +1

Query: 1108 LSEGSPATFQSWYVPEGVSTGPRNHRNAGA--GNGAHPNKKSPNPNTRNTPGQQTRKSPY 1281
            LSEGS    + W +PEG+      +RN      NGA     SP P    +    +  S  
Sbjct: 625  LSEGSNP--EDWNIPEGL------YRNGEFVFKNGARSTWNSPAPQQPPSTESSSSSSDS 676

Query: 1282 KYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRNRGPRYP 1428
                ++   PS + + T NP  N  Q       N +  + N+N  P  P
Sbjct: 677  STSQSSSTTPSTNNSTTTNPNNNTQQS------NTTPDQQNQNPQPAQP 719


>gi|585647|sp|Q04707|PBPA_STRPN PENICILLIN-BINDING PROTEIN 1A (PBP-1A)
            (EXPORTED PROTEIN 2) >gi|282329|pir||S28038
            penicillin-binding protein 1a - Streptococcus pneumoniae
            (strain 45607) (fragment) >gi|47420|emb|CAA48073|
            (X67873) penicillin-binding protein 1a [Streptococcus
            pneumoniae]
            Length = 719
            
 Score = 33.2 bits (74), Expect = 5.2
 Identities = 30/107 (28%), Positives = 43/107 (40%), Gaps = 2/107 (1%)
 Frame = +1

Query: 1108 LSEGSPATFQSWYVPEGVSTGPRNHRNAGA--GNGAHPNKKSPNPNTRNTPGQQTRKSPY 1281
            LSEGS    + W +PEG+      +RN      NGA     SP P    +    +  S  
Sbjct: 625  LSEGSNP--EDWNIPEGL------YRNGEFVFKNGARSTWSSPAPQQPPSTESSSSSSDS 676

Query: 1282 KYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRNRGPRYP 1428
                ++   PS + + T NP  N  Q       N +  + N+N  P  P
Sbjct: 677  STSQSSSTTPSTNNSTTTNPNNNTQQS------NTTPDQQNQNPQPAQP 719


>gi|4140029|dbj|BAA36973.1| (AB015438) alpha 1 type I collagen [Cynops
            pyrrhogaster]
            Length = 1450
            
 Score = 33.2 bits (74), Expect = 5.2
 Identities = 26/77 (33%), Positives = 29/77 (36%), Gaps = 10/77 (12%)
 Frame = +1

Query: 1168 GPRNHRNAGAGNGAHPNKKSPNP-------NTRNTPGQQTRKSPYKYPN--NAPNFPSDH 1320
            GP+  R +    GA     +P P           T GQ   K     P    AP FP   
Sbjct: 342  GPQGSRGSEGPQGARGEPGAPGPAGAAGPSGNPGTDGQPGGKGATGSPGIAGAPGFPGAR 401

Query: 1321 ATP-TFNPYGNPGQKTGAGRPNNSGGK 1398
              P    P G PG K   G P   G K
Sbjct: 402  GAPGPQGPAGAPGPKGNNGEPGAQGNK 428


>gi|5410459|gb|AAD43067.1|AF139884_1 (AF139884) penicillin-binding
            protein 1a [Streptococcus pneumoniae]
            >gi|5410461|gb|AAD43068.1|AF139885_1 (AF139885)
            penicillin-binding protein 1a [Streptococcus pneumoniae]
            >gi|5410463|gb|AAD43069.1|AF139886_1 (AF139886)
            penicillin-binding protein 1a [Streptococcus pneumoniae]
            Length = 719
            
 Score = 33.2 bits (74), Expect = 5.2
 Identities = 30/107 (28%), Positives = 43/107 (40%), Gaps = 2/107 (1%)
 Frame = +1

Query: 1108 LSEGSPATFQSWYVPEGVSTGPRNHRNAGA--GNGAHPNKKSPNPNTRNTPGQQTRKSPY 1281
            LSEGS    + W +PEG+      +RN      NGA     SP P    +    +  S  
Sbjct: 625  LSEGSNP--EDWNIPEGL------YRNGEFVFKNGARSTWSSPAPQQPPSTESSSSSSDS 676

Query: 1282 KYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRNRGPRYP 1428
                ++   PS + + T NP  N  Q       N +  + N+N  P  P
Sbjct: 677  STSQSSSTTPSTNNSTTTNPNNNTQQS------NTTPDQQNQNPQPAQP 719


>gi|6563337|gb|AAF17255.1|AF210745_1 (AF210745) penicillin-binding
            protein 1A [Streptococcus pneumoniae]
            Length = 719
            
 Score = 33.2 bits (74), Expect = 5.2
 Identities = 30/107 (28%), Positives = 43/107 (40%), Gaps = 2/107 (1%)
 Frame = +1

Query: 1108 LSEGSPATFQSWYVPEGVSTGPRNHRNAGA--GNGAHPNKKSPNPNTRNTPGQQTRKSPY 1281
            LSEGS    + W +PEG+      +RN      NGA     SP P    +    +  S  
Sbjct: 625  LSEGSNP--EDWNIPEGL------YRNGEFVFKNGARSTWNSPAPQQPPSTESSSSSSDS 676

Query: 1282 KYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRNRGPRYP 1428
                ++   PS + + T NP  N  Q       N +  + N+N  P  P
Sbjct: 677  STSQSSSTTPSTNNSTTTNPNNNTQQS------NTTPDQQNQNPQPAQP 719


>gi|6563341|gb|AAF17257.1|AF210747_1 (AF210747) penicillin-binding
            protein 1A [Streptococcus pneumoniae]
            Length = 719
            
 Score = 33.2 bits (74), Expect = 5.2
 Identities = 30/107 (28%), Positives = 43/107 (40%), Gaps = 2/107 (1%)
 Frame = +1

Query: 1108 LSEGSPATFQSWYVPEGVSTGPRNHRNAGA--GNGAHPNKKSPNPNTRNTPGQQTRKSPY 1281
            LSEGS    + W +PEG+      +RN      NGA     SP P    +    +  S  
Sbjct: 625  LSEGSNP--EDWNIPEGL------YRNGEFVFKNGARSTWSSPAPQQPPSTESSSSSSDS 676

Query: 1282 KYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRNRGPRYP 1428
                ++   PS + + T NP  N  Q       N +  + N+N  P  P
Sbjct: 677  STSQSSSTTPSTNNSTTTNPNNNTQQS------NTTPDQQNQNPQPAQP 719


>gi|102058|pir||C41710 promastigote surface antigen-2 (clone 2.5) -
           Leishmania major (fragment) >gi|9581|emb|CAA40413|
           (X57134) surface antigen P2 [Leishmania major]
           Length = 371
           
 Score = 33.2 bits (74), Expect = 5.2
 Identities = 18/40 (45%), Positives = 23/40 (57%), Gaps = 2/40 (5%)
 Frame = +3

Query: 696 SQPPQTYTLWFRPPT--TRTTAWPIDRTTQDPSRSTTHPTET 815
           ++PP T T   +PPT  T TT  P   TT+ P+  TT  T T
Sbjct: 255 TKPPTTTTTTTKPPTTITSTTKLPTTTTTEAPAEPTTTATPT 296


 Score = 32.5 bits (72), Expect = 9.0
 Identities = 20/52 (38%), Positives = 26/52 (49%), Gaps = 2/52 (3%)
 Frame = +3

Query: 696 SQPPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRK--RHAVSTHPAT 851
           ++PP T T   +PPTT TT      TT   ++  T  T T K    A +T P T
Sbjct: 206 TKPPTTTTTTTKPPTTTTTTTKPPTTTTTTTKPPTTTTTTTKPLTTATTTKPPT 259


>gi|186396 (M94131) mucin [Homo sapiens]
            Length = 1270
            
 Score = 33.2 bits (74), Expect = 5.2
 Identities = 28/79 (35%), Positives = 35/79 (43%), Gaps = 9/79 (11%)
 Frame = +3

Query: 651  PQPRSTSAMGIARLPSQPPQTYTLWFRPPTT------RTTAWPIDRTTQDPSRSTT---H 803
            P P +T+ +     PS P  T T    PPTT       TT  P+  TT  P  STT    
Sbjct: 783  PPPTTTTTLPPTTTPSPPTTTTTT--PPPTTTPSPPITTTTTPLPTTTPSPPISTTTTPP 840

Query: 804  PTETRKRHAVSTHPATDYRPTPFSQSHT 887
            PT T      +  P T     P + + T
Sbjct: 841  PTTTPSPPTTTPSPPTTTPSPPTTTTTT 868


>gi|3319463 (AF077544) unknown [Caenorhabditis elegans]
            Length = 235
            
 Score = 33.2 bits (74), Expect = 5.2
 Identities = 22/75 (29%), Positives = 26/75 (34%)
 Frame = +1

Query: 1189 AGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTG 1368
            +G  NG HPN   PN N  N              +           PT +PY N G   G
Sbjct: 132  SGYNNGPHPNGNFPNLNGYNNGPSSFNGGNTNVDDGIKGSVGAAVEPTKSPYPNNGY--G 189

Query: 1369 AGRPNNSGGKYNRNR 1413
             G  N  G  +  NR
Sbjct: 190  YGNRNGYGNNFGFNR 204


>gi|5815245|gb|AAD52614.1|AF175223_1 (AF175223) SANT domain protein
            SMRTER [Drosophila melanogaster]
            Length = 3469
            
 Score = 33.2 bits (74), Expect = 5.2
 Identities = 32/113 (28%), Positives = 52/113 (45%), Gaps = 13/113 (11%)
 Frame = +1

Query: 1060 QAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPN 1239
            Q  +  +Q + +  R++S GS A+                    G G G   +K+SP+P 
Sbjct: 2462 QGQQQQQQQQQQQQRNMSRGSSAS--------------------GGGGGGGSDKESPSP- 2500

Query: 1240 TRNTPGQQTRKSPYKY-------PNNAPNF-----PSDHATPTFNPYGN-PGQKTGAGRP 1380
             RN+ G     S + Y       P   P +     P+DH   T +P+   P Q+ G  + 
Sbjct: 2501 -RNSVGS---ASGFAYGGDKESAPRGRPEYSSRASPADHVNSTPSPHRTPPPQRQGVIQR 2556

Query: 1381 NNSGGK 1398
            +N+G K
Sbjct: 2557 HNTGSK 2562


>gi|4505285|ref|NP_002448.1|| mucin 2, intestinal/tracheal
            >gi|2506877|sp|Q02817|MUC2_HUMAN MUCIN 2 PRECURSOR
            (INTESTINAL MUCIN 2) >gi|454154 (L21998) mucin [Homo
            sapiens]
            Length = 5179
            
 Score = 33.2 bits (74), Expect = 5.2
 Identities = 28/79 (35%), Positives = 35/79 (43%), Gaps = 9/79 (11%)
 Frame = +3

Query: 651  PQPRSTSAMGIARLPSQPPQTYTLWFRPPTT------RTTAWPIDRTTQDPSRSTT---H 803
            P P +T+ +     PS P  T T    PPTT       TT  P+  TT  P  STT    
Sbjct: 1408 PPPTTTTTLPPTTTPSPPTTTTTT--PPPTTTPSPPITTTTTPLPTTTPSPPISTTTTPP 1465

Query: 804  PTETRKRHAVSTHPATDYRPTPFSQSHT 887
            PT T      +  P T     P + + T
Sbjct: 1466 PTTTPSPPTTTPSPPTTTPSPPTTTTTT 1493


>gi|82601|pir||A30843 glutenin high molecular weight chain Bx7
            precursor - wheat >gi|21749|emb|CAA32115| (X13927) HMW
            glutenin subunit (AA 1-789) [Triticum aestivum]
            >gi|170745 (M22209) high MW glutenin subunit (Bx7)
            [Triticum aestivum]
            Length = 789
            
 Score = 32.8 bits (73), Expect = 6.8
 Identities = 28/136 (20%), Positives = 48/136 (34%), Gaps = 5/136 (3%)
 Frame = +1

Query: 1000 GQGKHHFKDRLTVSGE-----AAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVS 1164
            GQG+   +++    G+        +Q + P    +P   + L +G P      Y P    
Sbjct: 127  GQGQQPGQEQQPGQGQQDQQPGQRQQGYYPTSPQQPGQGQQLGQGQPG-----YYPTSQQ 181

Query: 1165 TGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPY 1344
             G +     G  +G       P    ++  GQQ  +    Y   +P        P     
Sbjct: 182  PGQKQQAGQGQQSGQGQQGYYPTSPQQSGQGQQPGQGQPGYYPTSPQQSGQWQQPGQGQQ 241

Query: 1345 GNPGQKTGAGRPNNSGGKYNR 1407
               GQ++G G+     G+  R
Sbjct: 242  PGQGQQSGQGQQGQQPGQGQR 262


>gi|2388676 (AF015539) precollagen P [Mytilus edulis]
            Length = 902
            
 Score = 32.8 bits (73), Expect = 6.8
 Identities = 25/75 (33%), Positives = 32/75 (42%), Gaps = 5/75 (6%)
 Frame = +1

Query: 1204 GAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNA--PNFPSDHATPTF-NPYGNPGQKTGAG 1374
            G+ P  +  NP     PG   R      P  +  P  P    TP      G PGQ  G G
Sbjct: 256  GSTPPGRLGNPGPPGQPGNPGRPGSSGRPGGSGQPGGPGRPGTPGKPGNRGQPGQPGGPG 315

Query: 1375 RPNN--SGGKYNRNRGPRYP 1428
            +P +  +GG+  RN  P  P
Sbjct: 316  QPGHPGAGGQPGRNGNPGNP 335


>gi|1085433|pir||S55316 mucin (clone PGM-2B) - pig >gi|915207
           (U10281) gastric mucin [Sus scrofa]
           Length = 317
           
 Score = 32.8 bits (73), Expect = 6.8
 Identities = 44/201 (21%), Positives = 84/201 (40%), Gaps = 1/201 (0%)
 Frame = +2

Query: 134 NNASPTGSSKSMETLHSSACPLKVATKSNWTAAASSPAPSLNRRAY*STTNQKAK*PHAK 313
           ++++PT S+ S++   S + P   AT    +++ S+P  S       S+++     P + 
Sbjct: 71  SSSAPTTSATSVQPSSSGSAPTTSATSVQSSSSGSAPTTSATSVQPSSSSSP----PISS 126

Query: 314 TQRVAPRCSKHSP-SSKVHVGSPSAAWISTPLAYYCSPQTANLPMQ*CTPHRK*NANMSY 490
           T  V P  S  +P +S   V S S+    T  A    P +++ P    T   + +++ S 
Sbjct: 127 TISVQPSSSSSAPTTSATSVQSSSSGSAPTTSATSVQPSSSSSPPISSTISVQPSSSSS- 185

Query: 491 VYAPPKEKNMCRMSYSSN*RAASCWKTAPQNSTRLNASATPTHTTGFVSLSKKAATAKYV 670
             AP       + S SS+    S     P +S    ++ T + T+   S S     +  +
Sbjct: 186 --APTTSATSVQSSSSSSAPTTSATSVQPSSS---GSAPTTSATSVQSSSSSSPPISSTI 240

Query: 671 GYGNRKAAKSAASNVHVMVPSS 736
                 ++ S  ++   + PSS
Sbjct: 241 SVQTSSSSSSPTTSTTSVQPSS 262


>gi|2314597|gb|AAD08465.1| (AE000642) conserved hypothetical protein
           [Helicobacter pylori 26695]
           Length = 84
           
 Score = 32.8 bits (73), Expect = 6.8
 Identities = 19/47 (40%), Positives = 25/47 (52%), Gaps = 1/47 (2%)
 Frame = +1

Query: 82  RLHKVLAQAGLGSRRALEQRISN-GLIKVNGDIAQLGMSVKSGDKIEL 222
           R+ K L   GL  RR L   + N G + +NG  A+    VK+GD I L
Sbjct: 2   RIDKFLQSVGLVKRRVLATDMCNVGAVWLNGSCAKASKEVKAGDTISL 49


>gi|330361 (M10593) major outer envelope glycoprotein gp220
            [Epstein-Barr virus]
            Length = 658
            
 Score = 32.8 bits (73), Expect = 6.8
 Identities = 19/67 (28%), Positives = 28/67 (41%)
 Frame = +1

Query: 1225 SPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYN 1404
            SP P    +       SP  + N   + P  +AT   +P    GQKT      ++GGK N
Sbjct: 475  SPTPAGTTSGASPVTPSPSPWDNGTESTPPQNAT---SPQAPSGQKTAVPTVTSTGGKAN 531

Query: 1405 RNRGPRY 1425
               G ++
Sbjct: 532  STTGGKH 538


>gi|1841851 (U86876) chitinase-like protein [Bombyx mori]
           Length = 565
           
 Score = 32.8 bits (73), Expect = 6.8
 Identities = 23/54 (42%), Positives = 27/54 (49%), Gaps = 9/54 (16%)
 Frame = +3

Query: 684 ARLPSQP---------PQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRHAVS 836
           AR PS P         P T T   +P TTRTTA P   TT+ P  +T    +   R  V 
Sbjct: 416 ARPPSTPSDPSEGDPIPTTTTTTVKPTTTRTTARPTTTTTKVPHGTTEEDFDINVRPEVE 475

Query: 837 THP 845
             P
Sbjct: 476 ELP 478


>gi|2854193 (AF045645) Similar to cuticular collagen; coded for by C.
            elegans cDNA yk69e12.3; coded for by C. elegans cDNA
            yk69e12.5; coded for by C. elegans cDNA yk307b3.5; coded
            for by C. elegans cDNA yk307b3.3 [Caenorhabditis elegans]
            Length = 314
            
 Score = 32.8 bits (73), Expect = 6.8
 Identities = 31/82 (37%), Positives = 36/82 (43%), Gaps = 9/82 (10%)
 Frame = +1

Query: 1150 PEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSD--HA 1323
            P G S  P ++ NAGA     P       +   TPG      P   P  AP  P    HA
Sbjct: 211  PPGPSGQPGSNGNAGA-----PGAPGHVVDVPGTPGPAGPPGPAG-PAGAPGQPGQAGHA 264

Query: 1324 TPTF-NPYGN------PGQKTGAGRPNNSGG 1395
             P    P G+      PGQ   AG+P N GG
Sbjct: 265  QPGQPGPQGDAGAPGAPGQPGSAGQPGNDGG 295


>gi|3875708|emb|CAA84658.1| (Z35598) Asparagine, Serine and Glycine
            rich predicted protein [Caenorhabditis elegans]
            >gi|3880108|emb|CAA86461.1| (Z46343) Asparagine, Serine
            and Glycine rich predicted protein [Caenorhabditis
            elegans]
            Length = 549
            
 Score = 32.8 bits (73), Expect = 6.8
 Identities = 24/83 (28%), Positives = 32/83 (37%)
 Frame = +1

Query: 1168 GPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYG 1347
            G  N+RN G G+ +  N  + N N     G         Y NN   + S++     N  G
Sbjct: 424  GSNNNRNDGWGSSSSNNNNNNNNNNNGGTGG--------YSNNGGGWGSNN-----NNNG 470

Query: 1348 NPGQKTGAGRPNNSGGKYNRNRG 1416
            N G    +    N GG  N N G
Sbjct: 471  NDGNNWESNNGGNGGGGDNWNNG 493


>gi|2119159|pir||I50694 alpha-1 collagen type III - chicken (fragment)
            >gi|537432 (U07973) alpha-1 collagen type III [Gallus
            gallus]
            Length = 886
            
 Score = 32.8 bits (73), Expect = 6.8
 Identities = 24/83 (28%), Positives = 27/83 (31%), Gaps = 1/83 (1%)
 Frame = +1

Query: 1150 PEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATP 1329
            P G S  P      G    A P      P    +PG +    P   P   P  P     P
Sbjct: 357  PAGASGNPGERGEPGPQGQAGPPGPQGPPGRAGSPGGKGEMGPSGIPGG-PGPPGGRGLP 415

Query: 1330 -TFNPYGNPGQKTGAGRPNNSGGK 1398
                  GNPG K   G P  +G K
Sbjct: 416  GPPGTSGNPGAKGTPGEPGKNGAK 439


>gi|543541|pir||JC2099 glutenin, high molecular weight chain Bx17 -
            wheat
            Length = 753
            
 Score = 32.8 bits (73), Expect = 6.8
 Identities = 28/136 (20%), Positives = 48/136 (34%), Gaps = 5/136 (3%)
 Frame = +1

Query: 1000 GQGKHHFKDRLTVSGE-----AAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVS 1164
            GQG+   +++    G+        +Q + P    +P   + L +G P      Y P    
Sbjct: 127  GQGQQPGQEQQPGQGQQDQQPGQRQQGYYPTSPQQPGQGQQLGQGQPG-----YYPTSQQ 181

Query: 1165 TGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPY 1344
             G +     G  +G       P    ++  GQQ  +    Y   +P        P     
Sbjct: 182  PGQKQQAGQGQQSGQGQQGYYPTSPQQSGQGQQPGQGQPGYYPTSPQQSGQWQQPGQGQQ 241

Query: 1345 GNPGQKTGAGRPNNSGGKYNR 1407
               GQ++G G+     G+  R
Sbjct: 242  PGQGQQSGQGQQGQQPGQGQR 262


>gi|1118137 (U41746) coded for by C. elegans cDNA yk68a8.5
           [Caenorhabditis elegans]
           Length = 586
           
 Score = 32.5 bits (72), Expect = 9.0
 Identities = 22/68 (32%), Positives = 32/68 (46%), Gaps = 3/68 (4%)
 Frame = +3

Query: 645 RRPQPRSTSAMGIARLP---SQPPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTET 815
           R   P +T+ +   R P   S PP+T     + P T+    P   TT+ P+ +T  PT  
Sbjct: 344 RATTPLATTPLATTRAPLPPSPPPRTS----KRPVTQAPTTPRATTTRRPTTTTPRPTPR 399

Query: 816 RKRHAVSTHPA 848
           R R   +T  A
Sbjct: 400 RTRRPKTTTAA 410


>gi|3876265|emb|CAB02976.1| (Z81067) predicted using Genefinder; cDNA
            EST yk488h9.3 comes from this gene [Caenorhabditis
            elegans]
            Length = 1307
            
 Score = 32.5 bits (72), Expect = 9.0
 Identities = 38/153 (24%), Positives = 55/153 (35%)
 Frame = +1

Query: 907  NNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFKQY 1086
            +  H   NN+  A       R    RD R    G    +   TVS E   + +H P    
Sbjct: 70   HGNHQLQNNYGGASSRGAQSRGSPPRDPRRHANGSSSHRRDKTVSDELQHENSHTP---- 125

Query: 1087 KPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQT 1266
                     E S +TF S + P   S+  R+ R +G+       +KSP+      P QQ 
Sbjct: 126  -------RQEESQSTFGSSFRPSQYSSILRDPRLSGSCPPG--QEKSPSNGHNLLPHQQ- 175

Query: 1267 RKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
                 K+  + P   +   + T N    P Q T
Sbjct: 176  -----KFGGSIPVSSTLSDSHTSNGGSTPNQDT 203


>gi|3873739|emb|CAA86059.1| (Z37983) weak similarity with putative
           zinc finger transcription factors.  Possesses the
           prosite motif for a C3HC4 type Zinc finger (Prosite
           accession number PS00518) [Caenorhabditis elegans]
           Length = 417
           
 Score = 32.5 bits (72), Expect = 9.0
 Identities = 37/151 (24%), Positives = 56/151 (36%)
 Frame = +2

Query: 302 PHAKTQRVAPRCSKHSPSSKVHVGSPSAAWISTPLAYYCSPQTANLPMQ*CTPHRK*NAN 481
           PH+    VAPR    S SS     + S   +STP     S +TA L       H    ++
Sbjct: 271 PHSLPTNVAPRIPPSSRSSFTQHSNDSGVVLSTP-PTSSSAKTAGL----SPTHNFSRSS 325

Query: 482 MSYVYAPPKEKNMCRMSYSSN*RAASCWKTAPQNSTRLNASATPTHTTGFVSLSKKAATA 661
            S     P  K     ++    RA    +     +  +N  A PT+ T  V ++ K    
Sbjct: 326 TSLRIPTPTTKIQKIQNFFETTRAPRISRIRMGVTDLVNTYAPPTYATSPVHINTKQFEC 385

Query: 662 KYVGYGNRKAAKSAASNVHVMVPSSYHANYC 754
             VG          +++ H    ++ H NYC
Sbjct: 386 CSVG------EFGVSTDTHKSPTTAIHLNYC 410


>gi|1870123|emb|CAB06788| (Z86109) unknown [Saccharomyces pastorianus]
            Length = 193
            
 Score = 32.5 bits (72), Expect = 9.0
 Identities = 22/67 (32%), Positives = 29/67 (42%)
 Frame = +1

Query: 1225 SPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYN 1404
            +P+P  + TPG+   K+P K P  AP      A P   P   PG+  G   P  + GK  
Sbjct: 98   TPSPQGKKTPGKAPGKAPGKAPGKAPGKAPGKA-PGKAPGKAPGKAPGKA-PGKAPGKAP 155

Query: 1405 RNRGPRY 1425
               G  Y
Sbjct: 156  GKAGRSY 162


>gi|46691|emb|CAA43604| (X61307) protein A [Staphylococcus aureus]
            >gi|384170|prf||1905280A protein A [Staphylococcus
            aureus]
            Length = 454
            
 Score = 32.5 bits (72), Expect = 9.0
 Identities = 21/69 (30%), Positives = 28/69 (40%), Gaps = 3/69 (4%)
 Frame = +1

Query: 1168 GPRNHRNAGAGNGAHPNK---KSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFN 1338
            G  ++   G  +G  P K   K P     N PG++  K P K   N P    D   P   
Sbjct: 289  GKEDNNKPGKEDGNKPGKEDNKKPGKEDGNKPGKEDNKKPGKEDGNKPG-KEDGNKPGKE 347

Query: 1339 PYGNPGQKTGAG 1374
                PG++ G G
Sbjct: 348  DGNKPGKEDGNG 359


>gi|1019435 (U32447) mucin-like protein [Trypanosoma cruzi]
           Length = 197
           
 Score = 32.5 bits (72), Expect = 9.0
 Identities = 21/64 (32%), Positives = 30/64 (46%), Gaps = 2/64 (3%)
 Frame = +3

Query: 696 SQPPQTYTLWFRPPTTRTTAW--PIDRTTQDPSRSTTHPTETRKRHAVSTHPATDYRPTP 869
           ++PP T T   +PPTT TT    P   TT   ++  T  T T  +   +T   T   PT 
Sbjct: 76  TKPPTTTTTTTKPPTTTTTTTTKPPTTTTTTTTKPPTTTTTTTTKPPTTTTTTTTKPPTT 135

Query: 870 FSQSHT 887
            + + T
Sbjct: 136 TTTTTT 141


>gi|6324130|ref|NP_014200.1|GCR2| Transcription factor; Gcr2p
            >gi|417039|sp|Q01722|GCR2_YEAST GLYCOLYTIC GENES
            TRANSCRIPTIONAL ACTIVATOR GCR2 >gi|320841|pir||S31300
            regulatory protein GCR2 - yeast (Saccharomyces
            cerevisiae) >gi|218427|dbj|BAA00985| (D10104) GCR2
            protein [Saccharomyces cerevisiae]
            >gi|600066|emb|CAA55509| (X78898) Gcr2; acc.#:D10104
            [Saccharomyces cerevisiae] >gi|1302197|emb|CAA96097|
            (Z71475) ORF YNL199c [Saccharomyces cerevisiae]
            Length = 534
            
 Score = 32.5 bits (72), Expect = 9.0
 Identities = 19/65 (29%), Positives = 30/65 (45%)
 Frame = +1

Query: 1111 SEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYP 1290
            ++GSP+  Q      G++ G  N  N   GNG++      N   +N P  +T+K   +  
Sbjct: 244  TKGSPSDLQ------GINNGNNNGNNGNIGNGSNIK----NYGNKNMPNNRTKKRGTRVA 293

Query: 1291 NNAPN 1305
             NA N
Sbjct: 294  KNAKN 298


>gi|1136289 (U42597) histidine kinase A [Dictyostelium discoideum]
            Length = 2150
            
 Score = 32.5 bits (72), Expect = 9.0
 Identities = 29/145 (20%), Positives = 54/145 (37%), Gaps = 1/145 (0%)
 Frame = +1

Query: 895  NRNDNNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKP 1074
            + N NN +  +NN++  D   + +R  T   + G  Q +   +  +          A+  
Sbjct: 229  SNNSNNNNNGNNNNNITDSPTKSKRHSTYETNIGSHQRRKSIQSLI----------ANSA 278

Query: 1075 FKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGN-GAHPNKKSPNPNTRNT 1251
               +    ++ LS  +P+T  +       S    N+ N   G+ GA P  +S + N    
Sbjct: 279  IHSFSKLKNKPLSSSTPSTVNTCGAVNNNSNNNNNNNNNSTGSLGAIPMDRSFDGNINTI 338

Query: 1252 PGQQTRKSPYKYPNNAPNFPSDHATP 1329
              + T  +     N   N  S+   P
Sbjct: 339  TEESTGGNNSPRSNCGSNCGSNGGIP 364


>gi|2193933|emb|CAB09584| (Z96800) hypothetical protein Rv0312
           [Mycobacterium tuberculosis]
           Length = 620
           
 Score = 32.5 bits (72), Expect = 9.0
 Identities = 20/57 (35%), Positives = 25/57 (43%)
 Frame = +3

Query: 702 PPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRHAVSTHPATDYRPTPF 872
           PP T T    P TT TTA P   TT +P  +TT    T      +    T++   PF
Sbjct: 536 PPVTTTPRPSPTTTTTTAPPSTTTTTEPPVTTTSTIPTIPTTTTTVKMTTEWLHVPF 592


>gi|2493778|sp|Q09456|YQ35_CAEEL PUTATIVE CUTICLE COLLAGEN C09G5.5
            >gi|3874102|emb|CAA86758.1| (Z46791) similar to collagen
            [Caenorhabditis elegans]
            Length = 317
            
 Score = 32.5 bits (72), Expect = 9.0
 Identities = 26/81 (32%), Positives = 34/81 (41%), Gaps = 1/81 (1%)
 Frame = +1

Query: 1150 PEGVSTGPRNHRNAGA-GNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHAT 1326
            P G +    +   AGA GN   P +      +R  PG   R  P + P  AP  P   +T
Sbjct: 171  PPGPAGDAGSPGQAGAPGNPGRPGQSGQR--SRGLPGPSGRPGP-QGPPGAPGQPGSGST 227

Query: 1327 PTFNPYGNPGQKTGAGRPNNSG 1392
            P   P G PG     G+P + G
Sbjct: 228  P--GPAGPPGPPGPNGQPGHPG 247


>gi|182424 (J00127) alpha-fibrinogen precursor [Homo sapiens]
            Length = 644
            
 Score = 32.5 bits (72), Expect = 9.0
 Identities = 31/109 (28%), Positives = 47/109 (42%), Gaps = 5/109 (4%)
 Frame = +1

Query: 1093 KNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNT--PGQQT 1266
            +N  S   G  AT++      G S G  N  ++G G+  + N  SP P +  T  PG   
Sbjct: 308  RNPGSSGTGGTATWKPGSSGPG-SAGSWNSGSSGTGSTGNQNPGSPRPGSTGTWNPGSSE 366

Query: 1267 RKSPYKYPNNAPNFPSDHATPTFNPYGNPGQ---KTGAGRPNNSGGKYNRNRGP 1419
            R S            + H T   +  G+ GQ   ++G+ RP++ G    R   P
Sbjct: 367  RGS------------AGHWTSESSVSGSTGQWHSESGSFRPDSPGSGNARPNNP 408


>gi|1707117 (U80453) C23H3.9 [Caenorhabditis elegans]
           Length = 339
           
 Score = 32.5 bits (72), Expect = 9.0
 Identities = 23/73 (31%), Positives = 28/73 (37%)
 Frame = +3

Query: 651 PQPRSTSAMGIARLPSQPPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRHA 830
           P     SA    +  S   +T T     PTT TTA     TT  P+ +TT PT T     
Sbjct: 179 PTYNRISAEKALQKTSTTQETTTSTTAQPTTTTTATTTTTTTPLPTTTTTQPTTTTTEPT 238

Query: 831 VSTHPATDYRPTP 869
            +T        TP
Sbjct: 239 TTTTTTEPTTTTP 251


>gi|5901828|gb|AAD55422.1|AF181636_1 (AF181636) BcDNA.GH07269
            [Drosophila melanogaster]
            Length = 682
            
 Score = 32.5 bits (72), Expect = 9.0
 Identities = 35/128 (27%), Positives = 57/128 (44%), Gaps = 14/128 (10%)
 Frame = +1

Query: 1012 HHFKDRLTVSGEAAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNA 1191
            HH   RL+  G + ++    P  +       S +  +PA+   W VP      P  +  +
Sbjct: 70   HHDNVRLSYGGGSHSQ----PVSKVSSSQTHSAAPSAPASPIGWNVPAKPQGPPPAYSAS 125

Query: 1192 GAGNGAHPN--KKSP--NPNTRNTP-----GQQTR-----KSPYKYPNNAPNFPSDHATP 1329
                GAH N  ++ P  NP  +  P       QT      +SPY+ P  A +  +  ++ 
Sbjct: 126  NPVGGAHTNIHERPPAYNPAYKPAPPSYSAATQTHSNTNLQSPYR-PAGAASPGASSSSS 184

Query: 1330 TFNPYGNPGQKTGAGRPNNSGG 1395
              + YG        GR N++GG
Sbjct: 185  GSHYYGGAHNTAYRGRNNSTGG 206


>gi|4033468|sp|P92965|RS40_ARATH ARGININE/SERINE-RICH SPLICING FACTOR
            RSP40 >gi|2582641|emb|CAA67800| (X99437) splicing factor
            [Arabidopsis thaliana] >gi|2980800|emb|CAA18176.1|
            (AL022197) splicing factor At-SRp40 [Arabidopsis
            thaliana]
            Length = 350
            
 Score = 32.5 bits (72), Expect = 9.0
 Identities = 28/145 (19%), Positives = 58/145 (39%), Gaps = 2/145 (1%)
 Frame = +1

Query: 979  LRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPE- 1155
            ++DD  RG G  H  +R         +++  P+K+ +   D        A ++       
Sbjct: 167  VKDDDARGNG--HSPERRRDRSPERRRRSPSPYKRERGSPDYGRGASPVAAYRKERTSPD 224

Query: 1156 -GVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPT 1332
             G    P  ++ +  G+  +   +  N + R    ++   SP KY + +PN   +  +P 
Sbjct: 225  YGRRRSPSPYKKSRRGSPEYGRDRRGNDSPRR---RERVASPTKY-SRSPNNKRERMSPN 280

Query: 1333 FNPYGNPGQKTGAGRPNNSGGKYNRNR 1413
             +P+     + G G   +   +  R+R
Sbjct: 281  HSPFKKESPRNGVGEVESPIERRERSR 307


>gi|2981221 (AF053091) eyelid [Drosophila melanogaster]
            Length = 2715
            
 Score = 32.5 bits (72), Expect = 9.0
 Identities = 28/89 (31%), Positives = 40/89 (44%), Gaps = 13/89 (14%)
 Frame = +1

Query: 1150 PEGVSTGPRNHRNAGAGNG---AHPNKKSPNP--NTRNTPGQQTRKSPYKYPNNAPNFPS 1314
            P G   GP   + AG        +P ++ P P    +  P QQ ++ PY+     P    
Sbjct: 1537 PPGAPHGPPIQQPAGVAQWDQHRYPPQQGPPPPPQQQQQPQQQQQQPPYQQVAGPPGQQP 1596

Query: 1315 DHATPTFNPYGNPGQKTGAG--------RPNNSGGKYNRNRG 1416
              A P +    NPGQ   +G        RP +  G+ NR  G
Sbjct: 1597 PQAPPQWAQM-NPGQTAQSGIAPPGSPLRPPSGPGQQNRMPG 1637


 Score = 32.5 bits (72), Expect = 9.0
 Identities = 32/103 (31%), Positives = 42/103 (40%), Gaps = 3/103 (2%)
 Frame = +1

Query: 1117 GSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNN 1296
            G P      + P  V   P+ H    AG   +P   S  P    TP  +T  SP  YP+ 
Sbjct: 1311 GGPPPAPQQHGPGQVPPSPQQHVRPAAG-APYPPGGSGYP----TPVSRTPGSP--YPSQ 1363

Query: 1297 APNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKY---NRNRGPRY 1425
               +    ++  +N  G PGQ  G G      G+Y   NRN  P Y
Sbjct: 1364 PGAYGQYGSSDQYNATGPPGQPFGQG-----PGQYPPQNRNMYPPY 1404


>gi|5824601|emb|CAA82571.2| (Z29443) similar to Annexin; cDNA EST
            EMBL:C10640 comes from this gene; cDNA EST EMBL:C12433
            comes from this gene; cDNA EST yk192f7.5 comes from this
            gene; cDNA EST yk318c1.5 comes from this gene; cDNA EST
            yk494a12.3 comes fr...
            Length = 497
            
 Score = 32.5 bits (72), Expect = 9.0
 Identities = 25/75 (33%), Positives = 38/75 (50%), Gaps = 19/75 (25%)
 Frame = +1

Query: 1204 GAHPNKKSPNPNTRNTPGQQTR-KSPYKYPNNAPNFPSDHATP--------TFNPYGNPG 1356
            G   N K   P+++ +  + +   S   YPN  P++      P        +++PYG P 
Sbjct: 21   GLGGNNKQQQPSSQQSSQEPSNMNSGGGYPNQQPSYGGYGQPPQQPGYGNGSYDPYGQPQ 80

Query: 1357 QKT---GAGRP-------NNSGGKYNRNRGPRYP 1428
            Q+    G G+P       N  GG Y    G  YP
Sbjct: 81   QQPYPGGGGQPPYPGSNSNQGGGGYPGQGGAPYP 114


>gi|6273747|gb|AAF06358.1|AF102865_1 (AF102865) calymmin [Danio rerio]
            Length = 1207
            
 Score = 32.5 bits (72), Expect = 9.0
 Identities = 36/124 (29%), Positives = 49/124 (39%), Gaps = 20/124 (16%)
 Frame = +1

Query: 1051 AAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNA----GAGNGAHPN 1218
            +A Q  KP       N  + ++ +   FQ+   P G + GP+    A     AG+ A PN
Sbjct: 288  SAGQVAKPNGYGGYPNAGATNQPNGGPFQNMGYPNGGTKGPKPGYGAKAGPSAGHVAKPN 347

Query: 1219 KKSPNPNTRNTPGQQTRKSPYK-YPNNAPNFPSDH------------ATPTFN---PYGN 1350
                 PN   T       S +  YPN     P               A P  N   P G 
Sbjct: 348  GNGGYPNGGATSQHNGGSSQFMGYPNGGTKGPKSGYGANAGPSAGQVAKPNGNGRYPIGG 407

Query: 1351 PGQKTGAGRPNNSGGKYNRNRGPR 1422
               +   G   N G    R +GP+
Sbjct: 408  VANQPNRGSSQNMGYPNGRTKGPK 431


>gi|1523876|emb|CAA99996| (Z75666) BRCA2 [Chlorocebus aethiops]
            Length = 1548
            
 Score = 32.5 bits (72), Expect = 9.0
 Identities = 39/134 (29%), Positives = 62/134 (46%), Gaps = 10/134 (7%)
 Frame = +1

Query: 739  PRELLRGQSTELPKTQVEALRTQLKLEKDMPLALTLQPIIGQRRSAK---ATLHVN---- 897
            P  +L+  + E+P+ QV  L T  +  +D  L +   P IGQ  S+K    T+ +     
Sbjct: 463  PSYILQKNTFEVPENQVTILNTTTEENRDAGLVIMNAPSIGQVNSSKQFEGTVGIKQKFA 522

Query: 898  ---RNDNNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAH 1068
               ++D NK A  + + T +   E R F             H  K  L VS EA  K A 
Sbjct: 523  GLLKSDCNKSA--SGYLTDENEVEFRGF----------YSAHGVK--LNVSTEALQK-AV 567

Query: 1069 KPFKQYKPKNDRSLSEGSPATFQS 1140
            K F   +  ++++ +E  P +  S
Sbjct: 568  KLFSDIENISEKTSAEVDPISLSS 591


>gi|1448962 (L79944) OFR1 of non-LTR retrotransposon; putative
            [Chironomus tentans]
            Length = 165
            
 Score = 32.5 bits (72), Expect = 9.0
 Identities = 19/81 (23%), Positives = 28/81 (34%)
 Frame = +1

Query: 1033 TVSGEAAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAH 1212
            T SG A   Q ++     KP+N        P+        +G     +N+R  G G G  
Sbjct: 13   TTSGRAIHSQQNRATVNQKPQNTNPPPNNKPSN-------QGNENNQQNNRGKGRGKGKR 65

Query: 1213 PNKKSPNPNTRNTPGQQTRKS 1275
              +    P  +N   Q    S
Sbjct: 66   RRQNKSKPRNKNNKNQNKNSS 86


>gi|188864 (M74027) mucin [Homo sapiens]
           Length = 573
           
 Score = 32.5 bits (72), Expect = 9.0
 Identities = 26/73 (35%), Positives = 32/73 (43%), Gaps = 14/73 (19%)
 Frame = +3

Query: 651 PQPRSTSAMGIARLPSQPPQT-------------YTLWFRPPTTRTTAWPIDRTTQDPSR 791
           P P ST+ +     PS P  T              T    PP T T + PI  TT  P  
Sbjct: 66  PPPTSTTTLPPTTTPSPPTTTTTTPPPTTTPSPPITTTTTPPPTTTPSPPISTTTTPPPT 125

Query: 792 ST-THPTETRKRHAVSTHPATDYRPTP 869
           +T + PT T      +  P T    TP
Sbjct: 126 TTPSPPTTTPSPPTTTPSPPTTTTTTP 152


>gi|1256180|dbj|BAA12287| (D84250) chitinase [Penaeus japonicus]
           Length = 572
           
 Score = 32.5 bits (72), Expect = 9.0
 Identities = 18/36 (50%), Positives = 22/36 (61%), Gaps = 2/36 (5%)
 Frame = +3

Query: 693 PSQPPQTYTLWFRPPTTRTTAW--PIDRTTQDPSRSTT 800
           P+ PP T T  + PPTT TT     I  TT+DP+  TT
Sbjct: 423 PTLPPTTTTPHWTPPTTTTTTRDPSITTTTRDPNLPTT 460


>gi|4996369|dbj|BAA78427.1| (AB021267) polyprotein [Arabidopsis
            thaliana]
            Length = 1421
            
 Score = 32.5 bits (72), Expect = 9.0
 Identities = 26/81 (32%), Positives = 39/81 (48%), Gaps = 9/81 (11%)
 Frame = +1

Query: 1090 PKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAG----NGAHPNKKSPNPNTRNTPG 1257
            P +  S    S  T  S+  P+  +T P   +N+ +     N  +PN  SPN   +N+P 
Sbjct: 817  PSSSISSPSSSEPTAPSYNGPQP-TTQPHQTQNSNSNSPILNNPNPNSPSPNSPNQNSPL 875

Query: 1258 QQTRKSPYKYPN-----NAPNFPSDHATPT 1332
             Q+  S    P      + PN PS  +T T
Sbjct: 876  PQSPISSPHIPTPSTSISEPNSPSSSSTST 905


>gi|223918|prf||1004351A fibrinogen alphaA [Homo sapiens]
            Length = 462
            
 Score = 32.5 bits (72), Expect = 9.0
 Identities = 31/109 (28%), Positives = 47/109 (42%), Gaps = 5/109 (4%)
 Frame = +1

Query: 1093 KNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNT--PGQQT 1266
            +N  S   G  AT++      G S G  N  ++G G+  + N  SP P +  T  PG   
Sbjct: 237  RNPGSSGTGGTATWKPGSSGPG-SXGSWNSGSSGTGSTGNQNPGSPRPGSTGTWNPGSSE 295

Query: 1267 RKSPYKYPNNAPNFPSDHATPTFNPYGNPGQ---KTGAGRPNNSGGKYNRNRGP 1419
            R S            + H T   +  G+ GQ   ++G+ RP++ G    R   P
Sbjct: 296  RGS------------AGHWTSESSVSGSTGQWHSESGSFRPDSPGSGNARPNNP 337


>gi|2952545 (AF051898) coronin binding protein [Dictyostelium
            discoideum]
            Length = 560
            
 Score = 32.5 bits (72), Expect = 9.0
 Identities = 28/170 (16%), Positives = 63/170 (36%), Gaps = 13/170 (7%)
 Frame = +1

Query: 901  NDNNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFK 1080
            N+NN +   N++S  + +    R ++  +         +  +   ++  + A +++ P  
Sbjct: 316  NNNNSNNNSNSNSNNNNNGINNRNNSNNNSNNNSNNNSNNSNNRNITNGSNANKSNSPNN 375

Query: 1081 QYKPKNDR-------------SLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNK 1221
                 ND              + + G+     +  +    ++   ++ N+   +  + N+
Sbjct: 376  NLNTNNDNKNNNSNNNNNSNNNSNNGNSNNNNNNNIINNNNSNSNSNNNSNNNSNNNSNR 435

Query: 1222 KSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKY 1401
             SPN N        T  +     NN  N  +++     N   N      A   NN+    
Sbjct: 436  NSPNHNNNGDNDNNTNNNTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNYADNSNNNSSNS 495

Query: 1402 NRN 1410
            N N
Sbjct: 496  NNN 498


  Database: nr
    Posted date:  Feb 13, 2000  1:18 AM
  Number of letters in database: 140,124,617
  Number of sequences in database:  455,460
  
Lambda     K      H
   0.313    0.132    0.387 

Gapped
Lambda     K      H
   0.270   0.0470    0.230 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 349565694
Number of Sequences: 455460
Number of extensions: 7757399
Number of successful extensions: 26863
Number of sequences better than 10.0: 384
Number of HSP's better than 10.0 without gapping: 35
Number of HSP's successfully gapped in prelim test: 159
Number of HSP's that attempted gapping in prelim test: 26197
Number of HSP's gapped (non-prelim): 605
length of query: 477
length of database: 140,124,617
effective HSP length: 58
effective length of query: 418
effective length of database: 113,707,937
effective search space: 47529917666
effective search space used: 47529917666
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.8 bits)
X3: 64 (24.9 bits)
S1: 42 (21.9 bits)
S2: 72 (32.5 bits)