BLASTX 2.0.10 [Aug-26-1999]
Query= XF-02A11-GL38
(1431 letters)
Database: nr
455,460 sequences; 140,124,617 total letters
Graphical Overview:
Score E
Sequences producing significant alignments: (bits) Value
gi|1175678|sp|P37765|YCIL_ECOLI HYPOTHETICAL 32.7 KD PROTEIN IN... 239 4e-62
gi|602963 (U18111) ORF4 [Escherichia coli] 238 1e-61
gi|1175679|sp|P45104|YCIL_HAEIN HYPOTHETICAL PROTEIN HI1199 >gi... 234 2e-60
gi|1175677|sp|P42395|YCIL_BUCAP HYPOTHETICAL 30.2 KD PROTEIN IN... 159 7e-38
gi|466190|sp|P35159|RLUB_BACSU RIBOSOMAL LARGE SUBUNIT PSEUDOUR... 129 6e-29
gi|6458615|gb|AAF10472.1|AE001942_4 (AE001942) ribosomal large ... 123 3e-27
gi|4980760|gb|AAD35352.1|AE001708_20 (AE001708) 16S pseudouridy... 120 3e-26
gi|3915347|sp|O51155|Y129_BORBU HYPOTHETICAL PROTEIN BB0129 >gi... 118 1e-25
gi|6647946|sp|Q9ZD06|Y544_RICPR HYPOTHETICAL PROTEIN RP544 >gi|... 116 4e-25
gi|5738484|emb|CAB52832.1| (AL109848) putative pseudouridine sy... 115 1e-24
gi|3915377|sp|Q55578|Y361_SYNY3 HYPOTHETICAL 28.2 KD PROTEIN SL... 113 3e-24
gi|3915445|sp|O67444|YE64_AQUAE HYPOTHETICAL PROTEIN AQ_1464 >g... 110 2e-23
gi|3915547|sp|O33210|YH11_MYCTU HYPOTHETICAL 27.6 KD PROTEIN RV... 108 2e-22
gi|3915542|sp|O05668|YH11_MYCLE HYPOTHETICAL 28.1 KD PROTEIN CB... 106 4e-22
gi|3915394|sp|O66829|Y554_AQUAE HYPOTHETICAL PROTEIN AQ_554 >gi... 105 1e-21
gi|4155973 (AE001558) putative [Helicobacter pylori J99] 103 3e-21
gi|2501526|sp|P55986|YE59_HELPY HYPOTHETICAL PROTEIN HP1459 >gi... 102 5e-21
gi|418534|sp|P32684|YJBC_ECOLI HYPOTHETICAL 32.5 KD PROTEIN IN ... 96 9e-19
gi|3329180|gb|AAC68318.1| (AE001343) predicted pseudouridine sy... 83 7e-15
gi|465610|sp|P33918|RSUA_ECOLI RIBOSOMAL SMALL SUBUNIT PSEUDOUR... 83 7e-15
gi|2145990|pir||S72955 u0247g protein - Mycobacterium leprae >g... 82 9e-15
gi|4377181|gb|AAD19002| (AE001667) predicted pseudouridine synt... 82 1e-14
gi|3322747 (AE001223) conserved hypothetical protein [Treponema... 78 2e-13
gi|1175857|sp|P45124|RSUA_HAEIN RIBOSOMAL SMALL SUBUNIT PSEUDOU... 74 3e-12
gi|3845303 (AE001423) pseudouridine synthetase (RsuA fam.); 1st... 67 3e-10
gi|1175289|sp|P44827|YMFC_HAEIN HYPOTHETICAL PROTEIN HI0694 >gi... 67 4e-10
gi|1787380 (AE000213) orf, hypothetical protein [Escherichia coli] 58 2e-07
gi|3916025|sp|P75966|YMFC_ECOLI HYPOTHETICAL 24.9 KD PROTEIN IN... 58 2e-07
gi|3402689|gb|AAC28992.1| (AC004697) unknown protein [Arabidops... 54 2e-06
gi|3915559|sp|O32068|YTZF_BACSU HYPOTHETICAL 17.7 KD PROTEIN IN... 54 3e-06
gi|3402691|gb|AAC28994.1| (AC004697) unknown protein [Arabidops... 52 2e-05
gi|6689331|emb|CAB65436.1| (AJ250721) hypothetical protein [Syn... 49 8e-05
gi|3915395|sp|P72581|Y612_SYNY3 HYPOTHETICAL 21.0 KD PROTEIN SL... 48 1e-04
gi|1651652|dbj|BAA16580| (D90899) hypothetical protein [Synecho... 42 0.010
gi|1175470|sp|Q09801|YAAA_SCHPO HYPOTHETICAL 37.3 KD PROTEIN C2... 42 0.014
gi|294136|gb|AAA29551.1| (M83173) circumsporozoite protein [Pla... 41 0.018
gi|117587|sp|P08307|CSP_PLAFW CIRCUMSPOROZOITE PROTEIN PRECURSO... 41 0.018
gi|48921|emb|CAA42462| (X59794) traC-4 [Escherichia coli] >gi|1... 41 0.024
gi|48920|emb|CAA42461| (X59794) traC-3 [Escherichia coli] >gi|1... 41 0.024
gi|294138|gb|AAA29552.1| (M83174) circumsporozoite protein [Pla... 41 0.024
gi|136173|sp|P27190|TRC5_ECOLI DNA PRIMASE TRAC (REPLICATION PR... 41 0.024
gi|117591|sp|P26694|CSP_PLARE CIRCUMSPOROZOITE PROTEIN PRECURSO... 41 0.031
gi|117584|sp|P05691|CSP_PLAFL CIRCUMSPOROZOITE PROTEIN (CS) >gi... 41 0.031
gi|294119|gb|AAA29542.1| (M83164) circumsporozoite protein [Pla... 40 0.040
gi|294125|gb|AAA29545.1| (M83167) circumsporozoite protein [Pla... 40 0.040
gi|552191 (M57499) circumsporozoite protein [Plasmodium falcipa... 40 0.040
gi|294129|gb|AAA29547.1| (M83169) circumsporozoite protein [Pla... 40 0.040
gi|117586|sp|P13814|CSP_PLAFT CIRCUMSPOROZOITE PROTEIN PRECURSO... 40 0.040
gi|84198|pir||S05428 circumsporozoite protein - Plasmodium falc... 40 0.040
gi|294121|gb|AAA29543.1| (M83165) circumsporozoite protein [Pla... 40 0.040
gi|294123|gb|AAA29544.1| (M83166) circumsporozoite protein [Pla... 40 0.040
gi|294134|gb|AAA29550.1| (M83172) circumsporozoite protein [Pla... 40 0.040
gi|117583|sp|P02893|CSP_PLAFA CIRCUMSPOROZOITE PROTEIN PRECURSO... 40 0.040
gi|294151|gb|AAA29569.1| (M83156) circumsporozoite protein [Pla... 40 0.040
gi|4493889|emb|CAB38998.1| (AL034558) predicted using hexExon; ... 40 0.040
gi|6226848|sp|P19597|CSP_PLAFO CIRCUMSPOROZOITE PROTEIN PRECURS... 40 0.040
gi|552190 (M57498) circumsporozoite protein [Plasmodium falcipa... 40 0.040
gi|224051|prf||1008275A protein,circumsporozoite [Plasmodium sp.] 39 0.069
gi|294158|gb|AAA29574.1| (M83161) circumsporozoite protein [Pla... 39 0.12
gi|2462758 (AC002292) putative RNA-binding protein [Arabidopsis... 39 0.12
gi|283464|pir||A60540 sporozoite surface protein 2 - Plasmodium... 39 0.12
gi|548996|sp|Q01443|SSP2_PLAYO SPOROZOITE SURFACE PROTEIN 2 PRE... 39 0.12
gi|1582641|prf||2119210A mucin [Homo sapiens] 38 0.16
gi|2135764|pir||I53641 mucin - human (fragment) >gi|945219 (L46... 38 0.16
gi|6580291|emb|CAB63354.1| (AL032627) cDNA EST EMBL:D65921 come... 38 0.20
gi|677949 (U20969) Plasmodium falciparum circumsporozoite prote... 38 0.20
gi|93140|pir||A40505 early protein EP0 - suid herpesvirus 1 (st... 38 0.27
gi|124138|sp|P29129|ICP0_PRVIF TRANS-ACTING TRANSCRIPTIONAL PRO... 38 0.27
gi|3875920|emb|CAB04111.1| (Z81503) predicted using Genefinder;... 38 0.27
gi|6681425|dbj|BAA88672.1| (AB030233) CiMsi [Ciona intestinalis] 38 0.27
gi|2498734|sp|Q60865|P137_MOUSE GPI-ANCHORED PROTEIN P137 >gi|1... 37 0.35
gi|6322611|ref|NP_012685.1|YJR151C| Yjr151cp >gi|1352944|sp|P47... 37 0.46
gi|4758666|ref|NP_004681.1|| LATS (large tumor suppressor, Dros... 36 0.60
gi|3172134 (U90209) RNA polymerase II largest subunit [Bonnemai... 36 0.60
gi|2689219|emb|CAA67983.1| (X99669) dynamin like protein [Dicty... 36 0.60
gi|3064231|gb|AAC14254.1| (AF036460) mucin-like protein [Trypan... 36 0.60
gi|112945|sp|P14196|AAC2_DICDI AAC-RICH MRNA CLONE AAC11 PROTEI... 36 0.60
gi|969095 (U31961) no-on transient A-like protein [Drosophila m... 36 0.60
gi|1082604|pir||S53363 mucin 5AC (clone JER58) - human (fragmen... 36 0.60
gi|4324420|gb|AAD16881| (AF104350) prespore-specific protein [D... 36 0.60
gi|2497729|sp|Q49537|VLPE_MYCHR VARIANT SURFACE ANTIGEN E PRECU... 36 0.60
gi|2114108|dbj|BAA20059| (AB003911) OX40 precursor [Oryctolagus... 36 0.79
gi|82698|pir||JQ0985 hydroxyproline-rich glycoprotein precursor... 36 0.79
gi|4220540|emb|CAA23013| (AL035356) hypothetical protein [Arabi... 36 0.79
gi|5114426|gb|AAD40313.1|AF157503_1 (AF157503) chitinase 1 [Pen... 36 0.79
gi|228937|prf||1814452B Hyp-rich glycoprotein [Zea mays] 36 0.79
gi|106291|pir||S16681 homeotic protein - human 36 1.0
gi|1170758|sp|P38486|LEG3_CANFA GALECTIN-3 (GALACTOSE-SPECIFIC ... 36 1.0
gi|1724014|sp|P54604|YHCT_BACSU HYPOTHETICAL 33.7 KD PROTEIN IN... 36 1.0
gi|3242649|dbj|BAA29028.1| (AB015440) alpha 1 type I collagen [... 35 1.4
gi|4808164|emb|CAB42795.1| (Y18878) largest subunit of the RNA ... 35 1.4
gi|4808166|emb|CAB42797.1| (Y18879) largest subunit of the RNA ... 35 1.4
gi|1589837 (U68729) cuticle preprocollagen [Meloidogyne incognita] 35 1.4
gi|563375|emb|CAA84031| (Z34277) mucin [Homo sapiens] 35 1.4
gi|1519696 (U67956) coded for by C. elegans cDNA yk126f9.5; cod... 35 1.4
gi|1184072 (U40766) COL-1 [Meloidogyne incognita] 35 1.4
gi|2245693|gb|AAB62577.1| (AF010328) short form of CHIP [Drosop... 35 1.8
gi|4324436|gb|AAD16883| (AF104414) large tumor suppressor 1 [Mu... 35 1.8
gi|3877422|emb|CAB05195.1| (Z82268) predicted using Genefinder;... 35 1.8
gi|1280114|gb|AAC25888.1| (U55373) Similar to cuticular collage... 35 1.8
gi|2245687|gb|AAB62574.1| (AF010325) CHIP [Drosophila melanogas... 35 1.8
gi|119712|sp|P14918|EXTN_MAIZE EXTENSIN PRECURSOR (PROLINE-RICH... 34 2.3
gi|227614|prf||1707318A Thr rich extensin [Zea mays] 34 2.3
gi|3810839|emb|CAA21800| (AL032684) conserved hypothetical zinc... 34 2.3
gi|228938|prf||1814452C Hyp-rich glycoprotein [Zea diploperennis] 34 2.3
gi|1082938|pir||A49688 lactose-binding lectin L-29 - dog 34 2.3
gi|3851592 (AF093575) surface antigen ariel1 [Entamoeba histoly... 34 2.3
gi|71823|pir||FGHUA fibrinogen alpha chain precursor, short spl... 34 2.3
gi|437331 (L23429) beta-galactosides-binding lectin [Canis fami... 34 2.3
gi|120943|sp|P13816|GARP_PLAFF GLUTAMIC ACID-RICH PROTEIN PRECU... 34 2.3
gi|283032|pir||S22456 hydroxyproline-rich glycoprotein - perenn... 34 2.3
gi|3834294 (U80846) No definition line found [Caenorhabditis el... 34 2.3
gi|168457 (M36913) cell wall protein (put.); putative [Zea mays] 34 2.3
gi|4503689|ref|NP_000499.1|| fibrinogen, A alpha polypeptide >g... 34 2.3
gi|3834293 (U80846) No definition line found [Caenorhabditis el... 34 2.3
gi|3924913|emb|CAB03474.1| (Z81138) predicted using Genefinder;... 34 3.0
gi|3924914|emb|CAB03475.1| (Z81138) predicted using Genefinder;... 34 3.0
gi|6580246|emb|CAB63321.1| (Z81138) cDNA EST EMBL:D65528 comes ... 34 3.0
gi|1280073 (U55366) Similar to cuticle collagen [Caenorhabditis... 34 3.0
gi|5901942|ref|NP_008971.1|| E1B-55kDa-associated protein 5 >gi... 34 3.0
gi|2501678|sp|Q45480|YLYB_BACSU HYPOTHETICAL 33.7 KD PROTEIN IN... 34 3.0
gi|2633919|emb|CAB13420| (Z99112) alternate gene name: ylmL; si... 34 3.0
gi|2135766|pir||S53362 mucin 5AC (clone JER47) - human (fragment) 34 3.0
gi|1139597 (U43400) H1 gene product [Human herpesvirus 7] >gi|1... 34 3.0
gi|628112|pir||S32988 hypothetical protein - human herpesvirus 4 34 4.0
gi|628185|pir||S42447 nuclear protein EBNA2 - Epstein-Barr viru... 34 4.0
gi|5833486|gb|AAD53528.1| (AF162149) variable surface lipoprote... 34 4.0
gi|283045|pir||S28264 hydroxyproline-rich glycoprotein - maize ... 34 4.0
gi|3880306|emb|CAA90995.1| (Z54238) similar to cuticle collagen... 34 4.0
gi|2833307|sp|Q21184|YXWK_CAEEL PUTATIVE CUTICLE COLLAGEN F55C1... 34 4.0
gi|3877661|emb|CAA98486.1| (Z74036) predicted using Genefinder;... 34 4.0
gi|543543|pir||JN0690 glutenin, high-molecular-weight Bx7 chain... 34 4.0
gi|102059|pir||D41710 promastigote surface antigen-2 (clone 4.6... 34 4.0
gi|119111|sp|P12978|EBN2_EBV EBNA-2 NUCLEAR PROTEIN >gi|1083964... 34 4.0
gi|100864|pir||S08315 cell wall protein - maize (fragment) >gi|... 33 5.2
gi|2135765|pir||A43932 mucin 2 precursor, intestinal - human (f... 33 5.2
gi|5852937|gb|AAD54275.1|AF169793_1 (AF169793) HET-C protein [P... 33 5.2
gi|4503493|ref|NP_001955.1|| early growth response 1 >gi|119242... 33 5.2
gi|2833328|sp|Q27200|FBRL_TETTH FIBRILLARIN >gi|2654298|emb|CAA... 33 5.2
gi|2707270 (AF036171) homeobox-containing protein [Dictyosteliu... 33 5.2
gi|3879844|emb|CAB04725.1| (Z81592) predicted using Genefinder;... 33 5.2
gi|418972|pir||S31035 retrovirus-related gag polyprotein - mous... 33 5.2
gi|6594249|emb|CAB63561.1| (AL031746) hypothetical garp protein... 33 5.2
gi|6601502|gb|AAF19004.1|AF151366_1 (AF151366) arginine/serine-... 33 5.2
gi|476822|pir||A42893 penicillin-binding protein 1A - Streptoco... 33 5.2
gi|282331|pir||S28037 penicillin-binding protein 1a - Streptoco... 33 5.2
gi|585647|sp|Q04707|PBPA_STRPN PENICILLIN-BINDING PROTEIN 1A (P... 33 5.2
gi|4140029|dbj|BAA36973.1| (AB015438) alpha 1 type I collagen [... 33 5.2
gi|5410459|gb|AAD43067.1|AF139884_1 (AF139884) penicillin-bindi... 33 5.2
gi|6563337|gb|AAF17255.1|AF210745_1 (AF210745) penicillin-bindi... 33 5.2
gi|6563341|gb|AAF17257.1|AF210747_1 (AF210747) penicillin-bindi... 33 5.2
gi|102058|pir||C41710 promastigote surface antigen-2 (clone 2.5... 33 5.2
gi|186396 (M94131) mucin [Homo sapiens] 33 5.2
gi|3319463 (AF077544) unknown [Caenorhabditis elegans] 33 5.2
gi|5815245|gb|AAD52614.1|AF175223_1 (AF175223) SANT domain prot... 33 5.2
gi|4505285|ref|NP_002448.1|| mucin 2, intestinal/tracheal >gi|2... 33 5.2
gi|82601|pir||A30843 glutenin high molecular weight chain Bx7 p... 33 6.8
gi|2388676 (AF015539) precollagen P [Mytilus edulis] 33 6.8
gi|1085433|pir||S55316 mucin (clone PGM-2B) - pig >gi|915207 (U... 33 6.8
gi|2314597|gb|AAD08465.1| (AE000642) conserved hypothetical pro... 33 6.8
gi|330361 (M10593) major outer envelope glycoprotein gp220 [Eps... 33 6.8
gi|1841851 (U86876) chitinase-like protein [Bombyx mori] 33 6.8
gi|2854193 (AF045645) Similar to cuticular collagen; coded for ... 33 6.8
gi|3875708|emb|CAA84658.1| (Z35598) Asparagine, Serine and Glyc... 33 6.8
gi|2119159|pir||I50694 alpha-1 collagen type III - chicken (fra... 33 6.8
gi|543541|pir||JC2099 glutenin, high molecular weight chain Bx1... 33 6.8
gi|1118137 (U41746) coded for by C. elegans cDNA yk68a8.5 [Caen... 32 9.0
gi|3876265|emb|CAB02976.1| (Z81067) predicted using Genefinder;... 32 9.0
gi|3873739|emb|CAA86059.1| (Z37983) weak similarity with putati... 32 9.0
gi|1870123|emb|CAB06788| (Z86109) unknown [Saccharomyces pastor... 32 9.0
gi|46691|emb|CAA43604| (X61307) protein A [Staphylococcus aureu... 32 9.0
gi|1019435 (U32447) mucin-like protein [Trypanosoma cruzi] 32 9.0
gi|6324130|ref|NP_014200.1|GCR2| Transcription factor; Gcr2p >g... 32 9.0
gi|1136289 (U42597) histidine kinase A [Dictyostelium discoideum] 32 9.0
gi|2193933|emb|CAB09584| (Z96800) hypothetical protein Rv0312 [... 32 9.0
gi|2493778|sp|Q09456|YQ35_CAEEL PUTATIVE CUTICLE COLLAGEN C09G5... 32 9.0
gi|182424 (J00127) alpha-fibrinogen precursor [Homo sapiens] 32 9.0
gi|1707117 (U80453) C23H3.9 [Caenorhabditis elegans] 32 9.0
gi|5901828|gb|AAD55422.1|AF181636_1 (AF181636) BcDNA.GH07269 [D... 32 9.0
gi|4033468|sp|P92965|RS40_ARATH ARGININE/SERINE-RICH SPLICING F... 32 9.0
gi|2981221 (AF053091) eyelid [Drosophila melanogaster] 32 9.0
gi|5824601|emb|CAA82571.2| (Z29443) similar to Annexin; cDNA ES... 32 9.0
gi|6273747|gb|AAF06358.1|AF102865_1 (AF102865) calymmin [Danio ... 32 9.0
gi|1523876|emb|CAA99996| (Z75666) BRCA2 [Chlorocebus aethiops] 32 9.0
gi|1448962 (L79944) OFR1 of non-LTR retrotransposon; putative [... 32 9.0
gi|188864 (M74027) mucin [Homo sapiens] 32 9.0
gi|1256180|dbj|BAA12287| (D84250) chitinase [Penaeus japonicus] 32 9.0
gi|4996369|dbj|BAA78427.1| (AB021267) polyprotein [Arabidopsis ... 32 9.0
gi|223918|prf||1004351A fibrinogen alphaA [Homo sapiens] 32 9.0
gi|2952545 (AF051898) coronin binding protein [Dictyostelium di... 32 9.0
>gi|1175678|sp|P37765|YCIL_ECOLI HYPOTHETICAL 32.7 KD PROTEIN IN
TRPL-BTUR INTERGENIC REGION (ORF4)
>gi|1742064|dbj|BAA14806| (D90764) ORF_ID:o253#9;
similar to [SwissProt Accession Number P37765]
[Escherichia coli] >gi|1742080|dbj|BAA14821| (D90765)
ORF_ID:o253#9; similar to [SwissProt Accession Number
P37765] [Escherichia coli] >gi|1787524 (AE000225) orf,
hypothetical protein [Escherichia coli]
Length = 291
Score = 239 bits (604), Expect = 4e-62
Identities = 136/270 (50%), Positives = 168/270 (61%), Gaps = 3/270 (1%)
Frame = +1
Query: 73 LEERLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLG--MSVKSGDKIELDGRSF-VA 243
+ E+L KVLA+AG GSRR +E I G + V+G IA+LG + V G KI +DG V
Sbjct: 1 MSEKLQKVLARAGHGSRREIESIIEAGRVSVDGKIAKLGDRVEVTPGLKIRIDGHLISVR 60
Query: 244 SALTEPARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXX 423
+ + RVL Y KPEGE+ TR DPEGRPTVF+ LP L+GARWIA+GRLD+N
Sbjct: 61 ESAEQICRVLAYYKPEGELCTRNDPEGRPTVFDRLPKLRGARWIAVGRLDVNTCGLLLFT 120
Query: 424 XDGELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERI 603
DGELAN +MHPS E+EREY VRV V D L L+RGV LEDG A F TI+
Sbjct: 121 TDGELANRLMHPSREVEREYAVRVFG-----QVDDAKLRDLSRGVQLEDGPAAFKTIKFS 175
Query: 604 GNTDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKT 783
G + W+ V + EGRNREVRRLWE+ G QVSRL R RYG + LP+ L RG TEL
Sbjct: 176 GGEGINQWYNVTLTEGRNREVRRLWEAVGVQVSRLIRVRYGDIPLPKGLPRGGWTELDLA 235
Query: 784 QVEALRTQLKLEKDMPLALTLQPIIGQRRSAKA 882
Q LR ++L + + ++ RR KA
Sbjct: 236 QTNYLRELVELPPETSSKVAVEK---DRRRMKA 265
>gi|602963 (U18111) ORF4 [Escherichia coli]
Length = 243
Score = 238 bits (600), Expect = 1e-61
Identities = 131/243 (53%), Positives = 157/243 (63%), Gaps = 3/243 (1%)
Frame = +1
Query: 73 LEERLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLG--MSVKSGDKIELDGRSF-VA 243
+ E+L KVLA+AG GSRR +E I G + V+G IA+LG + V G KI +DG V
Sbjct: 1 MSEKLQKVLARAGHGSRREIESIIEAGRVSVDGKIAKLGDRVEVTPGLKIRIDGHLISVR 60
Query: 244 SALTEPARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXX 423
+ + RVL Y KPEGE+ TR DPEGRPTVF+ LP L+GARWIA+GRLD+N
Sbjct: 61 ESAEQICRVLAYYKPEGELCTRNDPEGRPTVFDRLPKLRGARWIAVGRLDVNTCGLLLFT 120
Query: 424 XDGELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERI 603
DGELAN +MHPS E+EREY VRV V D L L+RGV LEDG A F TI+
Sbjct: 121 TDGELANRLMHPSREVEREYAVRVFG-----QVDDAKLRDLSRGVQLEDGPAAFKTIKFS 175
Query: 604 GNTDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKT 783
G + W+ V + EGRNREVRRLWE+ G QVSRL R RYG + LP+ L RG TEL
Sbjct: 176 GGEGINQWYNVTLTEGRNREVRRLWEAVGVQVSRLIRVRYGDIPLPKGLPRGGWTELDLA 235
Query: 784 QVEALR 801
Q LR
Sbjct: 236 QTNYLR 241
>gi|1175679|sp|P45104|YCIL_HAEIN HYPOTHETICAL PROTEIN HI1199
>gi|1074680|pir||A64169 hypothetical protein HI1199 -
Haemophilus influenzae (strain Rd KW20) >gi|1574128
(U32799) conserved hypothetical protein [Haemophilus
influenzae Rd]
Length = 357
Score = 234 bits (590), Expect = 2e-60
Identities = 137/290 (47%), Positives = 174/290 (59%), Gaps = 4/290 (1%)
Frame = +1
Query: 55 ATEAPKLE-ERLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLG--MSVKSGDKIELD 225
A+ PK E E+L KVLA+AG GSRR +E I+ G + V G IA LG + V SG K+ +D
Sbjct: 67 ASNQPKAEGEKLQKVLARAGQGSRREIETMIAAGRVSVEGKIATLGDRIDVHSGVKVRID 126
Query: 226 GRSF-VASALTEPARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINX 402
G+ ++ E RVL+Y KPEGE+ TR DPEGR TVF+ LP L G+RWIA+GRLDIN
Sbjct: 127 GQIINLSHTQKEICRVLMYYKPEGELCTRSDPEGRATVFDRLPRLTGSRWIAVGRLDINT 186
Query: 403 XXXXXXXXDGELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAK 582
DGELAN +MHPS E+EREY VRV V D +L +L +GV LEDG A
Sbjct: 187 SGLLLFTTDGELANRLMHPSREVEREYSVRVFG-----QVDDAMLARLRKGVQLEDGLAN 241
Query: 583 FDTIERIGNTDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQ 762
F I+ G + W+ V + EGRNREVRRLWESQG QVSRL R RYG++ L + L RG
Sbjct: 242 FKEIKFTGGVGINQWYDVTLMEGRNREVRRLWESQGIQVSRLIRIRYGNIKLMKGLPRGG 301
Query: 763 STELPKTQVEALRTQLKLEKDMPLALTLQPIIGQRRSAKATLHVNRNDNNKHAY 924
E+ V LR + L + L ++ + +S + V R Y
Sbjct: 302 WEEMDLENVNYLRELVGLPAETETKLDVKQASRRPKSGQIRKAVKRYSEMNKRY 355
>gi|1175677|sp|P42395|YCIL_BUCAP HYPOTHETICAL 30.2 KD PROTEIN IN
TRPA 3'REGION >gi|480102|pir||S36431 hypothetical
protein - Buchnera aphidicola >gi|396661|emb|CAA79503|
(Z19055) unknown open reading frame [Buchnera
aphidicola]
Length = 258
Score = 159 bits (397), Expect = 7e-38
Identities = 92/249 (36%), Positives = 144/249 (56%), Gaps = 5/249 (2%)
Frame = +1
Query: 82 RLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLG--MSVKSGDKIELDGRSFVASALT 255
++ K+L+ G GSRR +E I G I +NG+ A +G ++ K+ +I +D + +
Sbjct: 4 KIQKILSDLGYGSRRFIECMIKCGKISINGEKAIIGQYLNKKNPGEILIDKKKIIVKRNK 63
Query: 256 EPARVLIYN-KPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDG 432
+VLIYN KP GEV TR+D + R TVF+ LP L RW+++GRLDIN DG
Sbjct: 64 NLPKVLIYNNKPIGEVCTRDDFQKRLTVFDKLPKLNLNRWVSVGRLDINTKGLLLFTNDG 123
Query: 433 ELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERI--G 606
LAN +MHP S+IEREY +R+ + + L +GV + G F I +
Sbjct: 124 TLANKLMHPRSQIEREYNIRIFG-----EMNKNKINILRKGVKIIHGYVSFKEIVPLYDK 178
Query: 607 NTDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKTQ 786
+ WF+ ++ EG+NRE+R +++S CQV++L R RYG+++LP+ L GQ L
Sbjct: 179 KEGKNKWFKGILCEGKNREIRLMFKSIQCQVNQLIRVRYGNIILPKNLKEGQWMMLNSIF 238
Query: 787 VEALRTQLKLEKDM 828
++ L + +K++
Sbjct: 239 LKKLYNLINFDKEI 252
>gi|466190|sp|P35159|RLUB_BACSU RIBOSOMAL LARGE SUBUNIT
PSEUDOURIDINE SYNTHASE B (PSEUDOURIDYLATE SYNTHASE)
(URACIL HYDROLYASE) >gi|629120|pir||S45555 hypothetical
protein X13 - Bacillus subtilis >gi|410137 (L09228)
ORFX13 [Bacillus subtilis] >gi|2634751|emb|CAB14248|
(Z99116) similar to hypothetical proteins [Bacillus
subtilis]
Length = 229
Score = 129 bits (321), Expect = 6e-29
Identities = 90/236 (38%), Positives = 133/236 (56%), Gaps = 9/236 (3%)
Frame = +1
Query: 79 ERLHKVLAQAGLGSRRALEQRISNGLIKVNGDIA-QLGMSVKSGDKIELDGRSFVASALT 255
ERL KV+A AG+ SR E+ I G +KVNG + +LG+ V D+IE++G
Sbjct: 2 ERLQKVIAHAGVASRSKAEELIKEGKVKVNGKVVTELGVKVTGSDQIEVNGLKVERE--- 58
Query: 256 EPARVLIYNKPEGEVTTREDPEGRPTV---FETLPVLKGARWIAIGRLDINXXXXXXXXX 426
EP L+Y KP G ++ +D +GR V F+ +P R IGRLD +
Sbjct: 59 EPVYFLLY-KPRGVISAAQDDKGRKVVTDFFKNIP----QRIYPIGRLDYDTSGLLLLTN 113
Query: 427 DGELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDG-----TAKFDT 591
DGE AN +MHP EI++ YV +V+ + ELL +L RG+ LE+G AK +
Sbjct: 114 DGEFANKLMHPKYEIDKTYVAKVKGIPPK-----ELLRKLERGIRLEEGKTAPAKAKLLS 168
Query: 592 IERIGNTDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTE 771
+++ T ++ + EGRNR+VRR++E+ G +V +LKR Y + L R L G + E
Sbjct: 169 LDKKKQTSI---IQLTIHEGRNRQVRRMFEAIGHEVIKLKREEYAFLNL-RGLHTGDARE 224
Query: 772 LPKTQ 786
L T+
Sbjct: 225 LRLTK 229
>gi|6458615|gb|AAF10472.1|AE001942_4 (AE001942) ribosomal large
subunit pseudouridine synthase B [Deinococcus
radiodurans]
Length = 257
Score = 123 bits (306), Expect = 3e-27
Identities = 89/240 (37%), Positives = 119/240 (49%)
Frame = +1
Query: 79 ERLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLGMSVKSGDKIELDGRSFVASALTE 258
ERLHK LA+AG+ SRRA E+ I G + VNG A LG V D + +DGR V E
Sbjct: 4 ERLHKRLARAGIASRRAAEELIRAGRVTVNGQTAGLGQGVNDTDDVRVDGR-LVELTRPE 62
Query: 259 PARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDGEL 438
+Y KP G VTT D GR V + +P + G +GRLD + DG+L
Sbjct: 63 TVTYALY-KPVGFVTTAHDEYGRRNVLDAMPDVPGLH--PVGRLDKDSEGLLLLTNDGDL 119
Query: 439 ANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGNTDS 618
+ HP E+ Y EG E L+ L RG+ ++DG A + + +
Sbjct: 120 TLTLTHPRYGHEKAYRAWT---EGREPPTQAELDVLVRGIAMDDGPA-----QALSAAPA 171
Query: 619 HDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKTQVEAL 798
D VV+ EGRNR+VRR+ E+ G V RL R R G + L +L G+ EL +E L
Sbjct: 172 EDGAYVVLGEGRNRQVRRMLEALGHPVGRLVRYRVGGLWL-GDLNPGEYRELGPRDLEQL 230
>gi|4980760|gb|AAD35352.1|AE001708_20 (AE001708) 16S pseudouridylate
synthase [Thermotoga maritima]
Length = 239
Score = 120 bits (298), Expect = 3e-26
Identities = 80/250 (32%), Positives = 136/250 (54%), Gaps = 2/250 (0%)
Frame = +1
Query: 82 RLHKVLAQAGLGSRRALEQRISNGLIKVNGDIA-QLGMSVKSGDKIELDGRSFVASALTE 258
RL + L+ +G+G+R+ +++ I G + VNG + G V D + LDG +
Sbjct: 2 RLDRYLSNSGVGTRKEVKKLIKQGRVTVNGRVVLDPGHPVLENDAVALDGE---VVRFHK 58
Query: 259 PARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDGEL 438
+L Y KP G VT+ +DP T+ E LP LKG +GRLD + DG+
Sbjct: 59 KVYILFY-KPSGYVTSTKDPHSE-TIMEFLPPLKGI--FPVGRLDKDAEGLLIITNDGDF 114
Query: 439 ANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGT-AKFDTIERIGNTD 615
A+ ++ P +E+EY+V+V EGE V ++ +E+L GV L DG AK +E++ N
Sbjct: 115 AHRVISPKWSVEKEYIVKV---EGE--VTEDKIEKLKNGVTLRDGFFAKAKRVEKLSN-- 167
Query: 616 SHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKTQVEA 795
D ++V+ EG+ +++R+ + G + LKRTR G ++LP ++ G+ L + +V+
Sbjct: 168 --DTLKIVITEGKYHQIKRMTAAVGLKTVHLKRTRIGGLVLPDDMKPGEYRFLSEEEVKK 225
Query: 796 LRTQLKLEKDMP 831
+ + ++D P
Sbjct: 226 VFEREDQKEDTP 237
>gi|3915347|sp|O51155|Y129_BORBU HYPOTHETICAL PROTEIN BB0129
>gi|2688006 (AE001124) conserved hypothetical protein
[Borrelia burgdorferi]
Length = 249
Score = 118 bits (292), Expect = 1e-25
Identities = 74/248 (29%), Positives = 129/248 (51%), Gaps = 1/248 (0%)
Frame = +1
Query: 82 RLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLGMSVKSGDKIELDGRSFVASALTEP 261
R+H LA+ G+GSRR E+ I L++VN IA+LG V GD+I + FV
Sbjct: 8 RVHVFLAEKGVGSRRFCEELIRKKLVRVNNTIAKLGDKVTLGDRIIYKKQIFVFKDFQIN 67
Query: 262 ARV-LIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDGEL 438
R+ L NKP + + D +GR + L R +IGRLD DG+
Sbjct: 68 NRIYLALNKPRNYLCSNFDVDGRKLAISLVQPLFKERVFSIGRLDFKSSGLLLFTNDGKF 127
Query: 439 ANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGNTDS 618
AN ++HP ++EREY++ E ++ + + LL G+ ++ K + E + +
Sbjct: 128 ANDIIHPRQKVEREYII-----ESKKDIDENLLISFKSGIKVKKEFFKLKSYEILNKNSA 182
Query: 619 HDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKTQVEAL 798
R+++ EG+NRE+R+++ S+ + ++ R R G++ L L GQ +P +++ +L
Sbjct: 183 ----RLILDEGKNREIRKVFLSKNIFLKKIHRIRIGNINLD-SLKEGQVKIVPLSKINSL 237
Query: 799 RTQLKLEKD 825
+++L+ D
Sbjct: 238 KSRLEKLND 246
>gi|6647946|sp|Q9ZD06|Y544_RICPR HYPOTHETICAL PROTEIN RP544
>gi|3861093|emb|CAA14993| (AJ235272) unknown [Rickettsia
prowazekii]
Length = 235
Score = 116 bits (288), Expect = 4e-25
Identities = 78/217 (35%), Positives = 123/217 (55%), Gaps = 1/217 (0%)
Frame = +1
Query: 82 RLHKVLAQAGLGSRRALEQRISNGLIKVNG-DIAQLGMSVKSGDKIELDGRSFVASALTE 258
RL K+++ AG+ SRR E+ I G +K++G I +V ++IE+ GR T+
Sbjct: 3 RLAKIISNAGVCSRRNAEKLIVGGKVKIDGITILSPATNVDMSNQIEVSGRLINN---TQ 59
Query: 259 PARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDGEL 438
R+ IY KP G +TT +DP R TVFE L L R I+IGRLD+N G+L
Sbjct: 60 KPRLWIYYKPVGLITTHKDPLSRKTVFEQLIGLP--RVISIGRLDLNSEGLLLLTNSGDL 117
Query: 439 ANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGNTDS 618
A+ P+S+++R Y VR G ++ LL+ + + ++ +I+ + S
Sbjct: 118 AHQFEMPASKLKRVYNVRAY---GNPNI---LLKNNYKNLKIDGIFYNPHSIKLLRQNKS 171
Query: 619 HDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSV 732
+ WF VV+ EG+NRE+RR++E G QV++L R +YG++
Sbjct: 172 NSWFEVVLFEGKNREIRRIFEYFGLQVNKLIRIQYGAL 209
>gi|5738484|emb|CAB52832.1| (AL109848) putative pseudouridine
synthase [Streptomyces coelicolor A3(2)]
Length = 371
Score = 115 bits (284), Expect = 1e-24
Identities = 80/246 (32%), Positives = 123/246 (49%), Gaps = 2/246 (0%)
Frame = +1
Query: 79 ERLHKVLAQAGLGSRRALEQRISNGLIKVNGDIA-QLGMSVK-SGDKIELDGRSFVASAL 252
ERL KVLA+AG GSRRA E+ I +++NG+I + G V D++++DG +
Sbjct: 135 ERLQKVLARAGYGSRRACEELIEQARVEINGEIVLEQGRRVDPEKDEVKVDG----LTVA 190
Query: 253 TEPARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDG 432
T+ + NKP G V+T EDPEGR + + + + R +GRLD G
Sbjct: 191 TQSYQFFSLNKPAGVVSTMEDPEGRQCLGDYV-TNRETRLFHVGRLDTETEGVILLTNHG 249
Query: 433 ELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGNT 612
ELA+ + HP +++ Y+ + P + +L ++L G+ LEDG A+ D + T
Sbjct: 250 ELAHRLTHPRYGVKKTYLAHIVGP-----IPRDLGKRLKDGIQLEDGYARADHFRVVEQT 304
Query: 613 DSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKTQVE 792
+ V + EGR VRR+ G V L RT +G + L + G L T+V
Sbjct: 305 GKNYLVEVTLHEGRKHIVRRMLAEAGFPVDNLVRTAFGPITL-GDQKSGWLRRLSNTEVG 363
Query: 793 ALRTQLKL 816
L ++ L
Sbjct: 364 MLMQEVDL 371
>gi|3915377|sp|Q55578|Y361_SYNY3 HYPOTHETICAL 28.2 KD PROTEIN
SLR0361 >gi|1001457|dbj|BAA10082| (D63999) hypothetical
protein [Synechocystis sp.]
Length = 249
Score = 113 bits (281), Expect = 3e-24
Identities = 83/248 (33%), Positives = 122/248 (48%), Gaps = 6/248 (2%)
Frame = +1
Query: 73 LEERLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLGMSVK-SGDKIELDGRSFVASA 249
+ ER+ K+L+Q G+ SRR E+ I G + VNG +A LG D + +DG+ A
Sbjct: 1 MAERIQKLLSQWGIASRRHAEEMILAGRVSVNGKVANLGDKADPQQDFLSVDGKQIKADN 60
Query: 250 LTEPARVLIYNKPEGEVTTREDPEGRPTVFETLP--VLKGARWIAIGRLDINXXXXXXXX 423
+L+ NKP ++T +DP GR TV + LP + +G +GRLD N
Sbjct: 61 RPRDIYLLV-NKPRDVLSTCDDPRGRKTVLDLLPQDLQRGKGLHPVGRLDRNSTGALLLT 119
Query: 424 XDGELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERI 603
DGEL + HP + + Y V + E + DE LE+ G+ML+ T+E I
Sbjct: 120 NDGELTLRLTHPRYHLPKTYDVWL-----EGNPSDEDLEKWRSGMMLDGKKTLPATLEVI 174
Query: 604 GNTDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLL---PRELLRGQSTEL 774
V + EGRNR++RRL E G V +L R G + L + L GQ L
Sbjct: 175 SENKDQIHLLVTLTEGRNRQIRRLAEELGLTVLKLHRRTIGPLQLHTRGKVLGSGQFRFL 234
Query: 775 PKTQVEALRTQLKL 816
++ L+ Q+ L
Sbjct: 235 SPAEIRLLKKQVNL 248
>gi|3915445|sp|O67444|YE64_AQUAE HYPOTHETICAL PROTEIN AQ_1464
>gi|2983856 (AE000741) hypothetical protein [Aquifex
aeolicus]
Length = 249
Score = 110 bits (273), Expect = 2e-23
Identities = 81/236 (34%), Positives = 126/236 (53%), Gaps = 10/236 (4%)
Frame = +1
Query: 73 LEERLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQ-LGMSVKSG-DKIELDGRSFVAS 246
+E R++K L++AG+ SRR E+ I G +KVNG++ + LG+ V D +E+DG+
Sbjct: 2 MEVRINKFLSEAGVASRRKAEKLILEGRVKVNGEVVRSLGVKVNPEVDIVEVDGKP---- 57
Query: 247 ALTEPARVLIYNKPEGEVTTR-EDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXX 423
+ R +I NKP +T P+GR T+ E + + R +GRLD N
Sbjct: 58 VKPQRKRYIILNKPCCYLTQLGRSPDGRKTIEELIKDIP-ERVFPVGRLDYNTEGLLILT 116
Query: 424 XDGELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERI 603
DGELAN ++HP ++ + Y+ V E V + L+++ +G+ LEDG AK D I +
Sbjct: 117 NDGELANRILHPRYKLPKVYLALV-----EGKVDQKTLKRMKQGIELEDGFAKPDNIRIV 171
Query: 604 GNTDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLP-------RELLRGQ 762
+ + EGR V+R + G +V RLKR G + L REL +G+
Sbjct: 172 RYEGKNTLLEITFHEGRKHLVKRFLGAFGHKVKRLKRIAIGPIKLGKLSPGKWRELNQGE 231
Query: 763 STELPK 780
+L K
Sbjct: 232 LAQLFK 237
>gi|3915547|sp|O33210|YH11_MYCTU HYPOTHETICAL 27.6 KD PROTEIN RV1711
>gi|2326754|emb|CAB10968| (Z98268) hypothetical protein
Rv1711 [Mycobacterium tuberculosis]
Length = 254
Score = 108 bits (266), Expect = 2e-22
Identities = 79/222 (35%), Positives = 111/222 (49%), Gaps = 4/222 (1%)
Frame = +1
Query: 82 RLHKVLAQAGLGSRRALEQRISNGLIKVNGDIA-QLGMSVKSGDKI-ELDGRSFVASALT 255
RL KVL+QAG+ SRRA E+ I +G ++V+G + +LG V + +DG V L
Sbjct: 15 RLQKVLSQAGIASRRAAEKMIVDGRVEVDGHVVTELGTRVDPQVAVVRVDGARVV---LD 71
Query: 256 EPARVLIYNKPEGEVTTREDPEGRPTVFETLP--VLKGARWIAIGRLDINXXXXXXXXXD 429
+ L NKP G +T D GRP + + + V + +GRLD + D
Sbjct: 72 DSLVYLALNKPRGMHSTMSDDRGRPCIGDLIERKVRGTKKLFHVGRLDADTEGLMLLTND 131
Query: 430 GELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGN 609
GELA+ +MHPS E+ + Y+ V V L L G+ L+DG A D +
Sbjct: 132 GELAHRLMHPSHEVPKTYLATVTGS-----VPRGLGRTLRAGIELDDGPAFVDDFAVVDA 186
Query: 610 TDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRE 747
RV + EGRNR VRRL + G V L RT G+V L ++
Sbjct: 187 IPGKTLVRVTLHEGRNRIVRRLLAAAGFPVEALVRTDIGAVSLGKQ 232
>gi|3915542|sp|O05668|YH11_MYCLE HYPOTHETICAL 28.1 KD PROTEIN
CB1351.03C >gi|2065214|emb|CAB08276| (Z95117)
MLC1351.03c, unknown, len: 256 aa, similar to eg.
YCIL_ECOLI P37765 hypothetical 32.7 kd protein in trpl-
(291 aa), fasta clones, opt: 481 z-score: 570.9 E():
8.5e-25, (42.4% identity in 229 aa overlap); contains
PS011...
Length = 256
Score = 106 bits (263), Expect = 4e-22
Identities = 81/233 (34%), Positives = 119/233 (50%), Gaps = 11/233 (4%)
Frame = +1
Query: 82 RLHKVLAQAGLGSRRALEQRISNGLIKVNGDIA-QLGMSVKSG-DKIELDGRSFVASALT 255
RL K+L++AG+ SRRA E+ I G ++V+G + +LG V + +DG V +
Sbjct: 17 RLQKILSRAGIASRRAAEKLIIEGRVEVDGQLVRELGTRVDPDVSVVRVDG---VKVVVD 73
Query: 256 EPARVLIYNKPEGEVTTREDPEGRPTVFETLP--VLKGARWIAIGRLDINXXXXXXXXXD 429
+ L NKP G +T D GRP V + + V + +GRLD + D
Sbjct: 74 DSLVYLALNKPRGMYSTMSDDRGRPCVGDLIERRVRGNKKLFHVGRLDADTEGLILLTND 133
Query: 430 GELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGN 609
GELA+ +MHPS E+ + Y+ V+ V L ++L+ G+ L+DG A D +
Sbjct: 134 GELAHRLMHPSHEVSKTYLATVKGA-----VPRGLGKKLSVGLELDDGPAHVDDFAVVDA 188
Query: 610 TDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLP-------RELLRGQST 768
R+ + EGR R VRRL + G V L RT G+V L R LLR +
Sbjct: 189 IPGKTLVRLTLHEGRKRIVRRLLTAAGFPVEMLVRTDIGAVSLGDQRPGCLRALLRDEIR 248
Query: 769 ELPK 780
+L K
Sbjct: 249 QLYK 252
>gi|3915394|sp|O66829|Y554_AQUAE HYPOTHETICAL PROTEIN AQ_554
>gi|2983194 (AE000695) hypothetical protein [Aquifex
aeolicus]
Length = 238
Score = 105 bits (259), Expect = 1e-21
Identities = 74/244 (30%), Positives = 135/244 (55%), Gaps = 4/244 (1%)
Frame = +1
Query: 82 RLHKVLAQAGLGSRRALEQRISNGLIKVNGDIA-QLGMSVKSGDKIELDGRSFVASALTE 258
RL K L+++ SR+ ++ I G +KV+G + Q VK G+++E++G+S +
Sbjct: 2 RLDKYLSKSLHISRKEAKELIREGRVKVSGKVVKQAEYRVKEGEEVEVEGKS------VK 55
Query: 259 PAR--VLIYNKPEGEVTTREDPEGRPTVFETLPV-LKGARWIAIGRLDINXXXXXXXXXD 429
P + L+ KP+G ++T E+ + P+ E + + + GRLD++ D
Sbjct: 56 PKKNVYLMLYKPKGYLSTTEEDKKYPSFLELIREHFPSRKLFSAGRLDVDAEGLLLITDD 115
Query: 430 GELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGN 609
GELA+ + HP ++E+EY+VR+ + + DE L++L V LE+ + E++
Sbjct: 116 GELAHRLTHPKWKVEKEYIVRL-----DRDIGDEELKKLYE-VKLEEKPVQLVKAEKL-- 167
Query: 610 TDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKTQV 789
S D + ++ EGR+ V+RL+++ G V LKRTR G++ L + G+ EL + +V
Sbjct: 168 --SGDTVKAILTEGRHHVVKRLFKAVGHNVVYLKRTRVGNLRLDENMEPGEWRELTEEEV 225
Query: 790 EALRTQLK 813
+ L+ +K
Sbjct: 226 KELKRLVK 233
>gi|4155973 (AE001558) putative [Helicobacter pylori J99]
Length = 262
Score = 103 bits (255), Expect = 3e-21
Identities = 74/219 (33%), Positives = 119/219 (53%), Gaps = 9/219 (4%)
Frame = +1
Query: 82 RLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLGMSVKSGDKIELDGRSFVASALTEP 261
R+++ LA SRR E+ + G +K+N + A+L VK DK+ LD R + +
Sbjct: 7 RINQFLAHYTKHSRREAEKLVLEGRVKINHEHAKLASVVKENDKVFLDKR-LIKPLKNKK 65
Query: 262 ARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDGELA 441
VL+Y+KP+GE+ ++ DP R ++E+L K A + +GRLD +
Sbjct: 66 FSVLVYHKPKGELVSKADPLKRHVIYESLEK-KYAHFAPVGRLDFASEGVLLLSDSKAVV 124
Query: 442 NAMMHPSSEIEREYVVRVR---SPEGEEHVQDEL-LEQLTRGVMLEDGT-----AKFDTI 594
+A+MH + +E+EY+V+++ + E E +Q+ L LE T+G + A F
Sbjct: 125 SALMH--ANLEKEYLVKIQGFVTREMENAMQEGLKLENATKGAHQKTPIKSMEFAPFIGY 182
Query: 595 ERIGNTDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLL 738
E I N + RV++ EG+NRE+RR + V L+R RYG V L
Sbjct: 183 EIIKNHAKYSKLRVIINEGKNRELRRFFAFFNAGVLDLRRVRYGFVNL 230
>gi|2501526|sp|P55986|YE59_HELPY HYPOTHETICAL PROTEIN HP1459
>gi|2314637|gb|AAD08501.1| (AE000646) conserved
hypothetical protein [Helicobacter pylori 26695]
Length = 262
Score = 102 bits (253), Expect = 5e-21
Identities = 72/219 (32%), Positives = 120/219 (53%), Gaps = 9/219 (4%)
Frame = +1
Query: 82 RLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLGMSVKSGDKIELDGRSFVASALTEP 261
R+++ LA SRR E+ + G +K+N + A+L VK D++ LD R + +
Sbjct: 7 RINQFLAHYTKHSRREAEKLVLEGRVKINHEHAKLASVVKENDRVFLDKR-LIKPLKNKK 65
Query: 262 ARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDGELA 441
VL+Y+KP+GE+ ++ DP R ++E+L K A + +GRLD +
Sbjct: 66 FSVLVYHKPKGELVSKADPLKRRVIYESLEK-KYAHFAPVGRLDFASEGVLLLSDSKAVV 124
Query: 442 NAMMHPSSEIEREYVVRVR---SPEGEEHVQDEL-LEQLTRGVMLEDGT-----AKFDTI 594
+A+MH +++E+EY+++++ + E E +Q+ L LE T+G + A F
Sbjct: 125 SALMH--ADLEKEYLIKIQGFVTREMENAMQEGLKLENATKGAHQKTPIKSMEFAPFIGY 182
Query: 595 ERIGNTDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLL 738
E I N + RV++ EG+NRE+RR + V L+R RYG V L
Sbjct: 183 EIIKNHAKYSKLRVIINEGKNRELRRFFAFFNAGVLDLRRVRYGFVNL 230
>gi|418534|sp|P32684|YJBC_ECOLI HYPOTHETICAL 32.5 KD PROTEIN IN
PEPE-LYSC INTERGENIC REGION >gi|396357 (U00006) No
definition line found [Escherichia coli] >gi|1790453
(AE000475) orf, hypothetical protein [Escherichia coli]
Length = 290
Score = 95.6 bits (234), Expect = 9e-19
Identities = 66/224 (29%), Positives = 117/224 (51%)
Frame = +1
Query: 67 PKLEERLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLGMSVKSGDKIELDGRSFVAS 246
P RL+K ++++G+ SRR ++ I G + +NG A +G VK GD ++++G+ +
Sbjct: 3 PDSSVRLNKYISESGICSRREADRYIEQGNVFLNGKRATIGDQVKPGDVVKVNGQ-LIEP 61
Query: 247 ALTEPARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXX 426
E ++ NKP G V+T ED E R + + V R IGRLD +
Sbjct: 62 REAEDLVLIALNKPVGIVSTTEDGE-RDNIVDF--VNHSKRVFPIGRLDKDSQGLIFLTN 118
Query: 427 DGELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIG 606
G+L N ++ ++ E+EY+V V P + +E + ++ GV + K +++
Sbjct: 119 HGDLVNKILRAGNDHEKEYLVTVDKP-----ITEEFIRGMSAGVPILGTVTKKCKVKK-- 171
Query: 607 NTDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLL 738
++ FR+ + +G NR++RR+ E G +V +L+RTR +V L
Sbjct: 172 --EAPFVFRITLVQGLNRQIRRMCEHFGYEVKKLERTRIMNVSL 213
>gi|3329180|gb|AAC68318.1| (AE001343) predicted pseudouridine
synthase [Chlamydia trachomatis]
Length = 241
Score = 82.7 bits (201), Expect = 7e-15
Identities = 71/238 (29%), Positives = 117/238 (48%), Gaps = 4/238 (1%)
Frame = +1
Query: 82 RLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLGMSV---KSGDKIELDGRSFVASAL 252
RL+K LA AG+ SRR ++ I G + VNG +A G V + D +E+ G+ A
Sbjct: 5 RLNKFLASAGVASRRKCDEIIFAGSVTVNGRVAA-GPFVTVDEEFDSVEVGGQRIGA--- 60
Query: 253 TEPARVLIYNKPEGEVTTREDP-EGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXD 429
E + +KP G + + E G V + L R +GRLD D
Sbjct: 61 -EKKVYFMVHKPLGYLCSSERKFPGSKLVIDLLSHCP-YRLFTVGRLDKETSGLILVTND 118
Query: 430 GELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGN 609
GE AN ++HPS I +EY+++V V LE L G +++ + +++++
Sbjct: 119 GEFANRVIHPSFGITKEYLLKV-----SRDVTARDLETLMAGTVIDGKVVRPVSVKKV-- 171
Query: 610 TDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKTQV 789
+++V EG+ E+R E+ G Q+ LKR R GS++L L G+ EL +++
Sbjct: 172 --RRGTIKIIVNEGKKHEIRLFAEAAGLQLLELKRIRIGSLVL-GGLPYGKYRELTDSEL 228
Query: 790 EA 795
++
Sbjct: 229 DS 230
>gi|465610|sp|P33918|RSUA_ECOLI RIBOSOMAL SMALL SUBUNIT
PSEUDOURIDINE SYNTHASE A (16S PSEUDOURIDYLATE 516
SYNTHASE) (16S PSEUDOURIDINE 516 SYNTHASE) (URACIL
HYDROLYASE) >gi|1042177|bbs|169371 16S RNA pseudouridine
516 synthase, 16S RNA psi 516 synthase=rsuA gene product
[Escherichia coli, Peptide, 231 aa] >gi|405907 (U00008)
yejD [Escherichia coli] >gi|1788510 (AE000308) 16S
pseudouridylate 516 synthase [Escherichia coli]
Length = 231
Score = 82.7 bits (201), Expect = 7e-15
Identities = 71/239 (29%), Positives = 111/239 (45%), Gaps = 4/239 (1%)
Frame = +1
Query: 82 RLHKVLAQAGLGSRRALEQR-ISNGLIKVNGDIAQ-LGMSVKSGDKIELDGRSFVASALT 255
RL K +AQ LG RA+ R I + V+G+I + + + DG A
Sbjct: 2 RLDKFIAQQ-LGVSRAIAGREIRGNRVTVDGEIVRNAAFKLLPEHDVAYDGNPL---AQQ 57
Query: 256 EPARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDGE 435
R + NKP+G V + +DP+ PTV L + A GRLDI+ DG+
Sbjct: 58 HGPRYFMLNKPQGYVCSTDDPD-HPTVLYFLDEPVAWKLHAAGRLDIDTTGLVLMTDDGQ 116
Query: 436 LANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVML--EDGTAKFDTIERIGN 609
++ + P E+ Y+V + SP V D+ EQ +GV L E K +E I
Sbjct: 117 WSHRITSPRHHCEKTYLVTLESP-----VADDTAEQFAKGVQLHNEKDLTKPAVLEVITP 171
Query: 610 TDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKTQV 789
T R+ + EGR +V+R++ + G V L R R G + L +L G+ L + ++
Sbjct: 172 TQ----VRLTISEGRYHQVKRMFAAVGNHVVELHRERIGGITLDADLAPGEYRPLTEEEI 227
Query: 790 EAL 798
++
Sbjct: 228 ASV 230
>gi|2145990|pir||S72955 u0247g protein - Mycobacterium leprae
>gi|467162 (U00021) u0247g [Mycobacterium leprae]
Length = 186
Score = 82.3 bits (200), Expect = 9e-15
Identities = 60/170 (35%), Positives = 84/170 (49%), Gaps = 9/170 (5%)
Frame = +1
Query: 271 LIYNKPEGEVTTREDPEGRPTVFETLP--VLKGARWIAIGRLDINXXXXXXXXXDGELAN 444
L NKP G +T D GRP V + + V + +GRLD + DGELA+
Sbjct: 9 LALNKPRGMYSTMSDDRGRPCVGDLIERRVRGNKKLFHVGRLDADTEGLILLTNDGELAH 68
Query: 445 AMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGNTDSHD 624
+MHPS E+ + Y+ V+ V L ++L+ G+ L+DG A D +
Sbjct: 69 RLMHPSHEVSKTYLATVKGA-----VPRGLGKKLSVGLELDDGPAHVDDFAVVDAIPGKT 123
Query: 625 WFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLP-------RELLRGQSTELPK 780
R+ + EGR R VRRL + G V L RT G+V L R LLR + +L K
Sbjct: 124 LVRLTLHEGRKRIVRRLLTAAGFPVEMLVRTDIGAVSLGDQRPGCLRALLRDEIRQLYK 182
>gi|4377181|gb|AAD19002| (AE001667) predicted pseudouridine synthase
[Chlamydia pneumoniae]
Length = 235
Score = 81.9 bits (199), Expect = 1e-14
Identities = 71/245 (28%), Positives = 118/245 (47%), Gaps = 1/245 (0%)
Frame = +1
Query: 82 RLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLG-MSVKSGDKIELDGRSFVASALTE 258
RL+K LA AG+ SRR ++ I +G + VNG +A+ + V DK+++ G S LT+
Sbjct: 5 RLNKFLASAGVASRRKCDEIIFSGSVTVNGRVAEGPFVLVDPEDKVQVGGTSV---HLTK 61
Query: 259 PARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDGEL 438
+++ ++ + G V + L R +GRLD DGE
Sbjct: 62 KVYFMVHKAIGYLCSSEKKFPGTKLVIDLFAHLP-YRVFTVGRLDKETSGLILVTNDGEF 120
Query: 439 ANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGNTDS 618
AN ++HPSS I +EY+++V V + L +L G ++ + ++ +I
Sbjct: 121 ANKIIHPSSGITKEYLLKV-----SRDVSAKDLGKLMEGTFIDGKHVRPVSVTKI----R 171
Query: 619 HDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKTQVEAL 798
++VV EG+ E+R ++ G + LKR R GS++L L G+ EL + L
Sbjct: 172 RGTVKIVVSEGKKHEIRLFADAAGFPILELKRIRIGSLVL-GGLRYGEYRELTDAE---L 227
Query: 799 RTQLKL 816
T +KL
Sbjct: 228 GTYMKL 233
>gi|3322747 (AE001223) conserved hypothetical protein [Treponema
pallidum]
Length = 261
Score = 78.0 bits (189), Expect = 2e-13
Identities = 70/224 (31%), Positives = 108/224 (47%), Gaps = 17/224 (7%)
Frame = +1
Query: 67 PKLEERLHKVLAQAGLGSRRALEQRISNGLIKVNGD-IAQLGMSVKSGDKIELDGRSFVA 243
P RL LA++G SRRA E I++G + V+G + G +V + + + +DG
Sbjct: 6 PFFRLRLQVYLARSGCASRRACEALIASGRVTVDGQTVTTQGRTVCAQNVVCVDG---TV 62
Query: 244 SALTEPARVLIYNKPEGEVTTR--EDPEG-----------RPTVFETLPVLKGA---RWI 375
L R ++ KP G + + + P G + + +++ A R
Sbjct: 63 VQLERVQRYVLLYKPVGYICSLAPQFPAGYAHTQVRAGPSKQEYARAIDLVQPAYQERLY 122
Query: 376 AIGRLDINXXXXXXXXXDGELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRG 555
IGRLD+ DG A A+ HP S IE+EY+V R P V LL RG
Sbjct: 123 HIGRLDVRSEGALLFTNDGSFAQALGHPRSGIEKEYIVETREP-----VPAALLSSFVRG 177
Query: 556 VMLEDGTAKFDTIERIGNTDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVL 735
V +E + + + ++V+ EG+ RE+R ++E+ G V RL R R G V
Sbjct: 178 VWVEGCRYRCVRARHL----AAQCVQLVLVEGKKREIRVVFEAWGQDVVRLVRVRIGRVR 233
Query: 736 L 738
L
Sbjct: 234 L 234
>gi|1175857|sp|P45124|RSUA_HAEIN RIBOSOMAL SMALL SUBUNIT
PSEUDOURIDINE SYNTHASE A (16S PSEUDOURIDYLATE 516
SYNTHASE) (16S PSEUDOURIDINE 516 SYNTHASE) (URACIL
HYDROLYASE) >gi|1074693|pir||F64169 hypothetical protein
HI1243 - Haemophilus influenzae (strain Rd KW20)
>gi|1574175 (U32804) 16s pseudouridylate 516 synthase
(rsuA) [Haemophilus influenzae Rd]
Length = 232
Score = 73.7 bits (178), Expect = 3e-12
Identities = 60/239 (25%), Positives = 110/239 (45%), Gaps = 2/239 (0%)
Frame = +1
Query: 82 RLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLGMSVKSGDKIELDGRSFVASALT-- 255
RL K +A+ +R + I +K+NG+I + G SV+ + E+ F LT
Sbjct: 2 RLDKFIAENVGLTRSQATKAIRQSAVKINGEIVKSG-SVQISQEDEI---YFEDELLTWI 57
Query: 256 EPARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDGE 435
E + + NKP+G V + +D + PT+++ + + GRLD++ DG+
Sbjct: 58 EEGQYFMLNKPQGCVCSNDDGD-YPTIYQFFDYPLAGKLHSAGRLDVDTTGLVLLTDDGQ 116
Query: 436 LANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGNTD 615
++ + P E+ Y+V + P E + L RG AK + ++
Sbjct: 117 WSHRITSPKHHCEKTYLVTLADPVEENYSAACAEGILLRGEKEPTKPAKLEILDDYN--- 173
Query: 616 SHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKTQVEA 795
+ + EGR +V+R++ + G +V L R + G V+L L G+ L ++++E
Sbjct: 174 ----VNLTISEGRYHQVKRMFAALGNKVVGLHRWKIGDVVLDESLEEGEYRPLTQSEIEK 229
Query: 796 L 798
L
Sbjct: 230 L 230
>gi|3845303 (AE001423) pseudouridine synthetase (RsuA fam.); 1st
euk. member (OO) [Plasmodium falciparum]
Length = 338
Score = 67.1 bits (161), Expect = 3e-10
Identities = 65/240 (27%), Positives = 120/240 (49%), Gaps = 53/240 (22%)
Frame = +1
Query: 82 RLHKVLAQAGLGSRRALEQRISNGLIKVNGDI-AQLGMSVKSGD--------KIELDGR- 231
RL+K+++ SRR ++ I +G +K+N I G V G KI+L
Sbjct: 47 RLNKLISMKRNISRRKSDEFIKDGKVKINNKIITNPGTHVHIGKDSLRIYDKKIKLTNII 106
Query: 232 SFVASALTEPARVLIYNKPEGEVTTREDPEGRPTVFETLP--VLKGARWIAIGRLDINXX 405
+ + + + ++ +KP+G + T D + R +++ P +L+ R + +GRLD N
Sbjct: 107 NMIKQNENKLHKWIVLHKPKGLLCTSNDEKNRKSIYTLFPEEMLQKYRLVTVGRLDRNTS 166
Query: 406 XXXXXXXDGELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDG---- 573
D N + HP + R Y V + P V+ L++L RG+ LE+
Sbjct: 167 GVLLLTNDYAWVNKLTHPKYQRIRTYRVHIEGP-----VKMNALKELARGIYLEEDEKTQ 221
Query: 574 -------------------------------------TAKFDTIERIGNTDSHDWFRVVV 642
+ + I+ +T + +
Sbjct: 222 PKKIYNYKESREKSNIDDKKKKKMSKMKKKTNPAFIEILREEKIKIKEDTKKITVLNISI 281
Query: 643 KEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKTQVEALR 801
KEGRNR++R++++ V ++KRT + ++ L Q EL + +V L+
Sbjct: 282 KEGRNRQIRKMFQQINQPVIKIKRTSFENITLKNIYFPKQYRELNQKEVNDLK 334
>gi|1175289|sp|P44827|YMFC_HAEIN HYPOTHETICAL PROTEIN HI0694
>gi|1074484|pir||I64156 hypothetical protein HI0694 -
Haemophilus influenzae (strain Rd KW20) >gi|1573697
(U32752) conserved hypothetical protein [Haemophilus
influenzae Rd]
Length = 240
Score = 66.7 bits (160), Expect = 4e-10
Identities = 57/187 (30%), Positives = 89/187 (47%), Gaps = 14/187 (7%)
Frame = +1
Query: 256 EPARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDGE 435
+ +V+++NKP +T D +GR T+ + + + A GRLD + +GE
Sbjct: 49 DETKVVLFNKPFDVLTQFTDEQGRATLKDFISI---PNVYAAGRLDRDSEGLLILTNNGE 105
Query: 436 LANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGNTD 615
L + + P + E+ Y V+V EG D L QL +GV L+DG K + I +
Sbjct: 106 LQHRLADPKFKTEKTYWVQV---EGIPEETD--LAQLRKGVELKDGVTKSAKVRLISEPN 160
Query: 616 SHD--------------WFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELL 753
+ W + + EGRNR+VRR+ G RL R G +L L
Sbjct: 161 LWERNPPIRERKNIPTSWLEIKISEGRNRQVRRMTAHIGFPTLRLVRVSMG-LLSINGLE 219
Query: 754 RGQSTELPKTQVEALRTQLKL 816
G L +++AL +KL
Sbjct: 220 NGSFRLLSLDEIKALFQTVKL 240
>gi|1787380 (AE000213) orf, hypothetical protein [Escherichia coli]
Length = 207
Score = 57.8 bits (137), Expect = 2e-07
Identities = 51/161 (31%), Positives = 74/161 (45%), Gaps = 15/161 (9%)
Frame = +1
Query: 256 EPARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDGE 435
+P RV+++NKP + D GR T+ E +PV +G A GRLD + +G
Sbjct: 27 QPTRVILFNKPYDVLPQFTDEAGRKTLKEFIPV-QGV--YAAGRLDRDSEGLLVLTNNGA 83
Query: 436 LANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGT---AKFDTIE--- 597
L + P + Y V+V E + LE L GV L DG A + ++
Sbjct: 84 LQARLTQPGKRTGKIYYVQV-----EGIPTQDALEALRNGVTLNDGPTLPAGAELVDEPA 138
Query: 598 ---------RIGNTDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLL 738
R + W ++ + EGRNR+VRR+ G RL R G L
Sbjct: 139 WLWPRNPPIRERKSIPTSWLKITLYEGRNRQVRRMTAHVGFPTLRLIRYAMGDYSL 194
>gi|3916025|sp|P75966|YMFC_ECOLI HYPOTHETICAL 24.9 KD PROTEIN IN
TRMU-ICDA INTERGENIC REGION >gi|4062699|dbj|BAA35957|
(D90748) Hypothetical protein HI0694 [Escherichia coli]
>gi|4062717|dbj|BAA35966| (D90749) Hypothetical protein
HI0694 [Escherichia coli]
Length = 217
Score = 57.8 bits (137), Expect = 2e-07
Identities = 51/161 (31%), Positives = 74/161 (45%), Gaps = 15/161 (9%)
Frame = +1
Query: 256 EPARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDGE 435
+P RV+++NKP + D GR T+ E +PV +G A GRLD + +G
Sbjct: 37 QPTRVILFNKPYDVLPQFTDEAGRKTLKEFIPV-QGV--YAAGRLDRDSEGLLVLTNNGA 93
Query: 436 LANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGT---AKFDTIE--- 597
L + P + Y V+V E + LE L GV L DG A + ++
Sbjct: 94 LQARLTQPGKRTGKIYYVQV-----EGIPTQDALEALRNGVTLNDGPTLPAGAELVDEPA 148
Query: 598 ---------RIGNTDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLL 738
R + W ++ + EGRNR+VRR+ G RL R G L
Sbjct: 149 WLWPRNPPIRERKSIPTSWLKITLYEGRNRQVRRMTAHVGFPTLRLIRYAMGDYSL 204
>gi|3402689|gb|AAC28992.1| (AC004697) unknown protein [Arabidopsis
thaliana]
Length = 303
Score = 54.3 bits (128), Expect = 2e-06
Identities = 41/134 (30%), Positives = 63/134 (46%), Gaps = 12/134 (8%)
Frame = +1
Query: 79 ERLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLGMS--VKSGDKIELDGRSFVASAL 252
+RL KVLA AG+ SRR E+ I +G + VNG + + S D I ++G
Sbjct: 160 QRLSKVLAAAGVASRRTSEELIFDGKVTVNGILCNTPQTRVDPSRDIIYVNGNRIPKK-- 217
Query: 253 TEPARVLIYNKPEGEVTTREDPEGRPTVF----------ETLPVLKGARWIAIGRLDINX 402
P NKP+G + + + E + + + P R +GRLD+
Sbjct: 218 LPPKVYFALNKPKGYICSSGEKEIKSAISLFDEYLSSWDKRNPGTPKPRLFTVGRLDVAT 277
Query: 403 XXXXXXXXDGELANAMMHPSSEIERE 480
DG+ A + HPSS + +E
Sbjct: 278 TGLIVVTNDGDFAQKLSHPSSSLPKE 303
>gi|3915559|sp|O32068|YTZF_BACSU HYPOTHETICAL 17.7 KD PROTEIN IN
AMYX-OPUD INTERGENIC REGION >gi|2635487|emb|CAB14981|
(Z99119) similar to hypothetical proteins [Bacillus
subtilis]
Length = 157
Score = 53.9 bits (127), Expect = 3e-06
Identities = 38/139 (27%), Positives = 64/139 (45%), Gaps = 1/139 (0%)
Frame = +1
Query: 382 GRLDINXXXXXXXXXDGELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVM 561
GRLD + DG+LA+ ++ P + + Y V ++S E + D L GV
Sbjct: 18 GRLDKDTEGFLLLTNDGQLAHRLLSPKKHVPKTYEVHLKSQISREDISD-----LETGVY 72
Query: 562 LEDGTAKFDTIERIGNTDS-HDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLL 738
+E G I DS + + + EG+ +V+++ ++ G +V LKR G V L
Sbjct: 73 IEGGYKTKPAKAEIKTNDSGNTVIYLTITEGKYHQVKQMAKAVGNEVVYLKRLSMGRVSL 132
Query: 739 PRELLRGQSTELPKTQVEAL 798
L G+ EL + ++ L
Sbjct: 133 DPALAPGEYRELTEEELHLL 152
>gi|3402691|gb|AAC28994.1| (AC004697) unknown protein [Arabidopsis
thaliana]
Length = 86
Score = 51.5 bits (121), Expect = 2e-05
Identities = 26/56 (46%), Positives = 37/56 (65%)
Frame = +1
Query: 631 RVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKTQVEAL 798
R+VV EGRN EVR L ++ G +V LKR R G LP +L G+ EL +++++AL
Sbjct: 27 RIVVHEGRNHEVRELVKNAGLEVHSLKRVRIGGFRLPSDLGLGKHVELKQSELKAL 82
>gi|6689331|emb|CAB65436.1| (AJ250721) hypothetical protein
[Synechococcus leopoliensis]
Length = 199
Score = 49.2 bits (115), Expect = 8e-05
Identities = 40/150 (26%), Positives = 69/150 (45%), Gaps = 14/150 (9%)
Frame = +1
Query: 265 RVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDGELAN 444
R L+++KP V + P RP + +GRLD + +G L +
Sbjct: 4 RYLLFHKPYDAVC-QFSPSDRPDQQTLKDYIDVPEVYPVGRLDRDSEGLLLLTNNGALQH 62
Query: 445 AMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGNTDSHD 624
+ HP +R Y V+V E + L+ L +GV ++D + ++R+ + +
Sbjct: 63 RLCHPRFGHDRTYWVQV-----EREPTEAALQALRQGVQIQDYRTRPAKVQRLDDPQIPE 117
Query: 625 --------------WFRVVVKEGRNREVRRLWESQGCQVSRLKR 714
W + ++EGRNR+VRR+ + G RL R
Sbjct: 118 RDPPIRFRKTVPTAWLALTLQEGRNRQVRRMTAAVGHPTLRLIR 161
>gi|3915395|sp|P72581|Y612_SYNY3 HYPOTHETICAL 21.0 KD PROTEIN
SLR0612
Length = 261
Score = 48.4 bits (113), Expect = 1e-04
Identities = 46/158 (29%), Positives = 71/158 (44%), Gaps = 18/158 (11%)
Frame = +1
Query: 247 ALTEPARVLIYNKPEGEVT--TREDPEGRPTV--FETLPVLKGARWIAIGRLDINXXXXX 414
AL + + +++ KP G + T RPT+ + LP L +GRLD +
Sbjct: 34 ALNKTPQTIVFYKPYGVLCQFTDNSAHPRPTLKDYINLPDL-----YPVGRLDQDSEGLL 88
Query: 415 XXXXDGELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTI 594
+G+L + + H +R Y +V E DE LE L RG+ D +
Sbjct: 89 LLTSNGKLQHRLAHREFAHQRTYFAQV-----EGSPTDEDLEPLRRGITFADYPTRPAIA 143
Query: 595 ERIGNTD--------------SHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTR 720
+ I D W + + EGRNR+VRR+ + G RL R +
Sbjct: 144 KIITEPDFPPRNPPIRYRASIPTSWLSITLTEGRNRQVRRMTAAVGFPTLRLVRVQ 199
>gi|1651652|dbj|BAA16580| (D90899) hypothetical protein
[Synechocystis sp.]
Length = 185
Score = 42.2 bits (97), Expect = 0.010
Identities = 34/114 (29%), Positives = 51/114 (43%), Gaps = 14/114 (12%)
Frame = +1
Query: 379 IGRLDINXXXXXXXXXDGELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGV 558
+GRLD + +G+L + + H +R Y +V E DE LE L RG+
Sbjct: 1 MGRLDQDSEGLLLLTSNGKLQHRLAHREFAHQRTYFAQV-----EGSPTDEDLEPLRRGI 55
Query: 559 MLEDGTAKFDTIERIGNTD--------------SHDWFRVVVKEGRNREVRRLWESQGCQ 696
D + + I D W + + EGRNR+VRR+ + G
Sbjct: 56 TFADYPTRPAIAKIITEPDFPPRNPPIRYRASIPTSWLSITLTEGRNRQVRRMTAAVGFP 115
Query: 697 VSRLKRTR 720
RL R +
Sbjct: 116 TLRLVRVQ 123
>gi|1175470|sp|Q09801|YAAA_SCHPO HYPOTHETICAL 37.3 KD PROTEIN C22G7.10
IN CHROMOSOME I >gi|2130331|pir||S62454 hypothetical
protein SPAC22G7.10 - fission yeast (Schizosaccharomyces
pombe) >gi|1009460|emb|CAA91134.1| (Z54328) hypothetical
protein [Schizosaccharomyces pombe]
Length = 344
Score = 41.8 bits (96), Expect = 0.014
Identities = 26/75 (34%), Positives = 34/75 (44%), Gaps = 4/75 (5%)
Frame = +1
Query: 1195 AGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSD----HATPTFNPYGNPGQK 1362
+G G H +PN N P R+S + P N+PN S HA PT NP + G
Sbjct: 237 SGGGVHSGAATPNAYVNNNPSSSRRES--ESPANSPNITSSAGMTHAQPTHNPTSSYG-- 292
Query: 1363 TGAGRPNNSGGKYNRNRGP 1419
N + YN +R P
Sbjct: 293 ------NGASTNYNASRPP 305
Score = 32.8 bits (73), Expect = 6.8
Identities = 25/84 (29%), Positives = 37/84 (43%), Gaps = 4/84 (4%)
Frame = +1
Query: 1162 STGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAP---NFP-SDHATP 1329
S P N N + G + + NP + G T + + P+N P N+P S P
Sbjct: 263 SESPANSPNITSSAGMTHAQPTHNPTSSYGNGASTNYNASRPPSNHPHSSNYPSSSRRKP 322
Query: 1330 TFNPYGNPGQKTGAGRPNNSGGKYNRNR 1413
+ + Y N + SGG+Y RNR
Sbjct: 323 SPDRYSNYSSR-------GSGGRYRRNR 343
>gi|294136|gb|AAA29551.1| (M83173) circumsporozoite protein
[Plasmodium falciparum]
Length = 442
Score = 41.4 bits (95), Expect = 0.018
Identities = 38/172 (22%), Positives = 56/172 (32%)
Frame = +1
Query: 895 NRNDNNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKP 1074
N N NN + +NN E ++ + D +D + + H K + G
Sbjct: 80 NDNGNNNNGNNNNGDNGREGKDEDKRDGNNEDNEKLRKPKHKKLKQPGDGNPDPNANPNV 139
Query: 1075 FKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTP 1254
P D + + + + + N A A+PN +PN N P
Sbjct: 140 DPNANPNVDPNANPNANPNANP-----NANPNANPNANPNANPNANPNA-NPNANPNANP 193
Query: 1255 GQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 1410
+P PN PN +A P NP NP A N N N
Sbjct: 194 NANPNANPNANPNANPNV-DPNANPNANPNANPNANPNANPNANPNANPNAN 244
Score = 40.2 bits (92), Expect = 0.040
Identities = 26/77 (33%), Positives = 33/77 (42%), Gaps = 3/77 (3%)
Frame = +1
Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 248 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 305
Query: 1366 GA---GRPNNSGGKYNRNRG 1416
A PN + K N+ G
Sbjct: 306 NANPNANPNANPNKNNQGNG 325
>gi|117587|sp|P08307|CSP_PLAFW CIRCUMSPOROZOITE PROTEIN PRECURSOR (CS)
>gi|627052|pir||A54529 circumsporozoite protein -
Plasmodium falciparum (strain Wellcome) >gi|160215
(M15505) circumsporozoite protein [Plasmodium falciparum]
Length = 442
Score = 41.4 bits (95), Expect = 0.018
Identities = 38/172 (22%), Positives = 56/172 (32%)
Frame = +1
Query: 895 NRNDNNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKP 1074
N N NN + +NN E ++ + D +D + + H K + G
Sbjct: 80 NDNGNNNNGNNNNGDNGREGKDEDKRDGNNEDNEKLRKPKHKKLKQPGDGNPDPNANPNV 139
Query: 1075 FKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTP 1254
P D + + + + + N A A+PN +PN N P
Sbjct: 140 DPNANPNVDPNANPNANPNANP-----NANPNANPNANPNANPNANPNA-NPNANPNANP 193
Query: 1255 GQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 1410
+P PN PN +A P NP NP A N N N
Sbjct: 194 NANPNANPNANPNANPNV-DPNANPNANPNANPNANPNANPNANPNANPNAN 244
Score = 40.2 bits (92), Expect = 0.040
Identities = 26/77 (33%), Positives = 33/77 (42%), Gaps = 3/77 (3%)
Frame = +1
Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 248 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 305
Query: 1366 GA---GRPNNSGGKYNRNRG 1416
A PN + K N+ G
Sbjct: 306 NANPNANPNANPNKNNQGNG 325
>gi|48921|emb|CAA42462| (X59794) traC-4 [Escherichia coli] >gi|1572574
(U67194) TraC4 [Enterobacter aerogenes]
Length = 747
Score = 41.0 bits (94), Expect = 0.024
Identities = 43/146 (29%), Positives = 65/146 (44%), Gaps = 16/146 (10%)
Frame = +1
Query: 862 QRRSAKATLHVNRNDNNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVS 1041
+R + A + R + +A + S A E+R+ + +D Q + +R
Sbjct: 137 ERAAKLARVQEERVRRDPNATEEDISAAKEARKTAEASAMLNDSD-AQRRAAELERQERD 195
Query: 1042 GEAAAKQAHKPFKQY-----KPKND-RSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGN 1203
+ A QA KP +QY K K++ +SL QSWYVP G P GA
Sbjct: 196 RQQAQPQAEKPERQYINVPYKEKDEAKSLGARWDRQQQSWYVPPGTDAAPFAKWAQGAAT 255
Query: 1204 GA-HPNKKSPNPNTR---NTPGQQTRKS------PYKYPNNA 1299
A P P P + P QQ +++ PY+ N A
Sbjct: 256 AAVEPRSAQPAPEAQGEAQKPAQQAQQARQYLAVPYEQRNAA 297
>gi|48920|emb|CAA42461| (X59794) traC-3 [Escherichia coli] >gi|1572575
(U67194) TraC3 [Enterobacter aerogenes]
Length = 1230
Score = 41.0 bits (94), Expect = 0.024
Identities = 43/146 (29%), Positives = 65/146 (44%), Gaps = 16/146 (10%)
Frame = +1
Query: 862 QRRSAKATLHVNRNDNNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVS 1041
+R + A + R + +A + S A E+R+ + +D Q + +R
Sbjct: 620 ERAAKLARVQEERVRRDPNATEEDISAAKEARKTAEASAMLNDSD-AQRRAAELERQERD 678
Query: 1042 GEAAAKQAHKPFKQY-----KPKND-RSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGN 1203
+ A QA KP +QY K K++ +SL QSWYVP G P GA
Sbjct: 679 RQQAQPQAEKPERQYINVPYKEKDEAKSLGARWDRQQQSWYVPPGTDAAPFAKWAQGAAT 738
Query: 1204 GA-HPNKKSPNPNTR---NTPGQQTRKS------PYKYPNNA 1299
A P P P + P QQ +++ PY+ N A
Sbjct: 739 AAVEPRSAQPAPEAQGEAQKPAQQAQQARQYLAVPYEQRNAA 780
>gi|294138|gb|AAA29552.1| (M83174) circumsporozoite protein
[Plasmodium falciparum]
Length = 420
Score = 41.0 bits (94), Expect = 0.024
Identities = 43/161 (26%), Positives = 56/161 (34%), Gaps = 3/161 (1%)
Frame = +1
Query: 928 NNHSTADESRELRRFDTLRDDRGRG--QGKHHFK-DRLTVSGEAAAKQAHKPFKQYKPKN 1098
N +S SR L D ++ G +GK K D E K HK KQ N
Sbjct: 61 NWYSLKKNSRSLGENDDGNNNNGDNGREGKDEDKRDGNNEDNEKLRKPKHKKLKQPGDGN 120
Query: 1099 DRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSP 1278
+ + + V + + N A A+PN +PN N P +P
Sbjct: 121 PDPNANPNVDPNANPNVDPNANPNVDPNANPNANPNANPNA-NPNANPNANPNANPNANP 179
Query: 1279 YKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 1410
PN PN +A P NP NP A N N N
Sbjct: 180 NANPNANPNV-DPNANPNANPNANPNANPNANPNANPNANPNAN 222
Score = 40.2 bits (92), Expect = 0.040
Identities = 26/77 (33%), Positives = 33/77 (42%), Gaps = 3/77 (3%)
Frame = +1
Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 226 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 283
Query: 1366 GA---GRPNNSGGKYNRNRG 1416
A PN + K N+ G
Sbjct: 284 NANPNANPNANPNKNNQGNG 303
>gi|136173|sp|P27190|TRC5_ECOLI DNA PRIMASE TRAC (REPLICATION PRIMASE)
>gi|481041|pir||S37669 traC-2 protein - Escherichia coli
>gi|48919|emb|CAA42460| (X59794) traC-2 [Escherichia
coli] >gi|1572573 (U67194) TraC2 [Enterobacter aerogenes]
Length = 1448
Score = 41.0 bits (94), Expect = 0.024
Identities = 43/146 (29%), Positives = 65/146 (44%), Gaps = 16/146 (10%)
Frame = +1
Query: 862 QRRSAKATLHVNRNDNNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVS 1041
+R + A + R + +A + S A E+R+ + +D Q + +R
Sbjct: 838 ERAAKLARVQEERVRRDPNATEEDISAAKEARKTAEASAMLNDSD-AQRRAAELERQERD 896
Query: 1042 GEAAAKQAHKPFKQY-----KPKND-RSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGN 1203
+ A QA KP +QY K K++ +SL QSWYVP G P GA
Sbjct: 897 RQQAQPQAEKPERQYINVPYKEKDEAKSLGARWDRQQQSWYVPPGTDAAPFAKWAQGAAT 956
Query: 1204 GA-HPNKKSPNPNTR---NTPGQQTRKS------PYKYPNNA 1299
A P P P + P QQ +++ PY+ N A
Sbjct: 957 AAVEPRSAQPAPEAQGEAQKPAQQAQQARQYLAVPYEQRNAA 998
>gi|117591|sp|P26694|CSP_PLARE CIRCUMSPOROZOITE PROTEIN PRECURSOR (CS)
>gi|102373|pir||A39756 circumsporozoite protein -
Plasmodium reichenowi >gi|160229 (M60972)
circumsporozoite protein [Plasmodium reichenowi]
Length = 388
Score = 40.6 bits (93), Expect = 0.031
Identities = 25/75 (33%), Positives = 30/75 (39%)
Frame = +1
Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 194 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 251
Query: 1366 GAGRPNNSGGKYNRN 1410
A N NRN
Sbjct: 252 NANPNANPNANPNRN 266
Score = 40.6 bits (93), Expect = 0.031
Identities = 41/167 (24%), Positives = 56/167 (32%), Gaps = 3/167 (1%)
Frame = +1
Query: 910 NKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFKQYK 1089
N ++ N + E+ + D D G + + H R E K H KQ
Sbjct: 61 NWYSLKKNSRSLGENDDADNGDADNGDEGIDENRRH---RNKEGKEKLKKPKHNKLKQ-- 115
Query: 1090 PKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPN---KKSPNPNTRNTPGQ 1260
P ND +P V + + N A+PN +PN N P
Sbjct: 116 PGNDNVDPNANPN------VDPNANPNVDPNANPNVDPNANPNVDPNANPNVNPNANPNV 169
Query: 1261 QTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 1410
+P PN PN + +A P NP NP A N N N
Sbjct: 170 DPNANPNVNPNANPNV-NPNANPNVNPNANPNANPNANPNANPNANPNAN 218
>gi|117584|sp|P05691|CSP_PLAFL CIRCUMSPOROZOITE PROTEIN (CS)
>gi|552195 (M17802) circumsporozoite protein [Plasmodium
falciparum]
Length = 315
Score = 40.6 bits (93), Expect = 0.031
Identities = 43/161 (26%), Positives = 56/161 (34%), Gaps = 6/161 (3%)
Frame = +1
Query: 928 NNHSTADESRELRRFDTLRDDRGRG--QGKHHFK-DRLTVSGEAAAKQAHKPFKQYKPKN 1098
N +S SR L D ++ G +GK K D E K HK KQ N
Sbjct: 45 NWYSLKKNSRSLGENDDGNNNNGDNGREGKDEDKRDGNNEDNEKLRKPKHKKLKQPADGN 104
Query: 1099 DRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKK---SPNPNTRNTPGQQTR 1269
+ + + V + + N A A+PN +PN N P
Sbjct: 105 PDPNANPNVDPNANPNVDPNANPNVDPNANPNANPNANPNANPNANPNANPNANPNANPN 164
Query: 1270 KSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 1410
+P PN PN +A P NP NP A N N N
Sbjct: 165 ANPNANPNANPNV-DPNANPNANPNANPNANPNANPNANPNANPNAN 210
Score = 40.2 bits (92), Expect = 0.040
Identities = 26/77 (33%), Positives = 33/77 (42%), Gaps = 3/77 (3%)
Frame = +1
Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 198 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 255
Query: 1366 GA---GRPNNSGGKYNRNRG 1416
A PN + K N+ G
Sbjct: 256 NANPNANPNANPNKNNQGNG 275
>gi|294119|gb|AAA29542.1| (M83164) circumsporozoite protein
[Plasmodium falciparum] >gi|294142|gb|AAA29563.1|
(M83150) circumsporozoite protein [Plasmodium falciparum]
>gi|294161|gb|AAA29576.1| (M83163) circumsporozoite
protein [Plasmodium falciparum]
Length = 436
Score = 40.2 bits (92), Expect = 0.040
Identities = 26/77 (33%), Positives = 33/77 (42%), Gaps = 3/77 (3%)
Frame = +1
Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 242 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 299
Query: 1366 GA---GRPNNSGGKYNRNRG 1416
A PN + K N+ G
Sbjct: 300 NANPNANPNANPNKNNQGNG 319
Score = 38.3 bits (87), Expect = 0.16
Identities = 43/161 (26%), Positives = 56/161 (34%), Gaps = 14/161 (8%)
Frame = +1
Query: 928 NNHSTADESRELRRFDTLRDDRGRG--QGKHHFK-DRLTVSGEAAAKQAHKPFKQYKPKN 1098
N +S SR L D ++ G +GK K D E K HK KQ N
Sbjct: 61 NWYSLKKNSRSLGENDDGNNNNGDNGREGKDEDKRDGNNEDNEKLRKPKHKKLKQPGDGN 120
Query: 1099 DRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKK-----------SPNPNTR 1245
+ + + V + + N A A+PN +PN N
Sbjct: 121 PDPNANPNVDPNANPNVDPNANPNANPNANPNANPNANPNANPNANPNANPNANPNANPN 180
Query: 1246 NTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 1410
P +P PN PN +A P NP NP A N N N
Sbjct: 181 ANPNANPNANPNANPNANPNV-DPNANPNANPNANPNANPNANPNANPNANPNAN 234
>gi|294125|gb|AAA29545.1| (M83167) circumsporozoite protein
[Plasmodium falciparum]
Length = 436
Score = 40.2 bits (92), Expect = 0.040
Identities = 26/77 (33%), Positives = 33/77 (42%), Gaps = 3/77 (3%)
Frame = +1
Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 242 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 299
Query: 1366 GA---GRPNNSGGKYNRNRG 1416
A PN + K N+ G
Sbjct: 300 NANPNANPNANPNKNNQGNG 319
Score = 33.6 bits (75), Expect = 4.0
Identities = 21/80 (26%), Positives = 30/80 (37%)
Frame = +1
Query: 1171 PRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGN 1350
P++ + G+G +PN + P +P PN PN + +A P NP N
Sbjct: 108 PKHKKLKQPGDGNPDPNANPNVDPNANPNVDPNANPNVDPNANPN-ANPNANPNANPNAN 166
Query: 1351 PGQKTGAGRPNNSGGKYNRN 1410
P A N N N
Sbjct: 167 PNANPNANPNANPNANPNAN 186
>gi|552191 (M57499) circumsporozoite protein [Plasmodium falciparum]
Length = 424
Score = 40.2 bits (92), Expect = 0.040
Identities = 26/77 (33%), Positives = 33/77 (42%), Gaps = 3/77 (3%)
Frame = +1
Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 230 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 287
Query: 1366 GA---GRPNNSGGKYNRNRG 1416
A PN + K N+ G
Sbjct: 288 NANPNANPNANPNKNNQGNG 307
Score = 33.2 bits (74), Expect = 5.2
Identities = 21/80 (26%), Positives = 29/80 (36%)
Frame = +1
Query: 1171 PRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGN 1350
P++ + G+G +PN + P +P PN PN +A P NP N
Sbjct: 108 PKHKKLKQPGDGNPDPNANPNVDPNANPNVDPNANPNVDPNANPNV-DPNANPNANPNAN 166
Query: 1351 PGQKTGAGRPNNSGGKYNRN 1410
P A N N N
Sbjct: 167 PNANPNANPNANPNANPNAN 186
>gi|294129|gb|AAA29547.1| (M83169) circumsporozoite protein
[Plasmodium falciparum] >gi|294140|gb|AAA29562.1|
(M83149) circumsporozoite protein [Plasmodium falciparum]
Length = 424
Score = 40.2 bits (92), Expect = 0.040
Identities = 26/77 (33%), Positives = 33/77 (42%), Gaps = 3/77 (3%)
Frame = +1
Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 230 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 287
Query: 1366 GA---GRPNNSGGKYNRNRG 1416
A PN + K N+ G
Sbjct: 288 NANPNANPNANPNKNNQGNG 307
Score = 33.6 bits (75), Expect = 4.0
Identities = 21/80 (26%), Positives = 30/80 (37%)
Frame = +1
Query: 1171 PRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGN 1350
P++ + G+G +PN + P +P PN PN + +A P NP N
Sbjct: 108 PKHKKLKQPGDGNPDPNANPNVDPNANPNVDPNANPNVDPNANPN-ANPNANPNANPNAN 166
Query: 1351 PGQKTGAGRPNNSGGKYNRN 1410
P A N N N
Sbjct: 167 PNANPNANPNANPNANPNAN 186
>gi|117586|sp|P13814|CSP_PLAFT CIRCUMSPOROZOITE PROTEIN PRECURSOR (CS)
>gi|627051|pir||A54533 circumsporozoite protein -
Plasmodium falciparum (strain T4, Thailand) >gi|160217
(M19752) circumsporozoite protein [Plasmodium falciparum]
Length = 424
Score = 40.2 bits (92), Expect = 0.040
Identities = 26/77 (33%), Positives = 33/77 (42%), Gaps = 3/77 (3%)
Frame = +1
Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 230 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 287
Query: 1366 GA---GRPNNSGGKYNRNRG 1416
A PN + K N+ G
Sbjct: 288 NANPNANPNANPNKNNQGNG 307
Score = 33.6 bits (75), Expect = 4.0
Identities = 21/80 (26%), Positives = 30/80 (37%)
Frame = +1
Query: 1171 PRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGN 1350
P++ + G+G +PN + P +P PN PN + +A P NP N
Sbjct: 108 PKHKKLKQPGDGNPDPNANPNVDPNANPNVDPNANPNVDPNANPN-ANPNANPNANPNAN 166
Query: 1351 PGQKTGAGRPNNSGGKYNRN 1410
P A N N N
Sbjct: 167 PNANPNANPNANPNANPNAN 186
>gi|84198|pir||S05428 circumsporozoite protein - Plasmodium falciparum
(isolate NF54) >gi|160169 (M22982) circumsporozoite
protein [Plasmodium falciparum]
Length = 405
Score = 40.2 bits (92), Expect = 0.040
Identities = 26/77 (33%), Positives = 33/77 (42%), Gaps = 3/77 (3%)
Frame = +1
Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 211 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 268
Query: 1366 GA---GRPNNSGGKYNRNRG 1416
A PN + K N+ G
Sbjct: 269 NANPNANPNANPNKNNQGNG 288
Score = 37.9 bits (86), Expect = 0.20
Identities = 24/75 (32%), Positives = 28/75 (37%)
Frame = +1
Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
N A A+PN +PN N P +P PN PN +A P NP NP
Sbjct: 167 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPNV-DPNANPNANPNANPNANP 224
Query: 1366 GAGRPNNSGGKYNRN 1410
A N N N
Sbjct: 225 NANPNANPNANPNAN 239
>gi|294121|gb|AAA29543.1| (M83165) circumsporozoite protein
[Plasmodium falciparum]
Length = 432
Score = 40.2 bits (92), Expect = 0.040
Identities = 26/77 (33%), Positives = 33/77 (42%), Gaps = 3/77 (3%)
Frame = +1
Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 238 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 295
Query: 1366 GA---GRPNNSGGKYNRNRG 1416
A PN + K N+ G
Sbjct: 296 NANPNANPNANPNKNNQGNG 315
Score = 33.6 bits (75), Expect = 4.0
Identities = 21/80 (26%), Positives = 30/80 (37%)
Frame = +1
Query: 1171 PRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGN 1350
P++ + G+G +PN + P +P PN PN + +A P NP N
Sbjct: 108 PKHKKLKQPGDGNPDPNANPNVDPNANPNVDPNANPNVDPNANPN-ANPNANPNANPNAN 166
Query: 1351 PGQKTGAGRPNNSGGKYNRN 1410
P A N N N
Sbjct: 167 PNANPNANPNANPNANPNAN 186
>gi|294123|gb|AAA29544.1| (M83166) circumsporozoite protein
[Plasmodium falciparum] >gi|294127|gb|AAA29546.1|
(M83168) circumsporozoite protein [Plasmodium falciparum]
>gi|294131|gb|AAA29548.1| (M83170) circumsporozoite
protein [Plasmodium falciparum] >gi|294145|gb|AAA29565.1|
(M83152) circumsporozoite protein [Plasmodium falciparum]
>gi|294149|gb|AAA29568.1| (M83155) circumsporozoite
protein [Plasmodium falciparum] >gi|294154|gb|AAA29571.1|
(M83158) circumsporozoite protein [Plasmodium falciparum]
Length = 432
Score = 40.2 bits (92), Expect = 0.040
Identities = 26/77 (33%), Positives = 33/77 (42%), Gaps = 3/77 (3%)
Frame = +1
Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 238 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 295
Query: 1366 GA---GRPNNSGGKYNRNRG 1416
A PN + K N+ G
Sbjct: 296 NANPNANPNANPNKNNQGNG 315
Score = 33.2 bits (74), Expect = 5.2
Identities = 21/80 (26%), Positives = 29/80 (36%)
Frame = +1
Query: 1171 PRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGN 1350
P++ + G+G +PN + P +P PN PN +A P NP N
Sbjct: 108 PKHKKLKQPGDGNPDPNANPNVDPNANPNVDPNANPNVDPNANPNV-DPNANPNANPNAN 166
Query: 1351 PGQKTGAGRPNNSGGKYNRN 1410
P A N N N
Sbjct: 167 PNANPNANPNANPNANPNAN 186
>gi|294134|gb|AAA29550.1| (M83172) circumsporozoite protein
[Plasmodium falciparum]
Length = 416
Score = 40.2 bits (92), Expect = 0.040
Identities = 26/77 (33%), Positives = 33/77 (42%), Gaps = 3/77 (3%)
Frame = +1
Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 222 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 279
Query: 1366 GA---GRPNNSGGKYNRNRG 1416
A PN + K N+ G
Sbjct: 280 NANPNANPNANPNKNNQGNG 299
Score = 37.9 bits (86), Expect = 0.20
Identities = 24/75 (32%), Positives = 28/75 (37%)
Frame = +1
Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
N A A+PN +PN N P +P PN PN +A P NP NP
Sbjct: 174 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPNV-DPNANPNANPNANPNANP 231
Query: 1366 GAGRPNNSGGKYNRN 1410
A N N N
Sbjct: 232 NANPNANPNANPNAN 246
Score = 33.6 bits (75), Expect = 4.0
Identities = 21/80 (26%), Positives = 30/80 (37%)
Frame = +1
Query: 1171 PRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGN 1350
P++ + G+G +PN + P +P PN PN + +A P NP N
Sbjct: 108 PKHKKLKQPGDGNPDPNANPNVDPNANPNVDPNANPNVDPNANPN-ANPNANPNANPNAN 166
Query: 1351 PGQKTGAGRPNNSGGKYNRN 1410
P A N N N
Sbjct: 167 PNANPNANPNANPNANPNAN 186
>gi|117583|sp|P02893|CSP_PLAFA CIRCUMSPOROZOITE PROTEIN PRECURSOR (CS)
>gi|72381|pir||OZZQAF circumsporozoite protein -
Plasmodium falciparum (isolate IMTM22) >gi|160161
(K02194) circumsporozoite protein [Plasmodium falciparum]
Length = 412
Score = 40.2 bits (92), Expect = 0.040
Identities = 26/77 (33%), Positives = 33/77 (42%), Gaps = 3/77 (3%)
Frame = +1
Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 218 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 275
Query: 1366 GA---GRPNNSGGKYNRNRG 1416
A PN + K N+ G
Sbjct: 276 NANPNANPNANPNKNNQGNG 295
Score = 37.9 bits (86), Expect = 0.20
Identities = 24/75 (32%), Positives = 28/75 (37%)
Frame = +1
Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
N A A+PN +PN N P +P PN PN +A P NP NP
Sbjct: 170 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPNV-DPNANPNANPNANPNANP 227
Query: 1366 GAGRPNNSGGKYNRN 1410
A N N N
Sbjct: 228 NANPNANPNANPNAN 242
Score = 33.6 bits (75), Expect = 4.0
Identities = 21/80 (26%), Positives = 30/80 (37%)
Frame = +1
Query: 1171 PRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGN 1350
P++ + G+G +PN + P +P PN PN + +A P NP N
Sbjct: 108 PKHKKLKQPGDGNPDPNANPNVDPNANPNVDPNANPNVDPNANPN-ANPNANPNANPNAN 166
Query: 1351 PGQKTGAGRPNNSGGKYNRN 1410
P A N N N
Sbjct: 167 PNANPNANPNANPNANPNAN 186
>gi|294151|gb|AAA29569.1| (M83156) circumsporozoite protein
[Plasmodium falciparum]
Length = 452
Score = 40.2 bits (92), Expect = 0.040
Identities = 26/77 (33%), Positives = 33/77 (42%), Gaps = 3/77 (3%)
Frame = +1
Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 258 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 315
Query: 1366 GA---GRPNNSGGKYNRNRG 1416
A PN + K N+ G
Sbjct: 316 NANPNANPNANPNKNNQGNG 335
Score = 34.4 bits (77), Expect = 2.3
Identities = 21/80 (26%), Positives = 30/80 (37%)
Frame = +1
Query: 1171 PRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGN 1350
P++ + G+G +PN + P +P PN PN + +A P NP N
Sbjct: 108 PKHKKLKQPGDGNPDPNANPNVDPNANPNVDPNANPNANPNANPN-ANPNANPNANPNAN 166
Query: 1351 PGQKTGAGRPNNSGGKYNRN 1410
P A N N N
Sbjct: 167 PNANPNANPNANPNANPNAN 186
>gi|4493889|emb|CAB38998.1| (AL034558) predicted using hexExon;
MAL3P2.11 (PFC0210c), Circumsporozoite (CS) protein, len:
397 aa; Similarity to many Plasmodium CS proteins.
[Plasmodium falciparum]
Length = 396
Score = 40.2 bits (92), Expect = 0.040
Identities = 26/77 (33%), Positives = 33/77 (42%), Gaps = 3/77 (3%)
Frame = +1
Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 202 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 259
Query: 1366 GA---GRPNNSGGKYNRNRG 1416
A PN + K N+ G
Sbjct: 260 NANPNANPNANPNKNNQGNG 279
Score = 37.9 bits (86), Expect = 0.20
Identities = 24/75 (32%), Positives = 28/75 (37%)
Frame = +1
Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
N A A+PN +PN N P +P PN PN +A P NP NP
Sbjct: 158 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPNV-DPNANPNANPNANPNANP 215
Query: 1366 GAGRPNNSGGKYNRN 1410
A N N N
Sbjct: 216 NANPNANPNANPNAN 230
>gi|6226848|sp|P19597|CSP_PLAFO CIRCUMSPOROZOITE PROTEIN PRECURSOR
(CS) >gi|160153 (M83886) circumsporozoite protein
[Plasmodium falciparum] >gi|2276342|emb|CAA33421|
(X15363) circumsporozoite protein (AA 1 - 405)
[Plasmodium falciparum]
Length = 397
Score = 40.2 bits (92), Expect = 0.040
Identities = 26/77 (33%), Positives = 33/77 (42%), Gaps = 3/77 (3%)
Frame = +1
Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 203 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 260
Query: 1366 GA---GRPNNSGGKYNRNRG 1416
A PN + K N+ G
Sbjct: 261 NANPNANPNANPNKNNQGNG 280
Score = 37.9 bits (86), Expect = 0.20
Identities = 24/75 (32%), Positives = 28/75 (37%)
Frame = +1
Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
N A A+PN +PN N P +P PN PN +A P NP NP
Sbjct: 159 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPNV-DPNANPNANPNANPNANP 216
Query: 1366 GAGRPNNSGGKYNRN 1410
A N N N
Sbjct: 217 NANPNANPNANPNAN 231
>gi|552190 (M57498) circumsporozoite protein [Plasmodium falciparum]
Length = 393
Score = 40.2 bits (92), Expect = 0.040
Identities = 26/77 (33%), Positives = 33/77 (42%), Gaps = 3/77 (3%)
Frame = +1
Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 199 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 256
Query: 1366 GA---GRPNNSGGKYNRNRG 1416
A PN + K N+ G
Sbjct: 257 NANPNANPNANPNKNNQGNG 276
Score = 37.9 bits (86), Expect = 0.20
Identities = 24/75 (32%), Positives = 28/75 (37%)
Frame = +1
Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
N A A+PN +PN N P +P PN PN +A P NP NP
Sbjct: 151 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPNV-DPNANPNANPNANPNANP 208
Query: 1366 GAGRPNNSGGKYNRN 1410
A N N N
Sbjct: 209 NANPNANPNANPNAN 223
>gi|224051|prf||1008275A protein,circumsporozoite [Plasmodium sp.]
Length = 100
Score = 39.5 bits (90), Expect = 0.069
Identities = 24/75 (32%), Positives = 30/75 (40%)
Frame = +1
Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 24 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 81
Query: 1366 GAGRPNNSGGKYNRN 1410
A N N+N
Sbjct: 82 NANPNANPNANPNKN 96
>gi|294158|gb|AAA29574.1| (M83161) circumsporozoite protein
[Plasmodium falciparum]
Length = 420
Score = 38.7 bits (88), Expect = 0.12
Identities = 26/77 (33%), Positives = 31/77 (39%), Gaps = 3/77 (3%)
Frame = +1
Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
N A A+PN +PN N P +P PN PN A P NP NP
Sbjct: 230 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-----ANPNANPNANPNANP 283
Query: 1366 GA---GRPNNSGGKYNRNRG 1416
A PN + K N+ G
Sbjct: 284 NANPNANPNANPNKNNQGNG 303
Score = 37.9 bits (86), Expect = 0.20
Identities = 24/75 (32%), Positives = 28/75 (37%)
Frame = +1
Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
N A A+PN +PN N P +P PN PN +A P NP NP
Sbjct: 186 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPNV-DPNANPNANPNANPNANP 243
Query: 1366 GAGRPNNSGGKYNRN 1410
A N N N
Sbjct: 244 NANPNANPNANPNAN 258
Score = 33.6 bits (75), Expect = 4.0
Identities = 21/80 (26%), Positives = 30/80 (37%)
Frame = +1
Query: 1171 PRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGN 1350
P++ + G+G +PN + P +P PN PN + +A P NP N
Sbjct: 108 PKHKKLKQPGDGNPDPNANPNVDPNANPNVDPNANPNVDPNANPN-ANPNANPNANPNAN 166
Query: 1351 PGQKTGAGRPNNSGGKYNRN 1410
P A N N N
Sbjct: 167 PNANPNANPNANPNANPNAN 186
>gi|2462758 (AC002292) putative RNA-binding protein [Arabidopsis
thaliana]
Length = 292
Score = 38.7 bits (88), Expect = 0.12
Identities = 23/71 (32%), Positives = 37/71 (51%), Gaps = 1/71 (1%)
Frame = +1
Query: 904 DNNKHAYHNNHSTADESRELRRFDTLRDDR-GRGQGKHHFKDRLTVSGEAAAKQAHKPFK 1080
D ++ Y + + RE R FD D R R G++ ++DR SG+ + H PF+
Sbjct: 154 DGHRDRYGDRDLERERERE-REFDRYMDGRRDRDGGRYSYRDRFD-SGDKYEPRDHYPFE 211
Query: 1081 QYKPKNDRSLSE 1116
+Y P DR +S+
Sbjct: 212 RYAPPGDRFVSD 223
>gi|283464|pir||A60540 sporozoite surface protein 2 - Plasmodium
yoelii (fragment)
Length = 477
Score = 38.7 bits (88), Expect = 0.12
Identities = 24/84 (28%), Positives = 29/84 (33%)
Frame = +1
Query: 1171 PRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGN 1350
P N N N + NPN N P + PNN PN P++ P N
Sbjct: 93 PNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNN-PNNPNNPNNP-----NN 146
Query: 1351 PGQKTGAGRPNNSGGKYNRNRGPR 1422
P PNN N N P+
Sbjct: 147 PNNPNNPNNPNNPNDPSNPNNHPK 170
Score = 37.9 bits (86), Expect = 0.20
Identities = 27/86 (31%), Positives = 33/86 (37%), Gaps = 3/86 (3%)
Frame = +1
Query: 1171 PRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNA--PNFPSDHATPTFNPY 1344
P N N N + NPN N P + PNN PN P++ P NP
Sbjct: 96 PNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPN-NP- 153
Query: 1345 GNPGQKTGAGRPNNSGGKYN-RNRGPRYP 1428
NP PNN + N + R P P
Sbjct: 154 NNPNNPNDPSNPNNHPKRRNPKRRNPNKP 182
Score = 37.1 bits (84), Expect = 0.35
Identities = 25/88 (28%), Positives = 31/88 (34%)
Frame = +1
Query: 1147 VPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHAT 1326
+P + P N N + NPN N P + PNN PN P++
Sbjct: 67 IPNKIPEKPSNPEEPVNPNDPNDPNNPNNPNNPNNPNNPNNPNNPNNPNN-PNNPNNPNN 125
Query: 1327 PTFNPYGNPGQKTGAGRPNNSGGKYNRN 1410
P NP NP PNN N N
Sbjct: 126 PN-NP-NNPNNPNNPNNPNNPNNPNNPN 151
Score = 37.1 bits (84), Expect = 0.35
Identities = 21/68 (30%), Positives = 29/68 (41%)
Frame = +1
Query: 1207 AHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNN 1386
++PNK PNPN + P + P PN PS+ P N NP + + P+N
Sbjct: 198 SNPNK--PNPNEPSNPNKPNPNEPSNPNKPNPNEPSNPNKPNPNEPLNPNEPSNPNEPSN 255
Query: 1387 SGGKYNRN 1410
N N
Sbjct: 256 PNAPSNPN 263
Score = 36.4 bits (82), Expect = 0.60
Identities = 26/82 (31%), Positives = 30/82 (35%), Gaps = 8/82 (9%)
Frame = +1
Query: 1165 TGPRNH--------RNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDH 1320
+ P NH RN PN PNPN + P + P PN PS+
Sbjct: 163 SNPNNHPKRRNPKRRNPNKPKPNKPNPNKPNPNEPSNPNKPNPNEPSNPNKPNPNEPSNP 222
Query: 1321 ATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 1410
P N NP K P N N N
Sbjct: 223 NKPNPNEPSNP-NKPNPNEPLNPNEPSNPN 251
Score = 36.4 bits (82), Expect = 0.60
Identities = 23/80 (28%), Positives = 38/80 (46%), Gaps = 9/80 (11%)
Frame = +1
Query: 1171 PRNHRNAGAGNGAHPNKKSPN----PNTRNTPGQQTRKSPYKYPN-----NAPNFPSDHA 1323
P N ++PNK +PN PN + P + + + PN N P+ P++ +
Sbjct: 219 PSNPNKPNPNEPSNPNKPNPNEPLNPNEPSNPNEPSNPNAPSNPNEPSNPNEPSNPNEPS 278
Query: 1324 TPTFNPYGNPGQKTGAGRPNNSGGKYNRN 1410
P N NP + + +P+N N N
Sbjct: 279 NP--NEPSNPNEPSNPKKPSNPNEPSNPN 305
Score = 35.6 bits (80), Expect = 1.0
Identities = 24/70 (34%), Positives = 29/70 (41%)
Frame = +1
Query: 1201 NGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRP 1380
N +PN + NPN N P + PNN PN P++ P NP NP P
Sbjct: 92 NPNNPNNPN-NPNNPNNPNNPNNPNNPNNPNN-PNNPNNPNNPN-NP-NNPNNPNNPNNP 147
Query: 1381 NNSGGKYNRN 1410
NN N N
Sbjct: 148 NNPNNPNNPN 157
Score = 35.6 bits (80), Expect = 1.0
Identities = 27/88 (30%), Positives = 32/88 (35%)
Frame = +1
Query: 1147 VPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHAT 1326
+PE S P N N + NPN N P + PNN PN P++
Sbjct: 71 IPEKPSN-PEEPVNPNDPNDPNNPNNPNNPNNPNNPNNPNNPNNPNNPNN-PNNPNNPNN 128
Query: 1327 PTFNPYGNPGQKTGAGRPNNSGGKYNRN 1410
P NP NP PNN N N
Sbjct: 129 PN-NP-NNPNNPNNPNNPNNPNNPNNPN 154
Score = 33.2 bits (74), Expect = 5.2
Identities = 24/80 (30%), Positives = 34/80 (42%), Gaps = 1/80 (1%)
Frame = +1
Query: 1171 PRNHRNAGAGNGAHPNKKSPN-PNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYG 1347
P N ++PNK +PN P+ N P +P K PN PN P + P+
Sbjct: 197 PSNPNKPNPNEPSNPNKPNPNEPSNPNKPNPNEPSNPNK-PN--PNEPLNPNEPS----- 248
Query: 1348 NPGQKTGAGRPNNSGGKYNRN 1410
NP + + P+N N N
Sbjct: 249 NPNEPSNPNAPSNPNEPSNPN 269
>gi|548996|sp|Q01443|SSP2_PLAYO SPOROZOITE SURFACE PROTEIN 2 PRECURSOR
>gi|323142|pir||A45559 sporozoite surface protein 2 -
Plasmodium yoelii >gi|160693 (M84732) sporozoite surface
protein [Plasmodium yoelii]
Length = 826
Score = 38.7 bits (88), Expect = 0.12
Identities = 24/84 (28%), Positives = 29/84 (33%)
Frame = +1
Query: 1171 PRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGN 1350
P N N N + NPN N P + PNN PN P++ P N
Sbjct: 315 PNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNN-PNNPNNPNNP-----NN 368
Query: 1351 PGQKTGAGRPNNSGGKYNRNRGPR 1422
P PNN N N P+
Sbjct: 369 PNNPNNPNNPNNPNDPSNPNNHPK 392
Score = 37.9 bits (86), Expect = 0.20
Identities = 27/86 (31%), Positives = 33/86 (37%), Gaps = 3/86 (3%)
Frame = +1
Query: 1171 PRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNA--PNFPSDHATPTFNPY 1344
P N N N + NPN N P + PNN PN P++ P NP
Sbjct: 318 PNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPN-NP- 375
Query: 1345 GNPGQKTGAGRPNNSGGKYN-RNRGPRYP 1428
NP PNN + N + R P P
Sbjct: 376 NNPNNPNDPSNPNNHPKRRNPKRRNPNKP 404
Score = 37.1 bits (84), Expect = 0.35
Identities = 25/88 (28%), Positives = 31/88 (34%)
Frame = +1
Query: 1147 VPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHAT 1326
+P + P N N + NPN N P + PNN PN P++
Sbjct: 289 IPNKIPEKPSNPEEPVNPNDPNDPNNPNNPNNPNNPNNPNNPNNPNNPNN-PNNPNNPNN 347
Query: 1327 PTFNPYGNPGQKTGAGRPNNSGGKYNRN 1410
P NP NP PNN N N
Sbjct: 348 PN-NP-NNPNNPNNPNNPNNPNNPNNPN 373
Score = 37.1 bits (84), Expect = 0.35
Identities = 21/68 (30%), Positives = 29/68 (41%)
Frame = +1
Query: 1207 AHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNN 1386
++PNK PNPN + P + P PN PS+ P N NP + + P+N
Sbjct: 420 SNPNK--PNPNEPSNPNKPNPNEPSNPNKPNPNEPSNPNKPNPNEPLNPNEPSNPNEPSN 477
Query: 1387 SGGKYNRN 1410
N N
Sbjct: 478 PNAPSNPN 485
Score = 36.4 bits (82), Expect = 0.60
Identities = 26/82 (31%), Positives = 30/82 (35%), Gaps = 8/82 (9%)
Frame = +1
Query: 1165 TGPRNH--------RNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDH 1320
+ P NH RN PN PNPN + P + P PN PS+
Sbjct: 385 SNPNNHPKRRNPKRRNPNKPKPNKPNPNKPNPNEPSNPNKPNPNEPSNPNKPNPNEPSNP 444
Query: 1321 ATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 1410
P N NP K P N N N
Sbjct: 445 NKPNPNEPSNP-NKPNPNEPLNPNEPSNPN 473
Score = 36.4 bits (82), Expect = 0.60
Identities = 23/80 (28%), Positives = 38/80 (46%), Gaps = 9/80 (11%)
Frame = +1
Query: 1171 PRNHRNAGAGNGAHPNKKSPN----PNTRNTPGQQTRKSPYKYPN-----NAPNFPSDHA 1323
P N ++PNK +PN PN + P + + + PN N P+ P++ +
Sbjct: 441 PSNPNKPNPNEPSNPNKPNPNEPLNPNEPSNPNEPSNPNAPSNPNEPSNPNEPSNPNEPS 500
Query: 1324 TPTFNPYGNPGQKTGAGRPNNSGGKYNRN 1410
P N NP + + +P+N N N
Sbjct: 501 NP--NEPSNPNEPSNPKKPSNPNEPSNPN 527
Score = 35.6 bits (80), Expect = 1.0
Identities = 24/70 (34%), Positives = 29/70 (41%)
Frame = +1
Query: 1201 NGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRP 1380
N +PN + NPN N P + PNN PN P++ P NP NP P
Sbjct: 314 NPNNPNNPN-NPNNPNNPNNPNNPNNPNNPNN-PNNPNNPNNPN-NP-NNPNNPNNPNNP 369
Query: 1381 NNSGGKYNRN 1410
NN N N
Sbjct: 370 NNPNNPNNPN 379
Score = 35.6 bits (80), Expect = 1.0
Identities = 27/88 (30%), Positives = 32/88 (35%)
Frame = +1
Query: 1147 VPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHAT 1326
+PE S P N N + NPN N P + PNN PN P++
Sbjct: 293 IPEKPSN-PEEPVNPNDPNDPNNPNNPNNPNNPNNPNNPNNPNNPNNPNN-PNNPNNPNN 350
Query: 1327 PTFNPYGNPGQKTGAGRPNNSGGKYNRN 1410
P NP NP PNN N N
Sbjct: 351 PN-NP-NNPNNPNNPNNPNNPNNPNNPN 376
Score = 33.2 bits (74), Expect = 5.2
Identities = 24/80 (30%), Positives = 34/80 (42%), Gaps = 1/80 (1%)
Frame = +1
Query: 1171 PRNHRNAGAGNGAHPNKKSPN-PNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYG 1347
P N ++PNK +PN P+ N P +P K PN PN P + P+
Sbjct: 419 PSNPNKPNPNEPSNPNKPNPNEPSNPNKPNPNEPSNPNK-PN--PNEPLNPNEPS----- 470
Query: 1348 NPGQKTGAGRPNNSGGKYNRN 1410
NP + + P+N N N
Sbjct: 471 NPNEPSNPNAPSNPNEPSNPN 491
>gi|1582641|prf||2119210A mucin [Homo sapiens]
Length = 164
Score = 38.3 bits (87), Expect = 0.16
Identities = 26/76 (34%), Positives = 38/76 (49%), Gaps = 4/76 (5%)
Frame = +3
Query: 663 STSAMGIARLPSQPPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRHA---- 830
ST++ PS P + TL PTT TT+ P TT P+ STT +T A
Sbjct: 62 STTSAPTTSTPSAPTTSTTL---APTTSTTSAPTTSTTSTPTSSTTSTPQTSTTSASTTS 118
Query: 831 VSTHPATDYRPTPFSQSHTA 890
+++ P T P P + + +A
Sbjct: 119 ITSGPGTTPSPVPTTSTTSA 138
>gi|2135764|pir||I53641 mucin - human (fragment) >gi|945219 (L46721)
mucin [Homo sapiens]
Length = 164
Score = 38.3 bits (87), Expect = 0.16
Identities = 27/76 (35%), Positives = 37/76 (48%), Gaps = 4/76 (5%)
Frame = +3
Query: 663 STSAMGIARLPSQPPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRHAVSTH 842
ST++ PS P + TL PTT TT+ P TT P+ STT +T A +T
Sbjct: 62 STTSAPTTSTPSAPTTSTTL---APTTSTTSAPTTSTTSTPTSSTTSTPQTSTTSASTTS 118
Query: 843 ----PATDYRPTPFSQSHTA 890
P T P P + + +A
Sbjct: 119 ITCGPGTTPSPVPTTSTTSA 138
>gi|6580291|emb|CAB63354.1| (AL032627) cDNA EST EMBL:D65921 comes from
this gene; cDNA EST yk385f3.3 comes from this gene; cDNA
EST yk385f3.5 comes from this gene; cDNA EST EMBL:D66141
comes from this gene; cDNA EST EMBL:D69818 comes from
this gene; cDN...
Length = 373
Score = 37.9 bits (86), Expect = 0.20
Identities = 32/87 (36%), Positives = 41/87 (46%), Gaps = 20/87 (22%)
Frame = +1
Query: 1156 GVSTGPRNHRNAGAGNGAHPNKKSPNPNTRN-TPGQQTRKSPYKYPNNAPNFPSDHATPT 1332
G S + N GNG++ N N N N G + PY P + +P + P
Sbjct: 249 GNSGNGNGNSNGNNGNGSNGNGNGNNGNGNNGNNGNGHPQYPYPVPPHG-YYPPGYPYPP 307
Query: 1333 FNPYGNPGQ---------------KTGAGRPN----NSGGKYNRNRG 1416
PY PG + G G+PN NSGG RNRG
Sbjct: 308 GYPYPPPGAFYYPPGGIPQNGMNGQNGNGQPNIIVINSGGNKKRNRG 354
Score = 32.8 bits (73), Expect = 6.8
Identities = 30/89 (33%), Positives = 40/89 (44%), Gaps = 5/89 (5%)
Frame = +1
Query: 1162 STGPRNH--RNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTF 1335
S+G N+ N+G+GNG N S N N+ N G NNA N + +
Sbjct: 215 SSGNSNYGSNNSGSGNG---NSNSGNGNSGNGNG-----------NNAGNSGNGNG---- 256
Query: 1336 NPYGNPGQKTGAGRPNNSGGKYNRNRG---PRYP 1428
N GN G + N+G N N G P+YP
Sbjct: 257 NSNGNNGNGSNGNGNGNNGNGNNGNNGNGHPQYP 290
>gi|677949 (U20969) Plasmodium falciparum circumsporozoite protein
(CS) gene, complete cds. [Plasmodium falciparum]
Length = 408
Score = 37.9 bits (86), Expect = 0.20
Identities = 24/75 (32%), Positives = 28/75 (37%)
Frame = +1
Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
N A A+PN +PN N P +P PN PN +A P NP NP
Sbjct: 170 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPNV-DPNANPNANPNANPNANP 227
Query: 1366 GAGRPNNSGGKYNRN 1410
A N N N
Sbjct: 228 NANPNANPNANPNAN 242
Score = 33.6 bits (75), Expect = 4.0
Identities = 22/78 (28%), Positives = 28/78 (35%)
Frame = +1
Query: 1186 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
N A A+PN +PN N P +P PN PN + N NP +
Sbjct: 246 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPNKNNQGNGQGHNMPNNPNRNV 304
Query: 1366 GAGRPNNSGGKYNRNRGP 1419
N+ K N N P
Sbjct: 305 DENANANNAVKNNNNEEP 322
>gi|93140|pir||A40505 early protein EP0 - suid herpesvirus 1 (strain
Indiana-Funkhuser or Becker)
Length = 410
Score = 37.5 bits (85), Expect = 0.27
Identities = 44/176 (25%), Positives = 70/176 (39%), Gaps = 12/176 (6%)
Frame = +1
Query: 862 QRRSAKATLHVNRNDNNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVS 1041
QRR+ + VN D ++ H++ + E D G H +D LT +
Sbjct: 228 QRRAPMSHQGVNYIDTSESEAHSDSEVSSPDEE---------DSGASSSGVHTED-LTEA 277
Query: 1042 GEAAAKQAHKPFK-----------QYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRN 1188
E+A Q P + + + + R L G PE S+G +
Sbjct: 278 SESADDQRPAPRRSPRRARRAAVLRREQRRTRCLRRGRTGGQAQGETPEAPSSGEGSSAQ 337
Query: 1189 AGA-GNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
GA G GA P + + R++P S + P+ PS A T P G P +
Sbjct: 338 HGASGAGAGPGSANTAASARSSPSSSPSSS-MRRPS-----PSASAPETAAPRGGPPASS 391
Query: 1366 GAGRPNNS 1389
+G P ++
Sbjct: 392 SSGSPRSA 399
>gi|124138|sp|P29129|ICP0_PRVIF TRANS-ACTING TRANSCRIPTIONAL PROTEIN
ICP0 (EARLY PROTEIN 0) (EP0) >gi|334048 (M57504) EPO
[Pseudorabies virus]
Length = 410
Score = 37.5 bits (85), Expect = 0.27
Identities = 44/176 (25%), Positives = 70/176 (39%), Gaps = 12/176 (6%)
Frame = +1
Query: 862 QRRSAKATLHVNRNDNNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVS 1041
QRR+ + VN D ++ H++ + E D G H +D LT +
Sbjct: 228 QRRAPMSHQGVNYIDTSESEAHSDSEVSSPDEE---------DSGASSSGVHTED-LTEA 277
Query: 1042 GEAAAKQAHKPFK-----------QYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRN 1188
E+A Q P + + + + R L G PE S+G +
Sbjct: 278 SESADDQRPAPRRSPRRARRAAVLRREQRRTRCLRRGRTGGQAQGETPEAPSSGEGSSAQ 337
Query: 1189 AGA-GNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
GA G GA P + + R++P S + P+ PS A T P G P +
Sbjct: 338 HGASGAGAGPGSANTAASARSSPSSSPSSS-MRRPS-----PSASAPETAAPRGGPPASS 391
Query: 1366 GAGRPNNS 1389
+G P ++
Sbjct: 392 SSGSPRSA 399
>gi|3875920|emb|CAB04111.1| (Z81503) predicted using Genefinder;
similar to collagen; cDNA EST EMBL:D65450 comes from this
gene; cDNA EST EMBL:D68888 comes from this gene
[Caenorhabditis elegans]
Length = 305
Score = 37.5 bits (85), Expect = 0.27
Identities = 36/92 (39%), Positives = 39/92 (42%), Gaps = 15/92 (16%)
Frame = +1
Query: 1150 PEGVSTGPRNHRNAGA----GNGAH--------PNKKSPN--PNTRNTPGQQTRKSPYKY 1287
P+G P N AGA GN AH P P P PG P
Sbjct: 190 PKGPRGAPGNSGRAGAPGQPGNDAHGYGGGVGAPGPAGPRGAPGPAGHPGSSGGGRPGPA 249
Query: 1288 -PNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRNRGPRY 1425
P AP P P G+PGQ GRP SGG NR P+Y
Sbjct: 250 GPKGAPGQPGRPG-----PDGHPGQP---GRPGQSGGSGNRGVCPKY 288
>gi|6681425|dbj|BAA88672.1| (AB030233) CiMsi [Ciona intestinalis]
Length = 393
Score = 37.5 bits (85), Expect = 0.27
Identities = 30/97 (30%), Positives = 36/97 (36%), Gaps = 1/97 (1%)
Frame = +1
Query: 1117 GSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPG-QQTRKSPYKYPN 1293
G+ Q Y P + R N GNGA PN S P + G T Y N
Sbjct: 299 GNMGNMQGGYQPGMMGMQGRGVNN---GNGAQPNAASTYPQNPTSYGPMPTSGGGYNQGN 355
Query: 1294 NAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNR 1407
N S A G+ GQK+G G N+ Y R
Sbjct: 356 TGSNNSSGQANTGNTGGGSYGQKSGGGSNNSGYHPYRR 393
>gi|2498734|sp|Q60865|P137_MOUSE GPI-ANCHORED PROTEIN P137 >gi|1098569
(U27838) glycosyl-phosphatidyl-inositol-anchored protein
homolog [Mus musculus]
Length = 656
Score = 37.1 bits (84), Expect = 0.35
Identities = 30/112 (26%), Positives = 38/112 (33%), Gaps = 2/112 (1%)
Frame = +1
Query: 1081 QYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNA--GAGNGAHPNKKSPNPNTRNTP 1254
Q P+ + S + S V G S G R N G NG P+ NTP
Sbjct: 535 QQPPQQNTGFPRSSQPYYNSRGVSRGGSRGARGLMNGYRGPANGFRGGYDGYRPSFSNTP 594
Query: 1255 GQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRNRG 1416
+S + P + + D F GQ G P GG NRG
Sbjct: 595 NSGYSQSQFTAPRDYSGYQRDGYQQNFK--RGSGQSGPRGAPRGRGGPPRPNRG 646
>gi|6322611|ref|NP_012685.1|YJR151C| Yjr151cp
>gi|1352944|sp|P47179|YJ9P_YEAST HYPOTHETICAL 118.4 KD
PROTEIN IN BAT2-DAL5 INTERGENIC REGION PRECURSOR
>gi|1078284|pir||S57180 probable membrane protein
YJR151c - yeast (Saccharomyces cerevisiae)
>gi|1015903|emb|CAA89684| (Z49651) ORF YJR151c
[Saccharomyces cerevisiae]
Length = 1161
Score = 36.7 bits (83), Expect = 0.46
Identities = 28/83 (33%), Positives = 35/83 (41%), Gaps = 1/83 (1%)
Frame = +3
Query: 651 PQPRSTSAMGIARLPSQPPQTYTLWFRPPTTRTTAWPIDRTTQD-PSRSTTHPTETRKRH 827
P +TS S P T T P T+ T+ P TT P+ STT T T
Sbjct: 162 PTTSTTSTTPTTSTTSTTPTTSTTSTTPTTSTTSTTPTTSTTSTTPTTSTTSTTPTTS-- 219
Query: 828 AVSTHPATDYRPTPFSQSHTARES 899
ST P T PT + S T++ S
Sbjct: 220 TTSTTPTTSTTPTTSTTSTTSQTS 243
Score = 32.8 bits (73), Expect = 6.8
Identities = 25/79 (31%), Positives = 33/79 (41%), Gaps = 5/79 (6%)
Frame = +3
Query: 663 STSAMGIARLPSQPPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTE-----TRKRH 827
+TS S QT T P T+ T+ P TT S ++T PT T
Sbjct: 226 TTSTTPTTSTTSTTSQTSTKSTTPTTSSTSTTPTTSTTPTTSTTSTAPTTSTTSTTSTTS 285
Query: 828 AVSTHPATDYRPTPFSQSHTARES 899
+ST P T + FS S + S
Sbjct: 286 TISTAPTTSTTSSTFSTSSASASS 309
>gi|4758666|ref|NP_004681.1|| LATS (large tumor suppressor,
Drosophila) homolog 1 >gi|4324434|gb|AAD16882| (AF104413)
large tumor suppressor 1 [Homo sapiens]
>gi|5738136|gb|AAD50272.1|AF164041_1 (AF164041) WARTS
protein kinase [Homo sapiens]
Length = 1130
Score = 36.4 bits (82), Expect = 0.60
Identities = 20/88 (22%), Positives = 40/88 (44%)
Frame = +1
Query: 1120 SPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNA 1299
+P+++ + +P+ + RN N N + P ++ P + + P Q + S ++ P
Sbjct: 396 APSSYTNGSIPQSMMVPNRNSHNMELYNISVPGLQTNWPQSSSAPAQSSPSSGHEIPTWQ 455
Query: 1300 PNFPSDHATPTFNPYGNPGQKTGAGRPN 1383
PN P + NP GN + +P+
Sbjct: 456 PNIPV-RSNSFNNPLGNRASHSANSQPS 482
>gi|3172134 (U90209) RNA polymerase II largest subunit [Bonnemaisonia
hamifera]
Length = 1732
Score = 36.4 bits (82), Expect = 0.60
Identities = 33/127 (25%), Positives = 45/127 (34%), Gaps = 4/127 (3%)
Frame = +1
Query: 1048 AAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKS 1227
AA A P QY P R + +P T S GA P+ S
Sbjct: 1591 AAQSPAQSPGVQYSPDKSRVQVQRAPPTAPS-------------AAGGGASRSYSPSSPS 1637
Query: 1228 PNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPY----GNPGQKTGAGRPNNSGG 1395
N +PG + Y ++P S + F+P G T A N +
Sbjct: 1638 YNGRGAASPGANYVAASPGYSPSSPGAYSPSSPAAFSPSSPAAGGYSPSTPAYTANAAAN 1697
Query: 1396 KYNRNRGPRYP 1428
+Y+ R PRYP
Sbjct: 1698 QYSYARSPRYP 1708
>gi|2689219|emb|CAA67983.1| (X99669) dynamin like protein
[Dictyostelium discoideum]
Length = 853
Score = 36.4 bits (82), Expect = 0.60
Identities = 28/115 (24%), Positives = 44/115 (37%), Gaps = 13/115 (11%)
Frame = +1
Query: 1060 QAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPN-- 1233
Q+ PF Q + + G PA Q P ++ GP+N PN+ P+
Sbjct: 569 QSTNPFLQQQQQGQNKYPGGPPAQQQPNQQPNQLNKGPQN---------MPPNQSKPSSI 619
Query: 1234 ----PNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRP------- 1380
PN N + ++ + +F P+ YG + P
Sbjct: 620 PQNGPNNNNNNNNNNNRQDHQQGSFFSSFFRASPDPSLGQYGGANNSNNSNNPTSPINSS 679
Query: 1381 NNSGGKYN 1404
+NSG YN
Sbjct: 680 SNSGNNYN 687
>gi|3064231|gb|AAC14254.1| (AF036460) mucin-like protein
[Trypanosoma cruzi]
Length = 119
Score = 36.4 bits (82), Expect = 0.60
Identities = 30/77 (38%), Positives = 36/77 (45%), Gaps = 6/77 (7%)
Frame = +3
Query: 633 CRCQRRPQPRSTSAMGIARLPSQPPQTYTLWF-RPPTTRTTAWPIDRTTQDPSRSTT--- 800
C +P P +S ++PP T T RPPTT TT TTQ P+ STT
Sbjct: 13 CVADAQPVPEGSSNTTTTTTTTKPPTTTTTTTTRPPTTTTTT-----TTQAPTTSTTTAP 67
Query: 801 --HPTETRKRHAVSTHPATDYRP 863
T T + AVST A P
Sbjct: 68 EAPSTTTTEAPAVSTTRAPSRLP 90
>gi|112945|sp|P14196|AAC2_DICDI AAC-RICH MRNA CLONE AAC11 PROTEIN
>gi|84120|pir||S05355 hypothetical protein (clone AAC11)
- slime mold (Dictyostelium discoideum) (fragment)
>gi|7174|emb|CAA34529| (X16522) coding region (AA 448)
[Dictyostelium discoideum]
Length = 448
Score = 36.4 bits (82), Expect = 0.60
Identities = 51/210 (24%), Positives = 79/210 (37%), Gaps = 4/210 (1%)
Frame = +1
Query: 781 TQVEALRTQLKLEKDMPLALTLQPIIGQRRSAKATLHVNRNDNNKHAYHNNHST--ADES 954
T + L ++ + +P QPI + ++N N+NN + +NN+++ S
Sbjct: 95 TNLNGLSLAIQNQSSLP-----QPINNNNNNNNNNSNINNNNNNSNNNNNNNNSNLGINS 149
Query: 955 RELRRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFKQYKPKNDRSLSEGSPATF 1134
+ D R RG + R E K + PK D EG+P
Sbjct: 150 SPTQSSANSADKRSRG------RPRKNPPSEPKDTSGPKRKRGRPPKMD---EEGNP--- 197
Query: 1135 QSWYVPEGVSTGPRNH--RNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNF 1308
Q VP+ S R + + N + NT TP ++ R P K +P+
Sbjct: 198 QPKPVPQPGSNKKRGRPKKPKDENESDYNNTSFSDSNTDGTPKKRGR--PPKAKGESPS- 254
Query: 1309 PSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 1410
A+PT N GN G NN+ N N
Sbjct: 255 ----ASPTHNTLGN-----GILNSNNNNNNNNNN 279
>gi|969095 (U31961) no-on transient A-like protein [Drosophila
melanogaster]
Length = 642
Score = 36.4 bits (82), Expect = 0.60
Identities = 26/82 (31%), Positives = 37/82 (44%), Gaps = 4/82 (4%)
Frame = +1
Query: 1177 NHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSP-YKYPNNAPNFPSDHAT-PTFNPY-G 1347
N +AG G PN + + G+Q + P ++ PN P+ +A N Y G
Sbjct: 139 NELSAGGGGQNQPNHSNKGQGNQGDQGEQGNQGPNFRGRGGGPNQPNQNANQEQSNGYPG 198
Query: 1348 NPG-QKTGAGRPNNSGGKYNRNRGPR 1422
N G K G G+ GGK+ R R
Sbjct: 199 NQGDNKGGQGQRGAGGGKHQRGNRSR 224
>gi|1082604|pir||S53363 mucin 5AC (clone JER58) - human (fragment)
>gi|563377|emb|CAA84032| (Z34278) mucin [Homo sapiens]
Length = 279
Score = 36.4 bits (82), Expect = 0.60
Identities = 27/75 (36%), Positives = 35/75 (46%)
Frame = +3
Query: 657 PRSTSAMGIARLPSQPPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRHAVS 836
P +++ G PS P T T PTTRTT+ P TT + STT ET R +
Sbjct: 4 PTTSTTSGPGTTPSPVPTTSTT--SAPTTRTTSAPKSSTTSAATTSTTSGPETTPRPVPT 61
Query: 837 THPATDYRPTPFSQS 881
T +T PT + S
Sbjct: 62 T--STTSSPTTSTTS 74
Score = 34.4 bits (77), Expect = 2.3
Identities = 20/52 (38%), Positives = 27/52 (51%)
Frame = +3
Query: 735 PTTRTTAWPIDRTTQDPSRSTTHPTETRKRHAVSTHPATDYRPTPFSQSHTA 890
PTT TT+ P RTT P STT T T + ++ P T P P + + +A
Sbjct: 100 PTTSTTSAPTTRTTSAPISSTTSATTT----STTSGPGTTPSPVPTTSTTSA 147
Score = 33.2 bits (74), Expect = 5.2
Identities = 24/78 (30%), Positives = 36/78 (45%), Gaps = 1/78 (1%)
Frame = +3
Query: 657 PRSTSAMGIARLPSQPPQTYTLWFRP-PTTRTTAWPIDRTTQDPSRSTTHPTETRKRHAV 833
P+S++ + P+T RP PTT TT+ P TT P+ STT + T
Sbjct: 36 PKSSTTSAATTSTTSGPETTP---RPVPTTSTTSSPTTSTTSAPTTSTTSASTTSTTSGA 92
Query: 834 STHPATDYRPTPFSQSHTA 890
T P+ P P + + +A
Sbjct: 93 GTTPS----PVPTTSTTSA 107
>gi|4324420|gb|AAD16881| (AF104350) prespore-specific protein
[Dictyostelium discoideum]
Length = 1231
Score = 36.4 bits (82), Expect = 0.60
Identities = 32/149 (21%), Positives = 54/149 (35%), Gaps = 1/149 (0%)
Frame = +1
Query: 964 RRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFKQYKPKND-RSLSEGSPATFQS 1140
R F +LRDD G KH+++ ++ + A + K K N+ +L+ +P
Sbjct: 781 RLFGSLRDDIG----KHNYQQNASLFFDFATFLSKKSNKNLGDINNLNNLNNNNP----- 831
Query: 1141 WYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDH 1320
+ P N+ PN + NPN N + NN N +++
Sbjct: 832 -------NNNPNNN----------PNNNNNNPNNNNNNNNNNNNNNNNNNNNNNNNNNNN 874
Query: 1321 ATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 1410
+N + N NN+ N N
Sbjct: 875 NNTNYNNFNNTNNNNNNSNKNNNNNNNNNN 904
>gi|2497729|sp|Q49537|VLPE_MYCHR VARIANT SURFACE ANTIGEN E PRECURSOR
(VLPE PROLIPOPROTEIN) >gi|1039437 (U35016) VlpE
prolipoprotein [Mycoplasma hyorhinis]
>gi|1583723|prf||2121355B Vlp surface protein [Mycoplasma
hyorhinis]
Length = 243
Score = 36.4 bits (82), Expect = 0.60
Identities = 22/108 (20%), Positives = 40/108 (36%)
Frame = +1
Query: 1096 NDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKS 1275
N + G+ ++ S P+G + P N + N + +P N T
Sbjct: 75 NQSGSASGNGSSNSSVSTPDGQHSNPSNPTTSDPKESNPSNPTTSDPKESNPSNPTTSDG 134
Query: 1276 PYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRNRGP 1419
+ P+N + P+ NP + GQ + P S G+++ P
Sbjct: 135 QHSNPSNPTTSDPKESNPS-NPTTSDGQHSNPSNPTTSDGQHSNPSNP 181
>gi|2114108|dbj|BAA20059| (AB003911) OX40 precursor [Oryctolagus
cuniculus]
Length = 267
Score = 36.0 bits (81), Expect = 0.79
Identities = 24/52 (46%), Positives = 32/52 (61%), Gaps = 5/52 (9%)
Frame = +3
Query: 642 QRRPQPRSTSAMGIAR----LPSQPPQTYTLWFRPPTTRT-TAWPIDRTTQDPSRST 797
+R QP S+ + + L +QP +T + +RPPT RT TAWP RT Q PS T
Sbjct: 146 KRTLQPASSISDAVCEDRSSLATQPWETPSAPYRPPTARTSTAWP--RTAQGPSTPT 200
>gi|82698|pir||JQ0985 hydroxyproline-rich glycoprotein precursor -
maize >gi|257041|bbs|115226 (S45164) hydroxyproline-rich
glycoprotein, HRGP [maize, Peptide, 328 aa] [Zea mays]
>gi|4007865|emb|CAA10387| (AJ131535) Hydroxyproline-rich
Glycoprotein (HRGP) [Zea mays]
Length = 328
Score = 36.0 bits (81), Expect = 0.79
Identities = 26/80 (32%), Positives = 35/80 (43%)
Frame = +3
Query: 648 RPQPRSTSAMGIARLPSQPPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRH 827
+P P + + P P TYT +PPT + T + + P+ T PT T
Sbjct: 243 KPTPPTYTPSPKPPTPKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPTPKPTPPTYTPTPK 302
Query: 828 AVSTHPATDYRPTPFSQSHT 887
+T P T Y PTP SHT
Sbjct: 303 PPATKPPT-YTPTP-PVSHT 320
Score = 32.8 bits (73), Expect = 6.8
Identities = 20/59 (33%), Positives = 27/59 (44%)
Frame = +3
Query: 693 PSQPPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRHAVSTHPATDYRPTP 869
P P TYT +PPT + T + + P+ T PT T +T P T +PTP
Sbjct: 138 PKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPATKPPTP-KPTP 195
>gi|4220540|emb|CAA23013| (AL035356) hypothetical protein [Arabidopsis
thaliana]
Length = 319
Score = 36.0 bits (81), Expect = 0.79
Identities = 44/170 (25%), Positives = 68/170 (39%), Gaps = 20/170 (11%)
Frame = +1
Query: 907 NNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFKQY 1086
+N A +NH +S E +RFD D FK T + + +H+
Sbjct: 40 SNPLAETSNHQ--QDSFETQRFDYYTDPMAAYSS---FKKNKTPKQQYISSPSHQGSSPV 94
Query: 1087 KPKNDRSLSEGSPAT-----------FQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPN 1233
P+ S+ GS + + Y P G++ +HR AG N P
Sbjct: 95 PPQFPPSVPPGSLCSEYQAQTNHGGFHAAHYEPRGMAHLSPSHRGPPAGWN---NNFRPP 151
Query: 1234 PNTRNTPGQQTRKSPYKYPNNAPNFPSD---------HATPTFNPYGNPGQKTGAGRPNN 1386
P + P Q + P+ + PN ++ + P F+ YG G N
Sbjct: 152 PVNHSGPPQWVPR-PFPFSQEMPNMGNNRFGGRGSYNNTPPQFSNYGRQNANWGGNTYPN 210
Query: 1387 SGGKYNRNRG 1416
SG +R RG
Sbjct: 211 SGRGRSRGRG 220
>gi|5114426|gb|AAD40313.1|AF157503_1 (AF157503) chitinase 1 [Penaeus
monodon]
Length = 620
Score = 36.0 bits (81), Expect = 0.79
Identities = 18/36 (50%), Positives = 23/36 (63%), Gaps = 2/36 (5%)
Frame = +3
Query: 693 PSQPPQTYTLWFRPPTTRTTAW--PIDRTTQDPSRSTT 800
P+ PP T + W+ PPTT TT I TT+DP+ TT
Sbjct: 423 PTLPPTTTSPWWTPPTTTTTTRDPSITTTTRDPNLPTT 460
>gi|228937|prf||1814452B Hyp-rich glycoprotein [Zea mays]
Length = 327
Score = 36.0 bits (81), Expect = 0.79
Identities = 26/80 (32%), Positives = 35/80 (43%)
Frame = +3
Query: 648 RPQPRSTSAMGIARLPSQPPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRH 827
+P P + + P P TYT +PPT + T + + P+ T PT T
Sbjct: 242 KPTPPTYTPSPKPPTPKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPTPKPTPPTYTPTPK 301
Query: 828 AVSTHPATDYRPTPFSQSHT 887
+T P T Y PTP SHT
Sbjct: 302 PPATKPPT-YTPTP-PVSHT 319
Score = 32.8 bits (73), Expect = 6.8
Identities = 20/59 (33%), Positives = 27/59 (44%)
Frame = +3
Query: 693 PSQPPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRHAVSTHPATDYRPTP 869
P P TYT +PPT + T + + P+ T PT T +T P T +PTP
Sbjct: 137 PKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPATKPPTP-KPTP 194
>gi|106291|pir||S16681 homeotic protein - human
Length = 316
Score = 35.6 bits (80), Expect = 1.0
Identities = 36/143 (25%), Positives = 60/143 (41%), Gaps = 3/143 (2%)
Frame = +1
Query: 988 DRGRGQGKHHFKDRLTVSGEAAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVST 1167
D+G G+ +D E + H P +Q +P S + + ++ P G +T
Sbjct: 169 DKGSGRRLRTLRDSDPEEDEDEDDEDHFPLQQRRPW---STASSDCSVGRTGIAPRGPAT 225
Query: 1168 GPRNHRNAGAGNGAHPNKKSPNP-NTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPY 1344
PR R+ A + + P + +P P + PG T P P + A P P+
Sbjct: 226 SPRPSRSPAAQDRSRPARSAPGPAASPGGPGAWTH----------PARPREQARPP--PH 273
Query: 1345 GNPGQKTGAG--RPNNSGGKYNRNRG 1416
G P + GAG R + G++ +G
Sbjct: 274 G-PLAQAGAGGIRRGSGPGRFPFKQG 298
>gi|1170758|sp|P38486|LEG3_CANFA GALECTIN-3 (GALACTOSE-SPECIFIC LECTIN
3) (MAC-2 ANTIGEN) (IGE-BINDING PROTEIN) (35 KD LECTIN)
(CARBOHYDRATE BINDING PROTEIN 35) (CBP 35)
(LAMININ-BINDING PROTEIN) (LECTIN L-29)
Length = 296
Score = 35.6 bits (80), Expect = 1.0
Identities = 36/111 (32%), Positives = 43/111 (38%), Gaps = 10/111 (9%)
Frame = +1
Query: 1096 NDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGN-------GAHPNKKSPN--PNTRN 1248
ND G+P Q W GP ++ AGAG GA+P + P P
Sbjct: 8 NDALSGSGNPNP-QGW-------PGPWGNQPAGAGGYPGASYPGAYPGQAPPGGYPGQAP 59
Query: 1249 TPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNS-GGKYNRNRGPRY 1425
G + P YP AP P P G PGQ G P + G Y P Y
Sbjct: 60 PGGYPGQAPPGGYPGQAPPGGYPGQAP---PGGYPGQAPPGGYPGQAPPGTYPGPTAPAY 116
Query: 1426 P 1428
P
Sbjct: 117 P 117
>gi|1724014|sp|P54604|YHCT_BACSU HYPOTHETICAL 33.7 KD PROTEIN IN
CSPB-GLPP INTERGENIC REGION >gi|1239996|emb|CAA65704.1|
(X96983) hypothetical protein [Bacillus subtilis]
>gi|2633244|emb|CAB12749| (Z99108) similar to
hypothetical proteins [Bacillus subtilis]
Length = 302
Score = 35.6 bits (80), Expect = 1.0
Identities = 28/81 (34%), Positives = 44/81 (53%), Gaps = 10/81 (12%)
Frame = +1
Query: 85 LHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLGMSVKSGDKIELDGRSFVASA----- 249
L VL A S+ ++ +S+ IKVN + M VK GD++ +D + AS+
Sbjct: 21 LFSVLKTALKASKPVIQDWMSHQQIKVNHESVLNNMIVKKGDRVFIDLQESEASSVIPEY 80
Query: 250 -----LTEPARVLIYNKPEGEVTTREDPEGR 327
L E +LI NKP G + T + +G+
Sbjct: 81 GELDILFEDNHMLIINKPAG-IATHPNEDGQ 110
>gi|3242649|dbj|BAA29028.1| (AB015440) alpha 1 type I collagen [Rana
catesbeiana]
Length = 1445
Score = 35.2 bits (79), Expect = 1.4
Identities = 31/83 (37%), Positives = 35/83 (41%), Gaps = 7/83 (8%)
Frame = +1
Query: 1150 PEGVSTGPRNHRNA----GAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPN--NAPNFP 1311
P+G S GP + A GA A P + NP T PG K P AP FP
Sbjct: 342 PQG-SRGPDGPQGARGEPGAPGQAGPAGSAGNPGTDGQPGA---KGATGAPGIAGAPGFP 397
Query: 1312 SDHATP-TFNPYGNPGQKTGAGRPNNSGGK 1398
P P G+PG K G P G K
Sbjct: 398 GARGAPGPQGPGGSPGPKGNNGEPGAQGNK 427
>gi|4808164|emb|CAB42795.1| (Y18878) largest subunit of the RNA
polymerase II complex [Drosophila guanche]
Length = 1889
Score = 35.2 bits (79), Expect = 1.4
Identities = 41/127 (32%), Positives = 55/127 (43%), Gaps = 24/127 (18%)
Frame = +1
Query: 1048 AAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGP-----RNHRNAGAGNGAH 1212
+AA A + P + S S SPA S Y P S P + A + GA
Sbjct: 1537 SAASDASGMSPSWSPAHPGS-SPSSPAPSMSPYYPASPSVSPSYSPTSPNYTASSPGGAS 1595
Query: 1213 PNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNF--------PS----DHATPTFNPYGN-P 1353
PN +PN T SP +Y + PNF PS +P ++P N P
Sbjct: 1596 PNYSPSSPNYSPTSPLYAAPSP-RYASTTPNFNPQSTGYSPSASGYSPTSPVYSPTSNFP 1654
Query: 1354 GQKTGAG------RPNNSGGKYNRNRGPRYP 1428
+ AG P N+ + N P P
Sbjct: 1655 SSPSFAGSGSNMYSPGNAYSPSSTNYSPNSP 1685
>gi|4808166|emb|CAB42797.1| (Y18879) largest subunit of the RNA
polymerase II complex [Drosophila pseudoobscura]
Length = 1811
Score = 35.2 bits (79), Expect = 1.4
Identities = 41/127 (32%), Positives = 55/127 (43%), Gaps = 24/127 (18%)
Frame = +1
Query: 1048 AAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGP-----RNHRNAGAGNGAH 1212
+AA A + P + S S SPA S Y P S P + A + GA
Sbjct: 1459 SAASDASGMSPSWSPAHPGS-SPSSPAPSMSPYYPASPSVSPSYSPTSPNYTASSPGGAS 1517
Query: 1213 PNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNF--------PS----DHATPTFNPYGN-P 1353
PN +PN T SP +Y + PNF PS +P ++P N P
Sbjct: 1518 PNYSPSSPNYSPTSPLYAAASP-RYASTTPNFNPQSTGYSPSASGYSPTSPVYSPTSNFP 1576
Query: 1354 GQKTGAG------RPNNSGGKYNRNRGPRYP 1428
+ AG P N+ + N P P
Sbjct: 1577 SSPSFAGSGSNMYSPGNAYSPSSTNYSPNSP 1607
>gi|1589837 (U68729) cuticle preprocollagen [Meloidogyne incognita]
Length = 308
Score = 35.2 bits (79), Expect = 1.4
Identities = 24/72 (33%), Positives = 35/72 (48%), Gaps = 4/72 (5%)
Frame = +1
Query: 1213 PNKKSPNPNTRNTPGQ--QTRKSPYKYPNNAPNFPSDHATPTF-NPYGNPGQKTGAGRPN 1383
P +S NP +PGQ +++++P + P P P P G PGQ G G+P
Sbjct: 121 PPGRSGNPGAPGSPGQPGKSQQAPCQQVTTPPCKPCPGGQPGQPGPPGPPGQPGGPGQPG 180
Query: 1384 NSGGKYNRNR-GPRYP 1428
GG+ + GP P
Sbjct: 181 QPGGQASPGEPGPAGP 196
>gi|563375|emb|CAA84031| (Z34277) mucin [Homo sapiens]
Length = 477
Score = 35.2 bits (79), Expect = 1.4
Identities = 28/97 (28%), Positives = 41/97 (41%)
Frame = +3
Query: 603 RQHRLTRLVSCRCQRRPQPRSTSAMGIARLPSQPPQTYTLWFRPPTTRTTAWPIDRTTQD 782
++ R T LV+ PQ +TSA PTT TT+ P TT
Sbjct: 154 QKSRTTTLVTTSTTSTPQTSTTSA--------------------PTTSTTSAPTTSTTSA 193
Query: 783 PSRSTTHPTETRKRHAVSTHPATDYRPTPFSQSHTAR 893
P+ STT +T ++S+ P + P S + +AR
Sbjct: 194 PTTSTTSTPQT----SISSAPTSSTTSAPTSSTISAR 226
Score = 34.0 bits (76), Expect = 3.0
Identities = 25/65 (38%), Positives = 32/65 (48%), Gaps = 2/65 (3%)
Frame = +3
Query: 705 PQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRHAV--STHPATDYRPTPFSQ 878
P T T F PTT TT+ TT P+ STT +T K A ST + P+P +
Sbjct: 234 PTTSTTSF--PTTSTTSATTTSTTSAPTSSTTSTPQTSKTSAATSSTTSGSGTTPSPVTT 291
Query: 879 SHTARES 899
+ TA S
Sbjct: 292 TSTASVS 298
>gi|1519696 (U67956) coded for by C. elegans cDNA yk126f9.5; coded
for by C. elegans cDNA yk159h6.3; coded for by C.
elegans cDNA yk126f9.3; coded for by C. elegans cDNA
yk159h6.5 [Caenorhabditis elegans]
Length = 1229
Score = 35.2 bits (79), Expect = 1.4
Identities = 23/78 (29%), Positives = 32/78 (40%)
Frame = +3
Query: 654 QPRSTSAMGIARLPSQPPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRHAV 833
QP ST+ LP QT T PTT + + T +T T T K+ +
Sbjct: 554 QPTSTAESTTTALPFTTEQTVTT--EEPTTAEKSTATQKPTTTQESVSTEKTSTTKKAST 611
Query: 834 STHPATDYRPTPFSQSHT 887
+ P T PT ++S T
Sbjct: 612 TEEPTTTDEPTTTTESST 629
>gi|1184072 (U40766) COL-1 [Meloidogyne incognita]
Length = 309
Score = 35.2 bits (79), Expect = 1.4
Identities = 24/72 (33%), Positives = 35/72 (48%), Gaps = 4/72 (5%)
Frame = +1
Query: 1213 PNKKSPNPNTRNTPGQ--QTRKSPYKYPNNAPNFPSDHATPTF-NPYGNPGQKTGAGRPN 1383
P +S NP +PGQ +++++P + P P P P G PGQ G G+P
Sbjct: 122 PPGRSGNPGAPGSPGQPGKSQQAPCQQVTTPPCKPCPGGQPGQPGPPGPPGQPGGPGQPG 181
Query: 1384 NSGGKYNRNR-GPRYP 1428
GG+ + GP P
Sbjct: 182 QPGGQASPGEPGPAGP 197
>gi|2245693|gb|AAB62577.1| (AF010328) short form of CHIP [Drosophila
melanogaster]
Length = 365
Score = 34.8 bits (78), Expect = 1.8
Identities = 24/94 (25%), Positives = 38/94 (39%), Gaps = 6/94 (6%)
Frame = +1
Query: 1144 YVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHA 1323
Y P G S+ P +++ N P +P + P + P N N+P
Sbjct: 65 YPPAGQSS-PAGNQSIVFQNSNQPGSNTPQYTSSPAPSGSSTPGPVGAQNIPGNYPQSAT 123
Query: 1324 TPTFN-----PYGNPGQKTGA-GRPNNSGGKYNRNRGPRY 1425
FN P+G+P G RP +SG +N + +
Sbjct: 124 AGNFNGPVGGPFGSPSSGLGQFSRPASSGTPFNSGQAGHF 163
>gi|4324436|gb|AAD16883| (AF104414) large tumor suppressor 1 [Mus
musculus]
Length = 962
Score = 34.8 bits (78), Expect = 1.8
Identities = 30/128 (23%), Positives = 52/128 (40%), Gaps = 6/128 (4%)
Frame = +1
Query: 1000 GQGKHHFKDRLTVSGEAAAKQAHKPFKQYKPKNDRSLS------EGSPATFQSWYVPEGV 1161
G G+ F V + +Q P+ P N +S S +P +F + VP+ +
Sbjct: 183 GGGQSDFIVHQNVPTGSVTRQPPPPYP-LTPANGQSPSALQTGASAAPPSFANGNVPQSM 241
Query: 1162 STGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNP 1341
RN N N P ++ P + + P Q + ++ P PN P + NP
Sbjct: 242 MVPNRNSHNMELYNINVPGLQTAWPQSSSAPAQSSPSGGHEIPTWQPNIPV-RSNSFNNP 300
Query: 1342 YGNPGQKTGAGRPN 1383
G+ + +P+
Sbjct: 301 LGSRASHSANSQPS 314
>gi|3877422|emb|CAB05195.1| (Z82268) predicted using Genefinder;
similar to CUTICLE COLLAGEN 34; cDNA EST EMBL:D65629
comes from this gene; cDNA EST EMBL:D68754 comes from
this gene; cDNA EST EMBL:D68791 comes from this gene;
cDNA EST EMBL:D68988 comes ...
Length = 304
Score = 34.8 bits (78), Expect = 1.8
Identities = 26/85 (30%), Positives = 33/85 (38%)
Frame = +1
Query: 1150 PEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATP 1329
P+G S P N AGA P + + + PGQ + P P +P P P
Sbjct: 190 PKGASGAPGNPGQAGAPG--QPGADAQSESIPGAPGQAGPQGP-PGPAGSPGAPGGPGQP 246
Query: 1330 TFNPYGNPGQKTGAGRPNNSGGKYN 1404
G PGQK +G P G N
Sbjct: 247 -----GAPGQKGPSGAPGQPGADGN 266
>gi|1280114|gb|AAC25888.1| (U55373) Similar to cuticular collagen;
coded for by C. elegans cDNA yk92h9.3; coded for by C.
elegans cDNA yk100f8.5; coded for by C. elegans cDNA
yk123h6.5; coded for by C. elegans cDNA yk125b5.5; coded
for by C. elegans cDN...
Length = 289
Score = 34.8 bits (78), Expect = 1.8
Identities = 28/81 (34%), Positives = 33/81 (40%), Gaps = 3/81 (3%)
Frame = +1
Query: 1150 PEGVSTGPRNHRNAGA-GNGAHPNKKSPN--PNTRNTPGQQTRKSPYKYPNNAPNFPSDH 1320
P G GP + G GN P P P T PGQ R P P P
Sbjct: 176 PPGGPGGPGEGGDGGRPGNPGRPGPAGPRGEPGTEYKPGQPGRPGP-------PG-PRGE 227
Query: 1321 ATPTFNPYGNPGQKTGAGRPNNSG 1392
A P P G+PG +G+P N+G
Sbjct: 228 AGPAGQP-GSPGNDGESGKPGNAG 250
>gi|2245687|gb|AAB62574.1| (AF010325) CHIP [Drosophila melanogaster]
Length = 577
Score = 34.8 bits (78), Expect = 1.8
Identities = 24/94 (25%), Positives = 38/94 (39%), Gaps = 6/94 (6%)
Frame = +1
Query: 1144 YVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHA 1323
Y P G S+ P +++ N P +P + P + P N N+P
Sbjct: 65 YPPAGQSS-PAGNQSIVFQNSNQPGSNTPQYTSSPAPSGSSTPGPVGAQNIPGNYPQSAT 123
Query: 1324 TPTFN-----PYGNPGQKTGA-GRPNNSGGKYNRNRGPRY 1425
FN P+G+P G RP +SG +N + +
Sbjct: 124 AGNFNGPVGGPFGSPSSGLGQFSRPASSGTPFNSGQAGHF 163
>gi|119712|sp|P14918|EXTN_MAIZE EXTENSIN PRECURSOR (PROLINE-RICH
GLYCOPROTEIN) >gi|100863|pir||S08314 cell wall
glycoprotein - maize >gi|22508|emb|CAA31854| (X13499)
cell wall protein (AA 1-267) [Zea mays] >gi|168455
(M36912) cell wall protein (put.); putative [Zea mays]
>gi|226756|prf||1604465A cell wall protein [Zea mays]
Length = 267
Score = 34.4 bits (77), Expect = 2.3
Identities = 28/80 (35%), Positives = 36/80 (45%), Gaps = 5/80 (6%)
Frame = +3
Query: 648 RPQPRSTSAMGIARLPSQPPQTYTLWFRPPTTRT-----TAWPIDRTTQDPSRSTTHPTE 812
+P P + + P P TYT +PPT + T P T+ P+ T PT
Sbjct: 177 KPTPPTYTPSPKPPTPKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPATKPPTPKPTPPTY 236
Query: 813 TRKRHAVSTHPATDYRPTPFSQSHT 887
T +T P T Y PTP SHT
Sbjct: 237 TPTPKPPATKPPT-YTPTP-PVSHT 259
Score = 33.2 bits (74), Expect = 5.2
Identities = 21/67 (31%), Positives = 31/67 (45%)
Frame = +3
Query: 693 PSQPPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRHAVSTHPATDYRPTPF 872
P P TYT +PPT + T + + P+ T PT T +T P T +PTP
Sbjct: 176 PKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPATKPPTP-KPTPP 234
Query: 873 SQSHTAR 893
+ + T +
Sbjct: 235 TYTPTPK 241
>gi|227614|prf||1707318A Thr rich extensin [Zea mays]
Length = 251
Score = 34.4 bits (77), Expect = 2.3
Identities = 28/80 (35%), Positives = 36/80 (45%), Gaps = 5/80 (6%)
Frame = +3
Query: 648 RPQPRSTSAMGIARLPSQPPQTYTLWFRPPTTRT-----TAWPIDRTTQDPSRSTTHPTE 812
+P P + + P P TYT +PPT + T P T+ P+ T PT
Sbjct: 161 KPTPPTYTPSPKPPTPKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPATKPPTPKPTPPTY 220
Query: 813 TRKRHAVSTHPATDYRPTPFSQSHT 887
T +T P T Y PTP SHT
Sbjct: 221 TPTPKPPATKPPT-YTPTP-PVSHT 243
Score = 33.2 bits (74), Expect = 5.2
Identities = 21/67 (31%), Positives = 31/67 (45%)
Frame = +3
Query: 693 PSQPPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRHAVSTHPATDYRPTPF 872
P P TYT +PPT + T + + P+ T PT T +T P T +PTP
Sbjct: 160 PKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPATKPPTP-KPTPP 218
Query: 873 SQSHTAR 893
+ + T +
Sbjct: 219 TYTPTPK 225
>gi|3810839|emb|CAA21800| (AL032684) conserved hypothetical
zinc-finger protein [Schizosaccharomyces pombe]
Length = 482
Score = 34.4 bits (77), Expect = 2.3
Identities = 39/154 (25%), Positives = 62/154 (39%), Gaps = 1/154 (0%)
Frame = +1
Query: 841 TLQPIIGQRRSAKATLHVNRNDNNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHF 1020
TL P ++R +A + N+K++ + T+D++ R+D G + F
Sbjct: 330 TLNPDYQKQREIEAVVKSVLGSNSKNS--DKVGTSDDNNTPMSEKRKREDDD-ANGPNKF 386
Query: 1021 KDRLT-VSGEAAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGA 1197
R + V +A A+ A K +G PA F + +P G+ P NA A
Sbjct: 387 AARSSAVFSKATAEPAFKSAMAIPDMPSMPHVQGFPAPFPPFMMP-GLPQMPPMMMNAIA 445
Query: 1198 GNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAP 1302
G H N+ P N+R P + P N P
Sbjct: 446 GQVYHNNRNPPRTNSR--PSNASVPPPSSLHKNPP 478
>gi|228938|prf||1814452C Hyp-rich glycoprotein [Zea diploperennis]
Length = 349
Score = 34.4 bits (77), Expect = 2.3
Identities = 28/80 (35%), Positives = 36/80 (45%), Gaps = 5/80 (6%)
Frame = +3
Query: 648 RPQPRSTSAMGIARLPSQPPQTYTLWFRPPTTRT-----TAWPIDRTTQDPSRSTTHPTE 812
+P P + + P P TYT +PPT + T P T+ P+ T PT
Sbjct: 259 KPTPPTYTPSPKPPTPKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPATKPPTPKPTPPTY 318
Query: 813 TRKRHAVSTHPATDYRPTPFSQSHT 887
T +T P T Y PTP SHT
Sbjct: 319 TPTPKPPATKPPT-YTPTP-PVSHT 341
>gi|1082938|pir||A49688 lactose-binding lectin L-29 - dog
Length = 294
Score = 34.4 bits (77), Expect = 2.3
Identities = 30/87 (34%), Positives = 36/87 (40%), Gaps = 10/87 (11%)
Frame = +1
Query: 1168 GPRNHRNAGAGN-------GAHPNKKSPN--PNTRNTPGQQTRKSPYKYPNNAPNFPSDH 1320
GP ++ AGAG GA+P + P P G + P YP AP
Sbjct: 22 GPWGNQPAGAGGYPGASYPGAYPGQAPPGGYPGQAPPGGYPGQAPPGGYPGQAPPGGYPG 81
Query: 1321 ATPTFNPYGNPGQKTGAGRPNNS-GGKYNRNRGPRYP 1428
P P G PGQ G P + G Y P YP
Sbjct: 82 QAP---PGGYPGQAPPGGYPGQAPPGTYPGPTAPAYP 115
>gi|3851592 (AF093575) surface antigen ariel1 [Entamoeba histolytica]
Length = 215
Score = 34.4 bits (77), Expect = 2.3
Identities = 25/102 (24%), Positives = 40/102 (38%)
Frame = +1
Query: 1093 KNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRK 1272
+ D+ S S S P+ S N + N + NK PN ++ N P + +
Sbjct: 47 EEDKKSSSNSELDENSNNQPDESSNNKPNESSDNKPNESSDNK--PNESSNNKPSESSNN 104
Query: 1273 SPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGK 1398
P + NN PN SD+ P + P + + +S K
Sbjct: 105 KPDESSNNKPNESSDN-KPNESSNNKPNESSNNKPSESSNNK 145
Score = 33.2 bits (74), Expect = 5.2
Identities = 22/80 (27%), Positives = 35/80 (43%)
Frame = +1
Query: 1150 PEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATP 1329
P+ S N + N + NK PN ++ N P + + P + NN PN SD+ P
Sbjct: 106 PDESSNNKPNESSDNKPNESSNNK--PNESSNNKPSESSNNKPDESSNNKPNESSDN-KP 162
Query: 1330 TFNPYGNPGQKTGAGRPNNS 1389
+ P + + +PN S
Sbjct: 163 NESSNNKPNESSD-NKPNES 181
>gi|71823|pir||FGHUA fibrinogen alpha chain precursor, short splice
form - human >gi|182426 (J00128) A-alpha fibrinogen [Homo
sapiens] >gi|458554 (M64982) common fibrinogen alpha
chain [Homo sapiens] >gi|4033511 (M58569) fibrinogen
alpha subunit [Homo sapiens]
Length = 644
Score = 34.4 bits (77), Expect = 2.3
Identities = 32/109 (29%), Positives = 48/109 (43%), Gaps = 5/109 (4%)
Frame = +1
Query: 1093 KNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNT--PGQQT 1266
+N S G AT++ G STG N ++G G+ + N SP P + T PG
Sbjct: 308 RNPGSSGTGGTATWKPGSSGPG-STGSWNSGSSGTGSTGNQNPGSPRPGSTGTWNPGSSE 366
Query: 1267 RKSPYKYPNNAPNFPSDHATPTFNPYGNPGQ---KTGAGRPNNSGGKYNRNRGP 1419
R S + H T + G+ GQ ++G+ RP++ G R P
Sbjct: 367 RGS------------AGHWTSESSVSGSTGQWHSESGSFRPDSPGSGNARPNNP 408
>gi|437331 (L23429) beta-galactosides-binding lectin [Canis
familiaris]
Length = 285
Score = 34.4 bits (77), Expect = 2.3
Identities = 30/87 (34%), Positives = 36/87 (40%), Gaps = 10/87 (11%)
Frame = +1
Query: 1168 GPRNHRNAGAGN-------GAHPNKKSPN--PNTRNTPGQQTRKSPYKYPNNAPNFPSDH 1320
GP ++ AGAG GA+P + P P G + P YP AP
Sbjct: 13 GPWGNQPAGAGGYPGASYPGAYPGQAPPGGYPGQAPPGGYPGQAPPGGYPGQAPPGGYPG 72
Query: 1321 ATPTFNPYGNPGQKTGAGRPNNS-GGKYNRNRGPRYP 1428
P P G PGQ G P + G Y P YP
Sbjct: 73 QAP---PGGYPGQAPPGGYPGQAPPGTYPGPTAPAYP 106
>gi|120943|sp|P13816|GARP_PLAFF GLUTAMIC ACID-RICH PROTEIN PRECURSOR
>gi|627054|pir||A54514 glutamic acid-rich protein
precursor - Plasmodium falciparum
>gi|160299|gb|AAA29605.1| (J03998) glutamic acid-rich
protein [Plasmodium falciparum]
Length = 678
Score = 34.4 bits (77), Expect = 2.3
Identities = 41/164 (25%), Positives = 69/164 (42%), Gaps = 11/164 (6%)
Frame = +1
Query: 796 LRTQLKLEKDMPLALTLQPIIGQRRSAKATLHVNRNDNNKHAYHNNH--STADESRELRR 969
L + +LEK+ + ++ + + K + NDN K+A++NN S+ D + +
Sbjct: 49 LLNETELEKNKDDNSKSETLLKEEKDEKDDVPTTSNDNLKNAHNNNEISSSTDPTNIINV 108
Query: 970 FD----TLRDDRGRGQGKHHFKDRLTV-----SGEAAAKQAHKPFKQYKPKNDRSLSEGS 1122
D D + + K H KD+ E K+ K K+ K K D+ E S
Sbjct: 109 NDKDNENSVDKKKDKKEKKHKKDKKEKKEKKDKKEKKDKKEKKHKKEKKHKKDKKKKENS 168
Query: 1123 PATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKY 1287
S Y TG +NA + +++ + N G SPY+Y
Sbjct: 169 EV--MSLY-----KTGQHKPKNATEHGEENLDEEMVSEINNNAQGGLLLSSPYQY 216
>gi|283032|pir||S22456 hydroxyproline-rich glycoprotein - perennial
teosinte >gi|22092|emb|CAA45514| (X64173)
hydroxyproline-rich glycoprotein [Zea diploperennis]
Length = 350
Score = 34.4 bits (77), Expect = 2.3
Identities = 28/80 (35%), Positives = 36/80 (45%), Gaps = 5/80 (6%)
Frame = +3
Query: 648 RPQPRSTSAMGIARLPSQPPQTYTLWFRPPTTRT-----TAWPIDRTTQDPSRSTTHPTE 812
+P P + + P P TYT +PPT + T P T+ P+ T PT
Sbjct: 260 KPTPPTYTPSPKPPTPKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPATKPPTPKPTPPTY 319
Query: 813 TRKRHAVSTHPATDYRPTPFSQSHT 887
T +T P T Y PTP SHT
Sbjct: 320 TPTPKPPATKPPT-YTPTP-PVSHT 342
>gi|3834294 (U80846) No definition line found [Caenorhabditis elegans]
Length = 2232
Score = 34.4 bits (77), Expect = 2.3
Identities = 27/97 (27%), Positives = 40/97 (40%), Gaps = 1/97 (1%)
Frame = +1
Query: 1105 SLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNT-RNTPGQQTRKSPY 1281
S GS T QS + G + + + P+ +SP PNT TP Q + +SP
Sbjct: 564 STVSGSTGTSQSTLASSTATPGSSS--TVPSSSSPQPSSQSPAPNTGSTTPSQTSSQSPS 621
Query: 1282 KYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGG 1395
N + + P+ + T P G+ A P S G
Sbjct: 622 PSMNPSSSTPTGSSQSTITPEGST-----ASSPTGSTG 654
>gi|168457 (M36913) cell wall protein (put.); putative [Zea mays]
Length = 109
Score = 34.4 bits (77), Expect = 2.3
Identities = 28/80 (35%), Positives = 36/80 (45%), Gaps = 5/80 (6%)
Frame = +3
Query: 648 RPQPRSTSAMGIARLPSQPPQTYTLWFRPPTTRT-----TAWPIDRTTQDPSRSTTHPTE 812
+P P + + P P TYT +PPT + T P T+ P+ T PT
Sbjct: 19 KPTPPTYTPSPKPPTPKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPATKPPTPKPTPPTY 78
Query: 813 TRKRHAVSTHPATDYRPTPFSQSHT 887
T +T P T Y PTP SHT
Sbjct: 79 TPTPKPPATKPPT-YTPTP-PVSHT 101
Score = 33.2 bits (74), Expect = 5.2
Identities = 21/67 (31%), Positives = 31/67 (45%)
Frame = +3
Query: 693 PSQPPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRHAVSTHPATDYRPTPF 872
P P TYT +PPT + T + + P+ T PT T +T P T +PTP
Sbjct: 18 PKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPATKPPTP-KPTPP 76
Query: 873 SQSHTAR 893
+ + T +
Sbjct: 77 TYTPTPK 83
>gi|4503689|ref|NP_000499.1|| fibrinogen, A alpha polypeptide
>gi|1706799|sp|P02671|FIBA_HUMAN FIBRINOGEN ALPHA/ALPHA-E
CHAIN PRECURSOR >gi|2135107|pir||D44234 fibrinogen alpha
chain precursor, extended splice form - human >gi|182407
(M58569) fibrinogen alpha subunit precursor [Homo
sapiens] >gi|458555 (M64982) fibrinogen alpha-E chain
[Homo sapiens]
Length = 866
Score = 34.4 bits (77), Expect = 2.3
Identities = 32/109 (29%), Positives = 48/109 (43%), Gaps = 5/109 (4%)
Frame = +1
Query: 1093 KNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNT--PGQQT 1266
+N S G AT++ G STG N ++G G+ + N SP P + T PG
Sbjct: 308 RNPGSSGTGGTATWKPGSSGPG-STGSWNSGSSGTGSTGNQNPGSPRPGSTGTWNPGSSE 366
Query: 1267 RKSPYKYPNNAPNFPSDHATPTFNPYGNPGQ---KTGAGRPNNSGGKYNRNRGP 1419
R S + H T + G+ GQ ++G+ RP++ G R P
Sbjct: 367 RGS------------AGHWTSESSVSGSTGQWHSESGSFRPDSPGSGNARPNNP 408
>gi|3834293 (U80846) No definition line found [Caenorhabditis elegans]
Length = 1032
Score = 34.4 bits (77), Expect = 2.3
Identities = 27/97 (27%), Positives = 40/97 (40%), Gaps = 1/97 (1%)
Frame = +1
Query: 1105 SLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNT-RNTPGQQTRKSPY 1281
S GS T QS + G + + + P+ +SP PNT TP Q + +SP
Sbjct: 564 STVSGSTGTSQSTLASSTATPGSSS--TVPSSSSPQPSSQSPAPNTGSTTPSQTSSQSPS 621
Query: 1282 KYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGG 1395
N + + P+ + T P G+ A P S G
Sbjct: 622 PSMNPSSSTPTGSSQSTITPEGST-----ASSPTGSTG 654
>gi|3924913|emb|CAB03474.1| (Z81138) predicted using Genefinder; cDNA
EST EMBL:D65543 comes from this gene [Caenorhabditis
elegans]
Length = 304
Score = 34.0 bits (76), Expect = 3.0
Identities = 30/92 (32%), Positives = 40/92 (42%), Gaps = 12/92 (13%)
Frame = +1
Query: 1150 PEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPN-----NAPNFPS 1314
P+G P N AGA P + + + ++PG + P P AP P
Sbjct: 190 PKGAPGAPGNPGQAGA-----PGQPGSDAQSESSPGAPGQAGPQGPPGPAGSPGAPGGPG 244
Query: 1315 DHATP-TFNPYGNPGQ-----KTGA-GRPNNSGGKYNRNRGPRY 1425
P P G PGQ GA G+P SGG + P+Y
Sbjct: 245 QAGAPGPKGPSGAPGQPGADGNPGAPGQPGQSGGAGEKGICPKY 288
>gi|3924914|emb|CAB03475.1| (Z81138) predicted using Genefinder; cDNA
EST EMBL:D69494 comes from this gene; cDNA EST
EMBL:D69317 comes from this gene [Caenorhabditis elegans]
Length = 304
Score = 34.0 bits (76), Expect = 3.0
Identities = 30/92 (32%), Positives = 40/92 (42%), Gaps = 12/92 (13%)
Frame = +1
Query: 1150 PEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPN-----NAPNFPS 1314
P+G P N AGA P + + + ++PG + P P AP P
Sbjct: 190 PKGAPGAPGNPGQAGA-----PGQPGSDAQSESSPGAPGQAGPQGPPGPAGSPGAPGGPG 244
Query: 1315 DHATP-TFNPYGNPGQ-----KTGA-GRPNNSGGKYNRNRGPRY 1425
P P G PGQ GA G+P SGG + P+Y
Sbjct: 245 QAGAPGPKGPSGAPGQPGADGNPGAPGQPGQSGGAGEKGICPKY 288
>gi|6580246|emb|CAB63321.1| (Z81138) cDNA EST EMBL:D65528 comes from
this gene [Caenorhabditis elegans]
Length = 304
Score = 34.0 bits (76), Expect = 3.0
Identities = 30/92 (32%), Positives = 40/92 (42%), Gaps = 12/92 (13%)
Frame = +1
Query: 1150 PEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPN-----NAPNFPS 1314
P+G P N AGA P + + + ++PG + P P AP P
Sbjct: 190 PKGAPGAPGNPGQAGA-----PGQPGSDAQSESSPGAPGQAGPQGPPGPAGSPGAPGGPG 244
Query: 1315 DHATP-TFNPYGNPGQ-----KTGA-GRPNNSGGKYNRNRGPRY 1425
P P G PGQ GA G+P SGG + P+Y
Sbjct: 245 QAGAPGPKGPSGAPGQPGADGNPGAPGQPGQSGGAGEKGICPKY 288
>gi|1280073 (U55366) Similar to cuticle collagen [Caenorhabditis
elegans]
Length = 310
Score = 34.0 bits (76), Expect = 3.0
Identities = 29/92 (31%), Positives = 39/92 (41%), Gaps = 10/92 (10%)
Frame = +1
Query: 1150 PEGVSTGP----RNHRNAGAG----NGAHPNKKSPN--PNTRNTPGQQTRKSPYKYPNNA 1299
P+G GP R+ + AG + + P + PN P R PGQ P
Sbjct: 195 PKGAPGGPGQPGRDGQPGQAGQPGSSSSEPGQPGPNGQPGPRGPPGQAGSPGGNGQPGG- 253
Query: 1300 PNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRNRGPRY 1425
P P + GN GQ G+P SGG + P+Y
Sbjct: 254 PGQPGQRGSD--GQPGNDGQPGAPGQPGQSGGSGEKGICPKY 293
>gi|5901942|ref|NP_008971.1|| E1B-55kDa-associated protein 5
>gi|3319956|emb|CAA07548| (AJ007509) E1B-55kDa-associated
protein [Homo sapiens]
Length = 856
Score = 34.0 bits (76), Expect = 3.0
Identities = 24/72 (33%), Positives = 34/72 (46%), Gaps = 3/72 (4%)
Frame = +1
Query: 1213 PNKKSPNPN---TRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPN 1383
P + P P+ RN PG T Y +N P ++ +TPT + Y +P Q + + P
Sbjct: 708 PQQPPPPPSYSPARNPPGAST----YNKNSNIPGSSANTSTPTVSSY-SPPQPSYSQPPY 762
Query: 1384 NSGGKYNRNRGPRYP 1428
N GG GP P
Sbjct: 763 NQGGYSQGYTGPPPP 777
>gi|2501678|sp|Q45480|YLYB_BACSU HYPOTHETICAL 33.7 KD PROTEIN IN
LSP-PYRR INTERGENIC REGION (ORF-X) >gi|1373157 (U48870)
orf-X; hypothetical protein; Method: conceptual
translation supplied by author
Length = 303
Score = 34.0 bits (76), Expect = 3.0
Identities = 28/83 (33%), Positives = 44/83 (52%), Gaps = 11/83 (13%)
Frame = +1
Query: 52 TATEAPKLEERLHKVLAQAGLG-SRRALEQRISNGLIKVNGDIAQLGMSVKSGDKI---- 216
TA+E K ER+ K LA SR ++Q + +G + VNG + ++ GD++
Sbjct: 7 TASEEQK-SERIDKFLASTENDWSRTQVQQWVKDGQVVVNGSAVKANYKIQPGDQVTVTV 65
Query: 217 -ELDGRSFVASALT-----EPARVLIYNKPEGEV 300
E + +A + E VL+ NKP G V
Sbjct: 66 PEPEALDVLAEPMDLDIYYEDQDVLVVNKPRGMV 99
>gi|2633919|emb|CAB13420| (Z99112) alternate gene name: ylmL;
similar to hypothetical proteins [Bacillus subtilis]
Length = 273
Score = 34.0 bits (76), Expect = 3.0
Identities = 28/83 (33%), Positives = 44/83 (52%), Gaps = 11/83 (13%)
Frame = +1
Query: 52 TATEAPKLEERLHKVLAQAGLG-SRRALEQRISNGLIKVNGDIAQLGMSVKSGDKI---- 216
TA+E K ER+ K LA SR ++Q + +G + VNG + ++ GD++
Sbjct: 7 TASEEQK-SERIDKFLASTENDWSRTQVQQWVKDGQVVVNGSAVKANYKIQPGDQVTVTV 65
Query: 217 -ELDGRSFVASALT-----EPARVLIYNKPEGEV 300
E + +A + E VL+ NKP G V
Sbjct: 66 PEPEALDVLAEPMDLDIYYEDQDVLVVNKPRGMV 99
>gi|2135766|pir||S53362 mucin 5AC (clone JER47) - human (fragment)
Length = 477
Score = 34.0 bits (76), Expect = 3.0
Identities = 25/65 (38%), Positives = 32/65 (48%), Gaps = 2/65 (3%)
Frame = +3
Query: 705 PQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRHAV--STHPATDYRPTPFSQ 878
P T T F PTT TT+ TT P+ STT +T K A ST + P+P +
Sbjct: 234 PTTSTTSF--PTTSTTSATTTSTTSAPTTSTTSTPQTSKTSAATSSTTSGSGTTPSPVTT 291
Query: 879 SHTARES 899
+ TA S
Sbjct: 292 TSTASVS 298
Score = 34.0 bits (76), Expect = 3.0
Identities = 27/93 (29%), Positives = 38/93 (40%)
Frame = +3
Query: 603 RQHRLTRLVSCRCQRRPQPRSTSAMGIARLPSQPPQTYTLWFRPPTTRTTAWPIDRTTQD 782
++ R T LV+ PQ +TSA PTT TT+ P TT
Sbjct: 154 QKSRTTTLVTTSTTSTPQTSTTSA--------------------PTTSTTSAPTTSTTSA 193
Query: 783 PSRSTTHPTETRKRHAVSTHPATDYRPTPFSQS 881
P+ STT +T ++S+ P T P S +
Sbjct: 194 PTTSTTSTPQT----SISSAPTTSTTSAPTSST 222
>gi|1139597 (U43400) H1 gene product [Human herpesvirus 7] >gi|1139696
(U43400) H1' gene product [Human herpesvirus 7]
Length = 169
Score = 34.0 bits (76), Expect = 3.0
Identities = 18/42 (42%), Positives = 21/42 (49%)
Frame = +1
Query: 1204 GAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATP 1329
GA+PN PNPN + P +P PN PN PS H P
Sbjct: 9 GANPN---PNPNPSSKPNPSPNPNPSSKPNPNPN-PSSHCHP 46
>gi|628112|pir||S32988 hypothetical protein - human herpesvirus 4
Length = 512
Score = 33.6 bits (75), Expect = 4.0
Identities = 35/122 (28%), Positives = 56/122 (45%), Gaps = 6/122 (4%)
Frame = +1
Query: 976 TLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPE 1155
T RG+ +G+ + R G++ KQ KP ++P+ + S SP+ +PE
Sbjct: 361 TQGQSRGQSRGRGRGRGRGRGKGKSRDKQ-RKPGGPWRPEPNTS----SPS------MPE 409
Query: 1156 GVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYP--NNAPN----FPSD 1317
+S H+ GAG+ P + P RN+ SP P +N+P FP D
Sbjct: 410 -LSPVLGLHQGQGAGDSPTPGPSNAAPVCRNSHTATPNVSPIHEPESHNSPEAPILFPDD 468
Query: 1318 HATPTFNP 1341
P+ +P
Sbjct: 469 WYPPSIDP 476
>gi|628185|pir||S42447 nuclear protein EBNA2 - Epstein-Barr virus
>gi|330444 (K03333) nuclear protein EBNA2 [Epstein-Barr
virus]
Length = 490
Score = 33.6 bits (75), Expect = 4.0
Identities = 35/122 (28%), Positives = 56/122 (45%), Gaps = 6/122 (4%)
Frame = +1
Query: 976 TLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPE 1155
T RG+ +G+ + R G++ KQ KP ++P+ + S SP+ +PE
Sbjct: 339 TQGQSRGQSRGRGRGRGRGRGKGKSRDKQ-RKPGGPWRPEPNTS----SPS------MPE 387
Query: 1156 GVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYP--NNAPN----FPSD 1317
+S H+ GAG+ P + P RN+ SP P +N+P FP D
Sbjct: 388 -LSPVLGLHQGQGAGDSPTPGPSNAAPVCRNSHTATPNVSPIHEPESHNSPEAPILFPDD 446
Query: 1318 HATPTFNP 1341
P+ +P
Sbjct: 447 WYPPSIDP 454
>gi|5833486|gb|AAD53528.1| (AF162149) variable surface lipoprotein
[Mycoplasma bovis]
Length = 202
Score = 33.6 bits (75), Expect = 4.0
Identities = 27/87 (31%), Positives = 39/87 (44%), Gaps = 2/87 (2%)
Frame = +1
Query: 1153 EGVSTGPRNHR--NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHAT 1326
+G T P N G G A+P++ +P + TP + +P P P D T
Sbjct: 105 QGTPTNPDQGTPANPGQGTPANPDQGTPTNPDQGTPANPGQGTPANPDQGTPANP-DQGT 163
Query: 1327 PTFNPYGNPGQKTGAGRPNNSGGKYNRNR 1413
PT NPGQ T A +P+ S + N +
Sbjct: 164 PT-----NPGQGTPA-KPHFSPEEENAEK 186
>gi|283045|pir||S28264 hydroxyproline-rich glycoprotein - maize
>gi|22333|emb|CAA44844| (X63134) hydroxyproline-rich
glycoprotein [Zea mays] >gi|228936|prf||1814452A
Hyp-rich glycoprotein [Zea mays]
Length = 303
Score = 33.6 bits (75), Expect = 4.0
Identities = 23/65 (35%), Positives = 30/65 (45%)
Frame = +3
Query: 693 PSQPPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRHAVSTHPATDYRPTPF 872
P P TYT +PPT + + + + P+ T PT T +T P T Y PTP
Sbjct: 233 PKPTPPTYTPSPKPPTPKPSPPTYTPSPKPPTPKPTPPTYTPTPKPPATKPPT-YTPTP- 290
Query: 873 SQSHT 887
SHT
Sbjct: 291 PVSHT 295
Score = 33.6 bits (75), Expect = 4.0
Identities = 22/74 (29%), Positives = 32/74 (42%)
Frame = +3
Query: 648 RPQPRSTSAMGIARLPSQPPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRH 827
+P P + + P P TYT +PPT + T + + P+ T PT T
Sbjct: 165 KPTPPTYTPSPKPPTPKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPTPKPTPPTYTPSPK 224
Query: 828 AVSTHPATDYRPTP 869
+T P T +PTP
Sbjct: 225 PPATKPPTP-KPTP 237
>gi|3880306|emb|CAA90995.1| (Z54238) similar to cuticle collagen; cDNA
EST EMBL:T01150 comes from this gene; cDNA EST
EMBL:D33882 comes from this gene; cDNA EST EMBL:D65956
comes from this gene; cDNA EST EMBL:D66123 comes from
this gene; cDNA EST EMBL:D... >gi|3880308|emb|CAA90997.1|
(Z54238) similar to cuticle collagen [Caenorhabditis
elegans]
Length = 299
Score = 33.6 bits (75), Expect = 4.0
Identities = 23/76 (30%), Positives = 30/76 (39%)
Frame = +1
Query: 1171 PRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGN 1350
P N GA P+ + NP PGQ + +P + P+ A P P G
Sbjct: 174 PGNDGQPGAPGNKGPSGPNGNPGAPGAPGQPGQDAPSEPITPGAPGPAGPAGPQ-GPPGA 232
Query: 1351 PGQKTGAGRPNNSGGK 1398
PGQ G+P G K
Sbjct: 233 PGQPGHDGQPGAPGPK 248
>gi|2833307|sp|Q21184|YXWK_CAEEL PUTATIVE CUTICLE COLLAGEN F55C10.3
>gi|3877662|emb|CAA98487.1| (Z74036) similar to collagen
[Caenorhabditis elegans]
Length = 266
Score = 33.6 bits (75), Expect = 4.0
Identities = 31/93 (33%), Positives = 37/93 (39%), Gaps = 7/93 (7%)
Frame = +1
Query: 1150 PEGVSTGPRNHRNAGA-GNGAHPNKKSP-----NPNTRNTPGQQTRKSPYKYPNNAPNFP 1311
P G + G N + G G P K P NP PGQ + +P + P P
Sbjct: 128 PSGDAGGNGNPGSPGQDGQPGAPGNKGPSGPNGNPGAPGAPGQPGQDAPSE--PITPGAP 185
Query: 1312 SDHATP-TFNPYGNPGQKTGAGRPNNSGGKYNRNRGPRYP 1428
TP P G PGQ G+P G K N P P
Sbjct: 186 GPQGTPGPQGPPGQPGQPGHDGQPGAPGPK-GPNGNPGQP 224
>gi|3877661|emb|CAA98486.1| (Z74036) predicted using Genefinder;
similar to collagen [Caenorhabditis elegans]
Length = 299
Score = 33.6 bits (75), Expect = 4.0
Identities = 31/93 (33%), Positives = 37/93 (39%), Gaps = 7/93 (7%)
Frame = +1
Query: 1150 PEGVSTGPRNHRNAGA-GNGAHPNKKSP-----NPNTRNTPGQQTRKSPYKYPNNAPNFP 1311
P G + G N + G G P K P NP PGQ + +P + P P
Sbjct: 161 PSGDAGGNGNPGSPGQDGQPGAPGNKGPSGPNGNPGAPGAPGQPGQDAPSE--PITPGAP 218
Query: 1312 SDHATP-TFNPYGNPGQKTGAGRPNNSGGKYNRNRGPRYP 1428
TP P G PGQ G+P G K N P P
Sbjct: 219 GPQGTPGPQGPPGQPGQPGHDGQPGAPGPK-GPNGNPGQP 257
>gi|543543|pir||JN0690 glutenin, high-molecular-weight Bx7 chain
precursor - wheat
Length = 791
Score = 33.6 bits (75), Expect = 4.0
Identities = 28/136 (20%), Positives = 45/136 (32%)
Frame = +1
Query: 1000 GQGKHHFKDRLTVSGEAAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRN 1179
GQG+ H + +Q + P +P + L +G P Y P G +
Sbjct: 139 GQGQQHQQP-------GQRQQGYYPTSPQQPGQGQQLGQGQPG-----YYPTSQQPGQKQ 186
Query: 1180 HRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQ 1359
G +G P ++ GQQ + Y +P P GQ
Sbjct: 187 QAGQGQQSGQGQQGYYPTSPQQSGQGQQPGQGQPGYYPTSPQQSGQWQQPGQGQQPGQGQ 246
Query: 1360 KTGAGRPNNSGGKYNR 1407
++G G+ G+ R
Sbjct: 247 QSGQGQQGQQPGQGQR 262
>gi|102059|pir||D41710 promastigote surface antigen-2 (clone 4.6) -
Leishmania major (fragment) >gi|9583|emb|CAA40414|
(X57135) surface antigen P2 [Leishmania major]
Length = 327
Score = 33.6 bits (75), Expect = 4.0
Identities = 21/66 (31%), Positives = 32/66 (47%)
Frame = +3
Query: 696 SQPPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRHAVSTHPATDYRPTPFS 875
++PP T T +PPTT TT + TT + +T PT T +T T +P P +
Sbjct: 151 TKPPTTTTTTTKPPTTTTTTTKLPTTT---TTTTKPPTTTTTTTTTTTTTTTTTKP-PIT 206
Query: 876 QSHTAR 893
+ T +
Sbjct: 207 TATTTK 212
Score = 33.2 bits (74), Expect = 5.2
Identities = 18/40 (45%), Positives = 23/40 (57%), Gaps = 2/40 (5%)
Frame = +3
Query: 696 SQPPQTYTLWFRPPT--TRTTAWPIDRTTQDPSRSTTHPTET 815
++PP T T +PPT T TT P TT+ P+ TT T T
Sbjct: 211 TKPPTTTTTTTKPPTTITSTTKLPTTTTTEAPAEPTTTATPT 252
>gi|119111|sp|P12978|EBN2_EBV EBNA-2 NUCLEAR PROTEIN
>gi|1083964|pir||S42442 EBNA2 protein - human herpesvirus
4 >gi|1632787|emb|CAA24877.1| (V01555) BYRF1, encodes
EBNA-2 (Dambaugh et al, 1984; Dillner et al, 1984) [Human
herpesvirus 4]
Length = 487
Score = 33.6 bits (75), Expect = 4.0
Identities = 35/122 (28%), Positives = 56/122 (45%), Gaps = 6/122 (4%)
Frame = +1
Query: 976 TLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPE 1155
T RG+ +G+ + R G++ KQ KP ++P+ + S SP+ +PE
Sbjct: 336 TQGQSRGQSRGRGRGRGRGRGKGKSRDKQ-RKPGGPWRPEPNTS----SPS------MPE 384
Query: 1156 GVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYP--NNAPN----FPSD 1317
+S H+ GAG+ P + P RN+ SP P +N+P FP D
Sbjct: 385 -LSPVLGLHQGQGAGDSPTPGPSNAAPVCRNSHTATPNVSPIHEPESHNSPEAPILFPDD 443
Query: 1318 HATPTFNP 1341
P+ +P
Sbjct: 444 WYPPSIDP 451
>gi|100864|pir||S08315 cell wall protein - maize (fragment)
>gi|22269|emb|CAA31860| (X13506) cell wall protein (108
AA) [Zea mays] >gi|168459 (M36914) cell wall protein
(put.); putative [Zea mays]
Length = 108
Score = 33.2 bits (74), Expect = 5.2
Identities = 21/67 (31%), Positives = 31/67 (45%)
Frame = +3
Query: 693 PSQPPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRHAVSTHPATDYRPTPF 872
P P TYT +PPT + T + + P+ T PT T +T P T +PTP
Sbjct: 18 PKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPATKPPTP-KPTPP 76
Query: 873 SQSHTAR 893
+ + T +
Sbjct: 77 TYTPTPK 83
>gi|2135765|pir||A43932 mucin 2 precursor, intestinal - human
(fragments)
Length = 3020
Score = 33.2 bits (74), Expect = 5.2
Identities = 28/79 (35%), Positives = 35/79 (43%), Gaps = 9/79 (11%)
Frame = +3
Query: 651 PQPRSTSAMGIARLPSQPPQTYTLWFRPPTT------RTTAWPIDRTTQDPSRSTT---H 803
P P +T+ + PS P T T PPTT TT P+ TT P STT
Sbjct: 1408 PPPTTTTTLPPTTTPSPPTTTTTT--PPPTTTPSPPITTTTTPLPTTTPSPPISTTTTPP 1465
Query: 804 PTETRKRHAVSTHPATDYRPTPFSQSHT 887
PT T + P T P + + T
Sbjct: 1466 PTTTPSPPTTTPSPPTTTPSPPTTTTTT 1493
>gi|5852937|gb|AAD54275.1|AF169793_1 (AF169793) HET-C protein
[Podospora anserina]
Length = 735
Score = 33.2 bits (74), Expect = 5.2
Identities = 34/146 (23%), Positives = 54/146 (36%), Gaps = 7/146 (4%)
Frame = +1
Query: 925 HNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFKQYKPKNDR 1104
H+NH A S R + G G H + ++ + +QA +Y P+
Sbjct: 576 HHNHILASNSSSSSRSVSAPSHGGNGCDHGHGRPAGSLWEQVKKQQADA---RYSPRPGS 632
Query: 1105 SLSEGSPATFQSWYVPEGVST----GPRNHRNAGAGNGAHPNKKSP---NPNTRNTPGQQ 1263
S S Y G + P+ + G G+GA+P ++ P G Q
Sbjct: 633 SGGGYGQRPGSSGYGSGGGGSYGRPSPQPGYSGGGGSGAYPPQQQPQYGGGGYGGPGGYQ 692
Query: 1264 TRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQK 1362
P+ + +P H P G PGQ+
Sbjct: 693 QPPPPHHHGQYGGGYPGQHPPPPPQGGGYPGQQ 725
>gi|4503493|ref|NP_001955.1|| early growth response 1
>gi|119242|sp|P18146|EGR1_HUMAN EARLY GROWTH RESPONSE
PROTEIN 1 (EGR-1) (KROX24) (ZIF268) (TRANSCRIPTION FACTOR
ETR103) (ZINC FINGER PROTEIN 225) (AT225)
>gi|87347|pir||A41211 early growth response protein 1 -
human >gi|31130|emb|CAA36777| (X52541) early growth
response protein 1 (AA 1-543) [Homo sapiens] >gi|182263
(M62829) ETR103 [Homo sapiens]
>gi|5420379|emb|CAB46678.1| (AJ243425) early growth
response protein 1 [Homo sapiens]
Length = 543
Score = 33.2 bits (74), Expect = 5.2
Identities = 24/82 (29%), Positives = 37/82 (44%), Gaps = 2/82 (2%)
Frame = +1
Query: 1066 HKPFKQYKPKNDRS--LSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPN 1239
H P PK + LS G+P + PEG + + + G G G + S + +
Sbjct: 25 HSPTMDNYPKLEEMMLLSNGAPQFLGAAGAPEGSGSNSSSSSSGGGGGGGGGSNSSSSSS 84
Query: 1240 TRNTPGQQTRKSPYKYPNNAPNFP 1311
T N P T + PY++ A +FP
Sbjct: 85 TFN-PQADTGEQPYEH-LTAESFP 106
>gi|2833328|sp|Q27200|FBRL_TETTH FIBRILLARIN >gi|2654298|emb|CAA54924|
(X77962) fibrillarin [Tetrahymena thermophila]
Length = 294
Score = 33.2 bits (74), Expect = 5.2
Identities = 15/28 (53%), Positives = 17/28 (60%)
Frame = +1
Query: 1345 GNPGQKTGAGRPNNSGGKYNRNRGPRYP 1428
G PG K G GRP GGK+ +GPR P
Sbjct: 37 GGPGGKFGGGRPGGPGGKFGA-KGPRGP 63
>gi|2707270 (AF036171) homeobox-containing protein [Dictyostelium
discoideum]
Length = 534
Score = 33.2 bits (74), Expect = 5.2
Identities = 31/174 (17%), Positives = 61/174 (34%), Gaps = 5/174 (2%)
Frame = +1
Query: 889 HVNRNDNNKHAYHNNHSTADESRELRRFDTL-----RDDRGRGQGKHHFKDRLTVSGEAA 1053
H N N+NN + Y+N +S ++ +R ++ + H D
Sbjct: 285 HNNNNNNNSNNYNNGNSNSNNNRNNNNNYNYNNYINNNNYNNNNNRQHCDDE-------- 336
Query: 1054 AKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPN 1233
++ + F N+ + + + Y + + N+ N + N + N
Sbjct: 337 -EEDEQYFNNNNNNNNNNNNNRISDSSDDQYFSDDTNNNNDNYNNGNNNYNNNNNNNNFN 395
Query: 1234 PNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 1410
N N + Y NN ++ + + FN N + NN+ +YN N
Sbjct: 396 NNYMNNYNNNYNNNNY---NNNNSYNNSNGNNNFNNNNNNNNQN--NNNNNNNNQYNNN 449
>gi|3879844|emb|CAB04725.1| (Z81592) predicted using Genefinder; cDNA
EST EMBL:M89005 comes from this gene [Caenorhabditis
elegans]
Length = 695
Score = 33.2 bits (74), Expect = 5.2
Identities = 25/91 (27%), Positives = 34/91 (36%), Gaps = 1/91 (1%)
Frame = +1
Query: 1150 PEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNT-PGQQTRKSPYKYPNNAPNFPSDHAT 1326
P S+G +RN G GN + NK S N N N G Y N+ +F +
Sbjct: 539 PPPRSSGANGNRNGGGGNRRNNNKNSSNSNNNNNFNGNGNGDGSYNNNNDNCDFENRCGG 598
Query: 1327 PTFNPYGNPGQKTGAGRPNNSGGKYNRNRGPR 1422
GN Q+ + + N N G R
Sbjct: 599 QGGFENGNENQRFSSRKQPPPKPSANNNNGDR 630
Score = 32.8 bits (73), Expect = 6.8
Identities = 26/79 (32%), Positives = 33/79 (40%), Gaps = 4/79 (5%)
Frame = +1
Query: 1192 GAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGA 1371
G GNG P P P + ++T NN N + + P P N G G
Sbjct: 458 GGGNGPVPPIPEPKPLCKGLKFKKTANGGGGNNNNNNNNNNRNNGP---PPRNNGNNNGN 514
Query: 1372 GR----PNNSGGKYNRNRGPRYP 1428
GR P++SG NR GP P
Sbjct: 515 GRPMKPPSSSGSGSNRRSGPPPP 537
>gi|418972|pir||S31035 retrovirus-related gag polyprotein - mouse
intracisternal A-particle MIAD8 (fragment)
Length = 717
Score = 33.2 bits (74), Expect = 5.2
Identities = 19/52 (36%), Positives = 27/52 (51%)
Frame = +1
Query: 1123 PATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSP 1278
PA QS Y+P+ S+GPR+ GN + P + R+ PG+ TR P
Sbjct: 402 PADSQSAYMPKNGSSGPRSQGPQRYGN---QFVEDPGSSQRDDPGRPTRVEP 450
>gi|6594249|emb|CAB63561.1| (AL031746) hypothetical garp protein
[Plasmodium falciparum]
Length = 673
Score = 33.2 bits (74), Expect = 5.2
Identities = 41/164 (25%), Positives = 68/164 (41%), Gaps = 11/164 (6%)
Frame = +1
Query: 796 LRTQLKLEKDMPLALTLQPIIGQRRSAKATLHVNRNDNNKHAYHNNH--STADESRELRR 969
L + +LEK+ + ++ + + K + NDN K+A++NN S+ D + +
Sbjct: 49 LLNETELEKNKDDNSKSETLLKEEKDEKDDVPTTSNDNLKNAHNNNEISSSTDPTNIINV 108
Query: 970 FD----TLRDDRGRGQGKHHFKDRLTV-----SGEAAAKQAHKPFKQYKPKNDRSLSEGS 1122
D D + + K H KD+ E K+ K K+ K K D+ E S
Sbjct: 109 NDKDNENSVDKKKDKKEKKHKKDKKEKKEKKDKKEKKDKKEKKHKKEKKHKKDKKKEENS 168
Query: 1123 PATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKY 1287
S Y TG +NA + ++ + N G SPY+Y
Sbjct: 169 EV--MSLY-----KTGQHKPKNATEHGEENLYEEMVSEINNNAQGGLLLSSPYQY 216
>gi|6601502|gb|AAF19004.1|AF151366_1 (AF151366) arginine/serine-rich
protein [Arabidopsis thaliana]
Length = 414
Score = 33.2 bits (74), Expect = 5.2
Identities = 31/114 (27%), Positives = 46/114 (40%), Gaps = 7/114 (6%)
Frame = +1
Query: 1078 KQYKPKNDRSLSE----GSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTR 1245
K+ PK+D + ++ G P + PR R+ G P ++SP+ R
Sbjct: 193 KRDAPKSDNAAADAEKDGGPRRPRETSPQRKTGLSPRR-RSPLPRRGLSPRRRSPDSPHR 251
Query: 1246 NTPGQQTRKSPYKYPNNAPNFPSDHATPTFNP---YGNPGQKTGAGRPNNSGGKYNRNRG 1416
PG R+ P P PS +P+ P Y +P + G P G R R
Sbjct: 252 RRPGSPIRRRGDTPPRRRPASPSRGRSPSSPPPRRYRSPPR----GSPRRIRGSPVRRRS 307
Query: 1417 P 1419
P
Sbjct: 308 P 308
>gi|476822|pir||A42893 penicillin-binding protein 1A - Streptococcus
pneumoniae >gi|153768 (M90527) penicillin-binding protein
[Streptococcus pneumoniae]
Length = 719
Score = 33.2 bits (74), Expect = 5.2
Identities = 30/107 (28%), Positives = 43/107 (40%), Gaps = 2/107 (1%)
Frame = +1
Query: 1108 LSEGSPATFQSWYVPEGVSTGPRNHRNAGA--GNGAHPNKKSPNPNTRNTPGQQTRKSPY 1281
LSEGS + W +PEG+ +RN NGA SP P + + S
Sbjct: 625 LSEGSNP--EDWNIPEGL------YRNGEFVFKNGARSTWSSPAPQQPPSTESSSSSSDS 676
Query: 1282 KYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRNRGPRYP 1428
++ PS + + T NP N Q N + + N+N P P
Sbjct: 677 STSQSSSTTPSTNNSTTTNPNNNTQQS------NTTPDQQNQNPQPAQP 719
>gi|282331|pir||S28037 penicillin-binding protein 1a - Streptococcus
pneumoniae (strain 63915) (fragment)
>gi|47418|emb|CAA48072| (X67872) penicillin-binding
protein 1a [Streptococcus pneumoniae]
Length = 719
Score = 33.2 bits (74), Expect = 5.2
Identities = 30/107 (28%), Positives = 43/107 (40%), Gaps = 2/107 (1%)
Frame = +1
Query: 1108 LSEGSPATFQSWYVPEGVSTGPRNHRNAGA--GNGAHPNKKSPNPNTRNTPGQQTRKSPY 1281
LSEGS + W +PEG+ +RN NGA SP P + + S
Sbjct: 625 LSEGSNP--EDWNIPEGL------YRNGEFVFKNGARSTWNSPAPQQPPSTESSSSSSDS 676
Query: 1282 KYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRNRGPRYP 1428
++ PS + + T NP N Q N + + N+N P P
Sbjct: 677 STSQSSSTTPSTNNSTTTNPNNNTQQS------NTTPDQQNQNPQPAQP 719
>gi|585647|sp|Q04707|PBPA_STRPN PENICILLIN-BINDING PROTEIN 1A (PBP-1A)
(EXPORTED PROTEIN 2) >gi|282329|pir||S28038
penicillin-binding protein 1a - Streptococcus pneumoniae
(strain 45607) (fragment) >gi|47420|emb|CAA48073|
(X67873) penicillin-binding protein 1a [Streptococcus
pneumoniae]
Length = 719
Score = 33.2 bits (74), Expect = 5.2
Identities = 30/107 (28%), Positives = 43/107 (40%), Gaps = 2/107 (1%)
Frame = +1
Query: 1108 LSEGSPATFQSWYVPEGVSTGPRNHRNAGA--GNGAHPNKKSPNPNTRNTPGQQTRKSPY 1281
LSEGS + W +PEG+ +RN NGA SP P + + S
Sbjct: 625 LSEGSNP--EDWNIPEGL------YRNGEFVFKNGARSTWSSPAPQQPPSTESSSSSSDS 676
Query: 1282 KYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRNRGPRYP 1428
++ PS + + T NP N Q N + + N+N P P
Sbjct: 677 STSQSSSTTPSTNNSTTTNPNNNTQQS------NTTPDQQNQNPQPAQP 719
>gi|4140029|dbj|BAA36973.1| (AB015438) alpha 1 type I collagen [Cynops
pyrrhogaster]
Length = 1450
Score = 33.2 bits (74), Expect = 5.2
Identities = 26/77 (33%), Positives = 29/77 (36%), Gaps = 10/77 (12%)
Frame = +1
Query: 1168 GPRNHRNAGAGNGAHPNKKSPNP-------NTRNTPGQQTRKSPYKYPN--NAPNFPSDH 1320
GP+ R + GA +P P T GQ K P AP FP
Sbjct: 342 GPQGSRGSEGPQGARGEPGAPGPAGAAGPSGNPGTDGQPGGKGATGSPGIAGAPGFPGAR 401
Query: 1321 ATP-TFNPYGNPGQKTGAGRPNNSGGK 1398
P P G PG K G P G K
Sbjct: 402 GAPGPQGPAGAPGPKGNNGEPGAQGNK 428
>gi|5410459|gb|AAD43067.1|AF139884_1 (AF139884) penicillin-binding
protein 1a [Streptococcus pneumoniae]
>gi|5410461|gb|AAD43068.1|AF139885_1 (AF139885)
penicillin-binding protein 1a [Streptococcus pneumoniae]
>gi|5410463|gb|AAD43069.1|AF139886_1 (AF139886)
penicillin-binding protein 1a [Streptococcus pneumoniae]
Length = 719
Score = 33.2 bits (74), Expect = 5.2
Identities = 30/107 (28%), Positives = 43/107 (40%), Gaps = 2/107 (1%)
Frame = +1
Query: 1108 LSEGSPATFQSWYVPEGVSTGPRNHRNAGA--GNGAHPNKKSPNPNTRNTPGQQTRKSPY 1281
LSEGS + W +PEG+ +RN NGA SP P + + S
Sbjct: 625 LSEGSNP--EDWNIPEGL------YRNGEFVFKNGARSTWSSPAPQQPPSTESSSSSSDS 676
Query: 1282 KYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRNRGPRYP 1428
++ PS + + T NP N Q N + + N+N P P
Sbjct: 677 STSQSSSTTPSTNNSTTTNPNNNTQQS------NTTPDQQNQNPQPAQP 719
>gi|6563337|gb|AAF17255.1|AF210745_1 (AF210745) penicillin-binding
protein 1A [Streptococcus pneumoniae]
Length = 719
Score = 33.2 bits (74), Expect = 5.2
Identities = 30/107 (28%), Positives = 43/107 (40%), Gaps = 2/107 (1%)
Frame = +1
Query: 1108 LSEGSPATFQSWYVPEGVSTGPRNHRNAGA--GNGAHPNKKSPNPNTRNTPGQQTRKSPY 1281
LSEGS + W +PEG+ +RN NGA SP P + + S
Sbjct: 625 LSEGSNP--EDWNIPEGL------YRNGEFVFKNGARSTWNSPAPQQPPSTESSSSSSDS 676
Query: 1282 KYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRNRGPRYP 1428
++ PS + + T NP N Q N + + N+N P P
Sbjct: 677 STSQSSSTTPSTNNSTTTNPNNNTQQS------NTTPDQQNQNPQPAQP 719
>gi|6563341|gb|AAF17257.1|AF210747_1 (AF210747) penicillin-binding
protein 1A [Streptococcus pneumoniae]
Length = 719
Score = 33.2 bits (74), Expect = 5.2
Identities = 30/107 (28%), Positives = 43/107 (40%), Gaps = 2/107 (1%)
Frame = +1
Query: 1108 LSEGSPATFQSWYVPEGVSTGPRNHRNAGA--GNGAHPNKKSPNPNTRNTPGQQTRKSPY 1281
LSEGS + W +PEG+ +RN NGA SP P + + S
Sbjct: 625 LSEGSNP--EDWNIPEGL------YRNGEFVFKNGARSTWSSPAPQQPPSTESSSSSSDS 676
Query: 1282 KYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRNRGPRYP 1428
++ PS + + T NP N Q N + + N+N P P
Sbjct: 677 STSQSSSTTPSTNNSTTTNPNNNTQQS------NTTPDQQNQNPQPAQP 719
>gi|102058|pir||C41710 promastigote surface antigen-2 (clone 2.5) -
Leishmania major (fragment) >gi|9581|emb|CAA40413|
(X57134) surface antigen P2 [Leishmania major]
Length = 371
Score = 33.2 bits (74), Expect = 5.2
Identities = 18/40 (45%), Positives = 23/40 (57%), Gaps = 2/40 (5%)
Frame = +3
Query: 696 SQPPQTYTLWFRPPT--TRTTAWPIDRTTQDPSRSTTHPTET 815
++PP T T +PPT T TT P TT+ P+ TT T T
Sbjct: 255 TKPPTTTTTTTKPPTTITSTTKLPTTTTTEAPAEPTTTATPT 296
Score = 32.5 bits (72), Expect = 9.0
Identities = 20/52 (38%), Positives = 26/52 (49%), Gaps = 2/52 (3%)
Frame = +3
Query: 696 SQPPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRK--RHAVSTHPAT 851
++PP T T +PPTT TT TT ++ T T T K A +T P T
Sbjct: 206 TKPPTTTTTTTKPPTTTTTTTKPPTTTTTTTKPPTTTTTTTKPLTTATTTKPPT 259
>gi|186396 (M94131) mucin [Homo sapiens]
Length = 1270
Score = 33.2 bits (74), Expect = 5.2
Identities = 28/79 (35%), Positives = 35/79 (43%), Gaps = 9/79 (11%)
Frame = +3
Query: 651 PQPRSTSAMGIARLPSQPPQTYTLWFRPPTT------RTTAWPIDRTTQDPSRSTT---H 803
P P +T+ + PS P T T PPTT TT P+ TT P STT
Sbjct: 783 PPPTTTTTLPPTTTPSPPTTTTTT--PPPTTTPSPPITTTTTPLPTTTPSPPISTTTTPP 840
Query: 804 PTETRKRHAVSTHPATDYRPTPFSQSHT 887
PT T + P T P + + T
Sbjct: 841 PTTTPSPPTTTPSPPTTTPSPPTTTTTT 868
>gi|3319463 (AF077544) unknown [Caenorhabditis elegans]
Length = 235
Score = 33.2 bits (74), Expect = 5.2
Identities = 22/75 (29%), Positives = 26/75 (34%)
Frame = +1
Query: 1189 AGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTG 1368
+G NG HPN PN N N + PT +PY N G G
Sbjct: 132 SGYNNGPHPNGNFPNLNGYNNGPSSFNGGNTNVDDGIKGSVGAAVEPTKSPYPNNGY--G 189
Query: 1369 AGRPNNSGGKYNRNR 1413
G N G + NR
Sbjct: 190 YGNRNGYGNNFGFNR 204
>gi|5815245|gb|AAD52614.1|AF175223_1 (AF175223) SANT domain protein
SMRTER [Drosophila melanogaster]
Length = 3469
Score = 33.2 bits (74), Expect = 5.2
Identities = 32/113 (28%), Positives = 52/113 (45%), Gaps = 13/113 (11%)
Frame = +1
Query: 1060 QAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPN 1239
Q + +Q + + R++S GS A+ G G G +K+SP+P
Sbjct: 2462 QGQQQQQQQQQQQQRNMSRGSSAS--------------------GGGGGGGSDKESPSP- 2500
Query: 1240 TRNTPGQQTRKSPYKY-------PNNAPNF-----PSDHATPTFNPYGN-PGQKTGAGRP 1380
RN+ G S + Y P P + P+DH T +P+ P Q+ G +
Sbjct: 2501 -RNSVGS---ASGFAYGGDKESAPRGRPEYSSRASPADHVNSTPSPHRTPPPQRQGVIQR 2556
Query: 1381 NNSGGK 1398
+N+G K
Sbjct: 2557 HNTGSK 2562
>gi|4505285|ref|NP_002448.1|| mucin 2, intestinal/tracheal
>gi|2506877|sp|Q02817|MUC2_HUMAN MUCIN 2 PRECURSOR
(INTESTINAL MUCIN 2) >gi|454154 (L21998) mucin [Homo
sapiens]
Length = 5179
Score = 33.2 bits (74), Expect = 5.2
Identities = 28/79 (35%), Positives = 35/79 (43%), Gaps = 9/79 (11%)
Frame = +3
Query: 651 PQPRSTSAMGIARLPSQPPQTYTLWFRPPTT------RTTAWPIDRTTQDPSRSTT---H 803
P P +T+ + PS P T T PPTT TT P+ TT P STT
Sbjct: 1408 PPPTTTTTLPPTTTPSPPTTTTTT--PPPTTTPSPPITTTTTPLPTTTPSPPISTTTTPP 1465
Query: 804 PTETRKRHAVSTHPATDYRPTPFSQSHT 887
PT T + P T P + + T
Sbjct: 1466 PTTTPSPPTTTPSPPTTTPSPPTTTTTT 1493
>gi|82601|pir||A30843 glutenin high molecular weight chain Bx7
precursor - wheat >gi|21749|emb|CAA32115| (X13927) HMW
glutenin subunit (AA 1-789) [Triticum aestivum]
>gi|170745 (M22209) high MW glutenin subunit (Bx7)
[Triticum aestivum]
Length = 789
Score = 32.8 bits (73), Expect = 6.8
Identities = 28/136 (20%), Positives = 48/136 (34%), Gaps = 5/136 (3%)
Frame = +1
Query: 1000 GQGKHHFKDRLTVSGE-----AAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVS 1164
GQG+ +++ G+ +Q + P +P + L +G P Y P
Sbjct: 127 GQGQQPGQEQQPGQGQQDQQPGQRQQGYYPTSPQQPGQGQQLGQGQPG-----YYPTSQQ 181
Query: 1165 TGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPY 1344
G + G +G P ++ GQQ + Y +P P
Sbjct: 182 PGQKQQAGQGQQSGQGQQGYYPTSPQQSGQGQQPGQGQPGYYPTSPQQSGQWQQPGQGQQ 241
Query: 1345 GNPGQKTGAGRPNNSGGKYNR 1407
GQ++G G+ G+ R
Sbjct: 242 PGQGQQSGQGQQGQQPGQGQR 262
>gi|2388676 (AF015539) precollagen P [Mytilus edulis]
Length = 902
Score = 32.8 bits (73), Expect = 6.8
Identities = 25/75 (33%), Positives = 32/75 (42%), Gaps = 5/75 (6%)
Frame = +1
Query: 1204 GAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNA--PNFPSDHATPTF-NPYGNPGQKTGAG 1374
G+ P + NP PG R P + P P TP G PGQ G G
Sbjct: 256 GSTPPGRLGNPGPPGQPGNPGRPGSSGRPGGSGQPGGPGRPGTPGKPGNRGQPGQPGGPG 315
Query: 1375 RPNN--SGGKYNRNRGPRYP 1428
+P + +GG+ RN P P
Sbjct: 316 QPGHPGAGGQPGRNGNPGNP 335
>gi|1085433|pir||S55316 mucin (clone PGM-2B) - pig >gi|915207
(U10281) gastric mucin [Sus scrofa]
Length = 317
Score = 32.8 bits (73), Expect = 6.8
Identities = 44/201 (21%), Positives = 84/201 (40%), Gaps = 1/201 (0%)
Frame = +2
Query: 134 NNASPTGSSKSMETLHSSACPLKVATKSNWTAAASSPAPSLNRRAY*STTNQKAK*PHAK 313
++++PT S+ S++ S + P AT +++ S+P S S+++ P +
Sbjct: 71 SSSAPTTSATSVQPSSSGSAPTTSATSVQSSSSGSAPTTSATSVQPSSSSSP----PISS 126
Query: 314 TQRVAPRCSKHSP-SSKVHVGSPSAAWISTPLAYYCSPQTANLPMQ*CTPHRK*NANMSY 490
T V P S +P +S V S S+ T A P +++ P T + +++ S
Sbjct: 127 TISVQPSSSSSAPTTSATSVQSSSSGSAPTTSATSVQPSSSSSPPISSTISVQPSSSSS- 185
Query: 491 VYAPPKEKNMCRMSYSSN*RAASCWKTAPQNSTRLNASATPTHTTGFVSLSKKAATAKYV 670
AP + S SS+ S P +S ++ T + T+ S S + +
Sbjct: 186 --APTTSATSVQSSSSSSAPTTSATSVQPSSS---GSAPTTSATSVQSSSSSSPPISSTI 240
Query: 671 GYGNRKAAKSAASNVHVMVPSS 736
++ S ++ + PSS
Sbjct: 241 SVQTSSSSSSPTTSTTSVQPSS 262
>gi|2314597|gb|AAD08465.1| (AE000642) conserved hypothetical protein
[Helicobacter pylori 26695]
Length = 84
Score = 32.8 bits (73), Expect = 6.8
Identities = 19/47 (40%), Positives = 25/47 (52%), Gaps = 1/47 (2%)
Frame = +1
Query: 82 RLHKVLAQAGLGSRRALEQRISN-GLIKVNGDIAQLGMSVKSGDKIEL 222
R+ K L GL RR L + N G + +NG A+ VK+GD I L
Sbjct: 2 RIDKFLQSVGLVKRRVLATDMCNVGAVWLNGSCAKASKEVKAGDTISL 49
>gi|330361 (M10593) major outer envelope glycoprotein gp220
[Epstein-Barr virus]
Length = 658
Score = 32.8 bits (73), Expect = 6.8
Identities = 19/67 (28%), Positives = 28/67 (41%)
Frame = +1
Query: 1225 SPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYN 1404
SP P + SP + N + P +AT +P GQKT ++GGK N
Sbjct: 475 SPTPAGTTSGASPVTPSPSPWDNGTESTPPQNAT---SPQAPSGQKTAVPTVTSTGGKAN 531
Query: 1405 RNRGPRY 1425
G ++
Sbjct: 532 STTGGKH 538
>gi|1841851 (U86876) chitinase-like protein [Bombyx mori]
Length = 565
Score = 32.8 bits (73), Expect = 6.8
Identities = 23/54 (42%), Positives = 27/54 (49%), Gaps = 9/54 (16%)
Frame = +3
Query: 684 ARLPSQP---------PQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRHAVS 836
AR PS P P T T +P TTRTTA P TT+ P +T + R V
Sbjct: 416 ARPPSTPSDPSEGDPIPTTTTTTVKPTTTRTTARPTTTTTKVPHGTTEEDFDINVRPEVE 475
Query: 837 THP 845
P
Sbjct: 476 ELP 478
>gi|2854193 (AF045645) Similar to cuticular collagen; coded for by C.
elegans cDNA yk69e12.3; coded for by C. elegans cDNA
yk69e12.5; coded for by C. elegans cDNA yk307b3.5; coded
for by C. elegans cDNA yk307b3.3 [Caenorhabditis elegans]
Length = 314
Score = 32.8 bits (73), Expect = 6.8
Identities = 31/82 (37%), Positives = 36/82 (43%), Gaps = 9/82 (10%)
Frame = +1
Query: 1150 PEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSD--HA 1323
P G S P ++ NAGA P + TPG P P AP P HA
Sbjct: 211 PPGPSGQPGSNGNAGA-----PGAPGHVVDVPGTPGPAGPPGPAG-PAGAPGQPGQAGHA 264
Query: 1324 TPTF-NPYGN------PGQKTGAGRPNNSGG 1395
P P G+ PGQ AG+P N GG
Sbjct: 265 QPGQPGPQGDAGAPGAPGQPGSAGQPGNDGG 295
>gi|3875708|emb|CAA84658.1| (Z35598) Asparagine, Serine and Glycine
rich predicted protein [Caenorhabditis elegans]
>gi|3880108|emb|CAA86461.1| (Z46343) Asparagine, Serine
and Glycine rich predicted protein [Caenorhabditis
elegans]
Length = 549
Score = 32.8 bits (73), Expect = 6.8
Identities = 24/83 (28%), Positives = 32/83 (37%)
Frame = +1
Query: 1168 GPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYG 1347
G N+RN G G+ + N + N N G Y NN + S++ N G
Sbjct: 424 GSNNNRNDGWGSSSSNNNNNNNNNNNGGTGG--------YSNNGGGWGSNN-----NNNG 470
Query: 1348 NPGQKTGAGRPNNSGGKYNRNRG 1416
N G + N GG N N G
Sbjct: 471 NDGNNWESNNGGNGGGGDNWNNG 493
>gi|2119159|pir||I50694 alpha-1 collagen type III - chicken (fragment)
>gi|537432 (U07973) alpha-1 collagen type III [Gallus
gallus]
Length = 886
Score = 32.8 bits (73), Expect = 6.8
Identities = 24/83 (28%), Positives = 27/83 (31%), Gaps = 1/83 (1%)
Frame = +1
Query: 1150 PEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATP 1329
P G S P G A P P +PG + P P P P P
Sbjct: 357 PAGASGNPGERGEPGPQGQAGPPGPQGPPGRAGSPGGKGEMGPSGIPGG-PGPPGGRGLP 415
Query: 1330 -TFNPYGNPGQKTGAGRPNNSGGK 1398
GNPG K G P +G K
Sbjct: 416 GPPGTSGNPGAKGTPGEPGKNGAK 439
>gi|543541|pir||JC2099 glutenin, high molecular weight chain Bx17 -
wheat
Length = 753
Score = 32.8 bits (73), Expect = 6.8
Identities = 28/136 (20%), Positives = 48/136 (34%), Gaps = 5/136 (3%)
Frame = +1
Query: 1000 GQGKHHFKDRLTVSGE-----AAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVS 1164
GQG+ +++ G+ +Q + P +P + L +G P Y P
Sbjct: 127 GQGQQPGQEQQPGQGQQDQQPGQRQQGYYPTSPQQPGQGQQLGQGQPG-----YYPTSQQ 181
Query: 1165 TGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPY 1344
G + G +G P ++ GQQ + Y +P P
Sbjct: 182 PGQKQQAGQGQQSGQGQQGYYPTSPQQSGQGQQPGQGQPGYYPTSPQQSGQWQQPGQGQQ 241
Query: 1345 GNPGQKTGAGRPNNSGGKYNR 1407
GQ++G G+ G+ R
Sbjct: 242 PGQGQQSGQGQQGQQPGQGQR 262
>gi|1118137 (U41746) coded for by C. elegans cDNA yk68a8.5
[Caenorhabditis elegans]
Length = 586
Score = 32.5 bits (72), Expect = 9.0
Identities = 22/68 (32%), Positives = 32/68 (46%), Gaps = 3/68 (4%)
Frame = +3
Query: 645 RRPQPRSTSAMGIARLP---SQPPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTET 815
R P +T+ + R P S PP+T + P T+ P TT+ P+ +T PT
Sbjct: 344 RATTPLATTPLATTRAPLPPSPPPRTS----KRPVTQAPTTPRATTTRRPTTTTPRPTPR 399
Query: 816 RKRHAVSTHPA 848
R R +T A
Sbjct: 400 RTRRPKTTTAA 410
>gi|3876265|emb|CAB02976.1| (Z81067) predicted using Genefinder; cDNA
EST yk488h9.3 comes from this gene [Caenorhabditis
elegans]
Length = 1307
Score = 32.5 bits (72), Expect = 9.0
Identities = 38/153 (24%), Positives = 55/153 (35%)
Frame = +1
Query: 907 NNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFKQY 1086
+ H NN+ A R RD R G + TVS E + +H P
Sbjct: 70 HGNHQLQNNYGGASSRGAQSRGSPPRDPRRHANGSSSHRRDKTVSDELQHENSHTP---- 125
Query: 1087 KPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQT 1266
E S +TF S + P S+ R+ R +G+ +KSP+ P QQ
Sbjct: 126 -------RQEESQSTFGSSFRPSQYSSILRDPRLSGSCPPG--QEKSPSNGHNLLPHQQ- 175
Query: 1267 RKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 1365
K+ + P + + T N P Q T
Sbjct: 176 -----KFGGSIPVSSTLSDSHTSNGGSTPNQDT 203
>gi|3873739|emb|CAA86059.1| (Z37983) weak similarity with putative
zinc finger transcription factors. Possesses the
prosite motif for a C3HC4 type Zinc finger (Prosite
accession number PS00518) [Caenorhabditis elegans]
Length = 417
Score = 32.5 bits (72), Expect = 9.0
Identities = 37/151 (24%), Positives = 56/151 (36%)
Frame = +2
Query: 302 PHAKTQRVAPRCSKHSPSSKVHVGSPSAAWISTPLAYYCSPQTANLPMQ*CTPHRK*NAN 481
PH+ VAPR S SS + S +STP S +TA L H ++
Sbjct: 271 PHSLPTNVAPRIPPSSRSSFTQHSNDSGVVLSTP-PTSSSAKTAGL----SPTHNFSRSS 325
Query: 482 MSYVYAPPKEKNMCRMSYSSN*RAASCWKTAPQNSTRLNASATPTHTTGFVSLSKKAATA 661
S P K ++ RA + + +N A PT+ T V ++ K
Sbjct: 326 TSLRIPTPTTKIQKIQNFFETTRAPRISRIRMGVTDLVNTYAPPTYATSPVHINTKQFEC 385
Query: 662 KYVGYGNRKAAKSAASNVHVMVPSSYHANYC 754
VG +++ H ++ H NYC
Sbjct: 386 CSVG------EFGVSTDTHKSPTTAIHLNYC 410
>gi|1870123|emb|CAB06788| (Z86109) unknown [Saccharomyces pastorianus]
Length = 193
Score = 32.5 bits (72), Expect = 9.0
Identities = 22/67 (32%), Positives = 29/67 (42%)
Frame = +1
Query: 1225 SPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYN 1404
+P+P + TPG+ K+P K P AP A P P PG+ G P + GK
Sbjct: 98 TPSPQGKKTPGKAPGKAPGKAPGKAPGKAPGKA-PGKAPGKAPGKAPGKA-PGKAPGKAP 155
Query: 1405 RNRGPRY 1425
G Y
Sbjct: 156 GKAGRSY 162
>gi|46691|emb|CAA43604| (X61307) protein A [Staphylococcus aureus]
>gi|384170|prf||1905280A protein A [Staphylococcus
aureus]
Length = 454
Score = 32.5 bits (72), Expect = 9.0
Identities = 21/69 (30%), Positives = 28/69 (40%), Gaps = 3/69 (4%)
Frame = +1
Query: 1168 GPRNHRNAGAGNGAHPNK---KSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFN 1338
G ++ G +G P K K P N PG++ K P K N P D P
Sbjct: 289 GKEDNNKPGKEDGNKPGKEDNKKPGKEDGNKPGKEDNKKPGKEDGNKPG-KEDGNKPGKE 347
Query: 1339 PYGNPGQKTGAG 1374
PG++ G G
Sbjct: 348 DGNKPGKEDGNG 359
>gi|1019435 (U32447) mucin-like protein [Trypanosoma cruzi]
Length = 197
Score = 32.5 bits (72), Expect = 9.0
Identities = 21/64 (32%), Positives = 30/64 (46%), Gaps = 2/64 (3%)
Frame = +3
Query: 696 SQPPQTYTLWFRPPTTRTTAW--PIDRTTQDPSRSTTHPTETRKRHAVSTHPATDYRPTP 869
++PP T T +PPTT TT P TT ++ T T T + +T T PT
Sbjct: 76 TKPPTTTTTTTKPPTTTTTTTTKPPTTTTTTTTKPPTTTTTTTTKPPTTTTTTTTKPPTT 135
Query: 870 FSQSHT 887
+ + T
Sbjct: 136 TTTTTT 141
>gi|6324130|ref|NP_014200.1|GCR2| Transcription factor; Gcr2p
>gi|417039|sp|Q01722|GCR2_YEAST GLYCOLYTIC GENES
TRANSCRIPTIONAL ACTIVATOR GCR2 >gi|320841|pir||S31300
regulatory protein GCR2 - yeast (Saccharomyces
cerevisiae) >gi|218427|dbj|BAA00985| (D10104) GCR2
protein [Saccharomyces cerevisiae]
>gi|600066|emb|CAA55509| (X78898) Gcr2; acc.#:D10104
[Saccharomyces cerevisiae] >gi|1302197|emb|CAA96097|
(Z71475) ORF YNL199c [Saccharomyces cerevisiae]
Length = 534
Score = 32.5 bits (72), Expect = 9.0
Identities = 19/65 (29%), Positives = 30/65 (45%)
Frame = +1
Query: 1111 SEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYP 1290
++GSP+ Q G++ G N N GNG++ N +N P +T+K +
Sbjct: 244 TKGSPSDLQ------GINNGNNNGNNGNIGNGSNIK----NYGNKNMPNNRTKKRGTRVA 293
Query: 1291 NNAPN 1305
NA N
Sbjct: 294 KNAKN 298
>gi|1136289 (U42597) histidine kinase A [Dictyostelium discoideum]
Length = 2150
Score = 32.5 bits (72), Expect = 9.0
Identities = 29/145 (20%), Positives = 54/145 (37%), Gaps = 1/145 (0%)
Frame = +1
Query: 895 NRNDNNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKP 1074
+ N NN + +NN++ D + +R T + G Q + + + A+
Sbjct: 229 SNNSNNNNNGNNNNNITDSPTKSKRHSTYETNIGSHQRRKSIQSLI----------ANSA 278
Query: 1075 FKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGN-GAHPNKKSPNPNTRNT 1251
+ ++ LS +P+T + S N+ N G+ GA P +S + N
Sbjct: 279 IHSFSKLKNKPLSSSTPSTVNTCGAVNNNSNNNNNNNNNSTGSLGAIPMDRSFDGNINTI 338
Query: 1252 PGQQTRKSPYKYPNNAPNFPSDHATP 1329
+ T + N N S+ P
Sbjct: 339 TEESTGGNNSPRSNCGSNCGSNGGIP 364
>gi|2193933|emb|CAB09584| (Z96800) hypothetical protein Rv0312
[Mycobacterium tuberculosis]
Length = 620
Score = 32.5 bits (72), Expect = 9.0
Identities = 20/57 (35%), Positives = 25/57 (43%)
Frame = +3
Query: 702 PPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRHAVSTHPATDYRPTPF 872
PP T T P TT TTA P TT +P +TT T + T++ PF
Sbjct: 536 PPVTTTPRPSPTTTTTTAPPSTTTTTEPPVTTTSTIPTIPTTTTTVKMTTEWLHVPF 592
>gi|2493778|sp|Q09456|YQ35_CAEEL PUTATIVE CUTICLE COLLAGEN C09G5.5
>gi|3874102|emb|CAA86758.1| (Z46791) similar to collagen
[Caenorhabditis elegans]
Length = 317
Score = 32.5 bits (72), Expect = 9.0
Identities = 26/81 (32%), Positives = 34/81 (41%), Gaps = 1/81 (1%)
Frame = +1
Query: 1150 PEGVSTGPRNHRNAGA-GNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHAT 1326
P G + + AGA GN P + +R PG R P + P AP P +T
Sbjct: 171 PPGPAGDAGSPGQAGAPGNPGRPGQSGQR--SRGLPGPSGRPGP-QGPPGAPGQPGSGST 227
Query: 1327 PTFNPYGNPGQKTGAGRPNNSG 1392
P P G PG G+P + G
Sbjct: 228 P--GPAGPPGPPGPNGQPGHPG 247
>gi|182424 (J00127) alpha-fibrinogen precursor [Homo sapiens]
Length = 644
Score = 32.5 bits (72), Expect = 9.0
Identities = 31/109 (28%), Positives = 47/109 (42%), Gaps = 5/109 (4%)
Frame = +1
Query: 1093 KNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNT--PGQQT 1266
+N S G AT++ G S G N ++G G+ + N SP P + T PG
Sbjct: 308 RNPGSSGTGGTATWKPGSSGPG-SAGSWNSGSSGTGSTGNQNPGSPRPGSTGTWNPGSSE 366
Query: 1267 RKSPYKYPNNAPNFPSDHATPTFNPYGNPGQ---KTGAGRPNNSGGKYNRNRGP 1419
R S + H T + G+ GQ ++G+ RP++ G R P
Sbjct: 367 RGS------------AGHWTSESSVSGSTGQWHSESGSFRPDSPGSGNARPNNP 408
>gi|1707117 (U80453) C23H3.9 [Caenorhabditis elegans]
Length = 339
Score = 32.5 bits (72), Expect = 9.0
Identities = 23/73 (31%), Positives = 28/73 (37%)
Frame = +3
Query: 651 PQPRSTSAMGIARLPSQPPQTYTLWFRPPTTRTTAWPIDRTTQDPSRSTTHPTETRKRHA 830
P SA + S +T T PTT TTA TT P+ +TT PT T
Sbjct: 179 PTYNRISAEKALQKTSTTQETTTSTTAQPTTTTTATTTTTTTPLPTTTTTQPTTTTTEPT 238
Query: 831 VSTHPATDYRPTP 869
+T TP
Sbjct: 239 TTTTTTEPTTTTP 251
>gi|5901828|gb|AAD55422.1|AF181636_1 (AF181636) BcDNA.GH07269
[Drosophila melanogaster]
Length = 682
Score = 32.5 bits (72), Expect = 9.0
Identities = 35/128 (27%), Positives = 57/128 (44%), Gaps = 14/128 (10%)
Frame = +1
Query: 1012 HHFKDRLTVSGEAAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNA 1191
HH RL+ G + ++ P + S + +PA+ W VP P + +
Sbjct: 70 HHDNVRLSYGGGSHSQ----PVSKVSSSQTHSAAPSAPASPIGWNVPAKPQGPPPAYSAS 125
Query: 1192 GAGNGAHPN--KKSP--NPNTRNTP-----GQQTR-----KSPYKYPNNAPNFPSDHATP 1329
GAH N ++ P NP + P QT +SPY+ P A + + ++
Sbjct: 126 NPVGGAHTNIHERPPAYNPAYKPAPPSYSAATQTHSNTNLQSPYR-PAGAASPGASSSSS 184
Query: 1330 TFNPYGNPGQKTGAGRPNNSGG 1395
+ YG GR N++GG
Sbjct: 185 GSHYYGGAHNTAYRGRNNSTGG 206
>gi|4033468|sp|P92965|RS40_ARATH ARGININE/SERINE-RICH SPLICING FACTOR
RSP40 >gi|2582641|emb|CAA67800| (X99437) splicing factor
[Arabidopsis thaliana] >gi|2980800|emb|CAA18176.1|
(AL022197) splicing factor At-SRp40 [Arabidopsis
thaliana]
Length = 350
Score = 32.5 bits (72), Expect = 9.0
Identities = 28/145 (19%), Positives = 58/145 (39%), Gaps = 2/145 (1%)
Frame = +1
Query: 979 LRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPE- 1155
++DD RG G H +R +++ P+K+ + D A ++
Sbjct: 167 VKDDDARGNG--HSPERRRDRSPERRRRSPSPYKRERGSPDYGRGASPVAAYRKERTSPD 224
Query: 1156 -GVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPT 1332
G P ++ + G+ + + N + R ++ SP KY + +PN + +P
Sbjct: 225 YGRRRSPSPYKKSRRGSPEYGRDRRGNDSPRR---RERVASPTKY-SRSPNNKRERMSPN 280
Query: 1333 FNPYGNPGQKTGAGRPNNSGGKYNRNR 1413
+P+ + G G + + R+R
Sbjct: 281 HSPFKKESPRNGVGEVESPIERRERSR 307
>gi|2981221 (AF053091) eyelid [Drosophila melanogaster]
Length = 2715
Score = 32.5 bits (72), Expect = 9.0
Identities = 28/89 (31%), Positives = 40/89 (44%), Gaps = 13/89 (14%)
Frame = +1
Query: 1150 PEGVSTGPRNHRNAGAGNG---AHPNKKSPNP--NTRNTPGQQTRKSPYKYPNNAPNFPS 1314
P G GP + AG +P ++ P P + P QQ ++ PY+ P
Sbjct: 1537 PPGAPHGPPIQQPAGVAQWDQHRYPPQQGPPPPPQQQQQPQQQQQQPPYQQVAGPPGQQP 1596
Query: 1315 DHATPTFNPYGNPGQKTGAG--------RPNNSGGKYNRNRG 1416
A P + NPGQ +G RP + G+ NR G
Sbjct: 1597 PQAPPQWAQM-NPGQTAQSGIAPPGSPLRPPSGPGQQNRMPG 1637
Score = 32.5 bits (72), Expect = 9.0
Identities = 32/103 (31%), Positives = 42/103 (40%), Gaps = 3/103 (2%)
Frame = +1
Query: 1117 GSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNN 1296
G P + P V P+ H AG +P S P TP +T SP YP+
Sbjct: 1311 GGPPPAPQQHGPGQVPPSPQQHVRPAAG-APYPPGGSGYP----TPVSRTPGSP--YPSQ 1363
Query: 1297 APNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKY---NRNRGPRY 1425
+ ++ +N G PGQ G G G+Y NRN P Y
Sbjct: 1364 PGAYGQYGSSDQYNATGPPGQPFGQG-----PGQYPPQNRNMYPPY 1404
>gi|5824601|emb|CAA82571.2| (Z29443) similar to Annexin; cDNA EST
EMBL:C10640 comes from this gene; cDNA EST EMBL:C12433
comes from this gene; cDNA EST yk192f7.5 comes from this
gene; cDNA EST yk318c1.5 comes from this gene; cDNA EST
yk494a12.3 comes fr...
Length = 497
Score = 32.5 bits (72), Expect = 9.0
Identities = 25/75 (33%), Positives = 38/75 (50%), Gaps = 19/75 (25%)
Frame = +1
Query: 1204 GAHPNKKSPNPNTRNTPGQQTR-KSPYKYPNNAPNFPSDHATP--------TFNPYGNPG 1356
G N K P+++ + + + S YPN P++ P +++PYG P
Sbjct: 21 GLGGNNKQQQPSSQQSSQEPSNMNSGGGYPNQQPSYGGYGQPPQQPGYGNGSYDPYGQPQ 80
Query: 1357 QKT---GAGRP-------NNSGGKYNRNRGPRYP 1428
Q+ G G+P N GG Y G YP
Sbjct: 81 QQPYPGGGGQPPYPGSNSNQGGGGYPGQGGAPYP 114
>gi|6273747|gb|AAF06358.1|AF102865_1 (AF102865) calymmin [Danio rerio]
Length = 1207
Score = 32.5 bits (72), Expect = 9.0
Identities = 36/124 (29%), Positives = 49/124 (39%), Gaps = 20/124 (16%)
Frame = +1
Query: 1051 AAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNA----GAGNGAHPN 1218
+A Q KP N + ++ + FQ+ P G + GP+ A AG+ A PN
Sbjct: 288 SAGQVAKPNGYGGYPNAGATNQPNGGPFQNMGYPNGGTKGPKPGYGAKAGPSAGHVAKPN 347
Query: 1219 KKSPNPNTRNTPGQQTRKSPYK-YPNNAPNFPSDH------------ATPTFN---PYGN 1350
PN T S + YPN P A P N P G
Sbjct: 348 GNGGYPNGGATSQHNGGSSQFMGYPNGGTKGPKSGYGANAGPSAGQVAKPNGNGRYPIGG 407
Query: 1351 PGQKTGAGRPNNSGGKYNRNRGPR 1422
+ G N G R +GP+
Sbjct: 408 VANQPNRGSSQNMGYPNGRTKGPK 431
>gi|1523876|emb|CAA99996| (Z75666) BRCA2 [Chlorocebus aethiops]
Length = 1548
Score = 32.5 bits (72), Expect = 9.0
Identities = 39/134 (29%), Positives = 62/134 (46%), Gaps = 10/134 (7%)
Frame = +1
Query: 739 PRELLRGQSTELPKTQVEALRTQLKLEKDMPLALTLQPIIGQRRSAK---ATLHVN---- 897
P +L+ + E+P+ QV L T + +D L + P IGQ S+K T+ +
Sbjct: 463 PSYILQKNTFEVPENQVTILNTTTEENRDAGLVIMNAPSIGQVNSSKQFEGTVGIKQKFA 522
Query: 898 ---RNDNNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAH 1068
++D NK A + + T + E R F H K L VS EA K A
Sbjct: 523 GLLKSDCNKSA--SGYLTDENEVEFRGF----------YSAHGVK--LNVSTEALQK-AV 567
Query: 1069 KPFKQYKPKNDRSLSEGSPATFQS 1140
K F + ++++ +E P + S
Sbjct: 568 KLFSDIENISEKTSAEVDPISLSS 591
>gi|1448962 (L79944) OFR1 of non-LTR retrotransposon; putative
[Chironomus tentans]
Length = 165
Score = 32.5 bits (72), Expect = 9.0
Identities = 19/81 (23%), Positives = 28/81 (34%)
Frame = +1
Query: 1033 TVSGEAAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAH 1212
T SG A Q ++ KP+N P+ +G +N+R G G G
Sbjct: 13 TTSGRAIHSQQNRATVNQKPQNTNPPPNNKPSN-------QGNENNQQNNRGKGRGKGKR 65
Query: 1213 PNKKSPNPNTRNTPGQQTRKS 1275
+ P +N Q S
Sbjct: 66 RRQNKSKPRNKNNKNQNKNSS 86
>gi|188864 (M74027) mucin [Homo sapiens]
Length = 573
Score = 32.5 bits (72), Expect = 9.0
Identities = 26/73 (35%), Positives = 32/73 (43%), Gaps = 14/73 (19%)
Frame = +3
Query: 651 PQPRSTSAMGIARLPSQPPQT-------------YTLWFRPPTTRTTAWPIDRTTQDPSR 791
P P ST+ + PS P T T PP T T + PI TT P
Sbjct: 66 PPPTSTTTLPPTTTPSPPTTTTTTPPPTTTPSPPITTTTTPPPTTTPSPPISTTTTPPPT 125
Query: 792 ST-THPTETRKRHAVSTHPATDYRPTP 869
+T + PT T + P T TP
Sbjct: 126 TTPSPPTTTPSPPTTTPSPPTTTTTTP 152
>gi|1256180|dbj|BAA12287| (D84250) chitinase [Penaeus japonicus]
Length = 572
Score = 32.5 bits (72), Expect = 9.0
Identities = 18/36 (50%), Positives = 22/36 (61%), Gaps = 2/36 (5%)
Frame = +3
Query: 693 PSQPPQTYTLWFRPPTTRTTAW--PIDRTTQDPSRSTT 800
P+ PP T T + PPTT TT I TT+DP+ TT
Sbjct: 423 PTLPPTTTTPHWTPPTTTTTTRDPSITTTTRDPNLPTT 460
>gi|4996369|dbj|BAA78427.1| (AB021267) polyprotein [Arabidopsis
thaliana]
Length = 1421
Score = 32.5 bits (72), Expect = 9.0
Identities = 26/81 (32%), Positives = 39/81 (48%), Gaps = 9/81 (11%)
Frame = +1
Query: 1090 PKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAG----NGAHPNKKSPNPNTRNTPG 1257
P + S S T S+ P+ +T P +N+ + N +PN SPN +N+P
Sbjct: 817 PSSSISSPSSSEPTAPSYNGPQP-TTQPHQTQNSNSNSPILNNPNPNSPSPNSPNQNSPL 875
Query: 1258 QQTRKSPYKYPN-----NAPNFPSDHATPT 1332
Q+ S P + PN PS +T T
Sbjct: 876 PQSPISSPHIPTPSTSISEPNSPSSSSTST 905
>gi|223918|prf||1004351A fibrinogen alphaA [Homo sapiens]
Length = 462
Score = 32.5 bits (72), Expect = 9.0
Identities = 31/109 (28%), Positives = 47/109 (42%), Gaps = 5/109 (4%)
Frame = +1
Query: 1093 KNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNT--PGQQT 1266
+N S G AT++ G S G N ++G G+ + N SP P + T PG
Sbjct: 237 RNPGSSGTGGTATWKPGSSGPG-SXGSWNSGSSGTGSTGNQNPGSPRPGSTGTWNPGSSE 295
Query: 1267 RKSPYKYPNNAPNFPSDHATPTFNPYGNPGQ---KTGAGRPNNSGGKYNRNRGP 1419
R S + H T + G+ GQ ++G+ RP++ G R P
Sbjct: 296 RGS------------AGHWTSESSVSGSTGQWHSESGSFRPDSPGSGNARPNNP 337
>gi|2952545 (AF051898) coronin binding protein [Dictyostelium
discoideum]
Length = 560
Score = 32.5 bits (72), Expect = 9.0
Identities = 28/170 (16%), Positives = 63/170 (36%), Gaps = 13/170 (7%)
Frame = +1
Query: 901 NDNNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFK 1080
N+NN + N++S + + R ++ + + + ++ + A +++ P
Sbjct: 316 NNNNSNNNSNSNSNNNNNGINNRNNSNNNSNNNSNNNSNNSNNRNITNGSNANKSNSPNN 375
Query: 1081 QYKPKNDR-------------SLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNK 1221
ND + + G+ + + ++ ++ N+ + + N+
Sbjct: 376 NLNTNNDNKNNNSNNNNNSNNNSNNGNSNNNNNNNIINNNNSNSNSNNNSNNNSNNNSNR 435
Query: 1222 KSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKY 1401
SPN N T + NN N +++ N N A NN+
Sbjct: 436 NSPNHNNNGDNDNNTNNNTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNYADNSNNNSSNS 495
Query: 1402 NRN 1410
N N
Sbjct: 496 NNN 498
Database: nr
Posted date: Feb 13, 2000 1:18 AM
Number of letters in database: 140,124,617
Number of sequences in database: 455,460
Lambda K H
0.313 0.132 0.387
Gapped
Lambda K H
0.270 0.0470 0.230
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 349565694
Number of Sequences: 455460
Number of extensions: 7757399
Number of successful extensions: 26863
Number of sequences better than 10.0: 384
Number of HSP's better than 10.0 without gapping: 35
Number of HSP's successfully gapped in prelim test: 159
Number of HSP's that attempted gapping in prelim test: 26197
Number of HSP's gapped (non-prelim): 605
length of query: 477
length of database: 140,124,617
effective HSP length: 58
effective length of query: 418
effective length of database: 113,707,937
effective search space: 47529917666
effective search space used: 47529917666
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.8 bits)
X3: 64 (24.9 bits)
S1: 42 (21.9 bits)
S2: 72 (32.5 bits)