BLASTP 2.0.10 [Aug-26-1999]
Query= XF-02A11-GL38
(476 letters)
Database: nr
455,460 sequences; 140,124,617 total letters
Graphical Overview:
Score E
Sequences producing significant alignments: (bits) Value
gi|1175678|sp|P37765|YCIL_ECOLI HYPOTHETICAL 32.7 KD PROTEIN IN... 239 3e-62
gi|602963 (U18111) ORF4 [Escherichia coli] 238 1e-61
gi|1175679|sp|P45104|YCIL_HAEIN HYPOTHETICAL PROTEIN HI1199 >gi... 234 1e-60
gi|1175677|sp|P42395|YCIL_BUCAP HYPOTHETICAL 30.2 KD PROTEIN IN... 159 6e-38
gi|466190|sp|P35159|RLUB_BACSU RIBOSOMAL LARGE SUBUNIT PSEUDOUR... 129 5e-29
gi|6458615|gb|AAF10472.1|AE001942_4 (AE001942) ribosomal large ... 123 3e-27
gi|4980760|gb|AAD35352.1|AE001708_20 (AE001708) 16S pseudouridy... 120 3e-26
gi|3915347|sp|O51155|Y129_BORBU HYPOTHETICAL PROTEIN BB0129 >gi... 118 1e-25
gi|6647946|sp|Q9ZD06|Y544_RICPR HYPOTHETICAL PROTEIN RP544 >gi|... 116 4e-25
gi|5738484|emb|CAB52832.1| (AL109848) putative pseudouridine sy... 115 1e-24
gi|3915377|sp|Q55578|Y361_SYNY3 HYPOTHETICAL 28.2 KD PROTEIN SL... 113 3e-24
gi|3915445|sp|O67444|YE64_AQUAE HYPOTHETICAL PROTEIN AQ_1464 >g... 110 2e-23
gi|3915547|sp|O33210|YH11_MYCTU HYPOTHETICAL 27.6 KD PROTEIN RV... 108 1e-22
gi|3915542|sp|O05668|YH11_MYCLE HYPOTHETICAL 28.1 KD PROTEIN CB... 106 3e-22
gi|3915394|sp|O66829|Y554_AQUAE HYPOTHETICAL PROTEIN AQ_554 >gi... 105 1e-21
gi|4155973 (AE001558) putative [Helicobacter pylori J99] 103 3e-21
gi|2501526|sp|P55986|YE59_HELPY HYPOTHETICAL PROTEIN HP1459 >gi... 102 5e-21
gi|418534|sp|P32684|YJBC_ECOLI HYPOTHETICAL 32.5 KD PROTEIN IN ... 96 8e-19
gi|3329180|gb|AAC68318.1| (AE001343) predicted pseudouridine sy... 83 6e-15
gi|465610|sp|P33918|RSUA_ECOLI RIBOSOMAL SMALL SUBUNIT PSEUDOUR... 83 6e-15
gi|2145990|pir||S72955 u0247g protein - Mycobacterium leprae >g... 82 8e-15
gi|4377181|gb|AAD19002| (AE001667) predicted pseudouridine synt... 82 1e-14
gi|3322747 (AE001223) conserved hypothetical protein [Treponema... 78 2e-13
gi|1175857|sp|P45124|RSUA_HAEIN RIBOSOMAL SMALL SUBUNIT PSEUDOU... 74 3e-12
gi|3845303 (AE001423) pseudouridine synthetase (RsuA fam.); 1st... 67 3e-10
gi|1175289|sp|P44827|YMFC_HAEIN HYPOTHETICAL PROTEIN HI0694 >gi... 67 4e-10
gi|1787380 (AE000213) orf, hypothetical protein [Escherichia coli] 58 2e-07
gi|3916025|sp|P75966|YMFC_ECOLI HYPOTHETICAL 24.9 KD PROTEIN IN... 58 2e-07
gi|3402689|gb|AAC28992.1| (AC004697) unknown protein [Arabidops... 54 2e-06
gi|3915559|sp|O32068|YTZF_BACSU HYPOTHETICAL 17.7 KD PROTEIN IN... 54 3e-06
gi|3402691|gb|AAC28994.1| (AC004697) unknown protein [Arabidops... 52 1e-05
gi|6689331|emb|CAB65436.1| (AJ250721) hypothetical protein [Syn... 49 7e-05
gi|3915395|sp|P72581|Y612_SYNY3 HYPOTHETICAL 21.0 KD PROTEIN SL... 48 1e-04
gi|677949 (U20969) Plasmodium falciparum circumsporozoite prote... 43 0.006
gi|294151|gb|AAA29569.1| (M83156) circumsporozoite protein [Pla... 43 0.007
gi|1651652|dbj|BAA16580| (D90899) hypothetical protein [Synecho... 42 0.009
gi|117587|sp|P08307|CSP_PLAFW CIRCUMSPOROZOITE PROTEIN PRECURSO... 42 0.009
gi|294136|gb|AAA29551.1| (M83173) circumsporozoite protein [Pla... 42 0.009
gi|1175470|sp|Q09801|YAAA_SCHPO HYPOTHETICAL 37.3 KD PROTEIN C2... 42 0.012
gi|117586|sp|P13814|CSP_PLAFT CIRCUMSPOROZOITE PROTEIN PRECURSO... 42 0.012
gi|294129|gb|AAA29547.1| (M83169) circumsporozoite protein [Pla... 41 0.016
gi|294121|gb|AAA29543.1| (M83165) circumsporozoite protein [Pla... 41 0.016
gi|294158|gb|AAA29574.1| (M83161) circumsporozoite protein [Pla... 41 0.016
gi|136173|sp|P27190|TRC5_ECOLI DNA PRIMASE TRAC (REPLICATION PR... 41 0.021
gi|48920|emb|CAA42461| (X59794) traC-3 [Escherichia coli] >gi|1... 41 0.021
gi|48921|emb|CAA42462| (X59794) traC-4 [Escherichia coli] >gi|1... 41 0.021
gi|117584|sp|P05691|CSP_PLAFL CIRCUMSPOROZOITE PROTEIN (CS) >gi... 41 0.028
gi|294125|gb|AAA29545.1| (M83167) circumsporozoite protein [Pla... 41 0.028
gi|117591|sp|P26694|CSP_PLARE CIRCUMSPOROZOITE PROTEIN PRECURSO... 41 0.028
gi|294119|gb|AAA29542.1| (M83164) circumsporozoite protein [Pla... 40 0.036
gi|552190 (M57498) circumsporozoite protein [Plasmodium falcipa... 40 0.048
gi|552191 (M57499) circumsporozoite protein [Plasmodium falcipa... 40 0.048
gi|117583|sp|P02893|CSP_PLAFA CIRCUMSPOROZOITE PROTEIN PRECURSO... 40 0.048
gi|84198|pir||S05428 circumsporozoite protein - Plasmodium falc... 40 0.048
gi|294123|gb|AAA29544.1| (M83166) circumsporozoite protein [Pla... 40 0.048
gi|294134|gb|AAA29550.1| (M83172) circumsporozoite protein [Pla... 40 0.048
gi|294138|gb|AAA29552.1| (M83174) circumsporozoite protein [Pla... 40 0.048
gi|6226848|sp|P19597|CSP_PLAFO CIRCUMSPOROZOITE PROTEIN PRECURS... 40 0.048
gi|4493889|emb|CAB38998.1| (AL034558) predicted using hexExon; ... 40 0.048
gi|224051|prf||1008275A protein,circumsporozoite [Plasmodium sp.] 39 0.062
gi|283464|pir||A60540 sporozoite surface protein 2 - Plasmodium... 39 0.11
gi|6679637|ref|NP_031951.1|| elastin >gi|1706636|sp|P54320|ELS_... 39 0.11
gi|2462758 (AC002292) putative RNA-binding protein [Arabidopsis... 39 0.11
gi|548996|sp|Q01443|SSP2_PLAYO SPOROZOITE SURFACE PROTEIN 2 PRE... 39 0.11
gi|6580291|emb|CAB63354.1| (AL032627) cDNA EST EMBL:D65921 come... 38 0.18
gi|2497274|sp|Q17107|AV71_ACAVI MUSCLE CELL INTERMEDIATE FILAME... 38 0.24
gi|93140|pir||A40505 early protein EP0 - suid herpesvirus 1 (st... 38 0.24
gi|124138|sp|P29129|ICP0_PRVIF TRANS-ACTING TRANSCRIPTIONAL PRO... 38 0.24
gi|3875920|emb|CAB04111.1| (Z81503) predicted using Genefinder;... 38 0.24
gi|6681425|dbj|BAA88672.1| (AB030233) CiMsi [Ciona intestinalis] 38 0.24
gi|2498734|sp|Q60865|P137_MOUSE GPI-ANCHORED PROTEIN P137 >gi|1... 37 0.32
gi|3879921|emb|CAA98545.1| (Z74043) predicted using Genefinder;... 37 0.41
gi|969095 (U31961) no-on transient A-like protein [Drosophila m... 36 0.54
gi|2497729|sp|Q49537|VLPE_MYCHR VARIANT SURFACE ANTIGEN E PRECU... 36 0.54
gi|2689219|emb|CAA67983.1| (X99669) dynamin like protein [Dicty... 36 0.54
gi|4758666|ref|NP_004681.1|| LATS (large tumor suppressor, Dros... 36 0.54
gi|3172134 (U90209) RNA polymerase II largest subunit [Bonnemai... 36 0.54
gi|112945|sp|P14196|AAC2_DICDI AAC-RICH MRNA CLONE AAC11 PROTEI... 36 0.54
gi|4324420|gb|AAD16881| (AF104350) prespore-specific protein [D... 36 0.54
gi|1706637|sp|Q99372|ELS_RAT ELASTIN PRECURSOR (TROPOELASTIN) >... 36 0.71
gi|4220540|emb|CAA23013| (AL035356) hypothetical protein [Arabi... 36 0.71
gi|106291|pir||S16681 homeotic protein - human 36 0.93
gi|1078825|pir||S52850 cytoplasmic intermediate filament protei... 36 0.93
gi|1170758|sp|P38486|LEG3_CANFA GALECTIN-3 (GALACTOSE-SPECIFIC ... 36 0.93
gi|1724014|sp|P54604|YHCT_BACSU HYPOTHETICAL 33.7 KD PROTEIN IN... 36 0.93
gi|1184072 (U40766) COL-1 [Meloidogyne incognita] 35 1.2
gi|1589837 (U68729) cuticle preprocollagen [Meloidogyne incognita] 35 1.2
gi|2981221 (AF053091) eyelid [Drosophila melanogaster] 35 1.2
gi|3242649|dbj|BAA29028.1| (AB015440) alpha 1 type I collagen [... 35 1.2
gi|3879922|emb|CAA98546.1| (Z74043) cDNA EST yk310e2.3 comes fr... 35 1.2
gi|4808164|emb|CAB42795.1| (Y18878) largest subunit of the RNA ... 35 1.2
gi|4808166|emb|CAB42797.1| (Y18879) largest subunit of the RNA ... 35 1.2
gi|735898 (L40992) core-binding factor, runt domain, alpha subu... 35 1.6
gi|1280114|gb|AAC25888.1| (U55373) Similar to cuticular collage... 35 1.6
gi|2245687|gb|AAB62574.1| (AF010325) CHIP [Drosophila melanogas... 35 1.6
gi|2290720|gb|AAB65158.1| (AF001450) core binding factor alpha1... 35 1.6
gi|2245693|gb|AAB62577.1| (AF010328) short form of CHIP [Drosop... 35 1.6
gi|2293472 (AF010284) Osf2 [Mus musculus] 35 1.6
gi|2580612|gb|AAB82419.1| (AF005936) PEBP2alphaA major til-1 is... 35 1.6
gi|3877422|emb|CAB05195.1| (Z82268) predicted using Genefinder;... 35 1.6
gi|4324436|gb|AAD16883| (AF104414) large tumor suppressor 1 [Mu... 35 1.6
gi|539914|pir||A48233 polyomavirus enhancer-binding protein 2 a... 35 1.6
gi|5724787|gb|AAB65159.2| (AF001450) core binding factor alpha1... 35 1.6
gi|120943|sp|P13816|GARP_PLAFF GLUTAMIC ACID-RICH PROTEIN PRECU... 34 2.1
gi|437331 (L23429) beta-galactosides-binding lectin [Canis fami... 34 2.1
gi|163002 (M19372) elastin [Bos taurus] 34 2.1
gi|163004 (M19372) elastin-cBEL2 [Bos taurus] 34 2.1
gi|71823|pir||FGHUA fibrinogen alpha chain precursor, short spl... 34 2.1
gi|4503689|ref|NP_000499.1|| fibrinogen, A alpha polypeptide >g... 34 2.1
gi|1082938|pir||A49688 lactose-binding lectin L-29 - dog 34 2.1
gi|3810839|emb|CAA21800| (AL032684) conserved hypothetical zinc... 34 2.1
gi|3834293 (U80846) No definition line found [Caenorhabditis el... 34 2.1
gi|3834294 (U80846) No definition line found [Caenorhabditis el... 34 2.1
gi|3851592 (AF093575) surface antigen ariel1 [Entamoeba histoly... 34 2.1
gi|1280073 (U55366) Similar to cuticle collagen [Caenorhabditis... 34 2.7
gi|2501678|sp|Q45480|YLYB_BACSU HYPOTHETICAL 33.7 KD PROTEIN IN... 34 2.7
gi|2633919|emb|CAB13420| (Z99112) alternate gene name: ylmL; si... 34 2.7
gi|5901942|ref|NP_008971.1|| E1B-55kDa-associated protein 5 >gi... 34 2.7
gi|3924913|emb|CAB03474.1| (Z81138) predicted using Genefinder;... 34 2.7
gi|3924914|emb|CAB03475.1| (Z81138) predicted using Genefinder;... 34 2.7
gi|6580246|emb|CAB63321.1| (Z81138) cDNA EST EMBL:D65528 comes ... 34 2.7
gi|628112|pir||S32988 hypothetical protein - human herpesvirus 4 34 3.6
gi|3880306|emb|CAA90995.1| (Z54238) similar to cuticle collagen... 34 3.6
gi|2833307|sp|Q21184|YXWK_CAEEL PUTATIVE CUTICLE COLLAGEN F55C1... 34 3.6
gi|2144807|pir||EABO elastin precursor, splice form a - bovine 34 3.6
gi|3877661|emb|CAA98486.1| (Z74036) predicted using Genefinder;... 34 3.6
gi|5833486|gb|AAD53528.1| (AF162149) variable surface lipoprote... 34 3.6
gi|119293|sp|P04985|ELS_BOVIN ELASTIN PRECURSOR (TROPOELASTIN) ... 34 3.6
gi|543543|pir||JN0690 glutenin, high-molecular-weight Bx7 chain... 34 3.6
gi|119111|sp|P12978|EBN2_EBV EBNA-2 NUCLEAR PROTEIN >gi|1083964... 34 3.6
gi|628185|pir||S42447 nuclear protein EBNA2 - Epstein-Barr viru... 34 3.6
gi|282331|pir||S28037 penicillin-binding protein 1a - Streptoco... 33 4.7
gi|585647|sp|Q04707|PBPA_STRPN PENICILLIN-BINDING PROTEIN 1A (P... 33 4.7
gi|538257 (L23872) intermediate filament protein [Onchocerca vo... 33 4.7
gi|556343 (L31528) intermediate filament protein [Onchocerca vo... 33 4.7
gi|400698|sp|P31732|OV71_ONCVO MUSCLE CELL INTERMEDIATE FILAMEN... 33 4.7
gi|418972|pir||S31035 retrovirus-related gag polyprotein - mous... 33 4.7
gi|4503493|ref|NP_001955.1|| early growth response 1 >gi|119242... 33 4.7
gi|2833328|sp|Q27200|FBRL_TETTH FIBRILLARIN >gi|2654298|emb|CAA... 33 4.7
gi|2707270 (AF036171) homeobox-containing protein [Dictyosteliu... 33 4.7
gi|3319463 (AF077544) unknown [Caenorhabditis elegans] 33 4.7
gi|3879844|emb|CAB04725.1| (Z81592) predicted using Genefinder;... 33 4.7
gi|4140029|dbj|BAA36973.1| (AB015438) alpha 1 type I collagen [... 33 4.7
gi|5410459|gb|AAD43067.1|AF139884_1 (AF139884) penicillin-bindi... 33 4.7
gi|5815245|gb|AAD52614.1|AF175223_1 (AF175223) SANT domain prot... 33 4.7
gi|5852937|gb|AAD54275.1|AF169793_1 (AF169793) HET-C protein [P... 33 4.7
gi|6563337|gb|AAF17255.1|AF210745_1 (AF210745) penicillin-bindi... 33 4.7
gi|6563341|gb|AAF17257.1|AF210747_1 (AF210747) penicillin-bindi... 33 4.7
gi|6594249|emb|CAB63561.1| (AL031746) hypothetical garp protein... 33 4.7
gi|476822|pir||A42893 penicillin-binding protein 1A - Streptoco... 33 4.7
gi|6601502|gb|AAF19004.1|AF151366_1 (AF151366) arginine/serine-... 33 4.7
gi|82601|pir||A30843 glutenin high molecular weight chain Bx7 p... 33 6.2
gi|543541|pir||JC2099 glutenin, high molecular weight chain Bx1... 33 6.2
gi|330361 (M10593) major outer envelope glycoprotein gp220 [Eps... 33 6.2
gi|2119159|pir||I50694 alpha-1 collagen type III - chicken (fra... 33 6.2
gi|2314597|gb|AAD08465.1| (AE000642) conserved hypothetical pro... 33 6.2
gi|2388676 (AF015539) precollagen P [Mytilus edulis] 33 6.2
gi|2854193 (AF045645) Similar to cuticular collagen; coded for ... 33 6.2
gi|3875708|emb|CAA84658.1| (Z35598) Asparagine, Serine and Glyc... 33 6.2
gi|4028688 (U91649) merozoite surface antigen 2 [Plasmodium fal... 33 6.2
gi|4028690 (U91650) merozoite surface antigen 2 [Plasmodium fal... 33 6.2
gi|6324130|ref|NP_014200.1|GCR2| Transcription factor; Gcr2p >g... 32 8.1
gi|182424 (J00127) alpha-fibrinogen precursor [Homo sapiens] 32 8.1
gi|1136289 (U42597) histidine kinase A [Dictyostelium discoideum] 32 8.1
gi|1723450|sp|Q10268|YD34_SCHPO HYPOTHETICAL 81.7 KD PROTEIN C1... 32 8.1
gi|1448962 (L79944) OFR1 of non-LTR retrotransposon; putative [... 32 8.1
gi|1523876|emb|CAA99996| (Z75666) BRCA2 [Chlorocebus aethiops] 32 8.1
gi|1870123|emb|CAB06788| (Z86109) unknown [Saccharomyces pastor... 32 8.1
gi|4033468|sp|P92965|RS40_ARATH ARGININE/SERINE-RICH SPLICING F... 32 8.1
gi|2952545 (AF051898) coronin binding protein [Dictyostelium di... 32 8.1
gi|3876265|emb|CAB02976.1| (Z81067) predicted using Genefinder;... 32 8.1
gi|4996369|dbj|BAA78427.1| (AB021267) polyprotein [Arabidopsis ... 32 8.1
gi|5824601|emb|CAA82571.2| (Z29443) similar to Annexin; cDNA ES... 32 8.1
gi|223918|prf||1004351A fibrinogen alphaA [Homo sapiens] 32 8.1
gi|5901828|gb|AAD55422.1|AF181636_1 (AF181636) BcDNA.GH07269 [D... 32 8.1
gi|6103602|gb|AAF03681.1| (AF160252) KIAA0553 protein [Homo sap... 32 8.1
gi|6273747|gb|AAF06358.1|AF102865_1 (AF102865) calymmin [Danio ... 32 8.1
>gi|1175678|sp|P37765|YCIL_ECOLI HYPOTHETICAL 32.7 KD PROTEIN IN
TRPL-BTUR INTERGENIC REGION (ORF4)
>gi|1742064|dbj|BAA14806| (D90764) ORF_ID:o253#9;
similar to [SwissProt Accession Number P37765]
[Escherichia coli] >gi|1742080|dbj|BAA14821| (D90765)
ORF_ID:o253#9; similar to [SwissProt Accession Number
P37765] [Escherichia coli] >gi|1787524 (AE000225) orf,
hypothetical protein [Escherichia coli]
Length = 291
Score = 239 bits (604), Expect = 3e-62
Identities = 136/273 (49%), Positives = 168/273 (60%), Gaps = 11/273 (4%)
Query: 25 LEERLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLG--MSVKSGDKIELDGRSF-VA 81
+ E+L KVLA+AG GSRR +E I G + V+G IA+LG + V G KI +DG V
Sbjct: 1 MSEKLQKVLARAGHGSRREIESIIEAGRVSVDGKIAKLGDRVEVTPGLKIRIDGHLISVR 60
Query: 82 SALTEPARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXX 141
+ + RVL Y KPEGE+ TR DPEGRPTVF+ LP L+GARWIA+GRLD+N
Sbjct: 61 ESAEQICRVLAYYKPEGELCTRNDPEGRPTVFDRLPKLRGARWIAVGRLDVNTCGLLLFT 120
Query: 142 XDGELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERI 201
DGELAN +MHPS E+EREY VRV V D L L+RGV LEDG A F TI+
Sbjct: 121 TDGELANRLMHPSREVEREYAVRVFG-----QVDDAKLRDLSRGVQLEDGPAAFKTIKFS 175
Query: 202 GNTDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKT 261
G + W+ V + EGRNREVRRLWE+ G QVSRL R RYG + LP+ L RG TEL
Sbjct: 176 GGEGINQWYNVTLTEGRNREVRRLWEAVGVQVSRLIRVRYGDIPLPKGLPRGGWTELDLA 235
Query: 262 QVEALRTQLKLEKDMPLALTLQPIIGQRRSAKA 294
Q LR ++L + + ++ RR KA
Sbjct: 236 QTNYLRELVELPPETSSKVAVEK---DRRRMKA 265
>gi|602963 (U18111) ORF4 [Escherichia coli]
Length = 243
Score = 238 bits (600), Expect = 1e-61
Identities = 131/246 (53%), Positives = 157/246 (63%), Gaps = 8/246 (3%)
Query: 25 LEERLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLG--MSVKSGDKIELDGRSF-VA 81
+ E+L KVLA+AG GSRR +E I G + V+G IA+LG + V G KI +DG V
Sbjct: 1 MSEKLQKVLARAGHGSRREIESIIEAGRVSVDGKIAKLGDRVEVTPGLKIRIDGHLISVR 60
Query: 82 SALTEPARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXX 141
+ + RVL Y KPEGE+ TR DPEGRPTVF+ LP L+GARWIA+GRLD+N
Sbjct: 61 ESAEQICRVLAYYKPEGELCTRNDPEGRPTVFDRLPKLRGARWIAVGRLDVNTCGLLLFT 120
Query: 142 XDGELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERI 201
DGELAN +MHPS E+EREY VRV V D L L+RGV LEDG A F TI+
Sbjct: 121 TDGELANRLMHPSREVEREYAVRVFG-----QVDDAKLRDLSRGVQLEDGPAAFKTIKFS 175
Query: 202 GNTDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKT 261
G + W+ V + EGRNREVRRLWE+ G QVSRL R RYG + LP+ L RG TEL
Sbjct: 176 GGEGINQWYNVTLTEGRNREVRRLWEAVGVQVSRLIRVRYGDIPLPKGLPRGGWTELDLA 235
Query: 262 QVEALR 267
Q LR
Sbjct: 236 QTNYLR 241
>gi|1175679|sp|P45104|YCIL_HAEIN HYPOTHETICAL PROTEIN HI1199
>gi|1074680|pir||A64169 hypothetical protein HI1199 -
Haemophilus influenzae (strain Rd KW20) >gi|1574128
(U32799) conserved hypothetical protein [Haemophilus
influenzae Rd]
Length = 357
Score = 234 bits (590), Expect = 1e-60
Identities = 137/294 (46%), Positives = 174/294 (58%), Gaps = 9/294 (3%)
Query: 19 ATEAPKLE-ERLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLG--MSVKSGDKIELD 75
A+ PK E E+L KVLA+AG GSRR +E I+ G + V G IA LG + V SG K+ +D
Sbjct: 67 ASNQPKAEGEKLQKVLARAGQGSRREIETMIAAGRVSVEGKIATLGDRIDVHSGVKVRID 126
Query: 76 GRSF-VASALTEPARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINX 134
G+ ++ E RVL+Y KPEGE+ TR DPEGR TVF+ LP L G+RWIA+GRLDIN
Sbjct: 127 GQIINLSHTQKEICRVLMYYKPEGELCTRSDPEGRATVFDRLPRLTGSRWIAVGRLDINT 186
Query: 135 XXXXXXXXDGELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAK 194
DGELAN +MHPS E+EREY VRV V D +L +L +GV LEDG A
Sbjct: 187 SGLLLFTTDGELANRLMHPSREVEREYSVRVFG-----QVDDAMLARLRKGVQLEDGLAN 241
Query: 195 FDTIERIGNTDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQ 254
F I+ G + W+ V + EGRNREVRRLWESQG QVSRL R RYG++ L + L RG
Sbjct: 242 FKEIKFTGGVGINQWYDVTLMEGRNREVRRLWESQGIQVSRLIRIRYGNIKLMKGLPRGG 301
Query: 255 STELPKTQVEALRTQLKLEKDMPLALTLQPIIGQRRSAKATLHVNRNDNNKHAY 308
E+ V LR + L + L ++ + +S + V R Y
Sbjct: 302 WEEMDLENVNYLRELVGLPAETETKLDVKQASRRPKSGQIRKAVKRYSEMNKRY 355
>gi|1175677|sp|P42395|YCIL_BUCAP HYPOTHETICAL 30.2 KD PROTEIN IN
TRPA 3'REGION >gi|480102|pir||S36431 hypothetical
protein - Buchnera aphidicola >gi|396661|emb|CAA79503|
(Z19055) unknown open reading frame [Buchnera
aphidicola]
Length = 258
Score = 159 bits (397), Expect = 6e-38
Identities = 92/254 (36%), Positives = 144/254 (56%), Gaps = 10/254 (3%)
Query: 28 RLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLG--MSVKSGDKIELDGRSFVASALT 85
++ K+L+ G GSRR +E I G I +NG+ A +G ++ K+ +I +D + +
Sbjct: 4 KIQKILSDLGYGSRRFIECMIKCGKISINGEKAIIGQYLNKKNPGEILIDKKKIIVKRNK 63
Query: 86 EPARVLIYN-KPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDG 144
+VLIYN KP GEV TR+D + R TVF+ LP L RW+++GRLDIN DG
Sbjct: 64 NLPKVLIYNNKPIGEVCTRDDFQKRLTVFDKLPKLNLNRWVSVGRLDINTKGLLLFTNDG 123
Query: 145 ELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERI--G 202
LAN +MHP S+IEREY +R+ + + L +GV + G F I +
Sbjct: 124 TLANKLMHPRSQIEREYNIRIFG-----EMNKNKINILRKGVKIIHGYVSFKEIVPLYDK 178
Query: 203 NTDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKTQ 262
+ WF+ ++ EG+NRE+R +++S CQV++L R RYG+++LP+ L GQ L
Sbjct: 179 KEGKNKWFKGILCEGKNREIRLMFKSIQCQVNQLIRVRYGNIILPKNLKEGQWMMLNSIF 238
Query: 263 VEALRTQLKLEKDM 276
++ L + +K++
Sbjct: 239 LKKLYNLINFDKEI 252
>gi|466190|sp|P35159|RLUB_BACSU RIBOSOMAL LARGE SUBUNIT
PSEUDOURIDINE SYNTHASE B (PSEUDOURIDYLATE SYNTHASE)
(URACIL HYDROLYASE) >gi|629120|pir||S45555 hypothetical
protein X13 - Bacillus subtilis >gi|410137 (L09228)
ORFX13 [Bacillus subtilis] >gi|2634751|emb|CAB14248|
(Z99116) similar to hypothetical proteins [Bacillus
subtilis]
Length = 229
Score = 129 bits (321), Expect = 5e-29
Identities = 90/245 (36%), Positives = 133/245 (53%), Gaps = 26/245 (10%)
Query: 27 ERLHKVLAQAGLGSRRALEQRISNGLIKVNGDIA-QLGMSVKSGDKIELDGRSFVASALT 85
ERL KV+A AG+ SR E+ I G +KVNG + +LG+ V D+IE++G
Sbjct: 2 ERLQKVIAHAGVASRSKAEELIKEGKVKVNGKVVTELGVKVTGSDQIEVNGLKVERE--- 58
Query: 86 EPARVLIYNKPEGEVTTREDPEGRPTV---FETLPVLKGARWIAIGRLDINXXXXXXXXX 142
EP L+Y KP G ++ +D +GR V F+ +P R IGRLD +
Sbjct: 59 EPVYFLLY-KPRGVISAAQDDKGRKVVTDFFKNIP----QRIYPIGRLDYDTSGLLLLTN 113
Query: 143 DGELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDG-----TAKFDT 197
DGE AN +MHP EI++ YV +V+ + ELL +L RG+ LE+G AK +
Sbjct: 114 DGEFANKLMHPKYEIDKTYVAKVKGIPPK-----ELLRKLERGIRLEEGKTAPAKAKLLS 168
Query: 198 IERIGNTDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTE 257
+++ T ++ + EGRNR+VRR++E+ G +V +LKR Y + L R L G + E
Sbjct: 169 LDKKKQTSI---IQLTIHEGRNRQVRRMFEAIGHEVIKLKREEYAFLNL-RGLHTGDARE 224
Query: 258 LPKTQ 262
L T+
Sbjct: 225 LRLTK 229
>gi|6458615|gb|AAF10472.1|AE001942_4 (AE001942) ribosomal large
subunit pseudouridine synthase B [Deinococcus
radiodurans]
Length = 257
Score = 123 bits (306), Expect = 3e-27
Identities = 89/240 (37%), Positives = 119/240 (49%), Gaps = 13/240 (5%)
Query: 27 ERLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLGMSVKSGDKIELDGRSFVASALTE 86
ERLHK LA+AG+ SRRA E+ I G + VNG A LG V D + +DGR V E
Sbjct: 4 ERLHKRLARAGIASRRAAEELIRAGRVTVNGQTAGLGQGVNDTDDVRVDGR-LVELTRPE 62
Query: 87 PARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDGEL 146
+Y KP G VTT D GR V + +P + G +GRLD + DG+L
Sbjct: 63 TVTYALY-KPVGFVTTAHDEYGRRNVLDAMPDVPGLH--PVGRLDKDSEGLLLLTNDGDL 119
Query: 147 ANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGNTDS 206
+ HP E+ Y EG E L+ L RG+ ++DG A + + +
Sbjct: 120 TLTLTHPRYGHEKAYRAWT---EGREPPTQAELDVLVRGIAMDDGPA-----QALSAAPA 171
Query: 207 HDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKTQVEAL 266
D VV+ EGRNR+VRR+ E+ G V RL R R G + L +L G+ EL +E L
Sbjct: 172 EDGAYVVLGEGRNRQVRRMLEALGHPVGRLVRYRVGGLWL-GDLNPGEYRELGPRDLEQL 230
>gi|4980760|gb|AAD35352.1|AE001708_20 (AE001708) 16S pseudouridylate
synthase [Thermotoga maritima]
Length = 239
Score = 120 bits (298), Expect = 3e-26
Identities = 80/252 (31%), Positives = 136/252 (53%), Gaps = 18/252 (7%)
Query: 28 RLHKVLAQAGLGSRRALEQRISNGLIKVNGDIA-QLGMSVKSGDKIELDGRSFVASALTE 86
RL + L+ +G+G+R+ +++ I G + VNG + G V D + LDG +
Sbjct: 2 RLDRYLSNSGVGTRKEVKKLIKQGRVTVNGRVVLDPGHPVLENDAVALDGE---VVRFHK 58
Query: 87 PARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDGEL 146
+L Y KP G VT+ +DP T+ E LP LKG +GRLD + DG+
Sbjct: 59 KVYILFY-KPSGYVTSTKDPHSE-TIMEFLPPLKGI--FPVGRLDKDAEGLLIITNDGDF 114
Query: 147 ANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGT-AKFDTIERIGNTD 205
A+ ++ P +E+EY+V+V EGE V ++ +E+L GV L DG AK +E++ N
Sbjct: 115 AHRVISPKWSVEKEYIVKV---EGE--VTEDKIEKLKNGVTLRDGFFAKAKRVEKLSN-- 167
Query: 206 SHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKTQVEA 265
D ++V+ EG+ +++R+ + G + LKRTR G ++LP ++ G+ L + +V+
Sbjct: 168 --DTLKIVITEGKYHQIKRMTAAVGLKTVHLKRTRIGGLVLPDDMKPGEYRFLSEEEVKK 225
Query: 266 LRTQLKLEKDMP 277
+ + ++D P
Sbjct: 226 VFEREDQKEDTP 237
>gi|3915347|sp|O51155|Y129_BORBU HYPOTHETICAL PROTEIN BB0129
>gi|2688006 (AE001124) conserved hypothetical protein
[Borrelia burgdorferi]
Length = 249
Score = 118 bits (292), Expect = 1e-25
Identities = 74/249 (29%), Positives = 129/249 (51%), Gaps = 11/249 (4%)
Query: 28 RLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLGMSVKSGDKIELDGRSFVASALTEP 87
R+H LA+ G+GSRR E+ I L++VN IA+LG V GD+I + FV
Sbjct: 8 RVHVFLAEKGVGSRRFCEELIRKKLVRVNNTIAKLGDKVTLGDRIIYKKQIFVFKDFQIN 67
Query: 88 ARV-LIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDGEL 146
R+ L NKP + + D +GR + L R +IGRLD DG+
Sbjct: 68 NRIYLALNKPRNYLCSNFDVDGRKLAISLVQPLFKERVFSIGRLDFKSSGLLLFTNDGKF 127
Query: 147 ANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGNTDS 206
AN ++HP ++EREY++ E ++ + + LL G+ ++ K + E + +
Sbjct: 128 ANDIIHPRQKVEREYII-----ESKKDIDENLLISFKSGIKVKKEFFKLKSYEILNKNSA 182
Query: 207 HDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKTQVEAL 266
R+++ EG+NRE+R+++ S+ + ++ R R G++ L L GQ +P +++ +L
Sbjct: 183 ----RLILDEGKNREIRKVFLSKNIFLKKIHRIRIGNINLD-SLKEGQVKIVPLSKINSL 237
Query: 267 RTQLKLEKD 275
+++L+ D
Sbjct: 238 KSRLEKLND 246
>gi|6647946|sp|Q9ZD06|Y544_RICPR HYPOTHETICAL PROTEIN RP544
>gi|3861093|emb|CAA14993| (AJ235272) unknown [Rickettsia
prowazekii]
Length = 235
Score = 116 bits (288), Expect = 4e-25
Identities = 78/218 (35%), Positives = 123/218 (55%), Gaps = 12/218 (5%)
Query: 28 RLHKVLAQAGLGSRRALEQRISNGLIKVNG-DIAQLGMSVKSGDKIELDGRSFVASALTE 86
RL K+++ AG+ SRR E+ I G +K++G I +V ++IE+ GR T+
Sbjct: 3 RLAKIISNAGVCSRRNAEKLIVGGKVKIDGITILSPATNVDMSNQIEVSGRLINN---TQ 59
Query: 87 PARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDGEL 146
R+ IY KP G +TT +DP R TVFE L L R I+IGRLD+N G+L
Sbjct: 60 KPRLWIYYKPVGLITTHKDPLSRKTVFEQLIGLP--RVISIGRLDLNSEGLLLLTNSGDL 117
Query: 147 ANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGNTDS 206
A+ P+S+++R Y VR G ++ LL+ + + ++ +I+ + S
Sbjct: 118 AHQFEMPASKLKRVYNVRAY---GNPNI---LLKNNYKNLKIDGIFYNPHSIKLLRQNKS 171
Query: 207 HDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSV 244
+ WF VV+ EG+NRE+RR++E G QV++L R +YG++
Sbjct: 172 NSWFEVVLFEGKNREIRRIFEYFGLQVNKLIRIQYGAL 209
>gi|5738484|emb|CAB52832.1| (AL109848) putative pseudouridine
synthase [Streptomyces coelicolor A3(2)]
Length = 371
Score = 115 bits (284), Expect = 1e-24
Identities = 80/248 (32%), Positives = 123/248 (49%), Gaps = 13/248 (5%)
Query: 27 ERLHKVLAQAGLGSRRALEQRISNGLIKVNGDIA-QLGMSVK-SGDKIELDGRSFVASAL 84
ERL KVLA+AG GSRRA E+ I +++NG+I + G V D++++DG +
Sbjct: 135 ERLQKVLARAGYGSRRACEELIEQARVEINGEIVLEQGRRVDPEKDEVKVDG----LTVA 190
Query: 85 TEPARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDG 144
T+ + NKP G V+T EDPEGR + + + + R +GRLD G
Sbjct: 191 TQSYQFFSLNKPAGVVSTMEDPEGRQCLGDYV-TNRETRLFHVGRLDTETEGVILLTNHG 249
Query: 145 ELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGNT 204
ELA+ + HP +++ Y+ + P + +L ++L G+ LEDG A+ D + T
Sbjct: 250 ELAHRLTHPRYGVKKTYLAHIVGP-----IPRDLGKRLKDGIQLEDGYARADHFRVVEQT 304
Query: 205 DSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKTQVE 264
+ V + EGR VRR+ G V L RT +G + L + G L T+V
Sbjct: 305 GKNYLVEVTLHEGRKHIVRRMLAEAGFPVDNLVRTAFGPITL-GDQKSGWLRRLSNTEVG 363
Query: 265 ALRTQLKL 272
L ++ L
Sbjct: 364 MLMQEVDL 371
>gi|3915377|sp|Q55578|Y361_SYNY3 HYPOTHETICAL 28.2 KD PROTEIN
SLR0361 >gi|1001457|dbj|BAA10082| (D63999) hypothetical
protein [Synechocystis sp.]
Length = 249
Score = 113 bits (281), Expect = 3e-24
Identities = 83/254 (32%), Positives = 122/254 (47%), Gaps = 12/254 (4%)
Query: 25 LEERLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLGMSVK-SGDKIELDGRSFVASA 83
+ ER+ K+L+Q G+ SRR E+ I G + VNG +A LG D + +DG+ A
Sbjct: 1 MAERIQKLLSQWGIASRRHAEEMILAGRVSVNGKVANLGDKADPQQDFLSVDGKQIKADN 60
Query: 84 LTEPARVLIYNKPEGEVTTREDPEGRPTVFETLP--VLKGARWIAIGRLDINXXXXXXXX 141
+L+ NKP ++T +DP GR TV + LP + +G +GRLD N
Sbjct: 61 RPRDIYLLV-NKPRDVLSTCDDPRGRKTVLDLLPQDLQRGKGLHPVGRLDRNSTGALLLT 119
Query: 142 XDGELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERI 201
DGEL + HP + + Y V + E + DE LE+ G+ML+ T+E I
Sbjct: 120 NDGELTLRLTHPRYHLPKTYDVWL-----EGNPSDEDLEKWRSGMMLDGKKTLPATLEVI 174
Query: 202 GNTDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLL---PRELLRGQSTEL 258
V + EGRNR++RRL E G V +L R G + L + L GQ L
Sbjct: 175 SENKDQIHLLVTLTEGRNRQIRRLAEELGLTVLKLHRRTIGPLQLHTRGKVLGSGQFRFL 234
Query: 259 PKTQVEALRTQLKL 272
++ L+ Q+ L
Sbjct: 235 SPAEIRLLKKQVNL 248
>gi|3915445|sp|O67444|YE64_AQUAE HYPOTHETICAL PROTEIN AQ_1464
>gi|2983856 (AE000741) hypothetical protein [Aquifex
aeolicus]
Length = 249
Score = 110 bits (273), Expect = 2e-23
Identities = 81/246 (32%), Positives = 126/246 (50%), Gaps = 20/246 (8%)
Query: 25 LEERLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQ-LGMSVKSG-DKIELDGRSFVAS 82
+E R++K L++AG+ SRR E+ I G +KVNG++ + LG+ V D +E+DG+
Sbjct: 2 MEVRINKFLSEAGVASRRKAEKLILEGRVKVNGEVVRSLGVKVNPEVDIVEVDGKP---- 57
Query: 83 ALTEPARVLIYNKPEGEVTTR-EDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXX 141
+ R +I NKP +T P+GR T+ E + + R +GRLD N
Sbjct: 58 VKPQRKRYIILNKPCCYLTQLGRSPDGRKTIEELIKDIP-ERVFPVGRLDYNTEGLLILT 116
Query: 142 XDGELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERI 201
DGELAN ++HP ++ + Y+ V E V + L+++ +G+ LEDG AK D I +
Sbjct: 117 NDGELANRILHPRYKLPKVYLALV-----EGKVDQKTLKRMKQGIELEDGFAKPDNIRIV 171
Query: 202 GNTDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLP-------RELLRGQ 254
+ + EGR V+R + G +V RLKR G + L REL +G+
Sbjct: 172 RYEGKNTLLEITFHEGRKHLVKRFLGAFGHKVKRLKRIAIGPIKLGKLSPGKWRELNQGE 231
Query: 255 STELPK 260
+L K
Sbjct: 232 LAQLFK 237
>gi|3915547|sp|O33210|YH11_MYCTU HYPOTHETICAL 27.6 KD PROTEIN RV1711
>gi|2326754|emb|CAB10968| (Z98268) hypothetical protein
Rv1711 [Mycobacterium tuberculosis]
Length = 254
Score = 108 bits (266), Expect = 1e-22
Identities = 79/226 (34%), Positives = 111/226 (48%), Gaps = 12/226 (5%)
Query: 28 RLHKVLAQAGLGSRRALEQRISNGLIKVNGDIA-QLGMSVKSGDKI-ELDGRSFVASALT 85
RL KVL+QAG+ SRRA E+ I +G ++V+G + +LG V + +DG V L
Sbjct: 15 RLQKVLSQAGIASRRAAEKMIVDGRVEVDGHVVTELGTRVDPQVAVVRVDGARVV---LD 71
Query: 86 EPARVLIYNKPEGEVTTREDPEGRPTVFETLP--VLKGARWIAIGRLDINXXXXXXXXXD 143
+ L NKP G +T D GRP + + + V + +GRLD + D
Sbjct: 72 DSLVYLALNKPRGMHSTMSDDRGRPCIGDLIERKVRGTKKLFHVGRLDADTEGLMLLTND 131
Query: 144 GELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGN 203
GELA+ +MHPS E+ + Y+ V V L L G+ L+DG A D +
Sbjct: 132 GELAHRLMHPSHEVPKTYLATVTGS-----VPRGLGRTLRAGIELDDGPAFVDDFAVVDA 186
Query: 204 TDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRE 249
RV + EGRNR VRRL + G V L RT G+V L ++
Sbjct: 187 IPGKTLVRVTLHEGRNRIVRRLLAAAGFPVEALVRTDIGAVSLGKQ 232
>gi|3915542|sp|O05668|YH11_MYCLE HYPOTHETICAL 28.1 KD PROTEIN
CB1351.03C >gi|2065214|emb|CAB08276| (Z95117)
MLC1351.03c, unknown, len: 256 aa, similar to eg.
YCIL_ECOLI P37765 hypothetical 32.7 kd protein in trpl-
(291 aa), fasta clones, opt: 481 z-score: 570.9 E():
8.5e-25, (42.4% identity in 229 aa overlap); contains
PS011...
Length = 256
Score = 106 bits (263), Expect = 3e-22
Identities = 81/244 (33%), Positives = 119/244 (48%), Gaps = 19/244 (7%)
Query: 28 RLHKVLAQAGLGSRRALEQRISNGLIKVNGDIA-QLGMSVKSG-DKIELDGRSFVASALT 85
RL K+L++AG+ SRRA E+ I G ++V+G + +LG V + +DG V +
Sbjct: 17 RLQKILSRAGIASRRAAEKLIIEGRVEVDGQLVRELGTRVDPDVSVVRVDG---VKVVVD 73
Query: 86 EPARVLIYNKPEGEVTTREDPEGRPTVFETLP--VLKGARWIAIGRLDINXXXXXXXXXD 143
+ L NKP G +T D GRP V + + V + +GRLD + D
Sbjct: 74 DSLVYLALNKPRGMYSTMSDDRGRPCVGDLIERRVRGNKKLFHVGRLDADTEGLILLTND 133
Query: 144 GELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGN 203
GELA+ +MHPS E+ + Y+ V+ V L ++L+ G+ L+DG A D +
Sbjct: 134 GELAHRLMHPSHEVSKTYLATVKGA-----VPRGLGKKLSVGLELDDGPAHVDDFAVVDA 188
Query: 204 TDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLP-------RELLRGQST 256
R+ + EGR R VRRL + G V L RT G+V L R LLR +
Sbjct: 189 IPGKTLVRLTLHEGRKRIVRRLLTAAGFPVEMLVRTDIGAVSLGDQRPGCLRALLRDEIR 248
Query: 257 ELPK 260
+L K
Sbjct: 249 QLYK 252
>gi|3915394|sp|O66829|Y554_AQUAE HYPOTHETICAL PROTEIN AQ_554
>gi|2983194 (AE000695) hypothetical protein [Aquifex
aeolicus]
Length = 238
Score = 105 bits (259), Expect = 1e-21
Identities = 74/248 (29%), Positives = 135/248 (53%), Gaps = 20/248 (8%)
Query: 28 RLHKVLAQAGLGSRRALEQRISNGLIKVNGDIA-QLGMSVKSGDKIELDGRSFVASALTE 86
RL K L+++ SR+ ++ I G +KV+G + Q VK G+++E++G+S +
Sbjct: 2 RLDKYLSKSLHISRKEAKELIREGRVKVSGKVVKQAEYRVKEGEEVEVEGKS------VK 55
Query: 87 PAR--VLIYNKPEGEVTTREDPEGRPTVFETLPV-LKGARWIAIGRLDINXXXXXXXXXD 143
P + L+ KP+G ++T E+ + P+ E + + + GRLD++ D
Sbjct: 56 PKKNVYLMLYKPKGYLSTTEEDKKYPSFLELIREHFPSRKLFSAGRLDVDAEGLLLITDD 115
Query: 144 GELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGN 203
GELA+ + HP ++E+EY+VR+ + + DE L++L V LE+ + E++
Sbjct: 116 GELAHRLTHPKWKVEKEYIVRL-----DRDIGDEELKKLYE-VKLEEKPVQLVKAEKL-- 167
Query: 204 TDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKTQV 263
S D + ++ EGR+ V+RL+++ G V LKRTR G++ L + G+ EL + +V
Sbjct: 168 --SGDTVKAILTEGRHHVVKRLFKAVGHNVVYLKRTRVGNLRLDENMEPGEWRELTEEEV 225
Query: 264 EALRTQLK 271
+ L+ +K
Sbjct: 226 KELKRLVK 233
>gi|4155973 (AE001558) putative [Helicobacter pylori J99]
Length = 262
Score = 103 bits (255), Expect = 3e-21
Identities = 74/228 (32%), Positives = 119/228 (51%), Gaps = 13/228 (5%)
Query: 28 RLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLGMSVKSGDKIELDGRSFVASALTEP 87
R+++ LA SRR E+ + G +K+N + A+L VK DK+ LD R + +
Sbjct: 7 RINQFLAHYTKHSRREAEKLVLEGRVKINHEHAKLASVVKENDKVFLDKR-LIKPLKNKK 65
Query: 88 ARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDGELA 147
VL+Y+KP+GE+ ++ DP R ++E+L K A + +GRLD +
Sbjct: 66 FSVLVYHKPKGELVSKADPLKRHVIYESLEK-KYAHFAPVGRLDFASEGVLLLSDSKAVV 124
Query: 148 NAMMHPSSEIEREYVVRVR---SPEGEEHVQDEL-LEQLTRGVMLEDGT-----AKFDTI 198
+A+MH + +E+EY+V+++ + E E +Q+ L LE T+G + A F
Sbjct: 125 SALMH--ANLEKEYLVKIQGFVTREMENAMQEGLKLENATKGAHQKTPIKSMEFAPFIGY 182
Query: 199 ERIGNTDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLL 246
E I N + RV++ EG+NRE+RR + V L+R RYG V L
Sbjct: 183 EIIKNHAKYSKLRVIINEGKNRELRRFFAFFNAGVLDLRRVRYGFVNL 230
>gi|2501526|sp|P55986|YE59_HELPY HYPOTHETICAL PROTEIN HP1459
>gi|2314637|gb|AAD08501.1| (AE000646) conserved
hypothetical protein [Helicobacter pylori 26695]
Length = 262
Score = 102 bits (253), Expect = 5e-21
Identities = 72/228 (31%), Positives = 120/228 (52%), Gaps = 13/228 (5%)
Query: 28 RLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLGMSVKSGDKIELDGRSFVASALTEP 87
R+++ LA SRR E+ + G +K+N + A+L VK D++ LD R + +
Sbjct: 7 RINQFLAHYTKHSRREAEKLVLEGRVKINHEHAKLASVVKENDRVFLDKR-LIKPLKNKK 65
Query: 88 ARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDGELA 147
VL+Y+KP+GE+ ++ DP R ++E+L K A + +GRLD +
Sbjct: 66 FSVLVYHKPKGELVSKADPLKRRVIYESLEK-KYAHFAPVGRLDFASEGVLLLSDSKAVV 124
Query: 148 NAMMHPSSEIEREYVVRVR---SPEGEEHVQDEL-LEQLTRGVMLEDGT-----AKFDTI 198
+A+MH +++E+EY+++++ + E E +Q+ L LE T+G + A F
Sbjct: 125 SALMH--ADLEKEYLIKIQGFVTREMENAMQEGLKLENATKGAHQKTPIKSMEFAPFIGY 182
Query: 199 ERIGNTDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLL 246
E I N + RV++ EG+NRE+RR + V L+R RYG V L
Sbjct: 183 EIIKNHAKYSKLRVIINEGKNRELRRFFAFFNAGVLDLRRVRYGFVNL 230
>gi|418534|sp|P32684|YJBC_ECOLI HYPOTHETICAL 32.5 KD PROTEIN IN
PEPE-LYSC INTERGENIC REGION >gi|396357 (U00006) No
definition line found [Escherichia coli] >gi|1790453
(AE000475) orf, hypothetical protein [Escherichia coli]
Length = 290
Score = 95.6 bits (234), Expect = 8e-19
Identities = 66/224 (29%), Positives = 117/224 (51%), Gaps = 13/224 (5%)
Query: 23 PKLEERLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLGMSVKSGDKIELDGRSFVAS 82
P RL+K ++++G+ SRR ++ I G + +NG A +G VK GD ++++G+ +
Sbjct: 3 PDSSVRLNKYISESGICSRREADRYIEQGNVFLNGKRATIGDQVKPGDVVKVNGQ-LIEP 61
Query: 83 ALTEPARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXX 142
E ++ NKP G V+T ED E R + + V R IGRLD +
Sbjct: 62 REAEDLVLIALNKPVGIVSTTEDGE-RDNIVDF--VNHSKRVFPIGRLDKDSQGLIFLTN 118
Query: 143 DGELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIG 202
G+L N ++ ++ E+EY+V V P + +E + ++ GV + K +++
Sbjct: 119 HGDLVNKILRAGNDHEKEYLVTVDKP-----ITEEFIRGMSAGVPILGTVTKKCKVKK-- 171
Query: 203 NTDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLL 246
++ FR+ + +G NR++RR+ E G +V +L+RTR +V L
Sbjct: 172 --EAPFVFRITLVQGLNRQIRRMCEHFGYEVKKLERTRIMNVSL 213
>gi|3329180|gb|AAC68318.1| (AE001343) predicted pseudouridine
synthase [Chlamydia trachomatis]
Length = 241
Score = 82.7 bits (201), Expect = 6e-15
Identities = 71/242 (29%), Positives = 117/242 (48%), Gaps = 20/242 (8%)
Query: 28 RLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLGMSV---KSGDKIELDGRSFVASAL 84
RL+K LA AG+ SRR ++ I G + VNG +A G V + D +E+ G+ A
Sbjct: 5 RLNKFLASAGVASRRKCDEIIFAGSVTVNGRVAA-GPFVTVDEEFDSVEVGGQRIGA--- 60
Query: 85 TEPARVLIYNKPEGEVTTREDP-EGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXD 143
E + +KP G + + E G V + L R +GRLD D
Sbjct: 61 -EKKVYFMVHKPLGYLCSSERKFPGSKLVIDLLSHCP-YRLFTVGRLDKETSGLILVTND 118
Query: 144 GELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGN 203
GE AN ++HPS I +EY+++V V LE L G +++ + +++++
Sbjct: 119 GEFANRVIHPSFGITKEYLLKV-----SRDVTARDLETLMAGTVIDGKVVRPVSVKKV-- 171
Query: 204 TDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKTQV 263
+++V EG+ E+R E+ G Q+ LKR R GS++L L G+ EL +++
Sbjct: 172 --RRGTIKIIVNEGKKHEIRLFAEAAGLQLLELKRIRIGSLVL-GGLPYGKYRELTDSEL 228
Query: 264 EA 265
++
Sbjct: 229 DS 230
>gi|465610|sp|P33918|RSUA_ECOLI RIBOSOMAL SMALL SUBUNIT
PSEUDOURIDINE SYNTHASE A (16S PSEUDOURIDYLATE 516
SYNTHASE) (16S PSEUDOURIDINE 516 SYNTHASE) (URACIL
HYDROLYASE) >gi|1042177|bbs|169371 16S RNA pseudouridine
516 synthase, 16S RNA psi 516 synthase=rsuA gene product
[Escherichia coli, Peptide, 231 aa] >gi|405907 (U00008)
yejD [Escherichia coli] >gi|1788510 (AE000308) 16S
pseudouridylate 516 synthase [Escherichia coli]
Length = 231
Score = 82.7 bits (201), Expect = 6e-15
Identities = 71/243 (29%), Positives = 111/243 (45%), Gaps = 18/243 (7%)
Query: 28 RLHKVLAQAGLGSRRALEQR-ISNGLIKVNGDIAQ-LGMSVKSGDKIELDGRSFVASALT 85
RL K +AQ LG RA+ R I + V+G+I + + + DG A
Sbjct: 2 RLDKFIAQQ-LGVSRAIAGREIRGNRVTVDGEIVRNAAFKLLPEHDVAYDGNPL---AQQ 57
Query: 86 EPARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDGE 145
R + NKP+G V + +DP+ PTV L + A GRLDI+ DG+
Sbjct: 58 HGPRYFMLNKPQGYVCSTDDPD-HPTVLYFLDEPVAWKLHAAGRLDIDTTGLVLMTDDGQ 116
Query: 146 LANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVML--EDGTAKFDTIERIGN 203
++ + P E+ Y+V + SP V D+ EQ +GV L E K +E I
Sbjct: 117 WSHRITSPRHHCEKTYLVTLESP-----VADDTAEQFAKGVQLHNEKDLTKPAVLEVITP 171
Query: 204 TDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKTQV 263
T R+ + EGR +V+R++ + G V L R R G + L +L G+ L + ++
Sbjct: 172 TQ----VRLTISEGRYHQVKRMFAAVGNHVVELHRERIGGITLDADLAPGEYRPLTEEEI 227
Query: 264 EAL 266
++
Sbjct: 228 ASV 230
>gi|2145990|pir||S72955 u0247g protein - Mycobacterium leprae
>gi|467162 (U00021) u0247g [Mycobacterium leprae]
Length = 186
Score = 82.3 bits (200), Expect = 8e-15
Identities = 60/179 (33%), Positives = 84/179 (46%), Gaps = 14/179 (7%)
Query: 91 LIYNKPEGEVTTREDPEGRPTVFETLP--VLKGARWIAIGRLDINXXXXXXXXXDGELAN 148
L NKP G +T D GRP V + + V + +GRLD + DGELA+
Sbjct: 9 LALNKPRGMYSTMSDDRGRPCVGDLIERRVRGNKKLFHVGRLDADTEGLILLTNDGELAH 68
Query: 149 AMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGNTDSHD 208
+MHPS E+ + Y+ V+ V L ++L+ G+ L+DG A D +
Sbjct: 69 RLMHPSHEVSKTYLATVKGA-----VPRGLGKKLSVGLELDDGPAHVDDFAVVDAIPGKT 123
Query: 209 WFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLP-------RELLRGQSTELPK 260
R+ + EGR R VRRL + G V L RT G+V L R LLR + +L K
Sbjct: 124 LVRLTLHEGRKRIVRRLLTAAGFPVEMLVRTDIGAVSLGDQRPGCLRALLRDEIRQLYK 182
>gi|4377181|gb|AAD19002| (AE001667) predicted pseudouridine synthase
[Chlamydia pneumoniae]
Length = 235
Score = 81.9 bits (199), Expect = 1e-14
Identities = 71/246 (28%), Positives = 118/246 (47%), Gaps = 18/246 (7%)
Query: 28 RLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLG-MSVKSGDKIELDGRSFVASALTE 86
RL+K LA AG+ SRR ++ I +G + VNG +A+ + V DK+++ G S LT+
Sbjct: 5 RLNKFLASAGVASRRKCDEIIFSGSVTVNGRVAEGPFVLVDPEDKVQVGGTSV---HLTK 61
Query: 87 PARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDGEL 146
+++ ++ + G V + L R +GRLD DGE
Sbjct: 62 KVYFMVHKAIGYLCSSEKKFPGTKLVIDLFAHLP-YRVFTVGRLDKETSGLILVTNDGEF 120
Query: 147 ANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGNTDS 206
AN ++HPSS I +EY+++V V + L +L G ++ + ++ +I
Sbjct: 121 ANKIIHPSSGITKEYLLKV-----SRDVSAKDLGKLMEGTFIDGKHVRPVSVTKI----R 171
Query: 207 HDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKTQVEAL 266
++VV EG+ E+R ++ G + LKR R GS++L L G+ EL + L
Sbjct: 172 RGTVKIVVSEGKKHEIRLFADAAGFPILELKRIRIGSLVL-GGLRYGEYRELTDAE---L 227
Query: 267 RTQLKL 272
T +KL
Sbjct: 228 GTYMKL 233
>gi|3322747 (AE001223) conserved hypothetical protein [Treponema
pallidum]
Length = 261
Score = 78.0 bits (189), Expect = 2e-13
Identities = 70/241 (29%), Positives = 108/241 (44%), Gaps = 29/241 (12%)
Query: 23 PKLEERLHKVLAQAGLGSRRALEQRISNGLIKVNGD-IAQLGMSVKSGDKIELDGRSFVA 81
P RL LA++G SRRA E I++G + V+G + G +V + + + +DG
Sbjct: 6 PFFRLRLQVYLARSGCASRRACEALIASGRVTVDGQTVTTQGRTVCAQNVVCVDG---TV 62
Query: 82 SALTEPARVLIYNKPEGEVTTR--EDPEG-----------RPTVFETLPVLKGA---RWI 125
L R ++ KP G + + + P G + + +++ A R
Sbjct: 63 VQLERVQRYVLLYKPVGYICSLAPQFPAGYAHTQVRAGPSKQEYARAIDLVQPAYQERLY 122
Query: 126 AIGRLDINXXXXXXXXXDGELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRG 185
IGRLD+ DG A A+ HP S IE+EY+V R P V LL RG
Sbjct: 123 HIGRLDVRSEGALLFTNDGSFAQALGHPRSGIEKEYIVETREP-----VPAALLSSFVRG 177
Query: 186 VMLEDGTAKFDTIERIGNTDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVL 245
V +E + + + ++V+ EG+ RE+R ++E+ G V RL R R G V
Sbjct: 178 VWVEGCRYRCVRARHL----AAQCVQLVLVEGKKREIRVVFEAWGQDVVRLVRVRIGRVR 233
Query: 246 L 246
L
Sbjct: 234 L 234
>gi|1175857|sp|P45124|RSUA_HAEIN RIBOSOMAL SMALL SUBUNIT
PSEUDOURIDINE SYNTHASE A (16S PSEUDOURIDYLATE 516
SYNTHASE) (16S PSEUDOURIDINE 516 SYNTHASE) (URACIL
HYDROLYASE) >gi|1074693|pir||F64169 hypothetical protein
HI1243 - Haemophilus influenzae (strain Rd KW20)
>gi|1574175 (U32804) 16s pseudouridylate 516 synthase
(rsuA) [Haemophilus influenzae Rd]
Length = 232
Score = 73.7 bits (178), Expect = 3e-12
Identities = 60/241 (24%), Positives = 110/241 (44%), Gaps = 14/241 (5%)
Query: 28 RLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLGMSVKSGDKIELDGRSFVASALT-- 85
RL K +A+ +R + I +K+NG+I + G SV+ + E+ F LT
Sbjct: 2 RLDKFIAENVGLTRSQATKAIRQSAVKINGEIVKSG-SVQISQEDEI---YFEDELLTWI 57
Query: 86 EPARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDGE 145
E + + NKP+G V + +D + PT+++ + + GRLD++ DG+
Sbjct: 58 EEGQYFMLNKPQGCVCSNDDGD-YPTIYQFFDYPLAGKLHSAGRLDVDTTGLVLLTDDGQ 116
Query: 146 LANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGNTD 205
++ + P E+ Y+V + P E + L RG AK + ++
Sbjct: 117 WSHRITSPKHHCEKTYLVTLADPVEENYSAACAEGILLRGEKEPTKPAKLEILDDYN--- 173
Query: 206 SHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKTQVEA 265
+ + EGR +V+R++ + G +V L R + G V+L L G+ L ++++E
Sbjct: 174 ----VNLTISEGRYHQVKRMFAALGNKVVGLHRWKIGDVVLDESLEEGEYRPLTQSEIEK 229
Query: 266 L 266
L
Sbjct: 230 L 230
>gi|3845303 (AE001423) pseudouridine synthetase (RsuA fam.); 1st
euk. member (OO) [Plasmodium falciparum]
Length = 338
Score = 67.1 bits (161), Expect = 3e-10
Identities = 65/293 (22%), Positives = 120/293 (40%), Gaps = 58/293 (19%)
Query: 28 RLHKVLAQAGLGSRRALEQRISNGLIKVNGDI-AQLGMSVKSGD--------KIELDGR- 77
RL+K+++ SRR ++ I +G +K+N I G V G KI+L
Sbjct: 47 RLNKLISMKRNISRRKSDEFIKDGKVKINNKIITNPGTHVHIGKDSLRIYDKKIKLTNII 106
Query: 78 SFVASALTEPARVLIYNKPEGEVTTREDPEGRPTVFETLP--VLKGARWIAIGRLDINXX 135
+ + + + ++ +KP+G + T D + R +++ P +L+ R + +GRLD N
Sbjct: 107 NMIKQNENKLHKWIVLHKPKGLLCTSNDEKNRKSIYTLFPEEMLQKYRLVTVGRLDRNTS 166
Query: 136 XXXXXXXDGELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDG---- 191
D N + HP + R Y V + P V+ L++L RG+ LE+
Sbjct: 167 GVLLLTNDYAWVNKLTHPKYQRIRTYRVHIEGP-----VKMNALKELARGIYLEEDEKTQ 221
Query: 192 -------------------------------------TAKFDTIERIGNTDSHDWFRVVV 214
+ + I+ +T + +
Sbjct: 222 PKKIYNYKESREKSNIDDKKKKKMSKMKKKTNPAFIEILREEKIKIKEDTKKITVLNISI 281
Query: 215 KEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKTQVEALR 267
KEGRNR++R++++ V ++KRT + ++ L Q EL + +V L+
Sbjct: 282 KEGRNRQIRKMFQQINQPVIKIKRTSFENITLKNIYFPKQYRELNQKEVNDLK 334
>gi|1175289|sp|P44827|YMFC_HAEIN HYPOTHETICAL PROTEIN HI0694
>gi|1074484|pir||I64156 hypothetical protein HI0694 -
Haemophilus influenzae (strain Rd KW20) >gi|1573697
(U32752) conserved hypothetical protein [Haemophilus
influenzae Rd]
Length = 240
Score = 66.7 bits (160), Expect = 4e-10
Identities = 57/201 (28%), Positives = 89/201 (43%), Gaps = 23/201 (11%)
Query: 86 EPARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDGE 145
+ +V+++NKP +T D +GR T+ + + + A GRLD + +GE
Sbjct: 49 DETKVVLFNKPFDVLTQFTDEQGRATLKDFISI---PNVYAAGRLDRDSEGLLILTNNGE 105
Query: 146 LANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGNTD 205
L + + P + E+ Y V+V EG D L QL +GV L+DG K + I +
Sbjct: 106 LQHRLADPKFKTEKTYWVQV---EGIPEETD--LAQLRKGVELKDGVTKSAKVRLISEPN 160
Query: 206 SHD--------------WFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELL 251
+ W + + EGRNR+VRR+ G RL R G +L L
Sbjct: 161 LWERNPPIRERKNIPTSWLEIKISEGRNRQVRRMTAHIGFPTLRLVRVSMG-LLSINGLE 219
Query: 252 RGQSTELPKTQVEALRTQLKL 272
G L +++AL +KL
Sbjct: 220 NGSFRLLSLDEIKALFQTVKL 240
>gi|1787380 (AE000213) orf, hypothetical protein [Escherichia coli]
Length = 207
Score = 57.8 bits (137), Expect = 2e-07
Identities = 51/176 (28%), Positives = 74/176 (41%), Gaps = 23/176 (13%)
Query: 86 EPARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDGE 145
+P RV+++NKP + D GR T+ E +PV +G A GRLD + +G
Sbjct: 27 QPTRVILFNKPYDVLPQFTDEAGRKTLKEFIPV-QGV--YAAGRLDRDSEGLLVLTNNGA 83
Query: 146 LANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGT---AKFDTIE--- 199
L + P + Y V+V E + LE L GV L DG A + ++
Sbjct: 84 LQARLTQPGKRTGKIYYVQV-----EGIPTQDALEALRNGVTLNDGPTLPAGAELVDEPA 138
Query: 200 ---------RIGNTDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLL 246
R + W ++ + EGRNR+VRR+ G RL R G L
Sbjct: 139 WLWPRNPPIRERKSIPTSWLKITLYEGRNRQVRRMTAHVGFPTLRLIRYAMGDYSL 194
>gi|3916025|sp|P75966|YMFC_ECOLI HYPOTHETICAL 24.9 KD PROTEIN IN
TRMU-ICDA INTERGENIC REGION >gi|4062699|dbj|BAA35957|
(D90748) Hypothetical protein HI0694 [Escherichia coli]
>gi|4062717|dbj|BAA35966| (D90749) Hypothetical protein
HI0694 [Escherichia coli]
Length = 217
Score = 57.8 bits (137), Expect = 2e-07
Identities = 51/176 (28%), Positives = 74/176 (41%), Gaps = 23/176 (13%)
Query: 86 EPARVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDGE 145
+P RV+++NKP + D GR T+ E +PV +G A GRLD + +G
Sbjct: 37 QPTRVILFNKPYDVLPQFTDEAGRKTLKEFIPV-QGV--YAAGRLDRDSEGLLVLTNNGA 93
Query: 146 LANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGT---AKFDTIE--- 199
L + P + Y V+V E + LE L GV L DG A + ++
Sbjct: 94 LQARLTQPGKRTGKIYYVQV-----EGIPTQDALEALRNGVTLNDGPTLPAGAELVDEPA 148
Query: 200 ---------RIGNTDSHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLL 246
R + W ++ + EGRNR+VRR+ G RL R G L
Sbjct: 149 WLWPRNPPIRERKSIPTSWLKITLYEGRNRQVRRMTAHVGFPTLRLIRYAMGDYSL 204
>gi|3402689|gb|AAC28992.1| (AC004697) unknown protein [Arabidopsis
thaliana]
Length = 303
Score = 54.3 bits (128), Expect = 2e-06
Identities = 41/146 (28%), Positives = 63/146 (43%), Gaps = 14/146 (9%)
Query: 27 ERLHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLGMS--VKSGDKIELDGRSFVASAL 84
+RL KVLA AG+ SRR E+ I +G + VNG + + S D I ++G
Sbjct: 160 QRLSKVLAAAGVASRRTSEELIFDGKVTVNGILCNTPQTRVDPSRDIIYVNGNRIPKK-- 217
Query: 85 TEPARVLIYNKPEGEVTTREDPEGRPTVF----------ETLPVLKGARWIAIGRLDINX 134
P NKP+G + + + E + + + P R +GRLD+
Sbjct: 218 LPPKVYFALNKPKGYICSSGEKEIKSAISLFDEYLSSWDKRNPGTPKPRLFTVGRLDVAT 277
Query: 135 XXXXXXXXDGELANAMMHPSSEIERE 160
DG+ A + HPSS + +E
Sbjct: 278 TGLIVVTNDGDFAQKLSHPSSSLPKE 303
>gi|3915559|sp|O32068|YTZF_BACSU HYPOTHETICAL 17.7 KD PROTEIN IN
AMYX-OPUD INTERGENIC REGION >gi|2635487|emb|CAB14981|
(Z99119) similar to hypothetical proteins [Bacillus
subtilis]
Length = 157
Score = 53.9 bits (127), Expect = 3e-06
Identities = 38/140 (27%), Positives = 64/140 (45%), Gaps = 6/140 (4%)
Query: 128 GRLDINXXXXXXXXXDGELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVM 187
GRLD + DG+LA+ ++ P + + Y V ++S E + D L GV
Sbjct: 18 GRLDKDTEGFLLLTNDGQLAHRLLSPKKHVPKTYEVHLKSQISREDISD-----LETGVY 72
Query: 188 LEDGTAKFDTIERIGNTDS-HDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLL 246
+E G I DS + + + EG+ +V+++ ++ G +V LKR G V L
Sbjct: 73 IEGGYKTKPAKAEIKTNDSGNTVIYLTITEGKYHQVKQMAKAVGNEVVYLKRLSMGRVSL 132
Query: 247 PRELLRGQSTELPKTQVEAL 266
L G+ EL + ++ L
Sbjct: 133 DPALAPGEYRELTEEELHLL 152
>gi|3402691|gb|AAC28994.1| (AC004697) unknown protein [Arabidopsis
thaliana]
Length = 86
Score = 51.5 bits (121), Expect = 1e-05
Identities = 26/56 (46%), Positives = 37/56 (65%)
Query: 211 RVVVKEGRNREVRRLWESQGCQVSRLKRTRYGSVLLPRELLRGQSTELPKTQVEAL 266
R+VV EGRN EVR L ++ G +V LKR R G LP +L G+ EL +++++AL
Sbjct: 27 RIVVHEGRNHEVRELVKNAGLEVHSLKRVRIGGFRLPSDLGLGKHVELKQSELKAL 82
>gi|6689331|emb|CAB65436.1| (AJ250721) hypothetical protein
[Synechococcus leopoliensis]
Length = 199
Score = 49.2 bits (115), Expect = 7e-05
Identities = 40/164 (24%), Positives = 69/164 (41%), Gaps = 20/164 (12%)
Query: 89 RVLIYNKPEGEVTTREDPEGRPTVFETLPVLKGARWIAIGRLDINXXXXXXXXXDGELAN 148
R L+++KP V + P RP + +GRLD + +G L +
Sbjct: 4 RYLLFHKPYDAVC-QFSPSDRPDQQTLKDYIDVPEVYPVGRLDRDSEGLLLLTNNGALQH 62
Query: 149 AMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGNTDSHD 208
+ HP +R Y V+V E + L+ L +GV ++D + ++R+ + +
Sbjct: 63 RLCHPRFGHDRTYWVQV-----EREPTEAALQALRQGVQIQDYRTRPAKVQRLDDPQIPE 117
Query: 209 --------------WFRVVVKEGRNREVRRLWESQGCQVSRLKR 238
W + ++EGRNR+VRR+ + G RL R
Sbjct: 118 RDPPIRFRKTVPTAWLALTLQEGRNRQVRRMTAAVGHPTLRLIR 161
>gi|3915395|sp|P72581|Y612_SYNY3 HYPOTHETICAL 21.0 KD PROTEIN
SLR0612
Length = 261
Score = 48.4 bits (113), Expect = 1e-04
Identities = 46/176 (26%), Positives = 71/176 (40%), Gaps = 28/176 (15%)
Query: 83 ALTEPARVLIYNKPEGEVT--TREDPEGRPTV--FETLPVLKGARWIAIGRLDINXXXXX 138
AL + + +++ KP G + T RPT+ + LP L +GRLD +
Sbjct: 34 ALNKTPQTIVFYKPYGVLCQFTDNSAHPRPTLKDYINLPDL-----YPVGRLDQDSEGLL 88
Query: 139 XXXXDGELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTI 198
+G+L + + H +R Y +V E DE LE L RG+ D +
Sbjct: 89 LLTSNGKLQHRLAHREFAHQRTYFAQV-----EGSPTDEDLEPLRRGITFADYPTRPAIA 143
Query: 199 ERIGNTD--------------SHDWFRVVVKEGRNREVRRLWESQGCQVSRLKRTR 240
+ I D W + + EGRNR+VRR+ + G RL R +
Sbjct: 144 KIITEPDFPPRNPPIRYRASIPTSWLSITLTEGRNRQVRRMTAAVGFPTLRLVRVQ 199
>gi|677949 (U20969) Plasmodium falciparum circumsporozoite protein
(CS) gene, complete cds. [Plasmodium falciparum]
Length = 408
Score = 43.0 bits (99), Expect = 0.006
Identities = 47/173 (27%), Positives = 63/173 (36%), Gaps = 19/173 (10%)
Query: 310 NNHSTADESRELRRFDTLRDDRGR--GQGKHHFK-DRLTVSGEAAAKQAHKPFKQYKPKN 366
N +S SR L D ++ G G+GK K D E K HK KQ
Sbjct: 61 NWYSLKKNSRSLGENDDGNNNNGDNGGEGKDEDKRDGNNEDNEKLRKPKHKKLKQP---- 116
Query: 367 DRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKK---SPNPNTRNTPGQQTR 423
++G+P + V + + N A+PN +PN N P
Sbjct: 117 ----ADGNPDPNANPNVDPNANPNVDPNANPNVDPNANPNANPNANPNANPNANPNANPN 172
Query: 424 KSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPN---NSGGKYNRNRGP 473
+P PN PN + +A P NP NP A PN N+ N N P
Sbjct: 173 ANPNANPNANPN-ANPNANPNANPNANPNANPNA-NPNVDPNANPNANPNANP 223
Score = 40.2 bits (92), Expect = 0.036
Identities = 25/81 (30%), Positives = 32/81 (38%), Gaps = 2/81 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 222 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 279
Query: 456 GAGRPNNSGGKYNRNRGPRYP 476
A N+ G + P P
Sbjct: 280 NANPNKNNQGNGQGHNMPNNP 300
Score = 37.9 bits (86), Expect = 0.18
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 186 NPNANPNANPNA-NPNANPNANPNVDPNANPNANPNANPN-ANPNANPNANPNANPNANP 243
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 244 NANPNANPNANPNAN 258
Score = 37.9 bits (86), Expect = 0.18
Identities = 24/75 (32%), Positives = 28/75 (37%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN +A P NP NP
Sbjct: 170 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPNV-DPNANPNANPNANPNANP 227
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 228 NANPNANPNANPNAN 242
Score = 37.5 bits (85), Expect = 0.24
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 178 NPNANPNANPNA-NPNANPNANPNANPNANPNVDPNANPN-ANPNANPNANPNANPNANP 235
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 236 NANPNANPNANPNAN 250
Score = 37.5 bits (85), Expect = 0.24
Identities = 24/75 (32%), Positives = 28/75 (37%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P P PN PN + +A P NP NP
Sbjct: 182 NPNANPNANPNA-NPNANPNANPNANPNVDPNANPNANPN-ANPNANPNANPNANPNANP 239
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 240 NANPNANPNANPNAN 254
Score = 33.6 bits (75), Expect = 3.6
Identities = 22/78 (28%), Positives = 28/78 (35%), Gaps = 1/78 (1%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + N NP +
Sbjct: 246 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPNKNNQGNGQGHNMPNNPNRNV 304
Query: 456 GAGRPNNSGGKYNRNRGP 473
N+ K N N P
Sbjct: 305 DENANANNAVKNNNNEEP 322
>gi|294151|gb|AAA29569.1| (M83156) circumsporozoite protein
[Plasmodium falciparum]
Length = 452
Score = 42.6 bits (98), Expect = 0.007
Identities = 43/164 (26%), Positives = 57/164 (34%), Gaps = 5/164 (3%)
Query: 310 NNHSTADESRELRRFDTLRDDRGRG--QGKHHFK-DRLTVSGEAAAKQAHKPFKQYKPKN 366
N +S SR L D ++ G +GK K D E K HK KQ N
Sbjct: 61 NWYSLKKNSRSLGENDDGNNNNGDNGREGKDEDKRDGNNEDNEKLRKPKHKKLKQPGDGN 120
Query: 367 DRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSP 426
+ + + V + + N A A+PN +PN N P +P
Sbjct: 121 PDPNANPNVDPNANPNVDPNANPNANPNANPNANPNANPNA-NPNANPNANPNANPNANP 179
Query: 427 YKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 470
PN PN + +A P NP NP A N N N
Sbjct: 180 NANPNANPN-ANPNANPNANPNANPNANPNANPNANPNANPNAN 222
Score = 39.9 bits (91), Expect = 0.048
Identities = 25/81 (30%), Positives = 32/81 (38%), Gaps = 2/81 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 266 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 323
Query: 456 GAGRPNNSGGKYNRNRGPRYP 476
A N+ G + P P
Sbjct: 324 NANPNKNNQGNGQGHNMPNDP 344
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 202 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 259
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 260 NANPNANPNANPNAN 274
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 238 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 295
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 296 NANPNANPNANPNAN 310
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 198 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 255
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 256 NANPNANPNANPNAN 270
>gi|1651652|dbj|BAA16580| (D90899) hypothetical protein
[Synechocystis sp.]
Length = 185
Score = 42.2 bits (97), Expect = 0.009
Identities = 34/128 (26%), Positives = 51/128 (39%), Gaps = 19/128 (14%)
Query: 127 IGRLDINXXXXXXXXXDGELANAMMHPSSEIEREYVVRVRSPEGEEHVQDELLEQLTRGV 186
+GRLD + +G+L + + H +R Y +V E DE LE L RG+
Sbjct: 1 MGRLDQDSEGLLLLTSNGKLQHRLAHREFAHQRTYFAQV-----EGSPTDEDLEPLRRGI 55
Query: 187 MLEDGTAKFDTIERIGNTD--------------SHDWFRVVVKEGRNREVRRLWESQGCQ 232
D + + I D W + + EGRNR+VRR+ + G
Sbjct: 56 TFADYPTRPAIAKIITEPDFPPRNPPIRYRASIPTSWLSITLTEGRNRQVRRMTAAVGFP 115
Query: 233 VSRLKRTR 240
RL R +
Sbjct: 116 TLRLVRVQ 123
>gi|117587|sp|P08307|CSP_PLAFW CIRCUMSPOROZOITE PROTEIN PRECURSOR
(CS) >gi|627052|pir||A54529 circumsporozoite protein -
Plasmodium falciparum (strain Wellcome) >gi|160215
(M15505) circumsporozoite protein [Plasmodium
falciparum]
Length = 442
Score = 42.2 bits (97), Expect = 0.009
Identities = 41/172 (23%), Positives = 55/172 (31%), Gaps = 23/172 (13%)
Query: 299 NRNDNNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKP 358
N N NN + +NN E ++ + D +D E K HK
Sbjct: 80 NDNGNNNNGNNNNGDNGREGKDEDKRDGNNEDN-----------------EKLRKPKHKK 122
Query: 359 FKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTP 418
KQ N + + + V + + N A A+PN +PN N P
Sbjct: 123 LKQPGDGNPDPNANPNVDPNANPNVDPNANPNANPNANPNANPNANPNA-NPNANPNANP 181
Query: 419 GQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 470
+P PN PN A P NP NP A N N N
Sbjct: 182 NANPNANPNANPNANPN-----ANPNANPNANPNVDPNANPNANPNANPNAN 228
Score = 39.9 bits (91), Expect = 0.048
Identities = 25/81 (30%), Positives = 32/81 (38%), Gaps = 2/81 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 256 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 313
Query: 456 GAGRPNNSGGKYNRNRGPRYP 476
A N+ G + P P
Sbjct: 314 NANPNKNNQGNGQGHNMPNDP 334
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 216 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 273
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 274 NANPNANPNANPNAN 288
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 220 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 277
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 278 NANPNANPNANPNAN 292
>gi|294136|gb|AAA29551.1| (M83173) circumsporozoite protein
[Plasmodium falciparum]
Length = 442
Score = 42.2 bits (97), Expect = 0.009
Identities = 41/172 (23%), Positives = 55/172 (31%), Gaps = 23/172 (13%)
Query: 299 NRNDNNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKP 358
N N NN + +NN E ++ + D +D E K HK
Sbjct: 80 NDNGNNNNGNNNNGDNGREGKDEDKRDGNNEDN-----------------EKLRKPKHKK 122
Query: 359 FKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTP 418
KQ N + + + V + + N A A+PN +PN N P
Sbjct: 123 LKQPGDGNPDPNANPNVDPNANPNVDPNANPNANPNANPNANPNANPNA-NPNANPNANP 181
Query: 419 GQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 470
+P PN PN A P NP NP A N N N
Sbjct: 182 NANPNANPNANPNANPN-----ANPNANPNANPNVDPNANPNANPNANPNAN 228
Score = 39.9 bits (91), Expect = 0.048
Identities = 25/81 (30%), Positives = 32/81 (38%), Gaps = 2/81 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 256 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 313
Query: 456 GAGRPNNSGGKYNRNRGPRYP 476
A N+ G + P P
Sbjct: 314 NANPNKNNQGNGQGHNMPNDP 334
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 216 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 273
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 274 NANPNANPNANPNAN 288
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 220 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 277
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 278 NANPNANPNANPNAN 292
>gi|1175470|sp|Q09801|YAAA_SCHPO HYPOTHETICAL 37.3 KD PROTEIN
C22G7.10 IN CHROMOSOME I >gi|2130331|pir||S62454
hypothetical protein SPAC22G7.10 - fission yeast
(Schizosaccharomyces pombe) >gi|1009460|emb|CAA91134.1|
(Z54328) hypothetical protein [Schizosaccharomyces
pombe]
Length = 344
Score = 41.8 bits (96), Expect = 0.012
Identities = 26/79 (32%), Positives = 34/79 (42%), Gaps = 14/79 (17%)
Query: 399 AGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSD----HATPTFNPYGNPGQK 454
+G G H +PN N P R+S + P N+PN S HA PT NP + G
Sbjct: 237 SGGGVHSGAATPNAYVNNNPSSSRRES--ESPANSPNITSSAGMTHAQPTHNPTSSYG-- 292
Query: 455 TGAGRPNNSGGKYNRNRGP 473
N + YN +R P
Sbjct: 293 ------NGASTNYNASRPP 305
Score = 32.8 bits (73), Expect = 6.2
Identities = 25/88 (28%), Positives = 37/88 (41%), Gaps = 11/88 (12%)
Query: 388 STGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAP---NFP-SDHATP 443
S P N N + G + + NP + G T + + P+N P N+P S P
Sbjct: 263 SESPANSPNITSSAGMTHAQPTHNPTSSYGNGASTNYNASRPPSNHPHSSNYPSSSRRKP 322
Query: 444 TFNPYGNPGQKTGAGRPNNSGGKYNRNR 471
+ + Y N + SGG+Y RNR
Sbjct: 323 SPDRYSNYSSR-------GSGGRYRRNR 343
>gi|117586|sp|P13814|CSP_PLAFT CIRCUMSPOROZOITE PROTEIN PRECURSOR
(CS) >gi|627051|pir||A54533 circumsporozoite protein -
Plasmodium falciparum (strain T4, Thailand) >gi|160217
(M19752) circumsporozoite protein [Plasmodium
falciparum]
Length = 424
Score = 41.8 bits (96), Expect = 0.012
Identities = 43/164 (26%), Positives = 57/164 (34%), Gaps = 5/164 (3%)
Query: 310 NNHSTADESRELRRFDTLRDDRGRG--QGKHHFK-DRLTVSGEAAAKQAHKPFKQYKPKN 366
N +S SR L D ++ G +GK K D E K HK KQ N
Sbjct: 61 NWYSLKKNSRSLGENDDGNNNNGDNNREGKDEDKRDGNNEDNETLRKPKHKKLKQPGDGN 120
Query: 367 DRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSP 426
+ + + V + + N A A+PN +PN N P +P
Sbjct: 121 PDPNANPNVDPNANPNVDPNANPNVDPNANPNANPNANPNA-NPNANPNANPNANPNANP 179
Query: 427 YKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 470
PN PN + +A P NP NP A N N N
Sbjct: 180 NANPNANPN-ANPNANPNANPNANPNANPNANPNANPNANPNAN 222
Score = 39.9 bits (91), Expect = 0.048
Identities = 25/81 (30%), Positives = 32/81 (38%), Gaps = 2/81 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 238 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 295
Query: 456 GAGRPNNSGGKYNRNRGPRYP 476
A N+ G + P P
Sbjct: 296 NANPNKNNQGNGQGHNMPNDP 316
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 198 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 255
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 256 NANPNANPNANPNAN 270
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 190 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 247
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 248 NANPNANPNANPNAN 262
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 202 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 259
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 260 NANPNANPNANPNAN 274
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 186 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 243
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 244 NANPNANPNANPNAN 258
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 194 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 251
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 252 NANPNANPNANPNAN 266
>gi|294129|gb|AAA29547.1| (M83169) circumsporozoite protein
[Plasmodium falciparum] >gi|294140|gb|AAA29562.1|
(M83149) circumsporozoite protein [Plasmodium
falciparum]
Length = 424
Score = 41.4 bits (95), Expect = 0.016
Identities = 43/164 (26%), Positives = 57/164 (34%), Gaps = 5/164 (3%)
Query: 310 NNHSTADESRELRRFDTLRDDRGRG--QGKHHFK-DRLTVSGEAAAKQAHKPFKQYKPKN 366
N +S SR L D ++ G +GK K D E K HK KQ N
Sbjct: 61 NWYSLKKNSRSLGENDDGNNNNGDNGREGKDEDKRDGNNEDNEKLRKPKHKKLKQPGDGN 120
Query: 367 DRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSP 426
+ + + V + + N A A+PN +PN N P +P
Sbjct: 121 PDPNANPNVDPNANPNVDPNANPNVDPNANPNANPNANPNA-NPNANPNANPNANPNANP 179
Query: 427 YKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 470
PN PN + +A P NP NP A N N N
Sbjct: 180 NANPNANPN-ANPNANPNANPNANPNANPNANPNANPNANPNAN 222
Score = 39.9 bits (91), Expect = 0.048
Identities = 25/81 (30%), Positives = 32/81 (38%), Gaps = 2/81 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 238 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 295
Query: 456 GAGRPNNSGGKYNRNRGPRYP 476
A N+ G + P P
Sbjct: 296 NANPNKNNQGNGQGHNMPNDP 316
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 198 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 255
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 256 NANPNANPNANPNAN 270
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 190 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 247
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 248 NANPNANPNANPNAN 262
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 202 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 259
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 260 NANPNANPNANPNAN 274
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 186 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 243
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 244 NANPNANPNANPNAN 258
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 194 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 251
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 252 NANPNANPNANPNAN 266
>gi|294121|gb|AAA29543.1| (M83165) circumsporozoite protein
[Plasmodium falciparum]
Length = 432
Score = 41.4 bits (95), Expect = 0.016
Identities = 43/164 (26%), Positives = 57/164 (34%), Gaps = 5/164 (3%)
Query: 310 NNHSTADESRELRRFDTLRDDRGRG--QGKHHFK-DRLTVSGEAAAKQAHKPFKQYKPKN 366
N +S SR L D ++ G +GK K D E K HK KQ N
Sbjct: 61 NWYSLKKNSRSLGENDDGNNNNGDNGREGKDEDKRDGNNEDNEKLRKPKHKKLKQPGDGN 120
Query: 367 DRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSP 426
+ + + V + + N A A+PN +PN N P +P
Sbjct: 121 PDPNANPNVDPNANPNVDPNANPNVDPNANPNANPNANPNA-NPNANPNANPNANPNANP 179
Query: 427 YKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 470
PN PN + +A P NP NP A N N N
Sbjct: 180 NANPNANPN-ANPNANPNANPNANPNANPNANPNANPNANPNAN 222
Score = 39.9 bits (91), Expect = 0.048
Identities = 25/81 (30%), Positives = 32/81 (38%), Gaps = 2/81 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 246 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 303
Query: 456 GAGRPNNSGGKYNRNRGPRYP 476
A N+ G + P P
Sbjct: 304 NANPNKNNQGNGQGHNMPNDP 324
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 202 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 259
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 260 NANPNANPNANPNAN 274
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 210 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 267
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 268 NANPNANPNANPNAN 282
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 206 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 263
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 264 NANPNANPNANPNAN 278
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 194 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 251
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 252 NANPNANPNANPNAN 266
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 190 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 247
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 248 NANPNANPNANPNAN 262
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 198 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 255
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 256 NANPNANPNANPNAN 270
>gi|294158|gb|AAA29574.1| (M83161) circumsporozoite protein
[Plasmodium falciparum]
Length = 420
Score = 41.4 bits (95), Expect = 0.016
Identities = 43/164 (26%), Positives = 57/164 (34%), Gaps = 13/164 (7%)
Query: 310 NNHSTADESRELRRFDTLRDDRGRG--QGKHHFK-DRLTVSGEAAAKQAHKPFKQYKPKN 366
N +S SR L D ++ G +GK K D E K HK KQ
Sbjct: 61 NWYSLKKNSRSLGENDDGNNNNGDNGREGKDEDKRDGNNEDNEKLRKPKHKKLKQP---- 116
Query: 367 DRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSP 426
+G+P + V + + N A+PN +PN N P +P
Sbjct: 117 ----GDGNPDPNANPNVDPNANPNVDPNANPNVDPNANPNA-NPNANPNANPNANPNANP 171
Query: 427 YKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 470
PN PN + +A P NP NP A N N N
Sbjct: 172 NANPNANPN-ANPNANPNANPNANPNANPNANPNANPNANPNAN 214
Score = 39.9 bits (91), Expect = 0.048
Identities = 25/81 (30%), Positives = 32/81 (38%), Gaps = 2/81 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 234 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 291
Query: 456 GAGRPNNSGGKYNRNRGPRYP 476
A N+ G + P P
Sbjct: 292 NANPNKNNQGNGQGHNMPNDP 312
Score = 37.5 bits (85), Expect = 0.24
Identities = 24/75 (32%), Positives = 28/75 (37%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P P PN PN + +A P NP NP
Sbjct: 198 NPNANPNANPNA-NPNANPNANPNANPNVDPNANPNANPN-ANPNANPNANPNANPNANP 255
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 256 NANPNANPNANPNAN 270
Score = 36.7 bits (83), Expect = 0.41
Identities = 23/75 (30%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P +P NP
Sbjct: 178 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNVDPNANPNANP 235
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 236 NANPNANPNANPNAN 250
>gi|136173|sp|P27190|TRC5_ECOLI DNA PRIMASE TRAC (REPLICATION
PRIMASE) >gi|481041|pir||S37669 traC-2 protein -
Escherichia coli >gi|48919|emb|CAA42460| (X59794) traC-2
[Escherichia coli] >gi|1572573 (U67194) TraC2
[Enterobacter aerogenes]
Length = 1448
Score = 41.0 bits (94), Expect = 0.021
Identities = 43/162 (26%), Positives = 65/162 (39%), Gaps = 17/162 (10%)
Query: 288 QRRSAKATLHVNRNDNNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVS 347
+R + A + R + +A + S A E+R+ + +D Q + +R
Sbjct: 838 ERAAKLARVQEERVRRDPNATEEDISAAKEARKTAEASAMLNDSD-AQRRAAELERQERD 896
Query: 348 GEAAAKQAHKPFKQY-----KPKND-RSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGN 401
+ A QA KP +QY K K++ +SL QSWYVP G P GA
Sbjct: 897 RQQAQPQAEKPERQYINVPYKEKDEAKSLGARWDRQQQSWYVPPGTDAAPFAKWAQGAAT 956
Query: 402 GA-HPNKKSPNPNTR---NTPGQQTRKS------PYKYPNNA 433
A P P P + P QQ +++ PY+ N A
Sbjct: 957 AAVEPRSAQPAPEAQGEAQKPAQQAQQARQYLAVPYEQRNAA 998
>gi|48920|emb|CAA42461| (X59794) traC-3 [Escherichia coli]
>gi|1572575 (U67194) TraC3 [Enterobacter aerogenes]
Length = 1230
Score = 41.0 bits (94), Expect = 0.021
Identities = 43/162 (26%), Positives = 65/162 (39%), Gaps = 17/162 (10%)
Query: 288 QRRSAKATLHVNRNDNNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVS 347
+R + A + R + +A + S A E+R+ + +D Q + +R
Sbjct: 620 ERAAKLARVQEERVRRDPNATEEDISAAKEARKTAEASAMLNDSD-AQRRAAELERQERD 678
Query: 348 GEAAAKQAHKPFKQY-----KPKND-RSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGN 401
+ A QA KP +QY K K++ +SL QSWYVP G P GA
Sbjct: 679 RQQAQPQAEKPERQYINVPYKEKDEAKSLGARWDRQQQSWYVPPGTDAAPFAKWAQGAAT 738
Query: 402 GA-HPNKKSPNPNTR---NTPGQQTRKS------PYKYPNNA 433
A P P P + P QQ +++ PY+ N A
Sbjct: 739 AAVEPRSAQPAPEAQGEAQKPAQQAQQARQYLAVPYEQRNAA 780
>gi|48921|emb|CAA42462| (X59794) traC-4 [Escherichia coli]
>gi|1572574 (U67194) TraC4 [Enterobacter aerogenes]
Length = 747
Score = 41.0 bits (94), Expect = 0.021
Identities = 43/162 (26%), Positives = 65/162 (39%), Gaps = 17/162 (10%)
Query: 288 QRRSAKATLHVNRNDNNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVS 347
+R + A + R + +A + S A E+R+ + +D Q + +R
Sbjct: 137 ERAAKLARVQEERVRRDPNATEEDISAAKEARKTAEASAMLNDSD-AQRRAAELERQERD 195
Query: 348 GEAAAKQAHKPFKQY-----KPKND-RSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGN 401
+ A QA KP +QY K K++ +SL QSWYVP G P GA
Sbjct: 196 RQQAQPQAEKPERQYINVPYKEKDEAKSLGARWDRQQQSWYVPPGTDAAPFAKWAQGAAT 255
Query: 402 GA-HPNKKSPNPNTR---NTPGQQTRKS------PYKYPNNA 433
A P P P + P QQ +++ PY+ N A
Sbjct: 256 AAVEPRSAQPAPEAQGEAQKPAQQAQQARQYLAVPYEQRNAA 297
>gi|117584|sp|P05691|CSP_PLAFL CIRCUMSPOROZOITE PROTEIN (CS)
>gi|552195 (M17802) circumsporozoite protein [Plasmodium
falciparum]
Length = 315
Score = 40.6 bits (93), Expect = 0.028
Identities = 43/167 (25%), Positives = 56/167 (32%), Gaps = 7/167 (4%)
Query: 310 NNHSTADESRELRRFDTLRDDRGRG--QGKHHFK-DRLTVSGEAAAKQAHKPFKQYKPKN 366
N +S SR L D ++ G +GK K D E K HK KQ N
Sbjct: 45 NWYSLKKNSRSLGENDDGNNNNGDNGREGKDEDKRDGNNEDNEKLRKPKHKKLKQPADGN 104
Query: 367 DRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKK---SPNPNTRNTPGQQTR 423
+ + + V + + N A A+PN +PN N P
Sbjct: 105 PDPNANPNVDPNANPNVDPNANPNVDPNANPNANPNANPNANPNANPNANPNANPNANPN 164
Query: 424 KSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 470
+P PN PN +A P NP NP A N N N
Sbjct: 165 ANPNANPNANPNV-DPNANPNANPNANPNANPNANPNANPNANPNAN 210
Score = 39.9 bits (91), Expect = 0.048
Identities = 25/81 (30%), Positives = 32/81 (38%), Gaps = 2/81 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 206 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 263
Query: 456 GAGRPNNSGGKYNRNRGPRYP 476
A N+ G + P P
Sbjct: 264 NANPNKNNQGNGQGHNMPNDP 284
Score = 37.5 bits (85), Expect = 0.24
Identities = 24/78 (30%), Positives = 29/78 (36%), Gaps = 4/78 (5%)
Query: 396 NAGAGNGAHPN---KKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPG 452
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 166 NPNANPNANPNVDPNANPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPN 224
Query: 453 QKTGAGRPNNSGGKYNRN 470
A N N N
Sbjct: 225 ANPNANPNANPNANPNAN 242
Score = 32.5 bits (72), Expect = 8.1
Identities = 21/78 (26%), Positives = 28/78 (34%), Gaps = 1/78 (1%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + N +P +
Sbjct: 230 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPNKNNQGNGQGHNMPNDPNRNV 288
Query: 456 GAGRPNNSGGKYNRNRGP 473
N+ K N N P
Sbjct: 289 DENANGNNAVKNNNNEEP 306
>gi|294125|gb|AAA29545.1| (M83167) circumsporozoite protein
[Plasmodium falciparum]
Length = 436
Score = 40.6 bits (93), Expect = 0.028
Identities = 43/167 (25%), Positives = 57/167 (33%), Gaps = 15/167 (8%)
Query: 310 NNHSTADESRELRRFDTLRDDRGRG--QGKHHFK-DRLTVSGEAAAKQAHKPFKQYKPKN 366
N +S SR L D ++ G +GK K D E K HK KQ
Sbjct: 61 NWYSLKKNSRSLGENDDGNNNNGDNGREGKDEDKRDGNNEDNEKLRKPKHKKLKQP---- 116
Query: 367 DRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKK---SPNPNTRNTPGQQTR 423
+G+P + V + + N A+PN +PN N P
Sbjct: 117 ----GDGNPDPNANPNVDPNANPNVDPNANPNVDPNANPNANPNANPNANPNANPNANPN 172
Query: 424 KSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 470
+P PN PN + +A P NP NP A N N N
Sbjct: 173 ANPNANPNANPN-ANPNANPNANPNANPNANPNANPNANPNANPNAN 218
Score = 39.9 bits (91), Expect = 0.048
Identities = 25/81 (30%), Positives = 32/81 (38%), Gaps = 2/81 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 250 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 307
Query: 456 GAGRPNNSGGKYNRNRGPRYP 476
A N+ G + P P
Sbjct: 308 NANPNKNNQGNGQGHNMPNDP 328
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 218 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 275
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 276 NANPNANPNANPNAN 290
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 178 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 235
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 236 NANPNANPNANPNAN 250
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 182 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 239
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 240 NANPNANPNANPNAN 254
>gi|117591|sp|P26694|CSP_PLARE CIRCUMSPOROZOITE PROTEIN PRECURSOR
(CS) >gi|102373|pir||A39756 circumsporozoite protein -
Plasmodium reichenowi >gi|160229 (M60972)
circumsporozoite protein [Plasmodium reichenowi]
Length = 388
Score = 40.6 bits (93), Expect = 0.028
Identities = 41/170 (24%), Positives = 56/170 (32%), Gaps = 15/170 (8%)
Query: 304 NKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFKQYK 363
N ++ N + E+ + D D G + + H R E K H KQ
Sbjct: 61 NWYSLKKNSRSLGENDDADNGDADNGDEGIDENRRH---RNKEGKEKLKKPKHNKLKQ-- 115
Query: 364 PKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPN---KKSPNPNTRNTPGQ 420
P ND +P V + + N A+PN +PN N P
Sbjct: 116 PGNDNVDPNANPN------VDPNANPNVDPNANPNVDPNANPNVDPNANPNVNPNANPNV 169
Query: 421 QTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 470
+P PN PN + +A P NP NP A N N N
Sbjct: 170 DPNANPNVNPNANPNV-NPNANPNVNPNANPNANPNANPNANPNANPNAN 218
Score = 36.7 bits (83), Expect = 0.41
Identities = 24/75 (32%), Positives = 27/75 (36%), Gaps = 6/75 (8%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN A P NP NP
Sbjct: 194 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-----ANPNANPNANPNANP 247
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 248 NANPNANPNANPNAN 262
Score = 36.7 bits (83), Expect = 0.41
Identities = 23/75 (30%), Positives = 28/75 (36%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A +PN +PN N P +P PN PN + +A P NP NP
Sbjct: 186 NPNANPNVNPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 243
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 244 NANPNANPNANPNAN 258
Score = 32.5 bits (72), Expect = 8.1
Identities = 20/78 (25%), Positives = 29/78 (36%), Gaps = 1/78 (1%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN ++ N + +
Sbjct: 226 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPNRNNEANGQGHNKPNDQNRNV 284
Query: 456 GAGRPNNSGGKYNRNRGP 473
N+ G+ N N P
Sbjct: 285 NENANANNAGRNNNNEEP 302
>gi|294119|gb|AAA29542.1| (M83164) circumsporozoite protein
[Plasmodium falciparum] >gi|294142|gb|AAA29563.1|
(M83150) circumsporozoite protein [Plasmodium
falciparum] >gi|294161|gb|AAA29576.1| (M83163)
circumsporozoite protein [Plasmodium falciparum]
Length = 436
Score = 40.2 bits (92), Expect = 0.036
Identities = 43/164 (26%), Positives = 55/164 (33%), Gaps = 9/164 (5%)
Query: 310 NNHSTADESRELRRFDTLRDDRGRG--QGKHHFK-DRLTVSGEAAAKQAHKPFKQYKPKN 366
N +S SR L D ++ G +GK K D E K HK KQ N
Sbjct: 61 NWYSLKKNSRSLGENDDGNNNNGDNGREGKDEDKRDGNNEDNEKLRKPKHKKLKQPGDGN 120
Query: 367 DRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSP 426
+ + + V + + N A A+PN +PN N P +P
Sbjct: 121 PDPNANPNVDPNANPNVDPNANPNANPNANPNANPNANPNA-NPNANPNANPNANPNANP 179
Query: 427 YKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 470
PN PN A P NP NP A N N N
Sbjct: 180 NANPNANPN-----ANPNANPNANPNVDPNANPNANPNANPNAN 218
Score = 39.9 bits (91), Expect = 0.048
Identities = 25/81 (30%), Positives = 32/81 (38%), Gaps = 2/81 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 250 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 307
Query: 456 GAGRPNNSGGKYNRNRGPRYP 476
A N+ G + P P
Sbjct: 308 NANPNKNNQGNGQGHNMPNDP 328
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 222 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 279
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 280 NANPNANPNANPNAN 294
Score = 37.5 bits (85), Expect = 0.24
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 182 NPNANPNANPNA-NPNANPNVDPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 239
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 240 NANPNANPNANPNAN 254
Score = 36.0 bits (81), Expect = 0.71
Identities = 24/75 (32%), Positives = 26/75 (34%), Gaps = 6/75 (8%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN PN N P +P PN PN A P NP NP
Sbjct: 190 NPNANPNANPNV-DPNANPNANPNANPNANPNANPNANPN-----ANPNANPNANPNANP 243
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 244 NANPNANPNANPNAN 258
>gi|552190 (M57498) circumsporozoite protein [Plasmodium falciparum]
Length = 393
Score = 39.9 bits (91), Expect = 0.048
Identities = 25/81 (30%), Positives = 32/81 (38%), Gaps = 2/81 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 207 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 264
Query: 456 GAGRPNNSGGKYNRNRGPRYP 476
A N+ G + P P
Sbjct: 265 NANPNKNNQGNGQGHNMPNDP 285
Score = 37.9 bits (86), Expect = 0.18
Identities = 24/75 (32%), Positives = 28/75 (37%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN +A P NP NP
Sbjct: 151 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPNV-DPNANPNANPNANPNANP 208
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 209 NANPNANPNANPNAN 223
Score = 37.5 bits (85), Expect = 0.24
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 171 NPNANPNANPNA-NPNANPNVDPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 228
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 229 NANPNANPNANPNAN 243
Score = 37.5 bits (85), Expect = 0.24
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 159 NPNANPNANPNA-NPNANPNANPNANPNANPNVDPNANPN-ANPNANPNANPNANPNANP 216
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 217 NANPNANPNANPNAN 231
Score = 35.6 bits (80), Expect = 0.93
Identities = 23/75 (30%), Positives = 27/75 (35%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A+PN PN N P +P PN PN + +A P NP NP
Sbjct: 115 NPNVDPNANPNV-DPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 172
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 173 NANPNANPNANPNAN 187
Score = 33.2 bits (74), Expect = 4.7
Identities = 22/68 (32%), Positives = 25/68 (36%), Gaps = 2/68 (2%)
Query: 403 AHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNN 462
A+PN PN N P P PN PN + +A P NP NP A N
Sbjct: 106 ANPNV-DPNANPNVDPNANPNVDPNANPNANPN-ANPNANPNANPNANPNANPNANPNAN 163
Query: 463 SGGKYNRN 470
N N
Sbjct: 164 PNANPNAN 171
>gi|552191 (M57499) circumsporozoite protein [Plasmodium falciparum]
Length = 424
Score = 39.9 bits (91), Expect = 0.048
Identities = 25/81 (30%), Positives = 32/81 (38%), Gaps = 2/81 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 238 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 295
Query: 456 GAGRPNNSGGKYNRNRGPRYP 476
A N+ G + P P
Sbjct: 296 NANPNKNNQGNGQGHNMPNDP 316
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 194 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 251
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 252 NANPNANPNANPNAN 266
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 198 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 255
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 256 NANPNANPNANPNAN 270
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 162 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 219
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 220 NANPNANPNANPNAN 234
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 190 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 247
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 248 NANPNANPNANPNAN 262
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 182 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 239
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 240 NANPNANPNANPNAN 254
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 178 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 235
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 236 NANPNANPNANPNAN 250
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 202 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 259
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 260 NANPNANPNANPNAN 274
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 186 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 243
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 244 NANPNANPNANPNAN 258
Score = 35.2 bits (79), Expect = 1.2
Identities = 24/80 (30%), Positives = 28/80 (35%), Gaps = 2/80 (2%)
Query: 391 PRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGN 450
P + N A+PN PN N P P PN PN + +A P NP N
Sbjct: 121 PDPNANPNVDPNANPNV-DPNANPNVDPNANPNVDPNANPNANPN-ANPNANPNANPNAN 178
Query: 451 PGQKTGAGRPNNSGGKYNRN 470
P A N N N
Sbjct: 179 PNANPNANPNANPNANPNAN 198
Score = 33.2 bits (74), Expect = 4.7
Identities = 21/80 (26%), Positives = 29/80 (36%), Gaps = 1/80 (1%)
Query: 391 PRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGN 450
P++ + G+G +PN + P +P PN PN +A P NP N
Sbjct: 108 PKHKKLKQPGDGNPDPNANPNVDPNANPNVDPNANPNVDPNANPNV-DPNANPNANPNAN 166
Query: 451 PGQKTGAGRPNNSGGKYNRN 470
P A N N N
Sbjct: 167 PNANPNANPNANPNANPNAN 186
>gi|117583|sp|P02893|CSP_PLAFA CIRCUMSPOROZOITE PROTEIN PRECURSOR
(CS) >gi|72381|pir||OZZQAF circumsporozoite protein -
Plasmodium falciparum (isolate IMTM22) >gi|160161
(K02194) circumsporozoite protein [Plasmodium
falciparum]
Length = 412
Score = 39.9 bits (91), Expect = 0.048
Identities = 25/81 (30%), Positives = 32/81 (38%), Gaps = 2/81 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 226 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 283
Query: 456 GAGRPNNSGGKYNRNRGPRYP 476
A N+ G + P P
Sbjct: 284 NANPNKNNQGNGQGHNMPNDP 304
Score = 37.9 bits (86), Expect = 0.18
Identities = 24/75 (32%), Positives = 28/75 (37%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN +A P NP NP
Sbjct: 170 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPNV-DPNANPNANPNANPNANP 227
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 228 NANPNANPNANPNAN 242
Score = 37.5 bits (85), Expect = 0.24
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 190 NPNANPNANPNA-NPNANPNVDPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 247
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 248 NANPNANPNANPNAN 262
Score = 36.4 bits (82), Expect = 0.54
Identities = 42/164 (25%), Positives = 54/164 (32%), Gaps = 21/164 (12%)
Query: 310 NNHSTADESRELRRFDTLRDDRGRG--QGKHHFK-DRLTVSGEAAAKQAHKPFKQYKPKN 366
N +S SR L D ++ G +GK K D E K HK KQ
Sbjct: 61 NWYSLKKNSRSLGENDDGNNNNGDNGREGKDEDKRDGNNEDNEKLRKPKHKKLKQP---- 116
Query: 367 DRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSP 426
+G+P + + N A+PN PN N P +P
Sbjct: 117 ----GDGNP--------DPNANPNVDPNANPNVDPNANPNV-DPNANPNANPNANPNANP 163
Query: 427 YKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 470
PN PN + +A P NP NP A N N N
Sbjct: 164 NANPNANPN-ANPNANPNANPNANPNANPNANPNANPNANPNAN 206
>gi|84198|pir||S05428 circumsporozoite protein - Plasmodium
falciparum (isolate NF54) >gi|160169 (M22982)
circumsporozoite protein [Plasmodium falciparum]
Length = 405
Score = 39.9 bits (91), Expect = 0.048
Identities = 25/81 (30%), Positives = 32/81 (38%), Gaps = 2/81 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 219 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 276
Query: 456 GAGRPNNSGGKYNRNRGPRYP 476
A N+ G + P P
Sbjct: 277 NANPNKNNQGNGQGHNMPNDP 297
Score = 38.7 bits (88), Expect = 0.11
Identities = 32/122 (26%), Positives = 43/122 (35%), Gaps = 10/122 (8%)
Query: 349 EAAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKK 408
E K HK KQ ++G+P + V + + N A+PN
Sbjct: 84 EKLRKPKHKKLKQP--------ADGNPDPNANPNVDPNANPNVDPNANPNVDPNANPNA- 134
Query: 409 SPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYN 468
+PN N P +P PN PN + +A P NP NP A N N
Sbjct: 135 NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANPNANPNANPNANPN 193
Query: 469 RN 470
N
Sbjct: 194 AN 195
Score = 37.9 bits (86), Expect = 0.18
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 183 NPNANPNANPNA-NPNANPNANPNVDPNANPNANPNANPN-ANPNANPNANPNANPNANP 240
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 241 NANPNANPNANPNAN 255
Score = 37.5 bits (85), Expect = 0.24
Identities = 24/75 (32%), Positives = 28/75 (37%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P P PN PN + +A P NP NP
Sbjct: 179 NPNANPNANPNA-NPNANPNANPNANPNVDPNANPNANPN-ANPNANPNANPNANPNANP 236
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 237 NANPNANPNANPNAN 251
Score = 37.5 bits (85), Expect = 0.24
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 175 NPNANPNANPNA-NPNANPNANPNANPNANPNVDPNANPN-ANPNANPNANPNANPNANP 232
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 233 NANPNANPNANPNAN 247
Score = 36.7 bits (83), Expect = 0.41
Identities = 23/75 (30%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P +P NP
Sbjct: 159 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNVDPNANPNANP 216
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 217 NANPNANPNANPNAN 231
Score = 32.8 bits (73), Expect = 6.2
Identities = 22/78 (28%), Positives = 28/78 (35%), Gaps = 1/78 (1%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + N +P +
Sbjct: 243 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPNKNNQGNGQGHNMPNDPNRNV 301
Query: 456 GAGRPNNSGGKYNRNRGP 473
NS K N N P
Sbjct: 302 DENANANSAVKNNNNEEP 319
>gi|294123|gb|AAA29544.1| (M83166) circumsporozoite protein
[Plasmodium falciparum] >gi|294127|gb|AAA29546.1|
(M83168) circumsporozoite protein [Plasmodium
falciparum] >gi|294131|gb|AAA29548.1| (M83170)
circumsporozoite protein [Plasmodium falciparum]
>gi|294145|gb|AAA29565.1| (M83152) circumsporozoite
protein [Plasmodium falciparum]
>gi|294149|gb|AAA29568.1| (M83155) circumsporozoite
protein [Plasmodium falciparum]
>gi|294154|gb|AAA29571.1| (M83158) circumsporozoite
protein [Plasmodium falciparum]
Length = 432
Score = 39.9 bits (91), Expect = 0.048
Identities = 25/81 (30%), Positives = 32/81 (38%), Gaps = 2/81 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 246 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 303
Query: 456 GAGRPNNSGGKYNRNRGPRYP 476
A N+ G + P P
Sbjct: 304 NANPNKNNQGNGQGHNMPNDP 324
Score = 38.7 bits (88), Expect = 0.11
Identities = 42/167 (25%), Positives = 56/167 (33%), Gaps = 7/167 (4%)
Query: 310 NNHSTADESRELRRFDTLRDDRGRG--QGKHHFK-DRLTVSGEAAAKQAHKPFKQYKPKN 366
N +S SR L D ++ G +GK K D E K HK KQ N
Sbjct: 61 NWYSLKKNSRSLGENDDGNNNNGDNGREGKDEDKRDGNNEDNEKLRKPKHKKLKQPGDGN 120
Query: 367 DRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKK---SPNPNTRNTPGQQTR 423
+ + + V + + N A+PN +PN N P
Sbjct: 121 PDPNANPNVDPNANPNVDPNANPNVDPNANPNVDPNANPNANPNANPNANPNANPNANPN 180
Query: 424 KSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 470
+P PN PN + +A P NP NP A N N N
Sbjct: 181 ANPNANPNANPN-ANPNANPNANPNANPNANPNANPNANPNANPNAN 226
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 202 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 259
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 260 NANPNANPNANPNAN 274
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 210 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 267
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 268 NANPNANPNANPNAN 282
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 206 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 263
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 264 NANPNANPNANPNAN 278
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 158 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 215
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 216 NANPNANPNANPNAN 230
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 194 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 251
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 252 NANPNANPNANPNAN 266
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 198 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 255
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 256 NANPNANPNANPNAN 270
>gi|294134|gb|AAA29550.1| (M83172) circumsporozoite protein
[Plasmodium falciparum]
Length = 416
Score = 39.9 bits (91), Expect = 0.048
Identities = 25/81 (30%), Positives = 32/81 (38%), Gaps = 2/81 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 230 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 287
Query: 456 GAGRPNNSGGKYNRNRGPRYP 476
A N+ G + P P
Sbjct: 288 NANPNKNNQGNGQGHNMPNDP 308
Score = 37.9 bits (86), Expect = 0.18
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 190 NPNANPNANPNA-NPNANPNANPNVDPNANPNANPNANPN-ANPNANPNANPNANPNANP 247
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 248 NANPNANPNANPNAN 262
Score = 37.5 bits (85), Expect = 0.24
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 194 NPNANPNANPNA-NPNANPNVDPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 251
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 252 NANPNANPNANPNAN 266
Score = 36.4 bits (82), Expect = 0.54
Identities = 23/75 (30%), Positives = 28/75 (36%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + + P NP NP
Sbjct: 170 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNVDPNANPNANPNANP 227
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 228 NANPNANPNANPNAN 242
Score = 36.4 bits (82), Expect = 0.54
Identities = 42/164 (25%), Positives = 54/164 (32%), Gaps = 21/164 (12%)
Query: 310 NNHSTADESRELRRFDTLRDDRGRG--QGKHHFK-DRLTVSGEAAAKQAHKPFKQYKPKN 366
N +S SR L D ++ G +GK K D E K HK KQ
Sbjct: 61 NWYSLKKNSRSLGENDDGNNNNGDNGREGKDEDKRDGNNEDNEKLRKPKHKKLKQP---- 116
Query: 367 DRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSP 426
+G+P + + N A+PN PN N P +P
Sbjct: 117 ----GDGNP--------DPNANPNVDPNANPNVDPNANPNV-DPNANPNANPNANPNANP 163
Query: 427 YKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 470
PN PN + +A P NP NP A N N N
Sbjct: 164 NANPNANPN-ANPNANPNANPNANPNANPNANPNANPNANPNAN 206
>gi|294138|gb|AAA29552.1| (M83174) circumsporozoite protein
[Plasmodium falciparum]
Length = 420
Score = 39.9 bits (91), Expect = 0.048
Identities = 25/81 (30%), Positives = 32/81 (38%), Gaps = 2/81 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 234 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 291
Query: 456 GAGRPNNSGGKYNRNRGPRYP 476
A N+ G + P P
Sbjct: 292 NANPNKNNQGNGQGHNMPNDP 312
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 198 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 255
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 256 NANPNANPNANPNAN 270
Score = 38.3 bits (87), Expect = 0.14
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 194 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 251
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 252 NANPNANPNANPNAN 266
Score = 37.5 bits (85), Expect = 0.24
Identities = 24/78 (30%), Positives = 29/78 (36%), Gaps = 4/78 (5%)
Query: 396 NAGAGNGAHPN---KKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPG 452
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 178 NPNANPNANPNVDPNANPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPN 236
Query: 453 QKTGAGRPNNSGGKYNRN 470
A N N N
Sbjct: 237 ANPNANPNANPNANPNAN 254
Score = 36.4 bits (82), Expect = 0.54
Identities = 23/75 (30%), Positives = 28/75 (36%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 186 NPNVDPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 243
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 244 NANPNANPNANPNAN 258
Score = 34.4 bits (77), Expect = 2.1
Identities = 43/164 (26%), Positives = 54/164 (32%), Gaps = 21/164 (12%)
Query: 310 NNHSTADESRELRRFDTLRDDRGRG--QGKHHFK-DRLTVSGEAAAKQAHKPFKQYKPKN 366
N +S SR L D ++ G +GK K D E K HK KQ
Sbjct: 61 NWYSLKKNSRSLGENDDGNNNNGDNGREGKDEDKRDGNNEDNEKLRKPKHKKLKQP---- 116
Query: 367 DRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSP 426
+G+P + V + + N A+PN NPN P +P
Sbjct: 117 ----GDGNPDPNANPNVDPNANPNVDPNANPNVDPNANPNA---NPNAN--PNANPNANP 167
Query: 427 YKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 470
PN PN A P NP NP A N N N
Sbjct: 168 NANPNANPN-----ANPNANPNANPNVDPNANPNANPNANPNAN 206
>gi|6226848|sp|P19597|CSP_PLAFO CIRCUMSPOROZOITE PROTEIN PRECURSOR
(CS) >gi|160153 (M83886) circumsporozoite protein
[Plasmodium falciparum] >gi|2276342|emb|CAA33421|
(X15363) circumsporozoite protein (AA 1 - 405)
[Plasmodium falciparum]
Length = 397
Score = 39.9 bits (91), Expect = 0.048
Identities = 25/81 (30%), Positives = 32/81 (38%), Gaps = 2/81 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 211 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 268
Query: 456 GAGRPNNSGGKYNRNRGPRYP 476
A N+ G + P P
Sbjct: 269 NANPNKNNQGNGQGHNMPNDP 289
Score = 38.7 bits (88), Expect = 0.11
Identities = 32/122 (26%), Positives = 43/122 (35%), Gaps = 10/122 (8%)
Query: 349 EAAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKK 408
E K HK KQ ++G+P + V + + N A+PN
Sbjct: 84 EKLRKPKHKKLKQP--------ADGNPDPNANPNVDPNANPNVDPNANPNVDPNANPNA- 134
Query: 409 SPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYN 468
+PN N P +P PN PN + +A P NP NP A N N
Sbjct: 135 NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANPNANPNANPNANPN 193
Query: 469 RN 470
N
Sbjct: 194 AN 195
Score = 37.9 bits (86), Expect = 0.18
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 175 NPNANPNANPNA-NPNANPNANPNVDPNANPNANPNANPN-ANPNANPNANPNANPNANP 232
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 233 NANPNANPNANPNAN 247
Score = 37.9 bits (86), Expect = 0.18
Identities = 24/75 (32%), Positives = 28/75 (37%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN +A P NP NP
Sbjct: 159 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPNV-DPNANPNANPNANPNANP 216
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 217 NANPNANPNANPNAN 231
Score = 37.5 bits (85), Expect = 0.24
Identities = 24/75 (32%), Positives = 28/75 (37%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P P PN PN + +A P NP NP
Sbjct: 171 NPNANPNANPNA-NPNANPNANPNANPNVDPNANPNANPN-ANPNANPNANPNANPNANP 228
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 229 NANPNANPNANPNAN 243
Score = 32.8 bits (73), Expect = 6.2
Identities = 22/78 (28%), Positives = 28/78 (35%), Gaps = 1/78 (1%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + N +P +
Sbjct: 235 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPNKNNQGNGQGHNMPNDPNRNV 293
Query: 456 GAGRPNNSGGKYNRNRGP 473
NS K N N P
Sbjct: 294 DENANANSAVKNNNNEEP 311
>gi|4493889|emb|CAB38998.1| (AL034558) predicted using hexExon;
MAL3P2.11 (PFC0210c), Circumsporozoite (CS) protein,
len: 397 aa; Similarity to many Plasmodium CS proteins.
[Plasmodium falciparum]
Length = 396
Score = 39.9 bits (91), Expect = 0.048
Identities = 25/81 (30%), Positives = 32/81 (38%), Gaps = 2/81 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 210 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 267
Query: 456 GAGRPNNSGGKYNRNRGPRYP 476
A N+ G + P P
Sbjct: 268 NANPNKNNQGNGQGHNMPNDP 288
Score = 38.7 bits (88), Expect = 0.11
Identities = 32/122 (26%), Positives = 43/122 (35%), Gaps = 10/122 (8%)
Query: 349 EAAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKK 408
E K HK KQ ++G+P + V + + N A+PN
Sbjct: 83 EKLRKPKHKKLKQP--------ADGNPDPNANPNVDPNANPNVDPNANPNVDPNANPNA- 133
Query: 409 SPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYN 468
+PN N P +P PN PN + +A P NP NP A N N
Sbjct: 134 NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANPNANPNANPNANPN 192
Query: 469 RN 470
N
Sbjct: 193 AN 194
Score = 37.9 bits (86), Expect = 0.18
Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 174 NPNANPNANPNA-NPNANPNANPNVDPNANPNANPNANPN-ANPNANPNANPNANPNANP 231
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 232 NANPNANPNANPNAN 246
Score = 37.9 bits (86), Expect = 0.18
Identities = 24/75 (32%), Positives = 28/75 (37%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN +A P NP NP
Sbjct: 158 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPNV-DPNANPNANPNANPNANP 215
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 216 NANPNANPNANPNAN 230
Score = 37.5 bits (85), Expect = 0.24
Identities = 24/75 (32%), Positives = 28/75 (37%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P P PN PN + +A P NP NP
Sbjct: 170 NPNANPNANPNA-NPNANPNANPNANPNVDPNANPNANPN-ANPNANPNANPNANPNANP 227
Query: 456 GAGRPNNSGGKYNRN 470
A N N N
Sbjct: 228 NANPNANPNANPNAN 242
Score = 32.8 bits (73), Expect = 6.2
Identities = 22/78 (28%), Positives = 28/78 (35%), Gaps = 1/78 (1%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + N +P +
Sbjct: 234 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPNKNNQGNGQGHNMPNDPNRNV 292
Query: 456 GAGRPNNSGGKYNRNRGP 473
NS K N N P
Sbjct: 293 DENANANSAVKNNNNEEP 310
>gi|224051|prf||1008275A protein,circumsporozoite [Plasmodium sp.]
Length = 100
Score = 39.5 bits (90), Expect = 0.062
Identities = 24/75 (32%), Positives = 30/75 (40%), Gaps = 2/75 (2%)
Query: 396 NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
N A A+PN +PN N P +P PN PN + +A P NP NP
Sbjct: 24 NPNANPNANPNA-NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANP 81
Query: 456 GAGRPNNSGGKYNRN 470
A N N+N
Sbjct: 82 NANPNANPNANPNKN 96
Score = 33.6 bits (75), Expect = 3.6
Identities = 19/62 (30%), Positives = 23/62 (36%), Gaps = 1/62 (1%)
Query: 409 SPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYN 468
+PN N P +P PN PN + +A P NP NP A N N
Sbjct: 4 NPNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANPNANPNANPNANPN 62
Query: 469 RN 470
N
Sbjct: 63 AN 64
Score = 33.2 bits (74), Expect = 4.7
Identities = 19/61 (31%), Positives = 22/61 (35%), Gaps = 1/61 (1%)
Query: 410 PNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNR 469
PN N P +P PN PN + +A P NP NP A N N
Sbjct: 1 PNANPNANPNANPNANPNANPNANPN-ANPNANPNANPNANPNANPNANPNANPNANPNA 59
Query: 470 N 470
N
Sbjct: 60 N 60
>gi|283464|pir||A60540 sporozoite surface protein 2 - Plasmodium
yoelii (fragment)
Length = 477
Score = 38.7 bits (88), Expect = 0.11
Identities = 24/84 (28%), Positives = 29/84 (33%), Gaps = 6/84 (7%)
Query: 391 PRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGN 450
P N N N + NPN N P + PNN PN P++ P N
Sbjct: 93 PNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNN-PNNPNNPNNP-----NN 146
Query: 451 PGQKTGAGRPNNSGGKYNRNRGPR 474
P PNN N N P+
Sbjct: 147 PNNPNNPNNPNNPNDPSNPNNHPK 170
Score = 37.9 bits (86), Expect = 0.18
Identities = 27/89 (30%), Positives = 33/89 (36%), Gaps = 5/89 (5%)
Query: 391 PRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNA--PNFPSDHATPTFNPY 448
P N N N + NPN N P + PNN PN P++ P NP
Sbjct: 96 PNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPN-NP- 153
Query: 449 GNPGQKTGAGRPNNSGGKYN-RNRGPRYP 476
NP PNN + N + R P P
Sbjct: 154 NNPNNPNDPSNPNNHPKRRNPKRRNPNKP 182
Score = 37.1 bits (84), Expect = 0.32
Identities = 21/68 (30%), Positives = 29/68 (41%), Gaps = 2/68 (2%)
Query: 403 AHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNN 462
++PNK PNPN + P + P PN PS+ P N NP + + P+N
Sbjct: 198 SNPNK--PNPNEPSNPNKPNPNEPSNPNKPNPNEPSNPNKPNPNEPLNPNEPSNPNEPSN 255
Query: 463 SGGKYNRN 470
N N
Sbjct: 256 PNAPSNPN 263
Score = 37.1 bits (84), Expect = 0.32
Identities = 25/88 (28%), Positives = 31/88 (34%), Gaps = 3/88 (3%)
Query: 383 VPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHAT 442
+P + P N N + NPN N P + PNN PN P++
Sbjct: 67 IPNKIPEKPSNPEEPVNPNDPNDPNNPNNPNNPNNPNNPNNPNNPNNPNN-PNNPNNPNN 125
Query: 443 PTFNPYGNPGQKTGAGRPNNSGGKYNRN 470
P NP NP PNN N N
Sbjct: 126 PN-NP-NNPNNPNNPNNPNNPNNPNNPN 151
Score = 36.4 bits (82), Expect = 0.54
Identities = 23/89 (25%), Positives = 38/89 (41%), Gaps = 11/89 (12%)
Query: 391 PRNHRNAGAGNGAHPNKKSPN----PNTRNTPGQQTRKSPYKYPN-----NAPNFPSDHA 441
P N ++PNK +PN PN + P + + + PN N P+ P++ +
Sbjct: 219 PSNPNKPNPNEPSNPNKPNPNEPLNPNEPSNPNEPSNPNAPSNPNEPSNPNEPSNPNEPS 278
Query: 442 TPTFNPYGNPGQKTGAGRPNNSGGKYNRN 470
P N NP + + +P+N N N
Sbjct: 279 NP--NEPSNPNEPSNPKKPSNPNEPSNPN 305
Score = 36.4 bits (82), Expect = 0.54
Identities = 26/90 (28%), Positives = 30/90 (32%), Gaps = 9/90 (10%)
Query: 389 TGPRNH--------RNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDH 440
+ P NH RN PN PNPN + P + P PN PS+
Sbjct: 163 SNPNNHPKRRNPKRRNPNKPKPNKPNPNKPNPNEPSNPNKPNPNEPSNPNKPNPNEPSNP 222
Query: 441 ATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 470
P N NP K P N N N
Sbjct: 223 NKPNPNEPSNP-NKPNPNEPLNPNEPSNPN 251
Score = 35.6 bits (80), Expect = 0.93
Identities = 27/88 (30%), Positives = 32/88 (35%), Gaps = 4/88 (4%)
Query: 383 VPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHAT 442
+PE S P N N + NPN N P + PNN PN P++
Sbjct: 71 IPEKPSN-PEEPVNPNDPNDPNNPNNPNNPNNPNNPNNPNNPNNPNNPNN-PNNPNNPNN 128
Query: 443 PTFNPYGNPGQKTGAGRPNNSGGKYNRN 470
P NP NP PNN N N
Sbjct: 129 PN-NP-NNPNNPNNPNNPNNPNNPNNPN 154
Score = 35.6 bits (80), Expect = 0.93
Identities = 24/70 (34%), Positives = 29/70 (41%), Gaps = 4/70 (5%)
Query: 401 NGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRP 460
N +PN + NPN N P + PNN PN P++ P NP NP P
Sbjct: 92 NPNNPNNPN-NPNNPNNPNNPNNPNNPNNPNN-PNNPNNPNNPN-NP-NNPNNPNNPNNP 147
Query: 461 NNSGGKYNRN 470
NN N N
Sbjct: 148 NNPNNPNNPN 157
Score = 33.2 bits (74), Expect = 4.7
Identities = 24/81 (29%), Positives = 34/81 (41%), Gaps = 9/81 (11%)
Query: 391 PRNHRNAGAGNGAHPNKKSPN-PNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYG 449
P N ++PNK +PN P+ N P +P K PN PN P + P+
Sbjct: 197 PSNPNKPNPNEPSNPNKPNPNEPSNPNKPNPNEPSNPNK-PN--PNEPLNPNEPS----- 248
Query: 450 NPGQKTGAGRPNNSGGKYNRN 470
NP + + P+N N N
Sbjct: 249 NPNEPSNPNAPSNPNEPSNPN 269
>gi|6679637|ref|NP_031951.1|| elastin
>gi|1706636|sp|P54320|ELS_MOUSE ELASTIN PRECURSOR
(TROPOELASTIN) >gi|2144808|pir||EAMS elastin precursor -
mouse >gi|473274 (U08210) tropoelastin [Mus musculus]
Length = 860
Score = 38.7 bits (88), Expect = 0.11
Identities = 31/95 (32%), Positives = 34/95 (35%), Gaps = 8/95 (8%)
Query: 373 GSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNN 432
G+ A F V GV TG A G GA P PG P YP
Sbjct: 218 GTGARFPGVGVLPGVPTGTGVKAKAPGGGGAFSGIPGVGPFGGQQPGV-----PLGYPIK 272
Query: 433 APNFPSDHATPTFN---PYGNPGQKTGAGRPNNSG 464
AP P + P N PYG G AG P +G
Sbjct: 273 APKLPGGYGLPYTNGKLPYGVAGAGGKAGYPTGTG 307
>gi|2462758 (AC002292) putative RNA-binding protein [Arabidopsis
thaliana]
Length = 292
Score = 38.7 bits (88), Expect = 0.11
Identities = 23/72 (31%), Positives = 37/72 (50%), Gaps = 3/72 (4%)
Query: 302 DNNKHAYHNNHSTADESRELRRFDTLRDDR-GRGQGKHHFKDRLTVSGEAAAKQAHKPFK 360
D ++ Y + + RE R FD D R R G++ ++DR SG+ + H PF+
Sbjct: 154 DGHRDRYGDRDLERERERE-REFDRYMDGRRDRDGGRYSYRDRFD-SGDKYEPRDHYPFE 211
Query: 361 QYKPKNDRSLSE 372
+Y P DR +S+
Sbjct: 212 RYAPPGDRFVSD 223
>gi|548996|sp|Q01443|SSP2_PLAYO SPOROZOITE SURFACE PROTEIN 2
PRECURSOR >gi|323142|pir||A45559 sporozoite surface
protein 2 - Plasmodium yoelii >gi|160693 (M84732)
sporozoite surface protein [Plasmodium yoelii]
Length = 826
Score = 38.7 bits (88), Expect = 0.11
Identities = 24/84 (28%), Positives = 29/84 (33%), Gaps = 6/84 (7%)
Query: 391 PRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGN 450
P N N N + NPN N P + PNN PN P++ P N
Sbjct: 315 PNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNN-PNNPNNPNNP-----NN 368
Query: 451 PGQKTGAGRPNNSGGKYNRNRGPR 474
P PNN N N P+
Sbjct: 369 PNNPNNPNNPNNPNDPSNPNNHPK 392
Score = 37.9 bits (86), Expect = 0.18
Identities = 27/89 (30%), Positives = 33/89 (36%), Gaps = 5/89 (5%)
Query: 391 PRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNA--PNFPSDHATPTFNPY 448
P N N N + NPN N P + PNN PN P++ P NP
Sbjct: 318 PNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPN-NP- 375
Query: 449 GNPGQKTGAGRPNNSGGKYN-RNRGPRYP 476
NP PNN + N + R P P
Sbjct: 376 NNPNNPNDPSNPNNHPKRRNPKRRNPNKP 404
Score = 37.1 bits (84), Expect = 0.32
Identities = 21/68 (30%), Positives = 29/68 (41%), Gaps = 2/68 (2%)
Query: 403 AHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNN 462
++PNK PNPN + P + P PN PS+ P N NP + + P+N
Sbjct: 420 SNPNK--PNPNEPSNPNKPNPNEPSNPNKPNPNEPSNPNKPNPNEPLNPNEPSNPNEPSN 477
Query: 463 SGGKYNRN 470
N N
Sbjct: 478 PNAPSNPN 485
Score = 37.1 bits (84), Expect = 0.32
Identities = 25/88 (28%), Positives = 31/88 (34%), Gaps = 3/88 (3%)
Query: 383 VPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHAT 442
+P + P N N + NPN N P + PNN PN P++
Sbjct: 289 IPNKIPEKPSNPEEPVNPNDPNDPNNPNNPNNPNNPNNPNNPNNPNNPNN-PNNPNNPNN 347
Query: 443 PTFNPYGNPGQKTGAGRPNNSGGKYNRN 470
P NP NP PNN N N
Sbjct: 348 PN-NP-NNPNNPNNPNNPNNPNNPNNPN 373
Score = 36.4 bits (82), Expect = 0.54
Identities = 23/89 (25%), Positives = 38/89 (41%), Gaps = 11/89 (12%)
Query: 391 PRNHRNAGAGNGAHPNKKSPN----PNTRNTPGQQTRKSPYKYPN-----NAPNFPSDHA 441
P N ++PNK +PN PN + P + + + PN N P+ P++ +
Sbjct: 441 PSNPNKPNPNEPSNPNKPNPNEPLNPNEPSNPNEPSNPNAPSNPNEPSNPNEPSNPNEPS 500
Query: 442 TPTFNPYGNPGQKTGAGRPNNSGGKYNRN 470
P N NP + + +P+N N N
Sbjct: 501 NP--NEPSNPNEPSNPKKPSNPNEPSNPN 527
Score = 36.4 bits (82), Expect = 0.54
Identities = 26/90 (28%), Positives = 30/90 (32%), Gaps = 9/90 (10%)
Query: 389 TGPRNH--------RNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDH 440
+ P NH RN PN PNPN + P + P PN PS+
Sbjct: 385 SNPNNHPKRRNPKRRNPNKPKPNKPNPNKPNPNEPSNPNKPNPNEPSNPNKPNPNEPSNP 444
Query: 441 ATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 470
P N NP K P N N N
Sbjct: 445 NKPNPNEPSNP-NKPNPNEPLNPNEPSNPN 473
Score = 35.6 bits (80), Expect = 0.93
Identities = 27/88 (30%), Positives = 32/88 (35%), Gaps = 4/88 (4%)
Query: 383 VPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHAT 442
+PE S P N N + NPN N P + PNN PN P++
Sbjct: 293 IPEKPSN-PEEPVNPNDPNDPNNPNNPNNPNNPNNPNNPNNPNNPNNPNN-PNNPNNPNN 350
Query: 443 PTFNPYGNPGQKTGAGRPNNSGGKYNRN 470
P NP NP PNN N N
Sbjct: 351 PN-NP-NNPNNPNNPNNPNNPNNPNNPN 376
Score = 35.6 bits (80), Expect = 0.93
Identities = 24/70 (34%), Positives = 29/70 (41%), Gaps = 4/70 (5%)
Query: 401 NGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRP 460
N +PN + NPN N P + PNN PN P++ P NP NP P
Sbjct: 314 NPNNPNNPN-NPNNPNNPNNPNNPNNPNNPNN-PNNPNNPNNPN-NP-NNPNNPNNPNNP 369
Query: 461 NNSGGKYNRN 470
NN N N
Sbjct: 370 NNPNNPNNPN 379
Score = 33.2 bits (74), Expect = 4.7
Identities = 24/81 (29%), Positives = 34/81 (41%), Gaps = 9/81 (11%)
Query: 391 PRNHRNAGAGNGAHPNKKSPN-PNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYG 449
P N ++PNK +PN P+ N P +P K PN PN P + P+
Sbjct: 419 PSNPNKPNPNEPSNPNKPNPNEPSNPNKPNPNEPSNPNK-PN--PNEPLNPNEPS----- 470
Query: 450 NPGQKTGAGRPNNSGGKYNRN 470
NP + + P+N N N
Sbjct: 471 NPNEPSNPNAPSNPNEPSNPN 491
>gi|6580291|emb|CAB63354.1| (AL032627) cDNA EST EMBL:D65921 comes
from this gene; cDNA EST yk385f3.3 comes from this gene;
cDNA EST yk385f3.5 comes from this gene; cDNA EST
EMBL:D66141 comes from this gene; cDNA EST EMBL:D69818
comes from this gene; cDN...
Length = 373
Score = 37.9 bits (86), Expect = 0.18
Identities = 32/107 (29%), Positives = 41/107 (37%), Gaps = 21/107 (19%)
Query: 386 GVSTGPRNHRNAGAGNGAHPNKKSPNPNTRN-TPGQQTRKSPYKYPNNAPNFPSDHATPT 444
G S + N GNG++ N N N N G + PY P + +P + P
Sbjct: 249 GNSGNGNGNSNGNNGNGSNGNGNGNNGNGNNGNNGNGHPQYPYPVPPHG-YYPPGYPYPP 307
Query: 445 FNPYGNPGQ---------------KTGAGRPN----NSGGKYNRNRG 472
PY PG + G G+PN NSGG RNRG
Sbjct: 308 GYPYPPPGAFYYPPGGIPQNGMNGQNGNGQPNIIVINSGGNKKRNRG 354
Score = 32.8 bits (73), Expect = 6.2
Identities = 30/94 (31%), Positives = 40/94 (41%), Gaps = 23/94 (24%)
Query: 388 STGPRNH--RNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTF 445
S+G N+ N+G+GNG N S N N+ N G NNA N + +
Sbjct: 215 SSGNSNYGSNNSGSGNG---NSNSGNGNSGNGNG-----------NNAGNSGNGNG---- 256
Query: 446 NPYGNPGQKTGAGRPNNSGGKYNRNRG---PRYP 476
N GN G + N+G N N G P+YP
Sbjct: 257 NSNGNNGNGSNGNGNGNNGNGNNGNNGNGHPQYP 290
>gi|2497274|sp|Q17107|AV71_ACAVI MUSCLE CELL INTERMEDIATE FILAMENT
PROTEIN AV71 >gi|283564|pir||S26431 intermediate
filament protein Av71, muscle - nematode
(Acanthocheilonema viteae) (fragment)
>gi|5709|emb|CAA48560| (X68557) Av71 muscle cell
intermediate filament [Acanthocheilonema viteae]
Length = 394
Score = 37.5 bits (85), Expect = 0.24
Identities = 33/110 (30%), Positives = 54/110 (49%), Gaps = 10/110 (9%)
Query: 164 RVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGNTDSHDWFRVVVKEGRNREVR 223
R +PE E+ ++EL + D A++D I I TD W+++ V+E + + R
Sbjct: 79 RDTTPENREYFKNELSSAI------RDIRAEYDQICNINRTDMESWYKLKVQEIQTQSTR 132
Query: 224 RLWESQGCQVSRLKRTRYGSVLLPREL--LRGQSTELPKTQVEALRTQLK 271
+ E QG +KR R L +L L G+++ L K Q + L QL+
Sbjct: 133 QNLE-QGYAKEEVKRLRVQLSELRGKLADLEGRNSLLEK-QTQELNYQLE 180
>gi|93140|pir||A40505 early protein EP0 - suid herpesvirus 1 (strain
Indiana-Funkhuser or Becker)
Length = 410
Score = 37.5 bits (85), Expect = 0.24
Identities = 44/188 (23%), Positives = 70/188 (36%), Gaps = 28/188 (14%)
Query: 288 QRRSAKATLHVNRNDNNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVS 347
QRR+ + VN D ++ H++ + E D G H +D LT +
Sbjct: 228 QRRAPMSHQGVNYIDTSESEAHSDSEVSSPDEE---------DSGASSSGVHTED-LTEA 277
Query: 348 GEAAAKQAHKPFK-----------QYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRN 396
E+A Q P + + + + R L G PE S+G +
Sbjct: 278 SESADDQRPAPRRSPRRARRAAVLRREQRRTRCLRRGRTGGQAQGETPEAPSSGEGSSAQ 337
Query: 397 AGA-GNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
GA G GA P + + R++P S + P+ PS A T P G P +
Sbjct: 338 HGASGAGAGPGSANTAASARSSPSSSPSSS-MRRPS-----PSASAPETAAPRGGPPASS 391
Query: 456 GAGRPNNS 463
+G P ++
Sbjct: 392 SSGSPRSA 399
>gi|124138|sp|P29129|ICP0_PRVIF TRANS-ACTING TRANSCRIPTIONAL PROTEIN
ICP0 (EARLY PROTEIN 0) (EP0) >gi|334048 (M57504) EPO
[Pseudorabies virus]
Length = 410
Score = 37.5 bits (85), Expect = 0.24
Identities = 44/188 (23%), Positives = 70/188 (36%), Gaps = 28/188 (14%)
Query: 288 QRRSAKATLHVNRNDNNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVS 347
QRR+ + VN D ++ H++ + E D G H +D LT +
Sbjct: 228 QRRAPMSHQGVNYIDTSESEAHSDSEVSSPDEE---------DSGASSSGVHTED-LTEA 277
Query: 348 GEAAAKQAHKPFK-----------QYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRN 396
E+A Q P + + + + R L G PE S+G +
Sbjct: 278 SESADDQRPAPRRSPRRARRAAVLRREQRRTRCLRRGRTGGQAQGETPEAPSSGEGSSAQ 337
Query: 397 AGA-GNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
GA G GA P + + R++P S + P+ PS A T P G P +
Sbjct: 338 HGASGAGAGPGSANTAASARSSPSSSPSSS-MRRPS-----PSASAPETAAPRGGPPASS 391
Query: 456 GAGRPNNS 463
+G P ++
Sbjct: 392 SSGSPRSA 399
>gi|3875920|emb|CAB04111.1| (Z81503) predicted using Genefinder;
similar to collagen; cDNA EST EMBL:D65450 comes from
this gene; cDNA EST EMBL:D68888 comes from this gene
[Caenorhabditis elegans]
Length = 305
Score = 37.5 bits (85), Expect = 0.24
Identities = 36/107 (33%), Positives = 39/107 (35%), Gaps = 23/107 (21%)
Query: 384 PEGVSTGPRNHRNAGA----GNGAH--------PNKKSPN--PNTRNTPGQQTRKSPYKY 429
P+G P N AGA GN AH P P P PG P
Sbjct: 190 PKGPRGAPGNSGRAGAPGQPGNDAHGYGGGVGAPGPAGPRGAPGPAGHPGSSGGGRPGPA 249
Query: 430 -PNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRNRGPRY 475
P AP P P G+PGQ GRP SGG NR P+Y
Sbjct: 250 GPKGAPGQPGRPG-----PDGHPGQP---GRPGQSGGSGNRGVCPKY 288
>gi|6681425|dbj|BAA88672.1| (AB030233) CiMsi [Ciona intestinalis]
Length = 393
Score = 37.5 bits (85), Expect = 0.24
Identities = 30/98 (30%), Positives = 36/98 (36%), Gaps = 4/98 (4%)
Query: 373 GSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPG-QQTRKSPYKYPN 431
G+ Q Y P + R N GNGA PN S P + G T Y N
Sbjct: 299 GNMGNMQGGYQPGMMGMQGRGVNN---GNGAQPNAASTYPQNPTSYGPMPTSGGGYNQGN 355
Query: 432 NAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNR 469
N S A G+ GQK+G G N+ Y R
Sbjct: 356 TGSNNSSGQANTGNTGGGSYGQKSGGGSNNSGYHPYRR 393
>gi|2498734|sp|Q60865|P137_MOUSE GPI-ANCHORED PROTEIN P137
>gi|1098569 (U27838)
glycosyl-phosphatidyl-inositol-anchored protein homolog
[Mus musculus]
Length = 656
Score = 37.1 bits (84), Expect = 0.32
Identities = 30/114 (26%), Positives = 38/114 (33%), Gaps = 4/114 (3%)
Query: 361 QYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNA--GAGNGAHPNKKSPNPNTRNTP 418
Q P+ + S + S V G S G R N G NG P+ NTP
Sbjct: 535 QQPPQQNTGFPRSSQPYYNSRGVSRGGSRGARGLMNGYRGPANGFRGGYDGYRPSFSNTP 594
Query: 419 GQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRNRG 472
+S + P + + D F GQ G P GG NRG
Sbjct: 595 NSGYSQSQFTAPRDYSGYQRDGYQQNFK--RGSGQSGPRGAPRGRGGPPRPNRG 646
>gi|3879921|emb|CAA98545.1| (Z74043) predicted using Genefinder;
Weak similarity to Drosphila nuclear hormpn receptor
FTZ-F1 (SW:FTF1_DROME); cDNA EST EMBL:C07270 comes from
this gene; cDNA EST EMBL:C08578 comes from this gene;
cDNA EST EMBL:C12993 come...
Length = 243
Score = 36.7 bits (83), Expect = 0.41
Identities = 29/98 (29%), Positives = 35/98 (35%), Gaps = 16/98 (16%)
Query: 393 NHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPY--GN 450
N+ N AGN ++ KK N N +N Q ++ P P P F P G
Sbjct: 22 NNNNKKAGNNSNNQKKGNNQNNKNNKNQNGGNRNQNRQHHQP--PMMQMVPPFGPMGPGG 79
Query: 451 PGQKTGAGRPN------------NSGGKYNRNRGPRYP 476
PG G PN N GK N NR P
Sbjct: 80 PGGPGFMGPPNGNMNFFNGQNRRNGSGKNNGNRNGAPP 117
>gi|969095 (U31961) no-on transient A-like protein [Drosophila
melanogaster]
Length = 642
Score = 36.4 bits (82), Expect = 0.54
Identities = 26/86 (30%), Positives = 37/86 (42%), Gaps = 4/86 (4%)
Query: 393 NHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSP-YKYPNNAPNFPSDHAT-PTFNPY-G 449
N +AG G PN + + G+Q + P ++ PN P+ +A N Y G
Sbjct: 139 NELSAGGGGQNQPNHSNKGQGNQGDQGEQGNQGPNFRGRGGGPNQPNQNANQEQSNGYPG 198
Query: 450 NPG-QKTGAGRPNNSGGKYNRNRGPR 474
N G K G G+ GGK+ R R
Sbjct: 199 NQGDNKGGQGQRGAGGGKHQRGNRSR 224
>gi|2497729|sp|Q49537|VLPE_MYCHR VARIANT SURFACE ANTIGEN E PRECURSOR
(VLPE PROLIPOPROTEIN) >gi|1039437 (U35016) VlpE
prolipoprotein [Mycoplasma hyorhinis]
>gi|1583723|prf||2121355B Vlp surface protein
[Mycoplasma hyorhinis]
Length = 243
Score = 36.4 bits (82), Expect = 0.54
Identities = 22/108 (20%), Positives = 40/108 (36%), Gaps = 1/108 (0%)
Query: 366 NDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKS 425
N + G+ ++ S P+G + P N + N + +P N T
Sbjct: 75 NQSGSASGNGSSNSSVSTPDGQHSNPSNPTTSDPKESNPSNPTTSDPKESNPSNPTTSDG 134
Query: 426 PYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRNRGP 473
+ P+N + P+ NP + GQ + P S G+++ P
Sbjct: 135 QHSNPSNPTTSDPKESNPS-NPTTSDGQHSNPSNPTTSDGQHSNPSNP 181
>gi|2689219|emb|CAA67983.1| (X99669) dynamin like protein
[Dictyostelium discoideum]
Length = 853
Score = 36.4 bits (82), Expect = 0.54
Identities = 28/128 (21%), Positives = 44/128 (33%), Gaps = 22/128 (17%)
Query: 354 QAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPN-- 411
Q+ PF Q + + G PA Q P ++ GP+N PN+ P+
Sbjct: 569 QSTNPFLQQQQQGQNKYPGGPPAQQQPNQQPNQLNKGPQN---------MPPNQSKPSSI 619
Query: 412 ----PNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRP------- 460
PN N + ++ + +F P+ YG + P
Sbjct: 620 PQNGPNNNNNNNNNNNRQDHQQGSFFSSFFRASPDPSLGQYGGANNSNNSNNPTSPINSS 679
Query: 461 NNSGGKYN 468
+NSG YN
Sbjct: 680 SNSGNNYN 687
>gi|4758666|ref|NP_004681.1|| LATS (large tumor suppressor,
Drosophila) homolog 1 >gi|4324434|gb|AAD16882|
(AF104413) large tumor suppressor 1 [Homo sapiens]
>gi|5738136|gb|AAD50272.1|AF164041_1 (AF164041) WARTS
protein kinase [Homo sapiens]
Length = 1130
Score = 36.4 bits (82), Expect = 0.54
Identities = 20/88 (22%), Positives = 40/88 (44%), Gaps = 1/88 (1%)
Query: 374 SPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNA 433
+P+++ + +P+ + RN N N + P ++ P + + P Q + S ++ P
Sbjct: 396 APSSYTNGSIPQSMMVPNRNSHNMELYNISVPGLQTNWPQSSSAPAQSSPSSGHEIPTWQ 455
Query: 434 PNFPSDHATPTFNPYGNPGQKTGAGRPN 461
PN P + NP GN + +P+
Sbjct: 456 PNIPV-RSNSFNNPLGNRASHSANSQPS 482
>gi|3172134 (U90209) RNA polymerase II largest subunit [Bonnemaisonia
hamifera]
Length = 1732
Score = 36.4 bits (82), Expect = 0.54
Identities = 33/131 (25%), Positives = 45/131 (34%), Gaps = 17/131 (12%)
Query: 350 AAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKS 409
AA A P QY P R + +P T S GA P+ S
Sbjct: 1591 AAQSPAQSPGVQYSPDKSRVQVQRAPPTAPS-------------AAGGGASRSYSPSSPS 1637
Query: 410 PNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPY----GNPGQKTGAGRPNNSGG 465
N +PG + Y ++P S + F+P G T A N +
Sbjct: 1638 YNGRGAASPGANYVAASPGYSPSSPGAYSPSSPAAFSPSSPAAGGYSPSTPAYTANAAAN 1697
Query: 466 KYNRNRGPRYP 476
+Y+ R PRYP
Sbjct: 1698 QYSYARSPRYP 1708
>gi|112945|sp|P14196|AAC2_DICDI AAC-RICH MRNA CLONE AAC11 PROTEIN
>gi|84120|pir||S05355 hypothetical protein (clone AAC11)
- slime mold (Dictyostelium discoideum) (fragment)
>gi|7174|emb|CAA34529| (X16522) coding region (AA 448)
[Dictyostelium discoideum]
Length = 448
Score = 36.4 bits (82), Expect = 0.54
Identities = 51/214 (23%), Positives = 79/214 (36%), Gaps = 33/214 (15%)
Query: 261 TQVEALRTQLKLEKDMPLALTLQPIIGQRRSAKATLHVNRNDNNKHAYHNNHST--ADES 318
T + L ++ + +P QPI + ++N N+NN + +NN+++ S
Sbjct: 95 TNLNGLSLAIQNQSSLP-----QPINNNNNNNNNNSNINNNNNNSNNNNNNNNSNLGINS 149
Query: 319 RELRRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFKQYKPKNDRSLSEGSPATF 378
+ D R RG + R E K + PK D EG+P
Sbjct: 150 SPTQSSANSADKRSRG------RPRKNPPSEPKDTSGPKRKRGRPPKMD---EEGNP--- 197
Query: 379 QSWYVPEGVSTGPRNH--RNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNF 436
Q VP+ S R + + N + NT TP ++ R P K +P+
Sbjct: 198 QPKPVPQPGSNKKRGRPKKPKDENESDYNNTSFSDSNTDGTPKKRGR--PPKAKGESPS- 254
Query: 437 PSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 470
A+PT N GN G NN+ N N
Sbjct: 255 ----ASPTHNTLGN-----GILNSNNNNNNNNNN 279
>gi|4324420|gb|AAD16881| (AF104350) prespore-specific protein
[Dictyostelium discoideum]
Length = 1231
Score = 36.4 bits (82), Expect = 0.54
Identities = 32/150 (21%), Positives = 54/150 (35%), Gaps = 27/150 (18%)
Query: 322 RRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFKQYKPKND-RSLSEGSPATFQS 380
R F +LRDD G KH+++ ++ + A + K K N+ +L+ +P
Sbjct: 781 RLFGSLRDDIG----KHNYQQNASLFFDFATFLSKKSNKNLGDINNLNNLNNNNP----- 831
Query: 381 WYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDH 440
+ P N+ PN + NPN N + NN N +++
Sbjct: 832 -------NNNPNNN----------PNNNNNNPNNNNNNNNNNNNNNNNNNNNNNNNNNNN 874
Query: 441 ATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 470
+N + N NN+ N N
Sbjct: 875 NNTNYNNFNNTNNNNNNSNKNNNNNNNNNN 904
>gi|1706637|sp|Q99372|ELS_RAT ELASTIN PRECURSOR (TROPOELASTIN)
>gi|2144809|pir||EART elastin precursor - rat >gi|207445
(M60647) tropoelastin [Rattus norvegicus]
Length = 864
Score = 36.0 bits (81), Expect = 0.71
Identities = 31/96 (32%), Positives = 35/96 (36%), Gaps = 9/96 (9%)
Query: 373 GSPATFQSWYVPEGVSTGPR-NHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPN 431
G+ A F V GV TG + G G GA P PG P YP
Sbjct: 202 GTGARFPGVGVLPGVPTGTGVKAKVPGGGGGAFSGIPGVGPFGGQQPGV-----PLGYPI 256
Query: 432 NAPNFPSDHATPTFN---PYGNPGQKTGAGRPNNSG 464
AP P + P N PYG G AG P +G
Sbjct: 257 KAPKLPGGYGLPYTNGKLPYGVAGAGGKAGYPTGTG 292
>gi|4220540|emb|CAA23013| (AL035356) hypothetical protein
[Arabidopsis thaliana]
Length = 319
Score = 36.0 bits (81), Expect = 0.71
Identities = 44/190 (23%), Positives = 68/190 (35%), Gaps = 29/190 (15%)
Query: 303 NNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFKQY 362
+N A +NH +S E +RFD D FK T + + +H+
Sbjct: 40 SNPLAETSNHQ--QDSFETQRFDYYTDPMAAYSS---FKKNKTPKQQYISSPSHQGSSPV 94
Query: 363 KPKNDRSLSEGSPAT-----------FQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPN 411
P+ S+ GS + + Y P G++ +HR AG N P
Sbjct: 95 PPQFPPSVPPGSLCSEYQAQTNHGGFHAAHYEPRGMAHLSPSHRGPPAGWN---NNFRPP 151
Query: 412 PNTRNTPGQQTRKSPYKYPNNAPNFPSD---------HATPTFNPYGNPGQKTGAGRPNN 462
P + P Q + P+ + PN ++ + P F+ YG G N
Sbjct: 152 PVNHSGPPQWVPR-PFPFSQEMPNMGNNRFGGRGSYNNTPPQFSNYGRQNANWGGNTYPN 210
Query: 463 SGGKYNRNRG 472
SG +R RG
Sbjct: 211 SGRGRSRGRG 220
>gi|106291|pir||S16681 homeotic protein - human
Length = 316
Score = 35.6 bits (80), Expect = 0.93
Identities = 36/146 (24%), Positives = 60/146 (40%), Gaps = 19/146 (13%)
Query: 330 DRGRGQGKHHFKDRLTVSGEAAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVST 389
D+G G+ +D E + H P +Q +P S + + ++ P G +T
Sbjct: 169 DKGSGRRLRTLRDSDPEEDEDEDDEDHFPLQQRRPW---STASSDCSVGRTGIAPRGPAT 225
Query: 390 GPRNHRNAGAGNGAHPNKKSPNP-NTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPY 448
PR R+ A + + P + +P P + PG T P P + A P P+
Sbjct: 226 SPRPSRSPAAQDRSRPARSAPGPAASPGGPGAWTH----------PARPREQARPP--PH 273
Query: 449 GNPGQKTGAG--RPNNSGGKYNRNRG 472
G P + GAG R + G++ +G
Sbjct: 274 G-PLAQAGAGGIRRGSGPGRFPFKQG 298
>gi|1078825|pir||S52850 cytoplasmic intermediate filament protein -
common roundworm >gi|763066|emb|CAA60045| (X86088)
cytoplasmic intermediate filament protein [Ascaris
lumbricoides]
Length = 610
Score = 35.6 bits (80), Expect = 0.93
Identities = 31/110 (28%), Positives = 53/110 (48%), Gaps = 10/110 (9%)
Query: 164 RVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGNTDSHDWFRVVVKEGRNREVR 223
R +PE E+ ++EL + D A++D I + TD W+++ V+E + + R
Sbjct: 294 RDTTPENREYFKNELASAI------RDIRAEYDQICNVNRTDMESWYKLKVQEIQTQSTR 347
Query: 224 RLWESQGCQVSRLKRTRYGSVLLPREL--LRGQSTELPKTQVEALRTQLK 271
+ E Q +KR R L +L L G+++ L K Q + L QL+
Sbjct: 348 QNLE-QNYAKEEVKRLRVQLTDLRGKLADLEGRNSLLEK-QTQELNYQLE 395
>gi|1170758|sp|P38486|LEG3_CANFA GALECTIN-3 (GALACTOSE-SPECIFIC
LECTIN 3) (MAC-2 ANTIGEN) (IGE-BINDING PROTEIN) (35 KD
LECTIN) (CARBOHYDRATE BINDING PROTEIN 35) (CBP 35)
(LAMININ-BINDING PROTEIN) (LECTIN L-29)
Length = 296
Score = 35.6 bits (80), Expect = 0.93
Identities = 36/121 (29%), Positives = 43/121 (34%), Gaps = 21/121 (17%)
Query: 366 NDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGN-------GAHPNKKSPN--PNTRN 416
ND G+P Q W GP ++ AGAG GA+P + P P
Sbjct: 8 NDALSGSGNPNP-QGW-------PGPWGNQPAGAGGYPGASYPGAYPGQAPPGGYPGQAP 59
Query: 417 TPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNS-GGKYNRNRGPRY 475
G + P YP AP P P G PGQ G P + G Y P Y
Sbjct: 60 PGGYPGQAPPGGYPGQAPPGGYPGQAP---PGGYPGQAPPGGYPGQAPPGTYPGPTAPAY 116
Query: 476 P 476
P
Sbjct: 117 P 117
>gi|1724014|sp|P54604|YHCT_BACSU HYPOTHETICAL 33.7 KD PROTEIN IN
CSPB-GLPP INTERGENIC REGION >gi|1239996|emb|CAA65704.1|
(X96983) hypothetical protein [Bacillus subtilis]
>gi|2633244|emb|CAB12749| (Z99108) similar to
hypothetical proteins [Bacillus subtilis]
Length = 302
Score = 35.6 bits (80), Expect = 0.93
Identities = 28/91 (30%), Positives = 44/91 (47%), Gaps = 11/91 (12%)
Query: 29 LHKVLAQAGLGSRRALEQRISNGLIKVNGDIAQLGMSVKSGDKIELDGRSFVASA----- 83
L VL A S+ ++ +S+ IKVN + M VK GD++ +D + AS+
Sbjct: 21 LFSVLKTALKASKPVIQDWMSHQQIKVNHESVLNNMIVKKGDRVFIDLQESEASSVIPEY 80
Query: 84 -----LTEPARVLIYNKPEGEVTTREDPEGR 109
L E +LI NKP G + T + +G+
Sbjct: 81 GELDILFEDNHMLIINKPAG-IATHPNEDGQ 110
>gi|1184072 (U40766) COL-1 [Meloidogyne incognita]
Length = 309
Score = 35.2 bits (79), Expect = 1.2
Identities = 24/76 (31%), Positives = 35/76 (45%), Gaps = 4/76 (5%)
Query: 405 PNKKSPNPNTRNTPGQ--QTRKSPYKYPNNAPNFPSDHATPTF-NPYGNPGQKTGAGRPN 461
P +S NP +PGQ +++++P + P P P P G PGQ G G+P
Sbjct: 122 PPGRSGNPGAPGSPGQPGKSQQAPCQQVTTPPCKPCPGGQPGQPGPPGPPGQPGGPGQPG 181
Query: 462 NSGGKYNRNR-GPRYP 476
GG+ + GP P
Sbjct: 182 QPGGQASPGEPGPAGP 197
>gi|1589837 (U68729) cuticle preprocollagen [Meloidogyne incognita]
Length = 308
Score = 35.2 bits (79), Expect = 1.2
Identities = 24/76 (31%), Positives = 35/76 (45%), Gaps = 4/76 (5%)
Query: 405 PNKKSPNPNTRNTPGQ--QTRKSPYKYPNNAPNFPSDHATPTF-NPYGNPGQKTGAGRPN 461
P +S NP +PGQ +++++P + P P P P G PGQ G G+P
Sbjct: 121 PPGRSGNPGAPGSPGQPGKSQQAPCQQVTTPPCKPCPGGQPGQPGPPGPPGQPGGPGQPG 180
Query: 462 NSGGKYNRNR-GPRYP 476
GG+ + GP P
Sbjct: 181 QPGGQASPGEPGPAGP 196
>gi|2981221 (AF053091) eyelid [Drosophila melanogaster]
Length = 2715
Score = 35.2 bits (79), Expect = 1.2
Identities = 33/125 (26%), Positives = 46/125 (36%), Gaps = 12/125 (9%)
Query: 348 GEAAAKQAHKPFK--QYKPKNDRSLSEGSPATFQSWYVPEGVS---TGPRNHRNAGAGNG 402
G++ Q + P + QY P N + P + + P S G N +GA G
Sbjct: 640 GQSPGAQGYPPQQPQQYPPGNYPPRPQYPPGAYATGPPPPPTSQAGAGGANSMPSGAQAG 699
Query: 403 AHPNKKSPNPNTRNTPGQQTRKSPYK-YPNNAP------NFPSDHATPTFNPYGNPGQKT 455
+P + PN + P Q SP + P AP N TP G P
Sbjct: 700 GYPGRGMPNHTGQYPPYQWVPPSPQQTVPGGAPGGAMVGNHVQGKGTPPPPVVGGPPPPQ 759
Query: 456 GAGRP 460
G+G P
Sbjct: 760 GSGSP 764
Score = 32.5 bits (72), Expect = 8.1
Identities = 32/106 (30%), Positives = 42/106 (39%), Gaps = 15/106 (14%)
Query: 373 GSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNN 432
G P + P V P+ H AG +P S P TP +T SP YP+
Sbjct: 1311 GGPPPAPQQHGPGQVPPSPQQHVRPAAG-APYPPGGSGYP----TPVSRTPGSP--YPSQ 1363
Query: 433 APNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKY---NRNRGPRY 475
+ ++ +N G PGQ G G G+Y NRN P Y
Sbjct: 1364 PGAYGQYGSSDQYNATGPPGQPFGQG-----PGQYPPQNRNMYPPY 1404
Score = 32.5 bits (72), Expect = 8.1
Identities = 28/102 (27%), Positives = 40/102 (38%), Gaps = 14/102 (13%)
Query: 384 PEGVSTGPRNHRNAGAGNG---AHPNKKSPNP--NTRNTPGQQTRKSPYKYPNNAPNFPS 438
P G GP + AG +P ++ P P + P QQ ++ PY+ P
Sbjct: 1537 PPGAPHGPPIQQPAGVAQWDQHRYPPQQGPPPPPQQQQQPQQQQQQPPYQQVAGPPGQQP 1596
Query: 439 DHATPTFNPYGNPGQKTGAG--------RPNNSGGKYNRNRG 472
A P + NPGQ +G RP + G+ NR G
Sbjct: 1597 PQAPPQWAQM-NPGQTAQSGIAPPGSPLRPPSGPGQQNRMPG 1637
>gi|3242649|dbj|BAA29028.1| (AB015440) alpha 1 type I collagen [Rana
catesbeiana]
Length = 1445
Score = 35.2 bits (79), Expect = 1.2
Identities = 31/90 (34%), Positives = 35/90 (38%), Gaps = 11/90 (12%)
Query: 384 PEGVSTGPRNHRNA----GAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPN--NAPNFP 437
P+G S GP + A GA A P + NP T PG K P AP FP
Sbjct: 342 PQG-SRGPDGPQGARGEPGAPGQAGPAGSAGNPGTDGQPGA---KGATGAPGIAGAPGFP 397
Query: 438 SDHATP-TFNPYGNPGQKTGAGRPNNSGGK 466
P P G+PG K G P G K
Sbjct: 398 GARGAPGPQGPGGSPGPKGNNGEPGAQGNK 427
>gi|3879922|emb|CAA98546.1| (Z74043) cDNA EST yk310e2.3 comes from
this gene; cDNA EST yk310e2.5 comes from this gene
[Caenorhabditis elegans]
Length = 125
Score = 35.2 bits (79), Expect = 1.2
Identities = 23/78 (29%), Positives = 31/78 (39%), Gaps = 4/78 (5%)
Query: 393 NHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPY--GN 450
N+ N AGN ++ KK N N +N Q ++ P P P F P G
Sbjct: 22 NNNNKKAGNNSNNQKKGNNQNNKNNKNQNGGNRNQNRQHHQP--PMMQMVPPFGPMGPGG 79
Query: 451 PGQKTGAGRPNNSGGKYN 468
PG G PN + +N
Sbjct: 80 PGGPGFMGPPNGNMNFFN 97
>gi|4808164|emb|CAB42795.1| (Y18878) largest subunit of the RNA
polymerase II complex [Drosophila guanche]
Length = 1889
Score = 35.2 bits (79), Expect = 1.2
Identities = 41/151 (27%), Positives = 55/151 (36%), Gaps = 26/151 (17%)
Query: 350 AAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGP-----RNHRNAGAGNGAH 404
+AA A + P + S S SPA S Y P S P + A + GA
Sbjct: 1537 SAASDASGMSPSWSPAHPGS-SPSSPAPSMSPYYPASPSVSPSYSPTSPNYTASSPGGAS 1595
Query: 405 PNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNF--------PS----DHATPTFNPYGN-P 451
PN +PN T SP +Y + PNF PS +P ++P N P
Sbjct: 1596 PNYSPSSPNYSPTSPLYAAPSP-RYASTTPNFNPQSTGYSPSASGYSPTSPVYSPTSNFP 1654
Query: 452 GQKTGAG------RPNNSGGKYNRNRGPRYP 476
+ AG P N+ + N P P
Sbjct: 1655 SSPSFAGSGSNMYSPGNAYSPSSTNYSPNSP 1685
>gi|4808166|emb|CAB42797.1| (Y18879) largest subunit of the RNA
polymerase II complex [Drosophila pseudoobscura]
Length = 1811
Score = 35.2 bits (79), Expect = 1.2
Identities = 41/151 (27%), Positives = 55/151 (36%), Gaps = 26/151 (17%)
Query: 350 AAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGP-----RNHRNAGAGNGAH 404
+AA A + P + S S SPA S Y P S P + A + GA
Sbjct: 1459 SAASDASGMSPSWSPAHPGS-SPSSPAPSMSPYYPASPSVSPSYSPTSPNYTASSPGGAS 1517
Query: 405 PNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNF--------PS----DHATPTFNPYGN-P 451
PN +PN T SP +Y + PNF PS +P ++P N P
Sbjct: 1518 PNYSPSSPNYSPTSPLYAAASP-RYASTTPNFNPQSTGYSPSASGYSPTSPVYSPTSNFP 1576
Query: 452 GQKTGAG------RPNNSGGKYNRNRGPRYP 476
+ AG P N+ + N P P
Sbjct: 1577 SSPSFAGSGSNMYSPGNAYSPSSTNYSPNSP 1607
>gi|735898 (L40992) core-binding factor, runt domain, alpha subunit
1 [Homo sapiens]
Length = 440
Score = 34.8 bits (78), Expect = 1.6
Identities = 49/229 (21%), Positives = 87/229 (37%), Gaps = 23/229 (10%)
Query: 247 PRELLRGQSTELPKTQVEALRTQLKLEKDMPLALTLQPIIGQRRSAKATLHVNRNDNNKH 306
P EL+R T+ P L + + K +P+A + + G+ + ND N
Sbjct: 51 PAELVR---TDSPNFLCSVLPSHWRCNKTLPVAFKVVAL-GEVPDGTVVTVMAGNDENYS 106
Query: 307 AYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFK-----Q 361
A N S ++ ++ RF+ LR G+GK + H+ K
Sbjct: 107 AELRNASAVMKN-QVARFNDLRFVGRSGRGKSFTLTITVFTNPPQVATYHRAIKVTVDGP 165
Query: 362 YKPKNDRS-LSEGSPATFQSWYVPEG--------VSTGPRNHRNAGAGNGAHPNKKSPNP 412
+P+ R L + P+ F G V P+N R + + P+ +P
Sbjct: 166 REPRRHRQKLDDSKPSLFSDRLSDLGRIPHPSMRVGVPPQNPRPS---LNSAPSPFNPQG 222
Query: 413 NTRNTPGQQTRKSP-YKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRP 460
++ T +Q + SP + Y + P++ S +P+ + G G P
Sbjct: 223 QSQITDPRQAQSSPPWSYDQSYPSYLSQMTSPSIHSTTPLSSTRGTGLP 271
>gi|1280114|gb|AAC25888.1| (U55373) Similar to cuticular collagen;
coded for by C. elegans cDNA yk92h9.3; coded for by C.
elegans cDNA yk100f8.5; coded for by C. elegans cDNA
yk123h6.5; coded for by C. elegans cDNA yk125b5.5; coded
for by C. elegans cDN...
Length = 289
Score = 34.8 bits (78), Expect = 1.6
Identities = 28/84 (33%), Positives = 33/84 (38%), Gaps = 12/84 (14%)
Query: 384 PEGVSTGPRNHRNAGA-GNGAHPNKKSPN--PNTRNTPGQQTRKSPYKYPNNAPNFPSDH 440
P G GP + G GN P P P T PGQ R P P P
Sbjct: 176 PPGGPGGPGEGGDGGRPGNPGRPGPAGPRGEPGTEYKPGQPGRPGP-------PG-PRGE 227
Query: 441 ATPTFNPYGNPGQKTGAGRPNNSG 464
A P P G+PG +G+P N+G
Sbjct: 228 AGPAGQP-GSPGNDGESGKPGNAG 250
>gi|2245687|gb|AAB62574.1| (AF010325) CHIP [Drosophila melanogaster]
Length = 577
Score = 34.8 bits (78), Expect = 1.6
Identities = 24/100 (24%), Positives = 38/100 (38%), Gaps = 7/100 (7%)
Query: 382 YVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHA 441
Y P G S+ P +++ N P +P + P + P N N+P
Sbjct: 65 YPPAGQSS-PAGNQSIVFQNSNQPGSNTPQYTSSPAPSGSSTPGPVGAQNIPGNYPQSAT 123
Query: 442 TPTFN-----PYGNPGQKTGA-GRPNNSGGKYNRNRGPRY 475
FN P+G+P G RP +SG +N + +
Sbjct: 124 AGNFNGPVGGPFGSPSSGLGQFSRPASSGTPFNSGQAGHF 163
>gi|2290720|gb|AAB65158.1| (AF001450) core binding factor alpha1
subunit [Homo sapiens]
Length = 507
Score = 34.8 bits (78), Expect = 1.6
Identities = 49/229 (21%), Positives = 87/229 (37%), Gaps = 23/229 (10%)
Query: 247 PRELLRGQSTELPKTQVEALRTQLKLEKDMPLALTLQPIIGQRRSAKATLHVNRNDNNKH 306
P EL+R T+ P L + + K +P+A + + G+ + ND N
Sbjct: 96 PAELVR---TDSPNFLCSVLPSHWRCNKTLPVAFKVVAL-GEVPDGTVVTVMAGNDENYS 151
Query: 307 AYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFK-----Q 361
A N S ++ ++ RF+ LR G+GK + H+ K
Sbjct: 152 AELRNASAVMKN-QVARFNDLRFVGRSGRGKSFTLTITVFTNPPQVATYHRAIKVTVDGP 210
Query: 362 YKPKNDRS-LSEGSPATFQSWYVPEG--------VSTGPRNHRNAGAGNGAHPNKKSPNP 412
+P+ R L + P+ F G V P+N R + + P+ +P
Sbjct: 211 REPRRHRQKLDDSKPSLFSDRLSDLGRIPHPSMRVGVPPQNPRPS---LNSAPSPFNPQG 267
Query: 413 NTRNTPGQQTRKSP-YKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRP 460
++ T +Q + SP + Y + P++ S +P+ + G G P
Sbjct: 268 QSQITDPRQAQSSPPWSYDQSYPSYLSQMTSPSIHSTTPLSSTRGTGLP 316
>gi|2245693|gb|AAB62577.1| (AF010328) short form of CHIP [Drosophila
melanogaster]
Length = 365
Score = 34.8 bits (78), Expect = 1.6
Identities = 24/100 (24%), Positives = 38/100 (38%), Gaps = 7/100 (7%)
Query: 382 YVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHA 441
Y P G S+ P +++ N P +P + P + P N N+P
Sbjct: 65 YPPAGQSS-PAGNQSIVFQNSNQPGSNTPQYTSSPAPSGSSTPGPVGAQNIPGNYPQSAT 123
Query: 442 TPTFN-----PYGNPGQKTGA-GRPNNSGGKYNRNRGPRY 475
FN P+G+P G RP +SG +N + +
Sbjct: 124 AGNFNGPVGGPFGSPSSGLGQFSRPASSGTPFNSGQAGHF 163
>gi|2293472 (AF010284) Osf2 [Mus musculus]
Length = 596
Score = 34.8 bits (78), Expect = 1.6
Identities = 49/229 (21%), Positives = 87/229 (37%), Gaps = 23/229 (10%)
Query: 247 PRELLRGQSTELPKTQVEALRTQLKLEKDMPLALTLQPIIGQRRSAKATLHVNRNDNNKH 306
P EL+R T+ P L + + K +P+A + + G+ + ND N
Sbjct: 185 PAELVR---TDSPNFLCSVLPSHWRCNKTLPVAFKVVAL-GEVPDGTVVTVMAGNDENYS 240
Query: 307 AYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFK-----Q 361
A N S ++ ++ RF+ LR G+GK + H+ K
Sbjct: 241 AELRNASAVMKN-QVARFNDLRFVGRSGRGKSFTLTITVFTNPPQVATYHRAIKVTVDGP 299
Query: 362 YKPKNDRS-LSEGSPATFQSWYVPEG--------VSTGPRNHRNAGAGNGAHPNKKSPNP 412
+P+ R L + P+ F G V P+N R + + P+ +P
Sbjct: 300 REPRRHRQKLDDSKPSLFSDRLSDLGRIPHPSMRVGVPPQNPRPS---LNSAPSPFNPQG 356
Query: 413 NTRNTPGQQTRKSP-YKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRP 460
++ T +Q + SP + Y + P++ S +P+ + G G P
Sbjct: 357 QSQITDPRQAQSSPPWSYDQSYPSYLSQMTSPSIHSTTPLSSTRGTGLP 405
>gi|2580612|gb|AAB82419.1| (AF005936) PEBP2alphaA major til-1
isoform [Mus musculus]
Length = 528
Score = 34.8 bits (78), Expect = 1.6
Identities = 49/229 (21%), Positives = 87/229 (37%), Gaps = 23/229 (10%)
Query: 247 PRELLRGQSTELPKTQVEALRTQLKLEKDMPLALTLQPIIGQRRSAKATLHVNRNDNNKH 306
P EL+R T+ P L + + K +P+A + + G+ + ND N
Sbjct: 117 PAELVR---TDSPNFLCSVLPSHWRCNKTLPVAFKVVAL-GEVPDGTVVTVMAGNDENYS 172
Query: 307 AYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFK-----Q 361
A N S ++ ++ RF+ LR G+GK + H+ K
Sbjct: 173 AELRNASAVMKN-QVARFNDLRFVGRSGRGKSFTLTITVFTNPPQVATYHRAIKVTVDGP 231
Query: 362 YKPKNDRS-LSEGSPATFQSWYVPEG--------VSTGPRNHRNAGAGNGAHPNKKSPNP 412
+P+ R L + P+ F G V P+N R + + P+ +P
Sbjct: 232 REPRRHRQKLDDSKPSLFSDRLSDLGRIPHPSMRVGVPPQNPRPS---LNSAPSPFNPQG 288
Query: 413 NTRNTPGQQTRKSP-YKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRP 460
++ T +Q + SP + Y + P++ S +P+ + G G P
Sbjct: 289 QSQITDPRQAQSSPPWSYDQSYPSYLSQMTSPSIHSTTPLSSTRGTGLP 337
>gi|3877422|emb|CAB05195.1| (Z82268) predicted using Genefinder;
similar to CUTICLE COLLAGEN 34; cDNA EST EMBL:D65629
comes from this gene; cDNA EST EMBL:D68754 comes from
this gene; cDNA EST EMBL:D68791 comes from this gene;
cDNA EST EMBL:D68988 comes ...
Length = 304
Score = 34.8 bits (78), Expect = 1.6
Identities = 26/85 (30%), Positives = 33/85 (38%), Gaps = 8/85 (9%)
Query: 384 PEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATP 443
P+G S P N AGA P + + + PGQ + P P +P P P
Sbjct: 190 PKGASGAPGNPGQAGAPG--QPGADAQSESIPGAPGQAGPQGP-PGPAGSPGAPGGPGQP 246
Query: 444 TFNPYGNPGQKTGAGRPNNSGGKYN 468
G PGQK +G P G N
Sbjct: 247 -----GAPGQKGPSGAPGQPGADGN 266
>gi|4324436|gb|AAD16883| (AF104414) large tumor suppressor 1 [Mus
musculus]
Length = 962
Score = 34.8 bits (78), Expect = 1.6
Identities = 30/134 (22%), Positives = 52/134 (38%), Gaps = 8/134 (5%)
Query: 334 GQGKHHFKDRLTVSGEAAAKQAHKPFKQYKPKNDRSLS------EGSPATFQSWYVPEGV 387
G G+ F V + +Q P+ P N +S S +P +F + VP+ +
Sbjct: 183 GGGQSDFIVHQNVPTGSVTRQPPPPYP-LTPANGQSPSALQTGASAAPPSFANGNVPQSM 241
Query: 388 STGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNP 447
RN N N P ++ P + + P Q + ++ P PN P + NP
Sbjct: 242 MVPNRNSHNMELYNINVPGLQTAWPQSSSAPAQSSPSGGHEIPTWQPNIPV-RSNSFNNP 300
Query: 448 YGNPGQKTGAGRPN 461
G+ + +P+
Sbjct: 301 LGSRASHSANSQPS 314
>gi|539914|pir||A48233 polyomavirus enhancer-binding protein 2 alpha
chain type 1 - mouse >gi|391767|dbj|BAA03485| (D14636)
PEBP2a1 protein [Mus musculus]
Length = 513
Score = 34.8 bits (78), Expect = 1.6
Identities = 49/229 (21%), Positives = 87/229 (37%), Gaps = 23/229 (10%)
Query: 247 PRELLRGQSTELPKTQVEALRTQLKLEKDMPLALTLQPIIGQRRSAKATLHVNRNDNNKH 306
P EL+R T+ P L + + K +P+A + + G+ + ND N
Sbjct: 102 PAELVR---TDSPNFLCSVLPSHWRCNKTLPVAFKVVAL-GEVPDGTVVTVMAGNDENYS 157
Query: 307 AYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFK-----Q 361
A N S ++ ++ RF+ LR G+GK + H+ K
Sbjct: 158 AELRNASAVMKN-QVARFNDLRFVGRSGRGKSFTLTITVFTNPPQVATYHRAIKVTVDGP 216
Query: 362 YKPKNDRS-LSEGSPATFQSWYVPEG--------VSTGPRNHRNAGAGNGAHPNKKSPNP 412
+P+ R L + P+ F G V P+N R + + P+ +P
Sbjct: 217 REPRRHRQKLDDSKPSLFSDRLSDLGRIPHPSMRVGVPPQNPRPS---LNSAPSPFNPQG 273
Query: 413 NTRNTPGQQTRKSP-YKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRP 460
++ T +Q + SP + Y + P++ S +P+ + G G P
Sbjct: 274 QSQITDPRQAQSSPPWSYDQSYPSYLSQMTSPSIHSTTPLSSTRGTGLP 322
>gi|5724787|gb|AAB65159.2| (AF001450) core binding factor alpha1
subunit [Homo sapiens]
Length = 521
Score = 34.8 bits (78), Expect = 1.6
Identities = 49/229 (21%), Positives = 87/229 (37%), Gaps = 23/229 (10%)
Query: 247 PRELLRGQSTELPKTQVEALRTQLKLEKDMPLALTLQPIIGQRRSAKATLHVNRNDNNKH 306
P EL+R T+ P L + + K +P+A + + G+ + ND N
Sbjct: 110 PAELVR---TDSPNFLCSVLPSHWRCNKTLPVAFKVVAL-GEVPDGTVVTVMAGNDENYS 165
Query: 307 AYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFK-----Q 361
A N S ++ ++ RF+ LR G+GK + H+ K
Sbjct: 166 AELRNASAVMKN-QVARFNDLRFVGRSGRGKSFTLTITVFTNPPQVATYHRAIKVTVDGP 224
Query: 362 YKPKNDRS-LSEGSPATFQSWYVPEG--------VSTGPRNHRNAGAGNGAHPNKKSPNP 412
+P+ R L + P+ F G V P+N R + + P+ +P
Sbjct: 225 REPRRHRQKLDDSKPSLFSDRLSDLGRIPHPSMRVGVPPQNPRPS---LNSAPSPFNPQG 281
Query: 413 NTRNTPGQQTRKSP-YKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRP 460
++ T +Q + SP + Y + P++ S +P+ + G G P
Sbjct: 282 QSQITDPRQAQSSPPWSYDQSYPSYLSQMTSPSIHSTTPLSSTRGTGLP 330
>gi|120943|sp|P13816|GARP_PLAFF GLUTAMIC ACID-RICH PROTEIN PRECURSOR
>gi|627054|pir||A54514 glutamic acid-rich protein
precursor - Plasmodium falciparum
>gi|160299|gb|AAA29605.1| (J03998) glutamic acid-rich
protein [Plasmodium falciparum]
Length = 678
Score = 34.4 bits (77), Expect = 2.1
Identities = 41/175 (23%), Positives = 69/175 (39%), Gaps = 18/175 (10%)
Query: 266 LRTQLKLEKDMPLALTLQPIIGQRRSAKATLHVNRNDNNKHAYHNNH--STADESRELRR 323
L + +LEK+ + ++ + + K + NDN K+A++NN S+ D + +
Sbjct: 49 LLNETELEKNKDDNSKSETLLKEEKDEKDDVPTTSNDNLKNAHNNNEISSSTDPTNIINV 108
Query: 324 FD----TLRDDRGRGQGKHHFKDRLTV-----SGEAAAKQAHKPFKQYKPKNDRSLSEGS 374
D D + + K H KD+ E K+ K K+ K K D+ E S
Sbjct: 109 NDKDNENSVDKKKDKKEKKHKKDKKEKKEKKDKKEKKDKKEKKHKKEKKHKKDKKKKENS 168
Query: 375 PATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKY 429
S Y TG +NA + +++ + N G SPY+Y
Sbjct: 169 EV--MSLY-----KTGQHKPKNATEHGEENLDEEMVSEINNNAQGGLLLSSPYQY 216
>gi|437331 (L23429) beta-galactosides-binding lectin [Canis
familiaris]
Length = 285
Score = 34.4 bits (77), Expect = 2.1
Identities = 30/97 (30%), Positives = 36/97 (36%), Gaps = 13/97 (13%)
Query: 390 GPRNHRNAGAGN-------GAHPNKKSPN--PNTRNTPGQQTRKSPYKYPNNAPNFPSDH 440
GP ++ AGAG GA+P + P P G + P YP AP
Sbjct: 13 GPWGNQPAGAGGYPGASYPGAYPGQAPPGGYPGQAPPGGYPGQAPPGGYPGQAPPGGYPG 72
Query: 441 ATPTFNPYGNPGQKTGAGRPNNS-GGKYNRNRGPRYP 476
P P G PGQ G P + G Y P YP
Sbjct: 73 QAP---PGGYPGQAPPGGYPGQAPPGTYPGPTAPAYP 106
>gi|163002 (M19372) elastin [Bos taurus]
Length = 707
Score = 34.4 bits (77), Expect = 2.1
Identities = 30/100 (30%), Positives = 34/100 (34%), Gaps = 13/100 (13%)
Query: 373 GSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNN 432
G+ A F V GV TG A G GA P PG P YP
Sbjct: 111 GAGARFPGIGVLPGVPTGAGVKPKAPGGGGAFAGIPGVGPFGGQQPGV-----PLGYPIK 165
Query: 433 APNFPSDHATP--------TFNPYGNPGQKTGAGRPNNSG 464
AP P+ + P F P G G AG P +G
Sbjct: 166 APKLPAGYGLPYKTGKLPYGFGPGGVAGSAGKAGYPTGTG 205
>gi|163004 (M19372) elastin-cBEL2 [Bos taurus]
Length = 679
Score = 34.4 bits (77), Expect = 2.1
Identities = 30/100 (30%), Positives = 34/100 (34%), Gaps = 13/100 (13%)
Query: 373 GSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNN 432
G+ A F V GV TG A G GA P PG P YP
Sbjct: 111 GAGARFPGIGVLPGVPTGAGVKPKAPGGGGAFAGIPGVGPFGGQQPGV-----PLGYPIK 165
Query: 433 APNFPSDHATP--------TFNPYGNPGQKTGAGRPNNSG 464
AP P+ + P F P G G AG P +G
Sbjct: 166 APKLPAGYGLPYKTGKLPYGFGPGGVAGSAGKAGYPTGTG 205
>gi|71823|pir||FGHUA fibrinogen alpha chain precursor, short splice
form - human >gi|182426 (J00128) A-alpha fibrinogen
[Homo sapiens] >gi|458554 (M64982) common fibrinogen
alpha chain [Homo sapiens] >gi|4033511 (M58569)
fibrinogen alpha subunit [Homo sapiens]
Length = 644
Score = 34.4 bits (77), Expect = 2.1
Identities = 32/114 (28%), Positives = 48/114 (42%), Gaps = 18/114 (15%)
Query: 365 KNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNT--PGQQT 422
+N S G AT++ G STG N ++G G+ + N SP P + T PG
Sbjct: 308 RNPGSSGTGGTATWKPGSSGPG-STGSWNSGSSGTGSTGNQNPGSPRPGSTGTWNPGSSE 366
Query: 423 RKSPYKYPNNAPNFPSDHATPTFNPYGNPGQ---KTGAGRPNNSGGKYNRNRGP 473
R S + H T + G+ GQ ++G+ RP++ G R P
Sbjct: 367 RGS------------AGHWTSESSVSGSTGQWHSESGSFRPDSPGSGNARPNNP 408
>gi|4503689|ref|NP_000499.1|| fibrinogen, A alpha polypeptide
>gi|1706799|sp|P02671|FIBA_HUMAN FIBRINOGEN
ALPHA/ALPHA-E CHAIN PRECURSOR >gi|2135107|pir||D44234
fibrinogen alpha chain precursor, extended splice form -
human >gi|182407 (M58569) fibrinogen alpha subunit
precursor [Homo sapiens] >gi|458555 (M64982) fibrinogen
alpha-E chain [Homo sapiens]
Length = 866
Score = 34.4 bits (77), Expect = 2.1
Identities = 32/114 (28%), Positives = 48/114 (42%), Gaps = 18/114 (15%)
Query: 365 KNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNT--PGQQT 422
+N S G AT++ G STG N ++G G+ + N SP P + T PG
Sbjct: 308 RNPGSSGTGGTATWKPGSSGPG-STGSWNSGSSGTGSTGNQNPGSPRPGSTGTWNPGSSE 366
Query: 423 RKSPYKYPNNAPNFPSDHATPTFNPYGNPGQ---KTGAGRPNNSGGKYNRNRGP 473
R S + H T + G+ GQ ++G+ RP++ G R P
Sbjct: 367 RGS------------AGHWTSESSVSGSTGQWHSESGSFRPDSPGSGNARPNNP 408
>gi|1082938|pir||A49688 lactose-binding lectin L-29 - dog
Length = 294
Score = 34.4 bits (77), Expect = 2.1
Identities = 30/97 (30%), Positives = 36/97 (36%), Gaps = 13/97 (13%)
Query: 390 GPRNHRNAGAGN-------GAHPNKKSPN--PNTRNTPGQQTRKSPYKYPNNAPNFPSDH 440
GP ++ AGAG GA+P + P P G + P YP AP
Sbjct: 22 GPWGNQPAGAGGYPGASYPGAYPGQAPPGGYPGQAPPGGYPGQAPPGGYPGQAPPGGYPG 81
Query: 441 ATPTFNPYGNPGQKTGAGRPNNS-GGKYNRNRGPRYP 476
P P G PGQ G P + G Y P YP
Sbjct: 82 QAP---PGGYPGQAPPGGYPGQAPPGTYPGPTAPAYP 115
>gi|3810839|emb|CAA21800| (AL032684) conserved hypothetical
zinc-finger protein [Schizosaccharomyces pombe]
Length = 482
Score = 34.4 bits (77), Expect = 2.1
Identities = 39/155 (25%), Positives = 62/155 (39%), Gaps = 7/155 (4%)
Query: 281 TLQPIIGQRRSAKATLHVNRNDNNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHF 340
TL P ++R +A + N+K++ + T+D++ R+D G + F
Sbjct: 330 TLNPDYQKQREIEAVVKSVLGSNSKNS--DKVGTSDDNNTPMSEKRKREDDD-ANGPNKF 386
Query: 341 KDRLT-VSGEAAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGA 399
R + V +A A+ A K +G PA F + +P G+ P NA A
Sbjct: 387 AARSSAVFSKATAEPAFKSAMAIPDMPSMPHVQGFPAPFPPFMMP-GLPQMPPMMMNAIA 445
Query: 400 GNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAP 434
G H N+ P N+R P + P N P
Sbjct: 446 GQVYHNNRNPPRTNSR--PSNASVPPPSSLHKNPP 478
>gi|3834293 (U80846) No definition line found [Caenorhabditis
elegans]
Length = 1032
Score = 34.4 bits (77), Expect = 2.1
Identities = 27/98 (27%), Positives = 40/98 (40%), Gaps = 8/98 (8%)
Query: 369 SLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNT-RNTPGQQTRKSPY 427
S GS T QS + G + + + P+ +SP PNT TP Q + +SP
Sbjct: 564 STVSGSTGTSQSTLASSTATPGSSS--TVPSSSSPQPSSQSPAPNTGSTTPSQTSSQSPS 621
Query: 428 KYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGG 465
N + + P+ + T P G+ A P S G
Sbjct: 622 PSMNPSSSTPTGSSQSTITPEGST-----ASSPTGSTG 654
>gi|3834294 (U80846) No definition line found [Caenorhabditis
elegans]
Length = 2232
Score = 34.4 bits (77), Expect = 2.1
Identities = 27/98 (27%), Positives = 40/98 (40%), Gaps = 8/98 (8%)
Query: 369 SLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNT-RNTPGQQTRKSPY 427
S GS T QS + G + + + P+ +SP PNT TP Q + +SP
Sbjct: 564 STVSGSTGTSQSTLASSTATPGSSS--TVPSSSSPQPSSQSPAPNTGSTTPSQTSSQSPS 621
Query: 428 KYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGG 465
N + + P+ + T P G+ A P S G
Sbjct: 622 PSMNPSSSTPTGSSQSTITPEGST-----ASSPTGSTG 654
>gi|3851592 (AF093575) surface antigen ariel1 [Entamoeba
histolytica]
Length = 215
Score = 34.4 bits (77), Expect = 2.1
Identities = 25/102 (24%), Positives = 40/102 (38%), Gaps = 3/102 (2%)
Query: 365 KNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRK 424
+ D+ S S S P+ S N + N + NK PN ++ N P + +
Sbjct: 47 EEDKKSSSNSELDENSNNQPDESSNNKPNESSDNKPNESSDNK--PNESSNNKPSESSNN 104
Query: 425 SPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGK 466
P + NN PN SD+ P + P + + +S K
Sbjct: 105 KPDESSNNKPNESSDN-KPNESSNNKPNESSNNKPSESSNNK 145
Score = 33.2 bits (74), Expect = 4.7
Identities = 22/80 (27%), Positives = 35/80 (43%), Gaps = 4/80 (5%)
Query: 384 PEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATP 443
P+ S N + N + NK PN ++ N P + + P + NN PN SD+ P
Sbjct: 106 PDESSNNKPNESSDNKPNESSNNK--PNESSNNKPSESSNNKPDESSNNKPNESSDN-KP 162
Query: 444 TFNPYGNPGQKTGAGRPNNS 463
+ P + + +PN S
Sbjct: 163 NESSNNKPNESSD-NKPNES 181
>gi|1280073 (U55366) Similar to cuticle collagen [Caenorhabditis
elegans]
Length = 310
Score = 34.0 bits (76), Expect = 2.7
Identities = 29/102 (28%), Positives = 39/102 (37%), Gaps = 13/102 (12%)
Query: 384 PEGVSTGP----RNHRNAGAG----NGAHPNKKSPN--PNTRNTPGQQTRKSPYKYPNNA 433
P+G GP R+ + AG + + P + PN P R PGQ P
Sbjct: 195 PKGAPGGPGQPGRDGQPGQAGQPGSSSSEPGQPGPNGQPGPRGPPGQAGSPGGNGQPGG- 253
Query: 434 PNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRNRGPRY 475
P P + GN GQ G+P SGG + P+Y
Sbjct: 254 PGQPGQRGSD--GQPGNDGQPGAPGQPGQSGGSGEKGICPKY 293
>gi|2501678|sp|Q45480|YLYB_BACSU HYPOTHETICAL 33.7 KD PROTEIN IN
LSP-PYRR INTERGENIC REGION (ORF-X) >gi|1373157 (U48870)
orf-X; hypothetical protein; Method: conceptual
translation supplied by author
Length = 303
Score = 34.0 bits (76), Expect = 2.7
Identities = 28/94 (29%), Positives = 44/94 (46%), Gaps = 12/94 (12%)
Query: 18 TATEAPKLEERLHKVLAQAGLG-SRRALEQRISNGLIKVNGDIAQLGMSVKSGDKI---- 72
TA+E K ER+ K LA SR ++Q + +G + VNG + ++ GD++
Sbjct: 7 TASEEQK-SERIDKFLASTENDWSRTQVQQWVKDGQVVVNGSAVKANYKIQPGDQVTVTV 65
Query: 73 -ELDGRSFVASALT-----EPARVLIYNKPEGEV 100
E + +A + E VL+ NKP G V
Sbjct: 66 PEPEALDVLAEPMDLDIYYEDQDVLVVNKPRGMV 99
>gi|2633919|emb|CAB13420| (Z99112) alternate gene name: ylmL;
similar to hypothetical proteins [Bacillus subtilis]
Length = 273
Score = 34.0 bits (76), Expect = 2.7
Identities = 28/94 (29%), Positives = 44/94 (46%), Gaps = 12/94 (12%)
Query: 18 TATEAPKLEERLHKVLAQAGLG-SRRALEQRISNGLIKVNGDIAQLGMSVKSGDKI---- 72
TA+E K ER+ K LA SR ++Q + +G + VNG + ++ GD++
Sbjct: 7 TASEEQK-SERIDKFLASTENDWSRTQVQQWVKDGQVVVNGSAVKANYKIQPGDQVTVTV 65
Query: 73 -ELDGRSFVASALT-----EPARVLIYNKPEGEV 100
E + +A + E VL+ NKP G V
Sbjct: 66 PEPEALDVLAEPMDLDIYYEDQDVLVVNKPRGMV 99
>gi|5901942|ref|NP_008971.1|| E1B-55kDa-associated protein 5
>gi|3319956|emb|CAA07548| (AJ007509)
E1B-55kDa-associated protein [Homo sapiens]
Length = 856
Score = 34.0 bits (76), Expect = 2.7
Identities = 24/75 (32%), Positives = 34/75 (45%), Gaps = 8/75 (10%)
Query: 405 PNKKSPNPN---TRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPN 461
P + P P+ RN PG T Y +N P ++ +TPT + Y +P Q + + P
Sbjct: 708 PQQPPPPPSYSPARNPPGAST----YNKNSNIPGSSANTSTPTVSSY-SPPQPSYSQPPY 762
Query: 462 NSGGKYNRNRGPRYP 476
N GG GP P
Sbjct: 763 NQGGYSQGYTGPPPP 777
>gi|3924913|emb|CAB03474.1| (Z81138) predicted using Genefinder;
cDNA EST EMBL:D65543 comes from this gene
[Caenorhabditis elegans]
Length = 304
Score = 34.0 bits (76), Expect = 2.7
Identities = 30/104 (28%), Positives = 40/104 (37%), Gaps = 17/104 (16%)
Query: 384 PEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPN-----NAPNFPS 438
P+G P N AGA P + + + ++PG + P P AP P
Sbjct: 190 PKGAPGAPGNPGQAGA-----PGQPGSDAQSESSPGAPGQAGPQGPPGPAGSPGAPGGPG 244
Query: 439 DHATP-TFNPYGNPGQ-----KTGA-GRPNNSGGKYNRNRGPRY 475
P P G PGQ GA G+P SGG + P+Y
Sbjct: 245 QAGAPGPKGPSGAPGQPGADGNPGAPGQPGQSGGAGEKGICPKY 288
>gi|3924914|emb|CAB03475.1| (Z81138) predicted using Genefinder;
cDNA EST EMBL:D69494 comes from this gene; cDNA EST
EMBL:D69317 comes from this gene [Caenorhabditis
elegans]
Length = 304
Score = 34.0 bits (76), Expect = 2.7
Identities = 30/104 (28%), Positives = 40/104 (37%), Gaps = 17/104 (16%)
Query: 384 PEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPN-----NAPNFPS 438
P+G P N AGA P + + + ++PG + P P AP P
Sbjct: 190 PKGAPGAPGNPGQAGA-----PGQPGSDAQSESSPGAPGQAGPQGPPGPAGSPGAPGGPG 244
Query: 439 DHATP-TFNPYGNPGQ-----KTGA-GRPNNSGGKYNRNRGPRY 475
P P G PGQ GA G+P SGG + P+Y
Sbjct: 245 QAGAPGPKGPSGAPGQPGADGNPGAPGQPGQSGGAGEKGICPKY 288
>gi|6580246|emb|CAB63321.1| (Z81138) cDNA EST EMBL:D65528 comes from
this gene [Caenorhabditis elegans]
Length = 304
Score = 34.0 bits (76), Expect = 2.7
Identities = 30/104 (28%), Positives = 40/104 (37%), Gaps = 17/104 (16%)
Query: 384 PEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPN-----NAPNFPS 438
P+G P N AGA P + + + ++PG + P P AP P
Sbjct: 190 PKGAPGAPGNPGQAGA-----PGQPGSDAQSESSPGAPGQAGPQGPPGPAGSPGAPGGPG 244
Query: 439 DHATP-TFNPYGNPGQ-----KTGA-GRPNNSGGKYNRNRGPRY 475
P P G PGQ GA G+P SGG + P+Y
Sbjct: 245 QAGAPGPKGPSGAPGQPGADGNPGAPGQPGQSGGAGEKGICPKY 288
>gi|628112|pir||S32988 hypothetical protein - human herpesvirus 4
Length = 512
Score = 33.6 bits (75), Expect = 3.6
Identities = 35/128 (27%), Positives = 56/128 (43%), Gaps = 18/128 (14%)
Query: 326 TLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPE 385
T RG+ +G+ + R G++ KQ KP ++P+ + S SP+ +PE
Sbjct: 361 TQGQSRGQSRGRGRGRGRGRGKGKSRDKQ-RKPGGPWRPEPNTS----SPS------MPE 409
Query: 386 GVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYP--NNAPN----FPSD 439
+S H+ GAG+ P + P RN+ SP P +N+P FP D
Sbjct: 410 -LSPVLGLHQGQGAGDSPTPGPSNAAPVCRNSHTATPNVSPIHEPESHNSPEAPILFPDD 468
Query: 440 HATPTFNP 447
P+ +P
Sbjct: 469 WYPPSIDP 476
>gi|3880306|emb|CAA90995.1| (Z54238) similar to cuticle collagen;
cDNA EST EMBL:T01150 comes from this gene; cDNA EST
EMBL:D33882 comes from this gene; cDNA EST EMBL:D65956
comes from this gene; cDNA EST EMBL:D66123 comes from
this gene; cDNA EST EMBL:D...
>gi|3880308|emb|CAA90997.1| (Z54238) similar to cuticle
collagen [Caenorhabditis elegans]
Length = 299
Score = 33.6 bits (75), Expect = 3.6
Identities = 23/76 (30%), Positives = 30/76 (39%), Gaps = 1/76 (1%)
Query: 391 PRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGN 450
P N GA P+ + NP PGQ + +P + P+ A P P G
Sbjct: 174 PGNDGQPGAPGNKGPSGPNGNPGAPGAPGQPGQDAPSEPITPGAPGPAGPAGPQ-GPPGA 232
Query: 451 PGQKTGAGRPNNSGGK 466
PGQ G+P G K
Sbjct: 233 PGQPGHDGQPGAPGPK 248
>gi|2833307|sp|Q21184|YXWK_CAEEL PUTATIVE CUTICLE COLLAGEN F55C10.3
>gi|3877662|emb|CAA98487.1| (Z74036) similar to collagen
[Caenorhabditis elegans]
Length = 266
Score = 33.6 bits (75), Expect = 3.6
Identities = 31/100 (31%), Positives = 37/100 (37%), Gaps = 10/100 (10%)
Query: 384 PEGVSTGPRNHRNAGA-GNGAHPNKKSP-----NPNTRNTPGQQTRKSPYKYPNNAPNFP 437
P G + G N + G G P K P NP PGQ + +P + P P
Sbjct: 128 PSGDAGGNGNPGSPGQDGQPGAPGNKGPSGPNGNPGAPGAPGQPGQDAPSE--PITPGAP 185
Query: 438 SDHATP-TFNPYGNPGQKTGAGRPNNSGGKYNRNRGPRYP 476
TP P G PGQ G+P G K N P P
Sbjct: 186 GPQGTPGPQGPPGQPGQPGHDGQPGAPGPK-GPNGNPGQP 224
>gi|2144807|pir||EABO elastin precursor, splice form a - bovine
Length = 747
Score = 33.6 bits (75), Expect = 3.6
Identities = 30/100 (30%), Positives = 34/100 (34%), Gaps = 13/100 (13%)
Query: 373 GSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNN 432
G+ A F V GV TG A G GA P PG P YP
Sbjct: 166 GAGARFPGIGVLPGVPTGAGVKPKAQVGAGAFAGIPGVGPFGGQQPGL-----PLGYPIK 220
Query: 433 APNFPSDHATP--------TFNPYGNPGQKTGAGRPNNSG 464
AP P+ + P F P G G AG P +G
Sbjct: 221 APKLPAGYGLPYKTGKLPYGFGPGGVAGSAGKAGYPTGTG 260
>gi|3877661|emb|CAA98486.1| (Z74036) predicted using Genefinder;
similar to collagen [Caenorhabditis elegans]
Length = 299
Score = 33.6 bits (75), Expect = 3.6
Identities = 31/100 (31%), Positives = 37/100 (37%), Gaps = 10/100 (10%)
Query: 384 PEGVSTGPRNHRNAGA-GNGAHPNKKSP-----NPNTRNTPGQQTRKSPYKYPNNAPNFP 437
P G + G N + G G P K P NP PGQ + +P + P P
Sbjct: 161 PSGDAGGNGNPGSPGQDGQPGAPGNKGPSGPNGNPGAPGAPGQPGQDAPSE--PITPGAP 218
Query: 438 SDHATP-TFNPYGNPGQKTGAGRPNNSGGKYNRNRGPRYP 476
TP P G PGQ G+P G K N P P
Sbjct: 219 GPQGTPGPQGPPGQPGQPGHDGQPGAPGPK-GPNGNPGQP 257
>gi|5833486|gb|AAD53528.1| (AF162149) variable surface lipoprotein
[Mycoplasma bovis]
Length = 202
Score = 33.6 bits (75), Expect = 3.6
Identities = 27/89 (30%), Positives = 39/89 (43%), Gaps = 9/89 (10%)
Query: 385 EGVSTGPRNHR--NAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHAT 442
+G T P N G G A+P++ +P + TP + +P P P D T
Sbjct: 105 QGTPTNPDQGTPANPGQGTPANPDQGTPTNPDQGTPANPGQGTPANPDQGTPANP-DQGT 163
Query: 443 PTFNPYGNPGQKTGAGRPNNSGGKYNRNR 471
PT NPGQ T A +P+ S + N +
Sbjct: 164 PT-----NPGQGTPA-KPHFSPEEENAEK 186
>gi|119293|sp|P04985|ELS_BOVIN ELASTIN PRECURSOR (TROPOELASTIN)
>gi|163020 (J02717) elastin a precursor [Bos taurus]
Length = 747
Score = 33.6 bits (75), Expect = 3.6
Identities = 30/100 (30%), Positives = 34/100 (34%), Gaps = 13/100 (13%)
Query: 373 GSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNN 432
G+ A F V GV TG A G GA P PG P YP
Sbjct: 166 GAGARFPGIGVLPGVPTGAGVKPKAQVGAGAFAGIPGVGPFGGQQPGL-----PLGYPIK 220
Query: 433 APNFPSDHATP--------TFNPYGNPGQKTGAGRPNNSG 464
AP P+ + P F P G G AG P +G
Sbjct: 221 APKLPAGYGLPYKTGKLPYGFGPGGVAGSAGKAGYPTGTG 260
>gi|543543|pir||JN0690 glutenin, high-molecular-weight Bx7 chain
precursor - wheat
Length = 791
Score = 33.6 bits (75), Expect = 3.6
Identities = 28/136 (20%), Positives = 45/136 (32%), Gaps = 12/136 (8%)
Query: 334 GQGKHHFKDRLTVSGEAAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRN 393
GQG+ H + +Q + P +P + L +G P Y P G +
Sbjct: 139 GQGQQHQQP-------GQRQQGYYPTSPQQPGQGQQLGQGQPG-----YYPTSQQPGQKQ 186
Query: 394 HRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQ 453
G +G P ++ GQQ + Y +P P GQ
Sbjct: 187 QAGQGQQSGQGQQGYYPTSPQQSGQGQQPGQGQPGYYPTSPQQSGQWQQPGQGQQPGQGQ 246
Query: 454 KTGAGRPNNSGGKYNR 469
++G G+ G+ R
Sbjct: 247 QSGQGQQGQQPGQGQR 262
>gi|119111|sp|P12978|EBN2_EBV EBNA-2 NUCLEAR PROTEIN
>gi|1083964|pir||S42442 EBNA2 protein - human
herpesvirus 4 >gi|1632787|emb|CAA24877.1| (V01555)
BYRF1, encodes EBNA-2 (Dambaugh et al, 1984; Dillner et
al, 1984) [Human herpesvirus 4]
Length = 487
Score = 33.6 bits (75), Expect = 3.6
Identities = 35/128 (27%), Positives = 56/128 (43%), Gaps = 18/128 (14%)
Query: 326 TLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPE 385
T RG+ +G+ + R G++ KQ KP ++P+ + S SP+ +PE
Sbjct: 336 TQGQSRGQSRGRGRGRGRGRGKGKSRDKQ-RKPGGPWRPEPNTS----SPS------MPE 384
Query: 386 GVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYP--NNAPN----FPSD 439
+S H+ GAG+ P + P RN+ SP P +N+P FP D
Sbjct: 385 -LSPVLGLHQGQGAGDSPTPGPSNAAPVCRNSHTATPNVSPIHEPESHNSPEAPILFPDD 443
Query: 440 HATPTFNP 447
P+ +P
Sbjct: 444 WYPPSIDP 451
>gi|628185|pir||S42447 nuclear protein EBNA2 - Epstein-Barr virus
>gi|330444 (K03333) nuclear protein EBNA2 [Epstein-Barr
virus]
Length = 490
Score = 33.6 bits (75), Expect = 3.6
Identities = 35/128 (27%), Positives = 56/128 (43%), Gaps = 18/128 (14%)
Query: 326 TLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPE 385
T RG+ +G+ + R G++ KQ KP ++P+ + S SP+ +PE
Sbjct: 339 TQGQSRGQSRGRGRGRGRGRGKGKSRDKQ-RKPGGPWRPEPNTS----SPS------MPE 387
Query: 386 GVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYP--NNAPN----FPSD 439
+S H+ GAG+ P + P RN+ SP P +N+P FP D
Sbjct: 388 -LSPVLGLHQGQGAGDSPTPGPSNAAPVCRNSHTATPNVSPIHEPESHNSPEAPILFPDD 446
Query: 440 HATPTFNP 447
P+ +P
Sbjct: 447 WYPPSIDP 454
>gi|282331|pir||S28037 penicillin-binding protein 1a - Streptococcus
pneumoniae (strain 63915) (fragment)
>gi|47418|emb|CAA48072| (X67872) penicillin-binding
protein 1a [Streptococcus pneumoniae]
Length = 719
Score = 33.2 bits (74), Expect = 4.7
Identities = 30/109 (27%), Positives = 43/109 (38%), Gaps = 16/109 (14%)
Query: 370 LSEGSPATFQSWYVPEGVSTGPRNHRNAGA--GNGAHPNKKSPNPNTRNTPGQQTRKSPY 427
LSEGS + W +PEG+ +RN NGA SP P + + S
Sbjct: 625 LSEGSNP--EDWNIPEGL------YRNGEFVFKNGARSTWNSPAPQQPPSTESSSSSSDS 676
Query: 428 KYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRNRGPRYP 476
++ PS + + T NP N Q N + + N+N P P
Sbjct: 677 STSQSSSTTPSTNNSTTTNPNNNTQQS------NTTPDQQNQNPQPAQP 719
>gi|585647|sp|Q04707|PBPA_STRPN PENICILLIN-BINDING PROTEIN 1A
(PBP-1A) (EXPORTED PROTEIN 2) >gi|282329|pir||S28038
penicillin-binding protein 1a - Streptococcus pneumoniae
(strain 45607) (fragment) >gi|47420|emb|CAA48073|
(X67873) penicillin-binding protein 1a [Streptococcus
pneumoniae]
Length = 719
Score = 33.2 bits (74), Expect = 4.7
Identities = 30/109 (27%), Positives = 43/109 (38%), Gaps = 16/109 (14%)
Query: 370 LSEGSPATFQSWYVPEGVSTGPRNHRNAGA--GNGAHPNKKSPNPNTRNTPGQQTRKSPY 427
LSEGS + W +PEG+ +RN NGA SP P + + S
Sbjct: 625 LSEGSNP--EDWNIPEGL------YRNGEFVFKNGARSTWSSPAPQQPPSTESSSSSSDS 676
Query: 428 KYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRNRGPRYP 476
++ PS + + T NP N Q N + + N+N P P
Sbjct: 677 STSQSSSTTPSTNNSTTTNPNNNTQQS------NTTPDQQNQNPQPAQP 719
>gi|538257 (L23872) intermediate filament protein [Onchocerca
volvulus]
Length = 542
Score = 33.2 bits (74), Expect = 4.7
Identities = 31/110 (28%), Positives = 54/110 (48%), Gaps = 10/110 (9%)
Query: 164 RVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGNTDSHDWFRVVVKEGRNREVR 223
R + E E+ ++EL + D A++D I + TD W+++ V+E + + R
Sbjct: 227 RDTTSENREYFKNELSSAI------RDIRAEYDQICNMNRTDMESWYKLKVQEIQTQSTR 280
Query: 224 RLWESQGCQVSRLKRTRYGSVLLPREL--LRGQSTELPKTQVEALRTQLK 271
+ E QG +KR R L +L L G+++ L K Q++ L QL+
Sbjct: 281 QNLE-QGYAKEEVKRLRVQLSDLRGKLADLEGRNSLLEK-QMQELNYQLE 328
>gi|556343 (L31528) intermediate filament protein [Onchocerca
volvulus] >gi|914120|bbs|164183 (S77073)
OVIF=intermediate-filament protein [Onchocerca volvulus,
host=human, Peptide, 613 aa] [Onchocerca volvulus]
Length = 613
Score = 33.2 bits (74), Expect = 4.7
Identities = 31/110 (28%), Positives = 54/110 (48%), Gaps = 10/110 (9%)
Query: 164 RVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGNTDSHDWFRVVVKEGRNREVR 223
R + E E+ ++EL + D A++D I + TD W+++ V+E + + R
Sbjct: 298 RDTTSENREYFKNELSSAI------RDIRAEYDQICNMNRTDMESWYKLKVQEIQTQSTR 351
Query: 224 RLWESQGCQVSRLKRTRYGSVLLPREL--LRGQSTELPKTQVEALRTQLK 271
+ E QG +KR R L +L L G+++ L K Q++ L QL+
Sbjct: 352 QNLE-QGYAKEEVKRLRVQLSDLRGKLADLEGRNSLLEK-QMQELNYQLE 399
>gi|400698|sp|P31732|OV71_ONCVO MUSCLE CELL INTERMEDIATE FILAMENT
PROTEIN OV71 >gi|283569|pir||S26432 intermediate
filament protein Ov71, muscle - nematode (Onchocerca
volvulus) (fragment) >gi|9771|emb|CAA48561| (X68558)
Ov71 muscle cell intermediate filament [Onchocerca
volvulus]
Length = 432
Score = 33.2 bits (74), Expect = 4.7
Identities = 31/110 (28%), Positives = 54/110 (48%), Gaps = 10/110 (9%)
Query: 164 RVRSPEGEEHVQDELLEQLTRGVMLEDGTAKFDTIERIGNTDSHDWFRVVVKEGRNREVR 223
R + E E+ ++EL + D A++D I + TD W+++ V+E + + R
Sbjct: 117 RDTTSENREYFKNELSSAI------RDIRAEYDQICNMNRTDMESWYKLKVQEIQTQSTR 170
Query: 224 RLWESQGCQVSRLKRTRYGSVLLPREL--LRGQSTELPKTQVEALRTQLK 271
+ E QG +KR R L +L L G+++ L K Q++ L QL+
Sbjct: 171 QNLE-QGYAKEEVKRLRVQLSDLRGKLADLEGRNSLLEK-QMQELNYQLE 218
>gi|418972|pir||S31035 retrovirus-related gag polyprotein - mouse
intracisternal A-particle MIAD8 (fragment)
Length = 717
Score = 33.2 bits (74), Expect = 4.7
Identities = 19/52 (36%), Positives = 27/52 (51%), Gaps = 3/52 (5%)
Query: 375 PATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSP 426
PA QS Y+P+ S+GPR+ GN + P + R+ PG+ TR P
Sbjct: 402 PADSQSAYMPKNGSSGPRSQGPQRYGN---QFVEDPGSSQRDDPGRPTRVEP 450
>gi|4503493|ref|NP_001955.1|| early growth response 1
>gi|119242|sp|P18146|EGR1_HUMAN EARLY GROWTH RESPONSE
PROTEIN 1 (EGR-1) (KROX24) (ZIF268) (TRANSCRIPTION
FACTOR ETR103) (ZINC FINGER PROTEIN 225) (AT225)
>gi|87347|pir||A41211 early growth response protein 1 -
human >gi|31130|emb|CAA36777| (X52541) early growth
response protein 1 (AA 1-543) [Homo sapiens] >gi|182263
(M62829) ETR103 [Homo sapiens]
>gi|5420379|emb|CAB46678.1| (AJ243425) early growth
response protein 1 [Homo sapiens]
Length = 543
Score = 33.2 bits (74), Expect = 4.7
Identities = 24/84 (28%), Positives = 37/84 (43%), Gaps = 4/84 (4%)
Query: 356 HKPFKQYKPKNDRS--LSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPN 413
H P PK + LS G+P + PEG + + + G G G + S + +
Sbjct: 25 HSPTMDNYPKLEEMMLLSNGAPQFLGAAGAPEGSGSNSSSSSSGGGGGGGGGSNSSSSSS 84
Query: 414 TRNTPGQQTRKSPYKYPNNAPNFP 437
T N P T + PY++ A +FP
Sbjct: 85 TFN-PQADTGEQPYEH-LTAESFP 106
>gi|2833328|sp|Q27200|FBRL_TETTH FIBRILLARIN
>gi|2654298|emb|CAA54924| (X77962) fibrillarin
[Tetrahymena thermophila]
Length = 294
Score = 33.2 bits (74), Expect = 4.7
Identities = 15/28 (53%), Positives = 17/28 (60%), Gaps = 1/28 (3%)
Query: 449 GNPGQKTGAGRPNNSGGKYNRNRGPRYP 476
G PG K G GRP GGK+ +GPR P
Sbjct: 37 GGPGGKFGGGRPGGPGGKFGA-KGPRGP 63
>gi|2707270 (AF036171) homeobox-containing protein [Dictyostelium
discoideum]
Length = 534
Score = 33.2 bits (74), Expect = 4.7
Identities = 31/179 (17%), Positives = 61/179 (33%), Gaps = 19/179 (10%)
Query: 297 HVNRNDNNKHAYHNNHSTADESRELRRFDTL-----RDDRGRGQGKHHFKDRLTVSGEAA 351
H N N+NN + Y+N +S ++ +R ++ + H D
Sbjct: 285 HNNNNNNNSNNYNNGNSNSNNNRNNNNNYNYNNYINNNNYNNNNNRQHCDDE-------- 336
Query: 352 AKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPN 411
++ + F N+ + + + Y + + N+ N + N + N
Sbjct: 337 -EEDEQYFNNNNNNNNNNNNNRISDSSDDQYFSDDTNNNNDNYNNGNNNYNNNNNNNNFN 395
Query: 412 PNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRN 470
N N + Y NN ++ + + FN N + NN+ +YN N
Sbjct: 396 NNYMNNYNNNYNNNNY---NNNNSYNNSNGNNNFNNNNNNNNQN--NNNNNNNNQYNNN 449
>gi|3319463 (AF077544) unknown [Caenorhabditis elegans]
Length = 235
Score = 33.2 bits (74), Expect = 4.7
Identities = 22/75 (29%), Positives = 26/75 (34%), Gaps = 2/75 (2%)
Query: 397 AGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTG 456
+G NG HPN PN N N + PT +PY N G G
Sbjct: 132 SGYNNGPHPNGNFPNLNGYNNGPSSFNGGNTNVDDGIKGSVGAAVEPTKSPYPNNGY--G 189
Query: 457 AGRPNNSGGKYNRNR 471
G N G + NR
Sbjct: 190 YGNRNGYGNNFGFNR 204
>gi|3879844|emb|CAB04725.1| (Z81592) predicted using Genefinder;
cDNA EST EMBL:M89005 comes from this gene
[Caenorhabditis elegans]
Length = 695
Score = 33.2 bits (74), Expect = 4.7
Identities = 25/92 (27%), Positives = 34/92 (36%), Gaps = 1/92 (1%)
Query: 384 PEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNT-PGQQTRKSPYKYPNNAPNFPSDHAT 442
P S+G +RN G GN + NK S N N N G Y N+ +F +
Sbjct: 539 PPPRSSGANGNRNGGGGNRRNNNKNSSNSNNNNNFNGNGNGDGSYNNNNDNCDFENRCGG 598
Query: 443 PTFNPYGNPGQKTGAGRPNNSGGKYNRNRGPR 474
GN Q+ + + N N G R
Sbjct: 599 QGGFENGNENQRFSSRKQPPPKPSANNNNGDR 630
Score = 32.8 bits (73), Expect = 6.2
Identities = 26/83 (31%), Positives = 33/83 (39%), Gaps = 7/83 (8%)
Query: 398 GAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGA 457
G GNG P P P + ++T NN N + + P P N G G
Sbjct: 458 GGGNGPVPPIPEPKPLCKGLKFKKTANGGGGNNNNNNNNNNRNNGP---PPRNNGNNNGN 514
Query: 458 GRP----NNSGGKYNRNRGPRYP 476
GRP ++SG NR GP P
Sbjct: 515 GRPMKPPSSSGSGSNRRSGPPPP 537
>gi|4140029|dbj|BAA36973.1| (AB015438) alpha 1 type I collagen
[Cynops pyrrhogaster]
Length = 1450
Score = 33.2 bits (74), Expect = 4.7
Identities = 26/87 (29%), Positives = 29/87 (32%), Gaps = 10/87 (11%)
Query: 390 GPRNHRNAGAGNGAHPNKKSPNP-------NTRNTPGQQTRKSPYKYPN--NAPNFPSDH 440
GP+ R + GA +P P T GQ K P AP FP
Sbjct: 342 GPQGSRGSEGPQGARGEPGAPGPAGAAGPSGNPGTDGQPGGKGATGSPGIAGAPGFPGAR 401
Query: 441 ATP-TFNPYGNPGQKTGAGRPNNSGGK 466
P P G PG K G P G K
Sbjct: 402 GAPGPQGPAGAPGPKGNNGEPGAQGNK 428
>gi|5410459|gb|AAD43067.1|AF139884_1 (AF139884) penicillin-binding
protein 1a [Streptococcus pneumoniae]
>gi|5410461|gb|AAD43068.1|AF139885_1 (AF139885)
penicillin-binding protein 1a [Streptococcus pneumoniae]
>gi|5410463|gb|AAD43069.1|AF139886_1 (AF139886)
penicillin-binding protein 1a [Streptococcus pneumoniae]
Length = 719
Score = 33.2 bits (74), Expect = 4.7
Identities = 30/109 (27%), Positives = 43/109 (38%), Gaps = 16/109 (14%)
Query: 370 LSEGSPATFQSWYVPEGVSTGPRNHRNAGA--GNGAHPNKKSPNPNTRNTPGQQTRKSPY 427
LSEGS + W +PEG+ +RN NGA SP P + + S
Sbjct: 625 LSEGSNP--EDWNIPEGL------YRNGEFVFKNGARSTWSSPAPQQPPSTESSSSSSDS 676
Query: 428 KYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRNRGPRYP 476
++ PS + + T NP N Q N + + N+N P P
Sbjct: 677 STSQSSSTTPSTNNSTTTNPNNNTQQS------NTTPDQQNQNPQPAQP 719
>gi|5815245|gb|AAD52614.1|AF175223_1 (AF175223) SANT domain protein
SMRTER [Drosophila melanogaster]
Length = 3469
Score = 33.2 bits (74), Expect = 4.7
Identities = 32/126 (25%), Positives = 52/126 (40%), Gaps = 38/126 (30%)
Query: 354 QAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPN 413
Q + +Q + + R++S GS A+ G G G +K+SP+P
Sbjct: 2462 QGQQQQQQQQQQQQRNMSRGSSAS--------------------GGGGGGGSDKESPSP- 2500
Query: 414 TRNTPGQQTRKSPYKY-------PNNAPNF-----PSDHATPTFNPYGN-PGQKTGAGRP 460
RN+ G S + Y P P + P+DH T +P+ P Q+ G +
Sbjct: 2501 -RNSVGS---ASGFAYGGDKESAPRGRPEYSSRASPADHVNSTPSPHRTPPPQRQGVIQR 2556
Query: 461 NNSGGK 466
+N+G K
Sbjct: 2557 HNTGSK 2562
>gi|5852937|gb|AAD54275.1|AF169793_1 (AF169793) HET-C protein
[Podospora anserina]
Length = 735
Score = 33.2 bits (74), Expect = 4.7
Identities = 34/153 (22%), Positives = 54/153 (35%), Gaps = 10/153 (6%)
Query: 309 HNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFKQYKPKNDR 368
H+NH A S R + G G H + ++ + +QA +Y P+
Sbjct: 576 HHNHILASNSSSSSRSVSAPSHGGNGCDHGHGRPAGSLWEQVKKQQADA---RYSPRPGS 632
Query: 369 SLSEGSPATFQSWYVPEGVST----GPRNHRNAGAGNGAHPNKKSPN---PNTRNTPGQQ 421
S S Y G + P+ + G G+GA+P ++ P G Q
Sbjct: 633 SGGGYGQRPGSSGYGSGGGGSYGRPSPQPGYSGGGGSGAYPPQQQPQYGGGGYGGPGGYQ 692
Query: 422 TRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQK 454
P+ + +P H P G PGQ+
Sbjct: 693 QPPPPHHHGQYGGGYPGQHPPPPPQGGGYPGQQ 725
>gi|6563337|gb|AAF17255.1|AF210745_1 (AF210745) penicillin-binding
protein 1A [Streptococcus pneumoniae]
Length = 719
Score = 33.2 bits (74), Expect = 4.7
Identities = 30/109 (27%), Positives = 43/109 (38%), Gaps = 16/109 (14%)
Query: 370 LSEGSPATFQSWYVPEGVSTGPRNHRNAGA--GNGAHPNKKSPNPNTRNTPGQQTRKSPY 427
LSEGS + W +PEG+ +RN NGA SP P + + S
Sbjct: 625 LSEGSNP--EDWNIPEGL------YRNGEFVFKNGARSTWNSPAPQQPPSTESSSSSSDS 676
Query: 428 KYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRNRGPRYP 476
++ PS + + T NP N Q N + + N+N P P
Sbjct: 677 STSQSSSTTPSTNNSTTTNPNNNTQQS------NTTPDQQNQNPQPAQP 719
>gi|6563341|gb|AAF17257.1|AF210747_1 (AF210747) penicillin-binding
protein 1A [Streptococcus pneumoniae]
Length = 719
Score = 33.2 bits (74), Expect = 4.7
Identities = 30/109 (27%), Positives = 43/109 (38%), Gaps = 16/109 (14%)
Query: 370 LSEGSPATFQSWYVPEGVSTGPRNHRNAGA--GNGAHPNKKSPNPNTRNTPGQQTRKSPY 427
LSEGS + W +PEG+ +RN NGA SP P + + S
Sbjct: 625 LSEGSNP--EDWNIPEGL------YRNGEFVFKNGARSTWSSPAPQQPPSTESSSSSSDS 676
Query: 428 KYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRNRGPRYP 476
++ PS + + T NP N Q N + + N+N P P
Sbjct: 677 STSQSSSTTPSTNNSTTTNPNNNTQQS------NTTPDQQNQNPQPAQP 719
>gi|6594249|emb|CAB63561.1| (AL031746) hypothetical garp protein
[Plasmodium falciparum]
Length = 673
Score = 33.2 bits (74), Expect = 4.7
Identities = 41/175 (23%), Positives = 68/175 (38%), Gaps = 18/175 (10%)
Query: 266 LRTQLKLEKDMPLALTLQPIIGQRRSAKATLHVNRNDNNKHAYHNNH--STADESRELRR 323
L + +LEK+ + ++ + + K + NDN K+A++NN S+ D + +
Sbjct: 49 LLNETELEKNKDDNSKSETLLKEEKDEKDDVPTTSNDNLKNAHNNNEISSSTDPTNIINV 108
Query: 324 FD----TLRDDRGRGQGKHHFKDRLTV-----SGEAAAKQAHKPFKQYKPKNDRSLSEGS 374
D D + + K H KD+ E K+ K K+ K K D+ E S
Sbjct: 109 NDKDNENSVDKKKDKKEKKHKKDKKEKKEKKDKKEKKDKKEKKHKKEKKHKKDKKKEENS 168
Query: 375 PATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKY 429
S Y TG +NA + ++ + N G SPY+Y
Sbjct: 169 EV--MSLY-----KTGQHKPKNATEHGEENLYEEMVSEINNNAQGGLLLSSPYQY 216
>gi|476822|pir||A42893 penicillin-binding protein 1A - Streptococcus
pneumoniae >gi|153768 (M90527) penicillin-binding
protein [Streptococcus pneumoniae]
Length = 719
Score = 33.2 bits (74), Expect = 4.7
Identities = 30/109 (27%), Positives = 43/109 (38%), Gaps = 16/109 (14%)
Query: 370 LSEGSPATFQSWYVPEGVSTGPRNHRNAGA--GNGAHPNKKSPNPNTRNTPGQQTRKSPY 427
LSEGS + W +PEG+ +RN NGA SP P + + S
Sbjct: 625 LSEGSNP--EDWNIPEGL------YRNGEFVFKNGARSTWSSPAPQQPPSTESSSSSSDS 676
Query: 428 KYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYNRNRGPRYP 476
++ PS + + T NP N Q N + + N+N P P
Sbjct: 677 STSQSSSTTPSTNNSTTTNPNNNTQQS------NTTPDQQNQNPQPAQP 719
>gi|6601502|gb|AAF19004.1|AF151366_1 (AF151366) arginine/serine-rich
protein [Arabidopsis thaliana]
Length = 414
Score = 33.2 bits (74), Expect = 4.7
Identities = 31/121 (25%), Positives = 46/121 (37%), Gaps = 12/121 (9%)
Query: 360 KQYKPKNDRSLSE----GSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTR 415
K+ PK+D + ++ G P + PR R+ G P ++SP+ R
Sbjct: 193 KRDAPKSDNAAADAEKDGGPRRPRETSPQRKTGLSPRR-RSPLPRRGLSPRRRSPDSPHR 251
Query: 416 NTPGQQTRKSPYKYPNNAPNFPSDHATPTFNP---YGNPGQKTGAGRPNNSGGKYNRNRG 472
PG R+ P P PS +P+ P Y +P + G P G R R
Sbjct: 252 RRPGSPIRRRGDTPPRRRPASPSRGRSPSSPPPRRYRSPPR----GSPRRIRGSPVRRRS 307
Query: 473 P 473
P
Sbjct: 308 P 308
>gi|82601|pir||A30843 glutenin high molecular weight chain Bx7
precursor - wheat >gi|21749|emb|CAA32115| (X13927) HMW
glutenin subunit (AA 1-789) [Triticum aestivum]
>gi|170745 (M22209) high MW glutenin subunit (Bx7)
[Triticum aestivum]
Length = 789
Score = 32.8 bits (73), Expect = 6.2
Identities = 28/141 (19%), Positives = 48/141 (33%), Gaps = 10/141 (7%)
Query: 334 GQGKHHFKDRLTVSGE-----AAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVS 388
GQG+ +++ G+ +Q + P +P + L +G P Y P
Sbjct: 127 GQGQQPGQEQQPGQGQQDQQPGQRQQGYYPTSPQQPGQGQQLGQGQPG-----YYPTSQQ 181
Query: 389 TGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPY 448
G + G +G P ++ GQQ + Y +P P
Sbjct: 182 PGQKQQAGQGQQSGQGQQGYYPTSPQQSGQGQQPGQGQPGYYPTSPQQSGQWQQPGQGQQ 241
Query: 449 GNPGQKTGAGRPNNSGGKYNR 469
GQ++G G+ G+ R
Sbjct: 242 PGQGQQSGQGQQGQQPGQGQR 262
>gi|543541|pir||JC2099 glutenin, high molecular weight chain Bx17 -
wheat
Length = 753
Score = 32.8 bits (73), Expect = 6.2
Identities = 28/141 (19%), Positives = 48/141 (33%), Gaps = 10/141 (7%)
Query: 334 GQGKHHFKDRLTVSGE-----AAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVS 388
GQG+ +++ G+ +Q + P +P + L +G P Y P
Sbjct: 127 GQGQQPGQEQQPGQGQQDQQPGQRQQGYYPTSPQQPGQGQQLGQGQPG-----YYPTSQQ 181
Query: 389 TGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPY 448
G + G +G P ++ GQQ + Y +P P
Sbjct: 182 PGQKQQAGQGQQSGQGQQGYYPTSPQQSGQGQQPGQGQPGYYPTSPQQSGQWQQPGQGQQ 241
Query: 449 GNPGQKTGAGRPNNSGGKYNR 469
GQ++G G+ G+ R
Sbjct: 242 PGQGQQSGQGQQGQQPGQGQR 262
>gi|330361 (M10593) major outer envelope glycoprotein gp220
[Epstein-Barr virus]
Length = 658
Score = 32.8 bits (73), Expect = 6.2
Identities = 19/67 (28%), Positives = 28/67 (41%), Gaps = 3/67 (4%)
Query: 409 SPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYN 468
SP P + SP + N + P +AT +P GQKT ++GGK N
Sbjct: 475 SPTPAGTTSGASPVTPSPSPWDNGTESTPPQNAT---SPQAPSGQKTAVPTVTSTGGKAN 531
Query: 469 RNRGPRY 475
G ++
Sbjct: 532 STTGGKH 538
>gi|2119159|pir||I50694 alpha-1 collagen type III - chicken
(fragment) >gi|537432 (U07973) alpha-1 collagen type III
[Gallus gallus]
Length = 886
Score = 32.8 bits (73), Expect = 6.2
Identities = 24/84 (28%), Positives = 27/84 (31%), Gaps = 2/84 (2%)
Query: 384 PEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATP 443
P G S P G A P P +PG + P P P P P
Sbjct: 357 PAGASGNPGERGEPGPQGQAGPPGPQGPPGRAGSPGGKGEMGPSGIPGG-PGPPGGRGLP 415
Query: 444 -TFNPYGNPGQKTGAGRPNNSGGK 466
GNPG K G P +G K
Sbjct: 416 GPPGTSGNPGAKGTPGEPGKNGAK 439
>gi|2314597|gb|AAD08465.1| (AE000642) conserved hypothetical
protein [Helicobacter pylori 26695]
Length = 84
Score = 32.8 bits (73), Expect = 6.2
Identities = 19/48 (39%), Positives = 25/48 (51%), Gaps = 1/48 (2%)
Query: 28 RLHKVLAQAGLGSRRALEQRISN-GLIKVNGDIAQLGMSVKSGDKIEL 74
R+ K L GL RR L + N G + +NG A+ VK+GD I L
Sbjct: 2 RIDKFLQSVGLVKRRVLATDMCNVGAVWLNGSCAKASKEVKAGDTISL 49
>gi|2388676 (AF015539) precollagen P [Mytilus edulis]
Length = 902
Score = 32.8 bits (73), Expect = 6.2
Identities = 25/80 (31%), Positives = 32/80 (39%), Gaps = 5/80 (6%)
Query: 402 GAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNA--PNFPSDHATPTF-NPYGNPGQKTGAG 458
G+ P + NP PG R P + P P TP G PGQ G G
Sbjct: 256 GSTPPGRLGNPGPPGQPGNPGRPGSSGRPGGSGQPGGPGRPGTPGKPGNRGQPGQPGGPG 315
Query: 459 RPNN--SGGKYNRNRGPRYP 476
+P + +GG+ RN P P
Sbjct: 316 QPGHPGAGGQPGRNGNPGNP 335
>gi|2854193 (AF045645) Similar to cuticular collagen; coded for by
C. elegans cDNA yk69e12.3; coded for by C. elegans cDNA
yk69e12.5; coded for by C. elegans cDNA yk307b3.5; coded
for by C. elegans cDNA yk307b3.3 [Caenorhabditis
elegans]
Length = 314
Score = 32.8 bits (73), Expect = 6.2
Identities = 31/91 (34%), Positives = 36/91 (39%), Gaps = 15/91 (16%)
Query: 384 PEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSD--HA 441
P G S P ++ NAGA P + TPG P P AP P HA
Sbjct: 211 PPGPSGQPGSNGNAGA-----PGAPGHVVDVPGTPGPAGPPGPAG-PAGAPGQPGQAGHA 264
Query: 442 TPTF-NPYGN------PGQKTGAGRPNNSGG 465
P P G+ PGQ AG+P N GG
Sbjct: 265 QPGQPGPQGDAGAPGAPGQPGSAGQPGNDGG 295
>gi|3875708|emb|CAA84658.1| (Z35598) Asparagine, Serine and Glycine
rich predicted protein [Caenorhabditis elegans]
>gi|3880108|emb|CAA86461.1| (Z46343) Asparagine, Serine
and Glycine rich predicted protein [Caenorhabditis
elegans]
Length = 549
Score = 32.8 bits (73), Expect = 6.2
Identities = 24/83 (28%), Positives = 32/83 (37%), Gaps = 13/83 (15%)
Query: 390 GPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYG 449
G N+RN G G+ + N + N N G Y NN + S++ N G
Sbjct: 424 GSNNNRNDGWGSSSSNNNNNNNNNNNGGTG--------GYSNNGGGWGSNN-----NNNG 470
Query: 450 NPGQKTGAGRPNNSGGKYNRNRG 472
N G + N GG N N G
Sbjct: 471 NDGNNWESNNGGNGGGGDNWNNG 493
>gi|4028688 (U91649) merozoite surface antigen 2 [Plasmodium
falciparum]
Length = 264
Score = 32.8 bits (73), Expect = 6.2
Identities = 30/109 (27%), Positives = 45/109 (40%), Gaps = 13/109 (11%)
Query: 368 RSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSP-NPNTRNTPGQQTRKSP 426
RS+ E P T S G +G A AGNGA+P + +P+T TP T +
Sbjct: 41 RSMEESKPPTGASGSAGSGSGSG----AVASAGNGANPGADAERSPSTPATPATTTTTTT 96
Query: 427 YKYPNNAPNFPSDHATPTFNPYGNPGQKTGAG-----RPNNSGGKYNRN 470
N+A + +T + NP N + G +PN + + N
Sbjct: 97 TTTTNDA---EASTSTSSENPNHNKAETNPKGKGEVQKPNQANKETQNN 142
>gi|4028690 (U91650) merozoite surface antigen 2 [Plasmodium
falciparum] >gi|6649612|gb|AAF21480.1|U91657_1 (U91657)
merozoite surface antigen 2 [Plasmodium falciparum]
>gi|6649614|gb|AAF21481.1|U91658_1 (U91658) merozoite
surface antigen 2 [Plasmodium falciparum]
Length = 184
Score = 32.8 bits (73), Expect = 6.2
Identities = 30/109 (27%), Positives = 45/109 (40%), Gaps = 13/109 (11%)
Query: 368 RSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSP-NPNTRNTPGQQTRKSP 426
RS+ E P T S G +G A AGNGA+P + +P+T TP T +
Sbjct: 20 RSMEESKPPTGASGSAGSGSGSG----AVASAGNGANPGADAERSPSTPATPATTTTTTT 75
Query: 427 YKYPNNAPNFPSDHATPTFNPYGNPGQKTGAG-----RPNNSGGKYNRN 470
N+A + +T + NP N + G +PN + + N
Sbjct: 76 TTTTNDA---EASTSTSSENPNHNKAETNPKGKGEVQKPNQANKETQNN 121
>gi|6324130|ref|NP_014200.1|GCR2| Transcription factor; Gcr2p
>gi|417039|sp|Q01722|GCR2_YEAST GLYCOLYTIC GENES
TRANSCRIPTIONAL ACTIVATOR GCR2 >gi|320841|pir||S31300
regulatory protein GCR2 - yeast (Saccharomyces
cerevisiae) >gi|218427|dbj|BAA00985| (D10104) GCR2
protein [Saccharomyces cerevisiae]
>gi|600066|emb|CAA55509| (X78898) Gcr2; acc.#:D10104
[Saccharomyces cerevisiae] >gi|1302197|emb|CAA96097|
(Z71475) ORF YNL199c [Saccharomyces cerevisiae]
Length = 534
Score = 32.5 bits (72), Expect = 8.1
Identities = 19/65 (29%), Positives = 30/65 (45%), Gaps = 10/65 (15%)
Query: 371 SEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYP 430
++GSP+ Q G++ G N N GNG++ N +N P +T+K +
Sbjct: 244 TKGSPSDLQ------GINNGNNNGNNGNIGNGSNIK----NYGNKNMPNNRTKKRGTRVA 293
Query: 431 NNAPN 435
NA N
Sbjct: 294 KNAKN 298
>gi|182424 (J00127) alpha-fibrinogen precursor [Homo sapiens]
Length = 644
Score = 32.5 bits (72), Expect = 8.1
Identities = 31/114 (27%), Positives = 47/114 (41%), Gaps = 18/114 (15%)
Query: 365 KNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNT--PGQQT 422
+N S G AT++ G S G N ++G G+ + N SP P + T PG
Sbjct: 308 RNPGSSGTGGTATWKPGSSGPG-SAGSWNSGSSGTGSTGNQNPGSPRPGSTGTWNPGSSE 366
Query: 423 RKSPYKYPNNAPNFPSDHATPTFNPYGNPGQ---KTGAGRPNNSGGKYNRNRGP 473
R S + H T + G+ GQ ++G+ RP++ G R P
Sbjct: 367 RGS------------AGHWTSESSVSGSTGQWHSESGSFRPDSPGSGNARPNNP 408
>gi|1136289 (U42597) histidine kinase A [Dictyostelium discoideum]
Length = 2150
Score = 32.5 bits (72), Expect = 8.1
Identities = 29/146 (19%), Positives = 54/146 (36%), Gaps = 11/146 (7%)
Query: 299 NRNDNNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKP 358
+ N NN + +NN++ D + +R T + G Q + + + A+
Sbjct: 229 SNNSNNNNNGNNNNNITDSPTKSKRHSTYETNIGSHQRRKSIQSLI----------ANSA 278
Query: 359 FKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGN-GAHPNKKSPNPNTRNT 417
+ ++ LS +P+T + S N+ N G+ GA P +S + N
Sbjct: 279 IHSFSKLKNKPLSSSTPSTVNTCGAVNNNSNNNNNNNNNSTGSLGAIPMDRSFDGNINTI 338
Query: 418 PGQQTRKSPYKYPNNAPNFPSDHATP 443
+ T + N N S+ P
Sbjct: 339 TEESTGGNNSPRSNCGSNCGSNGGIP 364
>gi|1723450|sp|Q10268|YD34_SCHPO HYPOTHETICAL 81.7 KD PROTEIN
C13G7.04C IN CHROMOSOME I PRECURSOR
>gi|2130226|pir||S67433 hypothetical protein - fission
yeast (Schizosaccharomyces pombe)
>gi|1204171|emb|CAA93592.1| (Z69729) hypothetical
protein. [Schizosaccharomyces pombe]
Length = 756
Score = 32.5 bits (72), Expect = 8.1
Identities = 49/231 (21%), Positives = 77/231 (33%), Gaps = 34/231 (14%)
Query: 252 RGQSTELPKTQVEALRTQLKLEKDMPLALTLQPIIGQRRSAKATLHV------NRNDNNK 305
R T P + + L ++ +M +T P +G R ++ LH N + +
Sbjct: 432 RMPQTRSPVNDHSSFPSDLPIKGEMSTNMTGAPRVGSRNNSSNDLHAQAGMLKNVGNGPR 491
Query: 306 HAYHNNHST---ADESRELRRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFKQY 362
+A NN S A + R G H + + ++G
Sbjct: 492 NAPRNNSSNNLHAQGGMPMNMRGPRGAPRNNSSGDLHIQSGMPMNGRNG----------- 540
Query: 363 KPKNDRSLSEGSPATFQSWYVP---EGVSTGPRNH--RNAGAGNGAHPNKKSPNPNTRNT 417
P++ + S QS P G PRN+ + A G N + P +RN
Sbjct: 541 -PRDTSRNNSSSDLYAQSGMHPNMNNGHRGAPRNNSSNDLHAHGGMPVNMRGPRNTSRNN 599
Query: 418 PGQQTRKSPYKYPNNAPNFP-----SDHATPTFNPYGNPGQKTGAGRPNNS 463
+ + P N N P S+ +T F G PG G NS
Sbjct: 600 SSSEFNA---QIPMNLRNGPRNASRSNSSTDLFGQSGIPGNSRGMPTSPNS 647
>gi|1448962 (L79944) OFR1 of non-LTR retrotransposon; putative
[Chironomus tentans]
Length = 165
Score = 32.5 bits (72), Expect = 8.1
Identities = 19/81 (23%), Positives = 28/81 (34%), Gaps = 7/81 (8%)
Query: 345 TVSGEAAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAH 404
T SG A Q ++ KP+N P+ +G +N+R G G G
Sbjct: 13 TTSGRAIHSQQNRATVNQKPQNTNPPPNNKPSN-------QGNENNQQNNRGKGRGKGKR 65
Query: 405 PNKKSPNPNTRNTPGQQTRKS 425
+ P +N Q S
Sbjct: 66 RRQNKSKPRNKNNKNQNKNSS 86
>gi|1523876|emb|CAA99996| (Z75666) BRCA2 [Chlorocebus aethiops]
Length = 1548
Score = 32.5 bits (72), Expect = 8.1
Identities = 39/144 (27%), Positives = 62/144 (42%), Gaps = 25/144 (17%)
Query: 247 PRELLRGQSTELPKTQVEALRTQLKLEKDMPLALTLQPIIGQRRSAK---ATLHVN---- 299
P +L+ + E+P+ QV L T + +D L + P IGQ S+K T+ +
Sbjct: 463 PSYILQKNTFEVPENQVTILNTTTEENRDAGLVIMNAPSIGQVNSSKQFEGTVGIKQKFA 522
Query: 300 ---RNDNNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAH 356
++D NK A + + T + E R F H K L VS EA K A
Sbjct: 523 GLLKSDCNKSA--SGYLTDENEVEFRGF----------YSAHGVK--LNVSTEALQK-AV 567
Query: 357 KPFKQYKPKNDRSLSEGSPATFQS 380
K F + ++++ +E P + S
Sbjct: 568 KLFSDIENISEKTSAEVDPISLSS 591
>gi|1870123|emb|CAB06788| (Z86109) unknown [Saccharomyces
pastorianus]
Length = 193
Score = 32.5 bits (72), Expect = 8.1
Identities = 22/67 (32%), Positives = 29/67 (42%), Gaps = 2/67 (2%)
Query: 409 SPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKYN 468
+P+P + TPG+ K+P K P AP A P P PG+ G P + GK
Sbjct: 98 TPSPQGKKTPGKAPGKAPGKAPGKAPGKAPGKA-PGKAPGKAPGKAPGKA-PGKAPGKAP 155
Query: 469 RNRGPRY 475
G Y
Sbjct: 156 GKAGRSY 162
>gi|4033468|sp|P92965|RS40_ARATH ARGININE/SERINE-RICH SPLICING
FACTOR RSP40 >gi|2582641|emb|CAA67800| (X99437) splicing
factor [Arabidopsis thaliana]
>gi|2980800|emb|CAA18176.1| (AL022197) splicing factor
At-SRp40 [Arabidopsis thaliana]
Length = 350
Score = 32.5 bits (72), Expect = 8.1
Identities = 28/147 (19%), Positives = 58/147 (39%), Gaps = 8/147 (5%)
Query: 327 LRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPE- 385
++DD RG G H +R +++ P+K+ + D A ++
Sbjct: 167 VKDDDARGNG--HSPERRRDRSPERRRRSPSPYKRERGSPDYGRGASPVAAYRKERTSPD 224
Query: 386 -GVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPT 444
G P ++ + G+ + + N + R ++ SP KY + +PN + +P
Sbjct: 225 YGRRRSPSPYKKSRRGSPEYGRDRRGNDSPRR---RERVASPTKY-SRSPNNKRERMSPN 280
Query: 445 FNPYGNPGQKTGAGRPNNSGGKYNRNR 471
+P+ + G G + + R+R
Sbjct: 281 HSPFKKESPRNGVGEVESPIERRERSR 307
>gi|2952545 (AF051898) coronin binding protein [Dictyostelium
discoideum]
Length = 560
Score = 32.5 bits (72), Expect = 8.1
Identities = 28/183 (15%), Positives = 63/183 (34%), Gaps = 13/183 (7%)
Query: 301 NDNNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFK 360
N+NN + N++S + + R ++ + + + ++ + A +++ P
Sbjct: 316 NNNNSNNNSNSNSNNNNNGINNRNNSNNNSNNNSNNNSNNSNNRNITNGSNANKSNSPNN 375
Query: 361 QYKPKNDR-------------SLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNK 407
ND + + G+ + + ++ ++ N+ + + N+
Sbjct: 376 NLNTNNDNKNNNSNNNNNSNNNSNNGNSNNNNNNNIINNNNSNSNSNNNSNNNSNNNSNR 435
Query: 408 KSPNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNNSGGKY 467
SPN N T + NN N +++ N N A NN+
Sbjct: 436 NSPNHNNNGDNDNNTNNNTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNYADNSNNNSSNS 495
Query: 468 NRN 470
N N
Sbjct: 496 NNN 498
>gi|3876265|emb|CAB02976.1| (Z81067) predicted using Genefinder;
cDNA EST yk488h9.3 comes from this gene [Caenorhabditis
elegans]
Length = 1307
Score = 32.5 bits (72), Expect = 8.1
Identities = 38/153 (24%), Positives = 55/153 (35%), Gaps = 19/153 (12%)
Query: 303 NNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVSGEAAAKQAHKPFKQY 362
+ H NN+ A R RD R G + TVS E + +H P
Sbjct: 70 HGNHQLQNNYGGASSRGAQSRGSPPRDPRRHANGSSSHRRDKTVSDELQHENSHTP---- 125
Query: 363 KPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNTPGQQT 422
E S +TF S + P S+ R+ R +G+ +KSP+ P QQ
Sbjct: 126 -------RQEESQSTFGSSFRPSQYSSILRDPRLSGSCPPG--QEKSPSNGHNLLPHQQ- 175
Query: 423 RKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKT 455
K+ + P + + T N P Q T
Sbjct: 176 -----KFGGSIPVSSTLSDSHTSNGGSTPNQDT 203
>gi|4996369|dbj|BAA78427.1| (AB021267) polyprotein [Arabidopsis
thaliana]
Length = 1421
Score = 32.5 bits (72), Expect = 8.1
Identities = 26/90 (28%), Positives = 39/90 (42%), Gaps = 10/90 (11%)
Query: 364 PKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAG----NGAHPNKKSPNPNTRNTPG 419
P + S S T S+ P+ +T P +N+ + N +PN SPN +N+P
Sbjct: 817 PSSSISSPSSSEPTAPSYNGPQP-TTQPHQTQNSNSNSPILNNPNPNSPSPNSPNQNSPL 875
Query: 420 QQTRKSPYKYPN-----NAPNFPSDHATPT 444
Q+ S P + PN PS +T T
Sbjct: 876 PQSPISSPHIPTPSTSISEPNSPSSSSTST 905
>gi|5824601|emb|CAA82571.2| (Z29443) similar to Annexin; cDNA EST
EMBL:C10640 comes from this gene; cDNA EST EMBL:C12433
comes from this gene; cDNA EST yk192f7.5 comes from this
gene; cDNA EST yk318c1.5 comes from this gene; cDNA EST
yk494a12.3 comes fr...
Length = 497
Score = 32.5 bits (72), Expect = 8.1
Identities = 25/94 (26%), Positives = 38/94 (39%), Gaps = 19/94 (20%)
Query: 402 GAHPNKKSPNPNTRNTPGQQTR-KSPYKYPNNAPNFPSDHATP--------TFNPYGNPG 452
G N K P+++ + + + S YPN P++ P +++PYG P
Sbjct: 21 GLGGNNKQQQPSSQQSSQEPSNMNSGGGYPNQQPSYGGYGQPPQQPGYGNGSYDPYGQPQ 80
Query: 453 QKT---GAGRP-------NNSGGKYNRNRGPRYP 476
Q+ G G+P N GG Y G YP
Sbjct: 81 QQPYPGGGGQPPYPGSNSNQGGGGYPGQGGAPYP 114
>gi|223918|prf||1004351A fibrinogen alphaA [Homo sapiens]
Length = 462
Score = 32.5 bits (72), Expect = 8.1
Identities = 31/114 (27%), Positives = 47/114 (41%), Gaps = 18/114 (15%)
Query: 365 KNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNKKSPNPNTRNT--PGQQT 422
+N S G AT++ G S G N ++G G+ + N SP P + T PG
Sbjct: 237 RNPGSSGTGGTATWKPGSSGPG-SXGSWNSGSSGTGSTGNQNPGSPRPGSTGTWNPGSSE 295
Query: 423 RKSPYKYPNNAPNFPSDHATPTFNPYGNPGQ---KTGAGRPNNSGGKYNRNRGP 473
R S + H T + G+ GQ ++G+ RP++ G R P
Sbjct: 296 RGS------------AGHWTSESSVSGSTGQWHSESGSFRPDSPGSGNARPNNP 337
>gi|5901828|gb|AAD55422.1|AF181636_1 (AF181636) BcDNA.GH07269
[Drosophila melanogaster]
Length = 682
Score = 32.5 bits (72), Expect = 8.1
Identities = 35/142 (24%), Positives = 57/142 (39%), Gaps = 19/142 (13%)
Query: 338 HHFKDRLTVSGEAAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNA 397
HH RL+ G + ++ P + S + +PA+ W VP P + +
Sbjct: 70 HHDNVRLSYGGGSHSQ----PVSKVSSSQTHSAAPSAPASPIGWNVPAKPQGPPPAYSAS 125
Query: 398 GAGNGAHPN--KKSP--NPNTRNTP-----GQQTR-----KSPYKYPNNAPNFPSDHATP 443
GAH N ++ P NP + P QT +SPY+ P A + + ++
Sbjct: 126 NPVGGAHTNIHERPPAYNPAYKPAPPSYSAATQTHSNTNLQSPYR-PAGAASPGASSSSS 184
Query: 444 TFNPYGNPGQKTGAGRPNNSGG 465
+ YG GR N++GG
Sbjct: 185 GSHYYGGAHNTAYRGRNNSTGG 206
>gi|6103602|gb|AAF03681.1| (AF160252) KIAA0553 protein [Homo
sapiens]
Length = 1089
Score = 32.5 bits (72), Expect = 8.1
Identities = 32/189 (16%), Positives = 76/189 (39%), Gaps = 6/189 (3%)
Query: 288 QRRSAKATLHVNRNDNNKHAYHNNHSTADESRELRRFDTLRDDRGRGQGKHHFKDRLTVS 347
+ RS K+ H + ++ K + H AD + + ++ + R + K K++ +
Sbjct: 253 KERSGKSHRHKKKKEHKKSSKHKRKPKADTEEKSSKAESGEKSKKRKKRKRK-KNKSSAP 311
Query: 348 GEAAAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNAGAGNGAHPNK 407
++ +P P R + + E S+G ++ G+ + H +
Sbjct: 312 ADSERGPKPEPPGSGSPAPPRRRRRAQDDSQRRSLPAEEGSSGKKDEGGGGSSSQDHGGR 371
Query: 408 KS-----PNPNTRNTPGQQTRKSPYKYPNNAPNFPSDHATPTFNPYGNPGQKTGAGRPNN 462
K P+ R +++ +S ++ ++ + SD A+ +P Q + +
Sbjct: 372 KHKGELPPSSCQRRAGTKRSSRSSHRSQPSSGDEDSDDASSHRLHQKSPSQYSEEEEEED 431
Query: 463 SGGKYNRNR 471
SG +++R+R
Sbjct: 432 SGSEHSRSR 440
>gi|6273747|gb|AAF06358.1|AF102865_1 (AF102865) calymmin [Danio
rerio]
Length = 1207
Score = 32.5 bits (72), Expect = 8.1
Identities = 36/144 (25%), Positives = 49/144 (34%), Gaps = 20/144 (13%)
Query: 351 AAKQAHKPFKQYKPKNDRSLSEGSPATFQSWYVPEGVSTGPRNHRNA----GAGNGAHPN 406
+A Q KP N + ++ + FQ+ P G + GP+ A AG+ A PN
Sbjct: 288 SAGQVAKPNGYGGYPNAGATNQPNGGPFQNMGYPNGGTKGPKPGYGAKAGPSAGHVAKPN 347
Query: 407 KKSPNPNTRNTPGQQTRKSPYK-YPNNAPNFPSDH------------ATPTFN---PYGN 450
PN T S + YPN P A P N P G
Sbjct: 348 GNGGYPNGGATSQHNGGSSQFMGYPNGGTKGPKSGYGANAGPSAGQVAKPNGNGRYPIGG 407
Query: 451 PGQKTGAGRPNNSGGKYNRNRGPR 474
+ G N G R +GP+
Sbjct: 408 VANQPNRGSSQNMGYPNGRTKGPK 431
Database: nr
Posted date: Feb 13, 2000 1:18 AM
Number of letters in database: 140,124,617
Number of sequences in database: 455,460
Lambda K H
0.312 0.132 0.383
Gapped
Lambda K H
0.270 0.0470 0.230
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 142232432
Number of Sequences: 455460
Number of extensions: 6589090
Number of successful extensions: 18239
Number of sequences better than 10.0: 185
Number of HSP's better than 10.0 without gapping: 33
Number of HSP's successfully gapped in prelim test: 153
Number of HSP's that attempted gapping in prelim test: 16504
Number of HSP's gapped (non-prelim): 517
length of query: 476
length of database: 140,124,617
effective HSP length: 58
effective length of query: 418
effective length of database: 113,707,937
effective search space: 47529917666
effective search space used: 47529917666
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.8 bits)
X3: 64 (24.9 bits)
S1: 42 (21.8 bits)
S2: 72 (32.5 bits)