BLASTP 2.12.0+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: 65cc171fb4831e9aaa3f7532dc1d6ee3.SwissProt.fasta
15 sequences; 6,445 total letters
Query= ACIAD0087
Length=431
Score E
Sequences producing significant alignments: (Bits) Value
Q04972 Vi polysaccharide biosynthesis protein VipA/TviB [Salmonel... 600 0.0
P39861 Protein CapL [Staphylococcus aureus] 473 2e-169
G3XD94 UDP-N-acetyl-D-glucosamine 6-dehydrogenase [Pseudomonas ae... 234 7e-76
O59284 UDP-N-acetyl-D-mannosamine dehydrogenase [Pyrococcus horik... 199 1e-62
A6VK13 UDP-N-acetyl-D-mannosamine dehydrogenase [Methanococcus ma... 191 2e-59
Q6LZC3 UDP-N-acetyl-D-mannosamine dehydrogenase [Methanococcus ma... 187 3e-58
Q57871 UDP-N-acetyl-D-mannosamine dehydrogenase [Methanocaldococc... 187 7e-58
A6USK4 UDP-N-acetyl-D-mannosamine dehydrogenase [Methanococcus va... 182 3e-56
Q45410 NDP-N-acetyl-D-galactosaminuronic acid dehydrogenase [Rals... 181 7e-56
A4FY94 UDP-N-acetyl-D-mannosamine dehydrogenase [Methanococcus ma... 181 1e-55
P27829 UDP-N-acetyl-D-mannosamine dehydrogenase [Escherichia coli... 159 2e-47
D4GYH5 UDP-glucose 6-dehydrogenase AglM [Haloferax volcanii (stra... 120 2e-33
O32271 UDP-glucose 6-dehydrogenase TuaD [Bacillus subtilis (strai... 112 2e-30
P11759 GDP-mannose 6-dehydrogenase [Pseudomonas aeruginosa (strai... 92.4 1e-23
O54068 UDP-glucose 6-dehydrogenase [Rhizobium meliloti (strain 10... 92.0 2e-23
>Q04972 Vi polysaccharide biosynthesis protein VipA/TviB [Salmonella
typhi]
Length=425
Score = 600 bits (1547), Expect = 0.0
Identities = 283/422 (67%), Positives = 349/422 (83%), Gaps = 0/422 (0%)
Query 9 LQNLRIAIIGLGYVGLPLAVEFGKHVSVVGFDIHQKRVAELQQGQDHTLEVTPEELKQAQ 68
+ ++IAIIGLGYVGLPLAVEFGK VVGFD+++KR+ EL+ G D LE T EEL++A+
Sbjct 4 IDEVKIAIIGLGYVGLPLAVEFGKSRQVVGFDVNKKRILELKNGVDVNLETTEEELREAR 63
Query 69 LLSYSCDLSDLKDCNFFIVTVPTPIDQFKQPDLTPLIKASQSIGKVLKKDDIVVYESTVY 128
L ++ ++ +K+CNF+I+TVPTPI+ +KQPDLTPLIKAS+++G VL + DIVVYESTVY
Sbjct 64 YLKFTSEIEKIKECNFYIITVPTPINTYKQPDLTPLIKASETVGTVLNRGDIVVYESTVY 123
Query 129 PGATEEVCVPVLEQVSGLTFNQDFFTGYSPERINPGDKAHRVTNILKITAGSTPEVADYV 188
PG TEE CVP+L ++SG+TFNQDF+ GYSPERINPGDK HR+TNI KIT+GST ++A+ +
Sbjct 124 PGCTEEECVPILARMSGMTFNQDFYVGYSPERINPGDKKHRLTNIKKITSGSTAQIAELI 183
Query 189 DAVYQLIIEVGTHKAPSIKVAEAAKVIENTQRDVNIALINELAIIFNKMGIDTQDVLEAA 248
D VYQ II GT+KA SIKVAEAAKVIENTQRD+NIAL+NELAIIFN++ IDT+ VL AA
Sbjct 184 DEVYQQIISAGTYKAESIKVAEAAKVIENTQRDLNIALVNELAIIFNRLNIDTEAVLRAA 243
Query 249 GTKWNFLPFRPGLVGGHCIGVDPYYLTHKAQAIGYHPEIILAGRRLNDNMSMYVANQLIK 308
G+KWNFLPFRPGLVGGHCIGVDPYYLTHK+Q IGY+PEIILAGRRLNDNM YV+ QLIK
Sbjct 244 GSKWNFLPFRPGLVGGHCIGVDPYYLTHKSQGIGYYPEIILAGRRLNDNMGNYVSEQLIK 303
Query 309 SMNKKRIQIEDAKVLILGLTFKENCPDIRNTKIVDIISELKDFNMQVDVYDPWADAEETR 368
+M KK I +E + VLILG TFKENCPDIRNT+I+D++ EL ++ +VD++DPW DAEE R
Sbjct 304 AMIKKGINVEGSSVLILGFTFKENCPDIRNTRIIDVVKELGKYSCKVDIFDPWVDAEEVR 363
Query 369 HEYAIDLIAEPAQGSYDAIILAVAHQQFKAMGAAAIHALGRSNHVIYDLKYVLDRQQSDI 428
EY I ++E YDAII+AV HQQFK MG+ I G+ HV+YDLKYVL +QSD+
Sbjct 364 REYGIIPVSEVKSSHYDAIIVAVGHQQFKQMGSEDIRGFGKDKHVLYDLKYVLPAEQSDV 423
Query 429 RL 430
RL
Sbjct 424 RL 425
>P39861 Protein CapL [Staphylococcus aureus]
Length=424
Score = 473 bits (1217), Expect = 2e-169
Identities = 230/418 (55%), Positives = 305/418 (73%), Gaps = 2/418 (0%)
Query 11 NLRIAIIGLGYVGLPLAVEFGKHVSVVGFDIHQKRVAELQQGQDHTLEVTPEELKQAQLL 70
N IA++GLGYVGLP+AV FG V+GFDI++ R+ EL+ D T EVT +LK +
Sbjct 2 NRNIAVVGLGYVGLPVAVTFGNKHKVIGFDINESRIKELKNNYDRTNEVTENKLKNTNI- 60
Query 71 SYSCDLSDLKDCNFFIVTVPTPIDQFKQPDLTPLIKASQSIGKVLKKDDIVVYESTVYPG 130
Y+ + DLK +F I+ VPTPID+ +PDL PL+KAS+++GKV+ D IVVYESTVYPG
Sbjct 61 EYTSNAEDLKKADFIIIAVPTPIDKHNKPDLLPLLKASETVGKVITPDTIVVYESTVYPG 120
Query 131 ATEEVCVPVLEQVSGLTFNQDFFTGYSPERINPGDKAHRVTNILKITAGSTPEVADYVDA 190
ATEE CVPVLE+ SGL +DFF GYSPERINPGDK H I K+ +G T EV + V
Sbjct 121 ATEEECVPVLEKYSGLVCGKDFFVGYSPERINPGDKVHTFETITKVVSGQTLEVLEIVAD 180
Query 191 VYQLIIEVGTHKAPSIKVAEAAKVIENTQRDVNIALINELAIIFNKMGIDTQDVLEAAGT 250
VY ++ G HKA SIKVAEAAKVIENTQRDVNIAL+NELAIIF+K+ IDT +VL+A+GT
Sbjct 181 VYSSVVTAGVHKASSIKVAEAAKVIENTQRDVNIALMNELAIIFDKLDIDTNEVLKASGT 240
Query 251 KWNFLPFRPGLVGGHCIGVDPYYLTHKAQAIGYHPEIILAGRRLNDNMSMYVANQLIKSM 310
KWNFL F+PGLVGGHCIGVDPYYLTHKAQ +G+HPE+ILAGRR+NDNM+ Y+A+ +IK +
Sbjct 241 KWNFLNFKPGLVGGHCIGVDPYYLTHKAQEVGHHPEVILAGRRINDNMAKYIASNVIKEL 300
Query 311 NKKRIQIEDAKVLILGLTFKENCPDIRNTKIVDIISELKDFNMQVDVYDPWADAEETRHE 370
K+ ++++ A V +LGLTFKENCPD+RNTK++ II ELK++ + V V D AD E +
Sbjct 301 LKQGLEVQGATVNVLGLTFKENCPDLRNTKVIHIIEELKEYGLNVTVNDVEADKNEAKKF 360
Query 371 YAIDLIAEPAQGSYDAIILAVAHQQFKAMGAAAIHALGRSNHVIYDLKYVLDRQQSDI 428
+ +DLI D ++ AV H+ + I+ L + +++D+K +++ + ++
Sbjct 361 FGLDLIDTKELKMVDVVLFAVPHKDYMENKKDYIN-LVKDCGIVFDIKGIINSDELNV 417
Score = 21.2 bits (43), Expect = 0.90
Identities = 11/38 (29%), Positives = 19/38 (50%), Gaps = 2/38 (5%)
Query 215 IENTQRDVNIALINELAIIFNKMGIDTQDVLEAAGTKW 252
+EN + +N L+ + I+F+ GI D L + W
Sbjct 387 MENKKDYIN--LVKDCGIVFDIKGIINSDELNVSQRLW 422
>G3XD94 UDP-N-acetyl-D-glucosamine 6-dehydrogenase [Pseudomonas
aeruginosa (strain ATCC 15692 / DSM 22644 / CIP 104116 / JCM
14847 / LMG 12228 / 1C / PRS 101 / PAO1)]
Length=436
Score = 234 bits (597), Expect = 7e-76
Identities = 133/392 (34%), Positives = 220/392 (56%), Gaps = 11/392 (3%)
Query 14 IAIIGLGYVGLPLAVEFGK-HVSVVGFDIHQKRVAELQQGQDHTLEVTPEELKQAQLLSY 72
I I+GLGYVGLPL + + V+G DI +V +L GQ + + ++ +A+ +
Sbjct 18 IGIVGLGYVGLPLMLRYNAIGFDVLGIDIDDVKVDKLNAGQCYIEHIPQAKIAKARASGF 77
Query 73 SC--DLSDLKDCNFFIVTVPTPIDQFKQPDLTPLIKASQSIGKVLKKDDIVVYESTVYPG 130
D S + +C+ I+ VPTP++++++PD++ +I + ++ L+ +V EST YPG
Sbjct 78 EATTDFSRVSECDALILCVPTPLNKYREPDMSFVINTTDALKPYLRVGQVVSLESTTYPG 137
Query 131 ATEEVCVPVLEQVSGLTFNQDFFTGYSPERINPGDKAHRVTNILKITAGSTPEVADYVDA 190
TEE +P +++ GL +D + YSPER +PG+ I K+ G TP+ + A
Sbjct 138 TTEEELLPRVQE-GGLVVGRDIYLVYSPEREDPGNPNFETRTIPKVIGGHTPQCLEVGIA 196
Query 191 VYQLIIEVGTHKAPSIKVAEAAKVIENTQRDVNIALINELAIIFNKMGIDTQDVLEAAGT 250
+Y+ I+ S K AE K++EN R VNI L+NE+ I+ ++MGID +V++AA T
Sbjct 197 LYEQAID-RVVPVSSTKAAEMTKLLENIHRAVNIGLVNEMKIVADRMGIDIFEVVDAAAT 255
Query 251 K-WNFLPFRPGL-VGGHCIGVDPYYLTHKAQAIGYHPEIILAGRRLNDNMSMYVANQLIK 308
K + F P+ PG +GGHCI +DP+YLT KA+ G H I +N M YV +L+
Sbjct 256 KPFGFTPYYPGPGLGGHCIPIDPFYLTWKAREYGLHTRFIELSGEVNQAMPEYVLGKLMD 315
Query 309 SMNKKRIQIEDAKVLILGLTFKENCPDIRNTKIVDIISELKDFNMQVDVYDPWADAEETR 368
+N+ ++ ++VL+LG+ +K+N D+R + V+I+ ++ V DP
Sbjct 316 GLNEAGRALKGSRVLVLGIAYKKNVDDMRESPSVEIMELIEAKGGMVAYSDPHVPVFPKM 375
Query 369 HEYAIDLIAEPAQGS----YDAIILAVAHQQF 396
E+ +L +EP +DA++LA H +F
Sbjct 376 REHHFELSSEPLTAENLARFDAVVLATDHDKF 407
>O59284 UDP-N-acetyl-D-mannosamine dehydrogenase [Pyrococcus horikoshii
(strain ATCC 700860 / DSM 12428 / JCM 9974 / NBRC
100139 / OT-3)]
Length=418
Score = 199 bits (506), Expect = 1e-62
Identities = 126/401 (31%), Positives = 216/401 (54%), Gaps = 24/401 (6%)
Query 12 LRIAIIGLGYVGLPLAVEFGKH-VSVVGFDIHQKRVAELQQGQDHTLEVTPE------EL 64
+RIA++GLGY+GLP A+ F VVG+DI + + ++ G H +E PE E+
Sbjct 1 MRIAVLGLGYIGLPTAIMFASSGYDVVGYDIRSEVIKKINSGVAHIIE--PEIDRRLKEV 58
Query 65 KQAQLLSYSCDLSDLKDCNFFIVTVPTPIDQFKQPDLTPLIKASQSIGKVLKKDDIVVYE 124
L + + DLK N FI+ V TP+ PDL+ L +A +++ +V+ + +V+ E
Sbjct 59 LSLGKLKVTDRVEDLKGSNVFIICVQTPLSG-DDPDLSYLERAIRTVAEVMDRGALVIIE 117
Query 125 STVYPGATEEVCVPVLEQVSGLTFNQDFFTGYSPERINPGDKAHRVTNILKITAGSTPEV 184
ST+ PG TE++ +LE ++GL DF+ ++PER+ PG + +I G + +
Sbjct 118 STIPPGTTEKMA-RLLENLTGLREGVDFYVAHAPERVMPGRIFKELVYNSRIIGGVSEKA 176
Query 185 ADYVDAVYQLIIEVGTHKAPSIKVAEAAKVIENTQRDVNIALINELAIIFNKMGIDTQDV 244
A+ + +Y+ ++ G + AE K++ENT RDVNIAL NE A++ ++ G++ +
Sbjct 177 ANLAEKLYRSFVK-GRIFLTNATTAEMVKLMENTFRDVNIALANEFALLAHQYGVNVYEA 235
Query 245 LEAAGTKWNFLPFRPGL-VGGHCIGVDPYYLTHKAQAIGYHPEIILAGRRLNDNMSMYVA 303
+E A T PG+ VGGHC+ DPY L A+ +I RR+N+ M + A
Sbjct 236 IELANTHPRVKIHTPGIGVGGHCLPKDPYLLLSNAKE---DFGLIRIARRINERMPAFAA 292
Query 304 NQLIKSMNKKRIQIEDAKVLILGLTFKENCPDIRNTKIVDIISELKDFNMQVDVYDPWAD 363
L +++ K I+ +A + +LGL +K D RN+ + + +++ +V YDP+
Sbjct 293 GLLFEALEKANIKPSEAIIAVLGLAYKGGTDDTRNSPALKFVEIIRNSVKEVRTYDPYVR 352
Query 364 AEETRHEYAIDLIAEPAQGSYDAIILAVAHQQFKAMGAAAI 404
D + + +G+ DAI++A H +FK++ +I
Sbjct 353 GTH-------DSLEKVVEGA-DAIVIATDHPEFKSVNWESI 385
>A6VK13 UDP-N-acetyl-D-mannosamine dehydrogenase [Methanococcus
maripaludis (strain C7 / ATCC BAA-1331)]
Length=427
Score = 191 bits (484), Expect = 2e-59
Identities = 133/420 (32%), Positives = 214/420 (51%), Gaps = 32/420 (8%)
Query 13 RIAIIGLGYVGLPLAVEFGKH-VSVVGFDIHQKRVAELQQGQDHTLEVTPEELKQAQLLS 71
+I +IGLGY+GLP A H VVG D+++KRV ++ G+ E L + + S
Sbjct 11 KICVIGLGYIGLPTASMLANHGYEVVGVDVNEKRVNHIKNGELKIEEPGLLTLVKGAINS 70
Query 72 YSCDL-SDLKDCNFFIVTVPTPI----DQFKQPDLTPLIKASQSIGKVLKKDDIVVYEST 126
+ ++ + ++ + FI+ VPTP D K+ DLT ++ A Q+I LK +++V EST
Sbjct 71 KNLNVQTSAEEADAFIICVPTPALENEDGSKKCDLTYVMSAVQAIIPFLKDGNLIVVEST 130
Query 127 VYPGATEEVCVPVLEQVSGLTFNQDFFTGYSPERINPGDKAHRVTNILKITAGSTPEVAD 186
+ P T+++ T N+ + + PER+ PG + +I G + A+
Sbjct 131 IPPETTKKIYE---------TINKKIYVAHCPERVLPGKILKELVENDRIIGGINKKSAE 181
Query 187 YVDAVYQLIIEVGTHKAPSIKVAEAAKVIENTQRDVNIALINELAIIFNKMGIDTQDVLE 246
+Y+ +E + S AE K++ENT RD+NIAL NE A I +++G++ D ++
Sbjct 182 MAKEIYKSFVEGKIYITDS-NTAEMVKLMENTYRDINIALANEFAKICDEIGVNVWDAIK 240
Query 247 AAG--TKWNFLPFRPGLVGGHCIGVDPYYLTHKAQAIGYHPEIILAGRRLNDNMSMYVAN 304
A + N L PG VGGHCI +DP+++ K + + I A R LNDNM YV
Sbjct 241 IANKHPRVNILNPGPG-VGGHCISIDPWFIVEKTN----NAKFIRAARELNDNMPAYVCK 295
Query 305 QLIKSMNKKRIQIEDAKVLILGLTFKENCPDIRNTKIVDIISELKDFNMQVDVYDPWADA 364
++ +NK + IE K+ I G T+K N D R + ++I L + V +DP AD
Sbjct 296 SVLSELNK--LGIEKPKISIFGATYKGNVEDTRESPSKNVIKMLLENGATVSTFDPHADC 353
Query 365 EETRHEYAIDLIAEPAQGSYDAIILAVAHQQFKAMGAAAIHAL--GRSNHVIYDLKYVLD 422
EY + + E GS D I++ H FK + I + N +++D K +L+
Sbjct 354 ----FEYPLSTLDECISGS-DCIVVLTDHDAFKNIKKDDIDEICPKLKNKIVFDTKNILE 408
>Q6LZC3 UDP-N-acetyl-D-mannosamine dehydrogenase [Methanococcus
maripaludis (strain S2 / LL)]
Length=427
Score = 187 bits (476), Expect = 3e-58
Identities = 130/420 (31%), Positives = 215/420 (51%), Gaps = 32/420 (8%)
Query 13 RIAIIGLGYVGLPLAVEFGKH-VSVVGFDIHQKRVAELQQGQDHTLEVTPEELKQAQLLS 71
+I +IGLGY+GLP A H VVG D+++KRV +++ G+ E L + + S
Sbjct 11 KICVIGLGYIGLPTASMLANHGYDVVGVDVNEKRVNQIKNGELKIEEPGLLTLVKGAINS 70
Query 72 YSCDL-SDLKDCNFFIVTVPTPI----DQFKQPDLTPLIKASQSIGKVLKKDDIVVYEST 126
+ ++ + + + FI+ VPTP D K+ DL+ ++ A ++I +K +++V EST
Sbjct 71 KNLNVRTSATEADAFIICVPTPALAKEDGSKKCDLSYVMSAVEAILPFVKDGNLIVIEST 130
Query 127 VYPGATEEVCVPVLEQVSGLTFNQDFFTGYSPERINPGDKAHRVTNILKITAGSTPEVAD 186
+ P T+++ T N+ + + PER+ PG + +I G + A+
Sbjct 131 IPPETTKKIYE---------TLNKKIYVAHCPERVLPGKILKELVENDRIIGGINKKSAE 181
Query 187 YVDAVYQLIIEVGTHKAPSIKVAEAAKVIENTQRDVNIALINELAIIFNKMGIDTQDVLE 246
+Y+ +E + S AE K++ENT RD+NIAL NE A I +++G++ D ++
Sbjct 182 MAKEIYKSFVEGQIYTTDS-NTAEMVKLMENTYRDINIALANEFAKICDEIGVNVWDAIK 240
Query 247 AAG--TKWNFLPFRPGLVGGHCIGVDPYYLTHKAQAIGYHPEIILAGRRLNDNMSMYVAN 304
A + N L PG VGGHCI +DP+++ K + + I A R LNDNM YV N
Sbjct 241 IANKHPRVNILNPGPG-VGGHCISIDPWFIVEKTN----NAKFIRAARELNDNMPAYVCN 295
Query 305 QLIKSMNKKRIQIEDAKVLILGLTFKENCPDIRNTKIVDIISELKDFNMQVDVYDPWADA 364
++ + K++ IE K+ I G T+K N D R + ++I L + V YDP A
Sbjct 296 SVLSEL--KKLGIEKPKISIFGATYKGNVEDTRESPSKNVIKMLLENGATVSTYDPHA-- 351
Query 365 EETRHEYAIDLIAEPAQGSYDAIILAVAHQQFKAMGAAAIHAL--GRSNHVIYDLKYVLD 422
+ EY + + E GS D I++ H FK + I + N +++D K +L+
Sbjct 352 --SYFEYPLSTLDECISGS-DCIVVLTDHDVFKTIKKDDIDEICPKLKNKIVFDTKNILE 408
>Q57871 UDP-N-acetyl-D-mannosamine dehydrogenase [Methanocaldococcus
jannaschii (strain ATCC 43067 / DSM 2661 / JAL-1 / JCM
10045 / NBRC 100440)]
Length=427
Score = 187 bits (474), Expect = 7e-58
Identities = 131/421 (31%), Positives = 218/421 (52%), Gaps = 32/421 (8%)
Query 13 RIAIIGLGYVGLPLAVEFG-KHVSVVGFDIHQKRVAELQQGQDHTLEVTPEELKQAQLLS 71
RI +IGLGY+GLP A + V+G DI++KRV E+++ T E L + + S
Sbjct 12 RICVIGLGYIGLPTASMLAIQGFDVIGVDINEKRVKEIKELSFKTTEKDLMTLVKGAINS 71
Query 72 YSCDLSDLKD-CNFFIVTVPTPI---DQFKQPDLTPLIKASQSIGKVLKKDDIVVYESTV 127
+ + + + FI+ VPTP D K+ DLT L KA +SI L+ ++++ EST+
Sbjct 72 GNLKVQTKPEKADVFIICVPTPCIECDGEKKCDLTYLNKAIESIKPYLENGNLIIIESTI 131
Query 128 YPGATEEVCVPVLEQVSGLTFNQDFFTGYSPERINPGDKAHRVTNILKITAGSTPEVADY 187
PG T+++ L+ ++ + + PER+ PG + ++ G + A+
Sbjct 132 PPGTTDDI-------YKKLSKDKKIYVAHCPERVLPGSILKELVENDRVIGGVDEKSAEM 184
Query 188 VDAVYQLIIEVGTHKAPSIKVAEAAKVIENTQRDVNIALINELAIIFNKMGIDTQDVLEA 247
+Y+ + G K AE K++ENT RDVNIAL NE A I ++GI+ + +E
Sbjct 185 AKEIYETFV-TGKIYLTDAKTAEMVKLMENTYRDVNIALANEFAKIAEEIGINVWEAIEL 243
Query 248 AG--TKWNFLPFRPGLVGGHCIGVDPYYLTHKAQAIGYHPEIILAGRRLNDNMSMYVANQ 305
A + N L PG VGGHCI +DP+++ K++ + ++I R LND+M ++V +
Sbjct 244 ANKHPRVNILKPGPG-VGGHCISIDPWFIVEKSK----NAKLIRTARELNDSMPLFVVEK 298
Query 306 LIKSMNKKRIQIEDAKVLILGLTFKENCPDIRNTKIVDIISELKDFNMQVDVYDPWADAE 365
+ KK I+ + KV I G+T+K N D R + ++S+L D +V YD +A
Sbjct 299 I-----KKIIKKDIGKVAIFGVTYKGNVDDTRESPAEKVVSKLIDEGFEVKCYDKYA--- 350
Query 366 ETRHEYAIDLIAEPAQGSYDAIILAVAHQQFKAMGAAAIHALGR--SNHVIYDLKYVLDR 423
Y ++ + E +G+ D I++ H ++K I + N +I D K +L+R
Sbjct 351 -RDFIYPLNSLDEAVEGA-DIIVILAEHDEYKNFDKEDIKNIASKVKNKIILDTKNILNR 408
Query 424 Q 424
+
Sbjct 409 E 409
>A6USK4 UDP-N-acetyl-D-mannosamine dehydrogenase [Methanococcus
vannielii (strain ATCC 35089 / DSM 1224 / JCM 13029 / OCM
148 / SB)]
Length=427
Score = 182 bits (463), Expect = 3e-56
Identities = 129/420 (31%), Positives = 213/420 (51%), Gaps = 32/420 (8%)
Query 13 RIAIIGLGYVGLPLAVEFGKH-VSVVGFDIHQKRVAELQQGQDHTLEVTPEELKQAQLLS 71
+I +IGLGY+GLP A H V+G DI +KRV E++ G E L + + S
Sbjct 11 KICVIGLGYIGLPTASMLANHGYEVIGVDISEKRVNEIKNGDFKIEEPGLLTLLKGAINS 70
Query 72 YSCDL-SDLKDCNFFIVTVPTPI----DQFKQPDLTPLIKASQSIGKVLKKDDIVVYEST 126
+ ++ + + + FI+ VPTP D K+ DL+ ++ A SI + + +++V EST
Sbjct 71 KNLNVKTKAEKADAFIICVPTPAIGCDDGSKKCDLSYVLDAVNSILPYIDEGNLIVIEST 130
Query 127 VYPGATEEVCVPVLEQVSGLTFNQDFFTGYSPERINPGDKAHRVTNILKITAGSTPEVAD 186
+ P T+++ + ++V + + PER+ PG + +I G + A+
Sbjct 131 IPPETTQKIYDIIDKKV---------YVAHCPERVLPGKILKELVENDRIIGGINKKSAE 181
Query 187 YVDAVYQLIIEVGTHKAPSIKVAEAAKVIENTQRDVNIALINELAIIFNKMGIDTQDVLE 246
+Y+ +E + S AE K++ENT RD+NIAL NE A I +++G++ D ++
Sbjct 182 MAKEIYKSFVEGKIYITDS-NTAEMVKLMENTYRDINIALANEFAKICDEIGVNVWDAIK 240
Query 247 AAG--TKWNFLPFRPGLVGGHCIGVDPYYLTHKAQAIGYHPEIILAGRRLNDNMSMYVAN 304
A + N L PG VGGHCI +DP+++ K + + I + R LND M YV N
Sbjct 241 IANKHPRVNILNPGPG-VGGHCISIDPWFIVEKTN----NAKFIRSARELNDKMPYYVCN 295
Query 305 QLIKSMNKKRIQIEDAKVLILGLTFKENCPDIRNTKIVDIISELKDFNMQVDVYDPWADA 364
+I + K + IE KV + G T+K N D R + +I L + N+ V YDP A++
Sbjct 296 MIISEL--KNLNIEKPKVTVFGATYKGNVEDTRESPSKKVIDALAEKNIPVSTYDPHANS 353
Query 365 EETRHEYAIDLIAEPAQGSYDAIILAVAHQQFKAMGAAAIHALGR--SNHVIYDLKYVLD 422
EY + + + S D I++ H +FK+ I + + N +I D K +L+
Sbjct 354 ----FEYELHSLEDSIVNS-DCIVVLTDHNEFKSFKKEEIDEISKKLKNKLIIDTKNILN 408
>Q45410 NDP-N-acetyl-D-galactosaminuronic acid dehydrogenase [Ralstonia
solanacearum]
Length=423
Score = 181 bits (460), Expect = 7e-56
Identities = 125/397 (31%), Positives = 205/397 (52%), Gaps = 20/397 (5%)
Query 14 IAIIGLGYVGLPLA-VEFGKHVSVVGFDIHQKRVAELQQGQDHTLEVTPEELKQAQLLS- 71
I+++GLGY+GLP A V + ++G DI+Q V + Q + H +E + L +A +
Sbjct 12 ISVVGLGYIGLPTATVLASRQRELIGVDINQHAVDTINQARIHIVEPDLDMLVRAAVSQG 71
Query 72 YSCDLSDLKDCNFFIVTVPTPIDQFKQPDLTPLIKASQSIGKVLKKDDIVVYESTVYPGA 131
Y ++ + + F++ VPTP + KQPDLT + A+++I VLK+ D+VV EST GA
Sbjct 72 YLRATTEPEPADAFLIAVPTPFLEDKQPDLTYIEAAAKAIAPVLKRGDLVVLESTSPVGA 131
Query 132 TEEVCVPVLEQVSGLTF------NQDFFTGYSPERINPGDKAHRVTNILKITAGSTPEVA 185
TE++ + EQ S L+F D + PER+ PG + +I G TP +
Sbjct 132 TEQLSAWLSEQRSDLSFPHQLGEESDIRVAHCPERVLPGHVLRELVENDRIIGGMTPRCS 191
Query 186 DYVDAVYQLIIEVGTHKAPSIKVAEAAKVIENTQRDVNIALINELAIIFNKMGIDTQDVL 245
+Y+L + G + AE K+ EN RDVNIA NEL++I +++G++ +++
Sbjct 192 QAAQRLYELFVR-GRCIVTDARTAEMCKLTENAFRDVNIAFANELSMICDEIGVNVWELI 250
Query 246 EAAG--TKWNFLPFRPGLVGGHCIGVDPYYLTHKAQAIGYHPEIILAGRRLNDNMSMYVA 303
A + N L PG VGGHCI VDP+++ A +I R +ND YV
Sbjct 251 SVANRHPRVNILQPGPG-VGGHCIAVDPWFIVDAAPE---SARLIRTAREVNDAKPHYVL 306
Query 304 NQLIKSMNKKRIQIEDAKVLILGLTFKENCPDIRNTKIVDIISELKDFNM-QVDVYDPWA 362
+++ ++ + ++ + GL+FK N D+R + ++I+ + + V V +P
Sbjct 307 DRVKQAARR----FKEPVIACFGLSFKANIDDLRESPAIEIVRTMVQQQLGTVLVVEPHI 362
Query 363 DAEETRHEYAIDLIAEPAQGSYDAIILAVAHQQFKAM 399
E L AEPA D ++L V HQ+F+ +
Sbjct 363 KVLPASLEGVELLNAEPALSRADIVVLLVDHQKFRKL 399
>A4FY94 UDP-N-acetyl-D-mannosamine dehydrogenase [Methanococcus
maripaludis (strain C5 / ATCC BAA-1333)]
Length=427
Score = 181 bits (459), Expect = 1e-55
Identities = 129/420 (31%), Positives = 214/420 (51%), Gaps = 32/420 (8%)
Query 13 RIAIIGLGYVGLPLAVEFGKH-VSVVGFDIHQKRVAELQQGQDHTLEVTPEELKQAQLLS 71
+I +IGLGY+GLP A H VVG D+++KRV +++ G+ E L + + S
Sbjct 11 KICVIGLGYIGLPTASMLANHGYEVVGVDVNEKRVNQIKNGELKIEEPGLLTLVKGAINS 70
Query 72 YSCDL-SDLKDCNFFIVTVPTPI----DQFKQPDLTPLIKASQSIGKVLKKDDIVVYEST 126
+ ++ + + + FI+ VPTP D K+ DLT ++ A Q+I LK+ +++V EST
Sbjct 71 KNLNVQTSATEADAFIICVPTPALENEDGSKKCDLTYVMGAVQNIIPFLKEGNLIVIEST 130
Query 127 VYPGATEEVCVPVLEQVSGLTFNQDFFTGYSPERINPGDKAHRVTNILKITAGSTPEVAD 186
+ P T+++ T ++ + + PER+ PG + +I G + A+
Sbjct 131 IPPEITKKIYE---------TIDKKIYVAHCPERVLPGKILKELVENDRIIGGINKKSAE 181
Query 187 YVDAVYQLIIEVGTHKAPSIKVAEAAKVIENTQRDVNIALINELAIIFNKMGIDTQDVLE 246
+Y+ +E + S AE K++ENT RD+NIAL NE A I +++G++ D ++
Sbjct 182 MAKEIYKSFVEGKIYITDS-NTAEMVKLMENTYRDINIALANEFAKICDEIGVNVWDAIK 240
Query 247 AAG--TKWNFLPFRPGLVGGHCIGVDPYYLTHKAQAIGYHPEIILAGRRLNDNMSMYVAN 304
A + N L PG VGGHCI +DP+++ K + + I A R LNDNM YV
Sbjct 241 IANKHPRVNILNPGPG-VGGHCISIDPWFIVEKTN----NAKFIRAARELNDNMPAYVCK 295
Query 305 QLIKSMNKKRIQIEDAKVLILGLTFKENCPDIRNTKIVDIISELKDFNMQVDVYDPWADA 364
++ + K+ I+ K+ + G T+K N D R + ++I L + V +DP A
Sbjct 296 SVLSEL--KKHGIKKPKISVFGATYKGNVEDTRESPSKNVIEMLLKNGVTVSTFDPHA-- 351
Query 365 EETRHEYAIDLIAEPAQGSYDAIILAVAHQQFKAMGAAAIHAL--GRSNHVIYDLKYVLD 422
T EY + + E GS D I++ H FK + I + N +++D K +L+
Sbjct 352 --TCFEYPLSTLDECISGS-DCIVVLTDHDAFKNIKKDDIDEICPKLKNKIVFDTKNILE 408
>P27829 UDP-N-acetyl-D-mannosamine dehydrogenase [Escherichia
coli (strain K12)]
Length=420
Score = 159 bits (401), Expect = 2e-47
Identities = 120/419 (29%), Positives = 204/419 (49%), Gaps = 43/419 (10%)
Query 14 IAIIGLGYVGLPLAVEFG-KHVSVVGFDIHQKRVAELQQGQDHTLEVT-PEELKQAQLLS 71
I++IGLGY+GLP A F + V+G DI+Q V + +G+ H +E +K A
Sbjct 6 ISVIGLGYIGLPTAAAFASRQKQVIGVDINQHAVDTINRGEIHIVEPDLASVVKTAVEGG 65
Query 72 YSCDLSDLKDCNFFIVTVPTPIDQFKQPDLTPLIKASQSIGKVLKKDDIVVYESTVYPGA 131
+ + + + +++ VPTP +PD+T + A++SI VLKK +V+ EST G+
Sbjct 66 FLRASTTPVEADAWLIAVPTPFKGDHEPDMTYVESAARSIAPVLKKGALVILESTSPVGS 125
Query 132 TEEVCVPVLEQVSGLTFNQ------DFFTGYSPERINPGDKAHRVTNILKITAGSTPEVA 185
TE++ + E LTF Q D Y PER+ PG + ++ G TP +
Sbjct 126 TEKMAEWLAEMRPDLTFPQQVGEQADVNIAYCPERVLPGQVMVELIKNDRVIGGMTPVCS 185
Query 186 DYVDAVYQLIIEVGTHKAPSIKVAEAAKVIENTQRDVNIALINELAIIFNKMGIDTQDVL 245
+Y++ +E G + + AE K+ EN+ RDVNIA NEL++I GI+ +++
Sbjct 186 ARASELYKIFLE-GECVVTNSRTAEMCKLTENSFRDVNIAFANELSLICADQGINVWELI 244
Query 246 EAAG--TKWNFLPFRPGLVGGHCIGVDPYYLTHKAQAIGYHPE---IILAGRRLNDNMSM 300
A + N L PG VGGHCI VDP+++ + +P+ +I R +ND+
Sbjct 245 RLANRHPRVNILQPGPG-VGGHCIAVDPWFI------VAQNPQQARLIRTAREVNDHKPF 297
Query 301 YVANQL-------IKSMNKKRIQIEDAKVLILGLTFKENCPDIRNTKIVDIISELKDFNM 353
+V +Q+ + + +K+ ++ K+ GL FK N D+R + ++I + ++
Sbjct 298 WVIDQVKAAVADCLAATDKRASEL---KIACFGLAFKPNIDDLRESPAMEIAELIAQWHS 354
Query 354 QVDVYDPWADAEETRHEYAIDLIA-------EPAQGSYDAIILAVAHQQFKAMGAAAIH 405
+ E H+ L + A + D +++ V H QFK + +H
Sbjct 355 GETLV-----VEPNIHQLPKKLTGLCTLAQLDEALATADVLVMLVDHSQFKVINGDNVH 408
>D4GYH5 UDP-glucose 6-dehydrogenase AglM [Haloferax volcanii (strain
ATCC 29605 / DSM 3757 / JCM 8879 / NBRC 14742 / NCIMB
2012 / VKM B-1768 / DS2)]
Length=430
Score = 120 bits (302), Expect = 2e-33
Identities = 117/406 (29%), Positives = 179/406 (44%), Gaps = 25/406 (6%)
Query 12 LRIAIIGLGYVGLPLAVEFGK--HVSVVGFDIHQKRVAELQQGQDHTLEVTPEELKQ--- 66
+ ++IIG GYVG +A F + H VV DI + VA L GQ E EL +
Sbjct 1 MELSIIGSGYVGTTIAACFAELGH-DVVNVDIDEDIVASLNDGQAPIHEPGLAELVERYA 59
Query 67 AQLLSYSCDLSDLKDCNFFIVTVPTPIDQFKQPDLTPLIKASQSIGKVL-KKDD--IVVY 123
L + D ++ D + + +PTP DL + A+ S+G+ L +KDD +VV
Sbjct 60 GDRLRATTDYDEILDTDATFLALPTPSTDDGSIDLGAMKTAATSLGETLARKDDSHLVVT 119
Query 124 ESTVYPGATEEVCVPVLEQVSGLTFNQDFFTGYSPERINPGDKAHRVTNILKITAGS-TP 182
+STV P T +V P +E+ SG +PE + G + KI G+ T
Sbjct 120 KSTVVPRTTVDVIGPRIEEASGKRVGDGLDIAMNPEFLREGTAVDDFLSPDKIVLGAQTD 179
Query 183 EVADYVDAVYQLIIEVGTHKA---PSIKVAEAAKVIENTQRDVNIALINELAIIFNKMGI 239
+ + ++ ++E + I AE K N I+L N+LA I G+
Sbjct 180 RAYETLAEIFAPLVERAGNPPVVKTGISEAEMIKYANNAFLASKISLANDLANICKVFGV 239
Query 240 DTQDVLEAAGTKWN----FLPFRPGLVGGHCIGVDPYYLTHKAQAIGYHPEIILAGRRLN 295
D+ +VLE+ G FL G GG C D + A+A GY P ++ A +N
Sbjct 240 DSAEVLESIGLDSRIGSAFLGAGLGW-GGSCFPKDTAAIIAAARAQGYEPRLLQAAVDVN 298
Query 296 DNMSMYVANQLIKSMNKKRIQIEDAKVLILGLTFKENCPDIRNTKIVDIISELKDFNMQV 355
D + + L ++R ++ +V +LGL FK DIR ++ + +I L D V
Sbjct 299 DGQPERMLDLL-----RERFDLDGKRVAVLGLAFKPGTDDIRKSRAILLIQALLDAGADV 353
Query 356 DVYDPWADAEETRHEYAIDLI--AEPAQGSYDAIILAVAHQQFKAM 399
YDP A ID A A + DA ++A +F A+
Sbjct 354 VGYDPVATENMRERFPDIDYADSAADALANADAALVATDWDEFAAL 399
Score = 19.6 bits (39), Expect = 2.7
Identities = 7/18 (39%), Positives = 11/18 (61%), Gaps = 0/18 (0%)
Query 4 RMPLDLQNLRIAIIGLGY 21
R DL R+A++GL +
Sbjct 310 RERFDLDGKRVAVLGLAF 327
>O32271 UDP-glucose 6-dehydrogenase TuaD [Bacillus subtilis (strain
168)]
Length=461
Score = 112 bits (280), Expect = 2e-30
Identities = 111/415 (27%), Positives = 190/415 (46%), Gaps = 36/415 (9%)
Query 13 RIAIIGLGYVGLPLAVEFGKHVS-VVGFDIHQKRVAELQQGQDHTLEVTPEELKQA---- 67
+IA+IG GYVGL F + + VV DI + ++ L+ G E +L +
Sbjct 3 KIAVIGTGYVGLVSGTCFAEIGNKVVCCDIDESKIRSLKNGVIPIYEPGLADLVEKNVLD 62
Query 68 QLLSYSCDL-SDLKDCNFFIVTVPTPIDQFKQPDLTPLIKASQSIGKVLKKDDIVVYEST 126
Q L+++ D+ S ++ + + V TP+ + + DLT + A+++IG+ L ++V +ST
Sbjct 63 QRLTFTNDIPSAIRASDIIYIAVGTPMSKTGEADLTYVKAAAKTIGEHLNGYKVIVNKST 122
Query 127 VYPGATEEVCVPVLEQVSGLTFNQDFFTGYSPERINPGDKAHRVTNILKITAGSTP-EVA 185
V P T ++ ++++ S ++ D + +PE + G H N+ + GST + A
Sbjct 123 V-PVGTGKLVQSIVQKASKGRYSFDVVS--NPEFLREGSAIHDTMNMERAVIGSTSHKAA 179
Query 186 DYVDAVYQLIIEVGTHKAPSIKV----AEAAKVIENTQRDVNIALINELAIIFNKMGIDT 241
++ ++Q AP IK AE K N I+ IN++A I ++G D
Sbjct 180 AIIEELHQ------PFHAPVIKTNLESAEMIKYAANAFLATKISFINDIANICERVGADV 233
Query 242 QDVLEAAGTKWN----FLPFRPGLVGGHCIGVDPYYLTHKAQAIGYHPEIILAGRRLNDN 297
V + G FL G GG C D L A++ GY ++I A N+
Sbjct 234 SKVADGVGLDSRIGRKFLKAGIGF-GGSCFPKDTTALLQIAKSAGYPFKLIEAVIETNEK 292
Query 298 MSMYVANQLIKSMNKKRIQIEDAKVLILGLTFKENCPDIRNTKIVDIISELKDFNMQVDV 357
+++ ++L+ M ++ + +LGL FK N D+R+ +DII L+ V
Sbjct 293 QRVHIVDKLLTVMGS----VKGRTISVLGLAFKPNTNDVRSAPALDIIPMLQQLGAHVKA 348
Query 358 YDPWADAEET-----RHEYAIDLIAEPAQGSYDAIILAVAHQQFKAMGAAAIHAL 407
YDP A E + + EY D+ A A DA ++ + K M + L
Sbjct 349 YDPIAIPEASAILGEQVEYYTDVYA--AMEDTDACLILTDWPEVKEMELVKVKTL 401
>P11759 GDP-mannose 6-dehydrogenase [Pseudomonas aeruginosa (strain
ATCC 15692 / DSM 22644 / CIP 104116 / JCM 14847 / LMG
12228 / 1C / PRS 101 / PAO1)]
Length=436
Score = 92.4 bits (228), Expect = 1e-23
Identities = 92/363 (25%), Positives = 159/363 (44%), Gaps = 25/363 (7%)
Query 12 LRIAIIGLGYVGLPLA-VEFGKHVSVVGFDIHQKRVAELQQGQDHTLEVTPEEL----KQ 66
+RI+I GLGYVG A + V+G D+ ++ + QG+ +E E L +Q
Sbjct 1 MRISIFGLGYVGAVCAGCLSARGHEVIGVDVSSTKIDLINQGKSPIVEPGLEALLQQGRQ 60
Query 67 AQLLSYSCDLSD-LKDCNFFIVTVPTPIDQFKQPDLTPLIKASQSIGKVLKKDD---IVV 122
LS + D + D + + V TP + DL + + IG +++ VV
Sbjct 61 TGRLSGTTDFKKAVLDSDVSFICVGTPSKKNGDLDLGYIETVCREIGFAIREKSERHTVV 120
Query 123 YESTVYPGATEEVCVPVLEQVSGLTFNQDFFTGYSPERINPGDKAHRVT-NILKITAGST 181
STV PG V +P++E SG DF G +PE + + +
Sbjct 121 VRSTVLPGTVNNVVIPLIEDCSGKKAGVDFGVGTNPEFLRESTAIKDYDFPPMTVIGELD 180
Query 182 PEVADYVDAVYQLIIEVGTHKAPSIKVAEAAKVIENTQRDVNIALINELAIIFNKMGIDT 241
+ D ++ +Y+ + K +++VAE K N + NE+ I +G+D
Sbjct 181 KQTGDLLEEIYRELDAPIIRK--TVEVAEMIKYTCNVWHAAKVTFANEIGNIAKAVGVDG 238
Query 242 QDVLE--AAGTKWNFLPF--RPGLV-GGHCIGVDPYYLTHKAQAIGYHPEIILAGRRLND 296
++V++ K N + RPG GG C+ D LT++A + ++ + R N
Sbjct 239 REVMDVICQDHKLNLSRYYMRPGFAFGGSCLPKDVRALTYRASQLDVEHPMLGSLMRSNS 298
Query 297 NMSMYVANQLIKSMNKKRIQIEDAKVLILGLTFKENCPDIRNTKIVDIISELKDFNMQVD 356
N + A LI S + + KV +LGL+FK D+R + +V++ L ++
Sbjct 299 N-QVQKAFDLITSHDTR-------KVGLLGLSFKAGTDDLRESPLVELAEMLIGKGYELR 350
Query 357 VYD 359
++D
Sbjct 351 IFD 353
>O54068 UDP-glucose 6-dehydrogenase [Rhizobium meliloti (strain
1021)]
Length=437
Score = 92.0 bits (227), Expect = 2e-23
Identities = 103/408 (25%), Positives = 171/408 (42%), Gaps = 31/408 (8%)
Query 12 LRIAIIGLGYVGLPLAV---EFGKHVSVVGFDIHQKRVAELQQGQDHTLEVTPEELKQAQ 68
++I +IG GYVGL V +FG V V D + +++ L++GQ E + L +
Sbjct 1 MKITMIGAGYVGLVSGVCFADFGHDVVCVDKD--EGKISALKKGQIPIFEPGLDHLVASN 58
Query 69 LLSYSCDLSD-------LKDCNFFIVTVPTPIDQFKQPDLTPLIKASQSIGKVLKKDDIV 121
+ S + +D D F V P+ DL+ + A++ I L+ +V
Sbjct 59 VASGRLNFTDDLKTAVAASDVVFIAVGTPSRRGD-GHADLSYVYAAAREIAANLQGFTVV 117
Query 122 VYESTVYPGATEEVCVPVLEQVSGLTFNQDFFTGYSPERINPG---DKAHRVTNILKITA 178
V +STV G +EV + E D +PE + G + R I+
Sbjct 118 VTKSTVPVGTGDEVERIIRETNPAA----DVTVVSNPEFLREGAAIEDFKRPDRIVIGVD 173
Query 179 GSTPEVADYVDAVYQ-LIIEVGTHKAPSIKVAEAAKVIENTQRDVNIALINELAIIFNKM 237
GS + + VY+ L + + + +E K N + I INE+A + K+
Sbjct 174 GSDGRAREVMTEVYRPLYLNQSPLVFTTRRTSELIKYAGNAFLAMKITFINEIADLCEKV 233
Query 238 GIDTQDVLEAAGTKWN----FLPFRPGLVGGHCIGVDPYYLTHKAQAIGYHPEIILAGRR 293
G + QDV G FL PG GG C D L AQ ++
Sbjct 234 GANVQDVARGIGLDGRIGSKFLHAGPGY-GGSCFPKDTLALVKTAQDHDTPVRLVETTVA 292
Query 294 LNDNMSMYVANQLIKSMNKKRIQIEDAKVLILGLTFKENCPDIRNTKIVDIISELKDFNM 353
+NDN + ++I + I +K+ +LGLTFK N D+R++ + ++ L+D
Sbjct 293 VNDNRKRAMGRKVIAAAGG---DIRGSKIAVLGLTFKPNTDDMRDSPAIAVVQALQDAGA 349
Query 354 QVDVYDPWADAEETRHEYAIDLIAEP--AQGSYDAIILAVAHQQFKAM 399
+V YDP + +D +P A DA+++ +F+A+
Sbjct 350 RVTGYDPEGMENARKLIEGLDCARDPYEAAAEADALVIITEWNEFRAL 397
Lambda K H a alpha
0.321 0.138 0.404 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 2158400
Database: 65cc171fb4831e9aaa3f7532dc1d6ee3.SwissProt.fasta
Posted date: May 19, 2024 10:33 PM
Number of letters in database: 6,445
Number of sequences in database: 15
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40