BLASTP 2.12.0+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: 1a693f7583086474516d251cfcd823e6.SwissProt.fasta
10 sequences; 2,883 total letters
Query= ACIAD1442
Length=312
Score E
Sequences producing significant alignments: (Bits) Value
P07773 Catechol 1,2-dioxygenase [Acinetobacter baylyi (strain ATC... 639 0.0
P31019 Catechol 1,2-dioxygenase [Pseudomonas sp. (strain EST1001)] 366 2e-131
O33948 Catechol 1,2-dioxygenase 1 [Acinetobacter lwoffii] 362 8e-130
Q43984 Catechol 1,2-dioxygenase [Acinetobacter guillouiae] 320 2e-113
O33950 Catechol 1,2-dioxygenase 2 [Acinetobacter lwoffii] 313 9e-111
A1IIX3 Hydroxyquinol 1,2-dioxygenase [Rhizobium sp. (strain MTP-1... 142 6e-44
P86029 Catechol 1,2-dioxygenase [Candida albicans (strain SC5314 ... 131 1e-39
P11451 Chlorocatechol 1,2-dioxygenase [Pseudomonas putida] 124 3e-37
P95607 Catechol 1,2-dioxygenase [Rhodococcus opacus] 122 2e-36
P27098 Chlorocatechol 1,2-dioxygenase [Pseudomonas sp. (strain P51)] 120 5e-36
>P07773 Catechol 1,2-dioxygenase [Acinetobacter baylyi (strain
ATCC 33305 / BD413 / ADP1)]
Length=311
Score = 639 bits (1648), Expect = 0.0
Identities = 311/311 (100%), Positives = 311/311 (100%), Gaps = 0/311 (0%)
Query 1 MEVKIFNTQDVQDFLRVASGLEQEGGNPRVKQIIHRVLSDLYKAIEDLNITSDEYWAGVA 60
MEVKIFNTQDVQDFLRVASGLEQEGGNPRVKQIIHRVLSDLYKAIEDLNITSDEYWAGVA
Sbjct 1 MEVKIFNTQDVQDFLRVASGLEQEGGNPRVKQIIHRVLSDLYKAIEDLNITSDEYWAGVA 60
Query 61 YLNQLGANQEAGLLSPGLGFDHYLDMRMDAEDAALGIENATPRTIEGPLYVAGAPESVGY 120
YLNQLGANQEAGLLSPGLGFDHYLDMRMDAEDAALGIENATPRTIEGPLYVAGAPESVGY
Sbjct 61 YLNQLGANQEAGLLSPGLGFDHYLDMRMDAEDAALGIENATPRTIEGPLYVAGAPESVGY 120
Query 121 ARMDDGSDPNGHTLILHGTIFDADGKPLPNAKVEIWHANTKGFYSHFDPTGEQQAFNMRR 180
ARMDDGSDPNGHTLILHGTIFDADGKPLPNAKVEIWHANTKGFYSHFDPTGEQQAFNMRR
Sbjct 121 ARMDDGSDPNGHTLILHGTIFDADGKPLPNAKVEIWHANTKGFYSHFDPTGEQQAFNMRR 180
Query 181 SIITDENGQYRVRTILPAGYGCPPEGPTQQLLNQLGRHGNRPAHIHYFVSADGHRKLTTQ 240
SIITDENGQYRVRTILPAGYGCPPEGPTQQLLNQLGRHGNRPAHIHYFVSADGHRKLTTQ
Sbjct 181 SIITDENGQYRVRTILPAGYGCPPEGPTQQLLNQLGRHGNRPAHIHYFVSADGHRKLTTQ 240
Query 241 INVAGDPYTYDDFAYATREGLVVDAVEHTDPEAIKANDVEGPFAEMVFDLKLTRLVDGVD 300
INVAGDPYTYDDFAYATREGLVVDAVEHTDPEAIKANDVEGPFAEMVFDLKLTRLVDGVD
Sbjct 241 INVAGDPYTYDDFAYATREGLVVDAVEHTDPEAIKANDVEGPFAEMVFDLKLTRLVDGVD 300
Query 301 NQVVDRPRLAV 311
NQVVDRPRLAV
Sbjct 301 NQVVDRPRLAV 311
>P31019 Catechol 1,2-dioxygenase [Pseudomonas sp. (strain EST1001)]
Length=302
Score = 366 bits (940), Expect = 2e-131
Identities = 172/293 (59%), Positives = 220/293 (75%), Gaps = 2/293 (1%)
Query 1 MEVKIFNTQDVQDFLRVASGLEQEGGNPRVKQIIHRVLSDLYKAIEDLNITSDEYWAGVA 60
M VKI++T +VQDFL++ +GL+QEGGN R KQIIHR+LSDLY+ I+D +IT+++YW+ V+
Sbjct 1 MTVKIYDTPEVQDFLKIVAGLDQEGGNDRGKQIIHRILSDLYRTIDDFDITAEQYWSAVS 60
Query 61 YLNQLGANQEAGLLSPGLGFDHYLDMRMDAEDAALGIENATPRTIEGPLYVAGAPESVGY 120
LN LG + GLLSPGLGFDHY+DMRMDA DA TPRTIEGPLYVAGAPE+ G+
Sbjct 61 LLNALGQASQFGLLSPGLGFDHYMDMRMDAADAEAKRTGGTPRTIEGPLYVAGAPEAEGF 120
Query 121 ARMDDGSDPNGHTLILHGTIFDADGKPLPNAKVEIWHANTKGFYSHFDPTGEQQAFNMRR 180
ARMDD D +G T+ LHG + D GKP+P AKVEIWH N+KG YS FD + Q +N+RR
Sbjct 121 ARMDDDPDTDGETMWLHGQVRDTAGKPIPGAKVEIWHCNSKGGYSFFDKS--QTPYNLRR 178
Query 181 SIITDENGQYRVRTILPAGYGCPPEGPTQQLLNQLGRHGNRPAHIHYFVSADGHRKLTTQ 240
+II D G YR R+++P+GYG P PT Q+L LGRHG RPAHIHYF+SA GH+ LTTQ
Sbjct 179 TIIADNEGYYRARSVIPSGYGVPEGAPTDQVLKLLGRHGERPAHIHYFISAPGHQHLTTQ 238
Query 241 INVAGDPYTYDDFAYATREGLVVDAVEHTDPEAIKANDVEGPFAEMVFDLKLT 293
IN+AGDPYTYDDFA+ATR+ L + + A + VEG E++F+++L+
Sbjct 239 INLAGDPYTYDDFAFATRQDLAAEGKRVENHPAAQQYGVEGTVTEVIFNIELS 291
>O33948 Catechol 1,2-dioxygenase 1 [Acinetobacter lwoffii]
Length=311
Score = 362 bits (930), Expect = 8e-130
Identities = 177/311 (57%), Positives = 223/311 (72%), Gaps = 2/311 (1%)
Query 1 MEVKIFNTQDVQDFLRVASGLEQEGGNPRVKQIIHRVLSDLYKAIEDLNITSDEYWAGVA 60
M +K+F T++VQD L+ A+ LE +GGN R KQI+HR+LSDL+KAI+DL+IT DE WAGV
Sbjct 1 MSIKVFGTKEVQDLLKAATNLEGKGGNARSKQIVHRLLSDLFKAIDDLDITPDEVWAGVN 60
Query 61 YLNQLGANQEAGLLSPGLGFDHYLDMRMDAEDAALGIENATPRTIEGPLYVAGAPESVGY 120
YLN+LG + EA LL+ G G + YLD+R+DA D A GIE TPRTIEGPLYVAGA G
Sbjct 61 YLNKLGQDGEATLLAAGSGLEKYLDIRLDAADKAEGIEGGTPRTIEGPLYVAGATVHDGV 120
Query 121 ARMDDGSDPNGHTLILHGTIFDADGKPLPNAKVEIWHANTKGFYSHFDPTGEQQAFNMRR 180
+++D D + L++HGT+ DGKP+ A VE WHAN+KGFYSHFDPTG Q FN+R
Sbjct 121 SKIDINPDEDAGPLVIHGTVTGPDGKPVAGAVVECWHANSKGFYSHFDPTGAQSDFNLRG 180
Query 181 SIITDENGQYRVRTILPAGYGCPPEGPTQQLLNQLGRHGNRPAHIHYFVSADGHRKLTTQ 240
++ T +G+Y RT++P GYGCPP+G TQQLLN LGRHGNRPAH+H+FVS+D RKLTTQ
Sbjct 181 AVKTGADGKYEFRTLMPVGYGCPPQGATQQLLNVLGRHGNRPAHVHFFVSSDSARKLTTQ 240
Query 241 INVAGDPYTYDDFAYATREGLVVDAVEHTDPEAIKANDVEGPFAEMVFDLKLTRLVDGVD 300
N+ GDP +DDFAYATRE L+ E A+ + ++ F+L LT LV G D
Sbjct 241 FNIEGDPLIWDDFAYATREELIPPVTEKKGGTALGLK--ADTYKDIEFNLTLTSLVKGKD 298
Query 301 NQVVDRPRLAV 311
NQVV R R V
Sbjct 299 NQVVHRLRAEV 309
>Q43984 Catechol 1,2-dioxygenase [Acinetobacter guillouiae]
Length=305
Score = 320 bits (821), Expect = 2e-113
Identities = 159/277 (57%), Positives = 198/277 (71%), Gaps = 3/277 (1%)
Query 27 NPRVKQIIHRVLSDLYKAIEDLNITSDEYWAGVAYLNQLGANQEAGLLSPGLGFDHYLDM 86
N RV+QI+ R+L DL++AIEDLN++ E W G+ YL G E GLL+ GLG +HYLD+
Sbjct 25 NLRVQQIVVRLLGDLFQAIEDLNMSQTELWKGLEYLTDAGQANELGLLAAGLGLEHYLDL 84
Query 87 RMDAEDAALGIENATPRTIEGPLYVAGAPESVGYARMDDGSDP-NGHTLILHGTIFDADG 145
R D DA GI TPRTIEGPLYVAGAPESVG+ARMDDGS+ + LI+ G + D G
Sbjct 85 RADEADAKAGITGGTPRTIEGPLYVAGAPESVGFARMDDGSESAHVDALIIEGNVTDTAG 144
Query 146 KPLPNAKVEIWHANTKGFYSHFDPTGEQQAFNMRRSIITDENGQYRVRTILPAGYGCPPE 205
+ +PNAKVEIWHAN+ G YS FD + Q AFN+RRSI TD GQY +T +P GYGCPPE
Sbjct 145 QIIPNAKVEIWHANSLGNYSFFDKS--QSAFNLRRSIFTDTQGQYIAQTTMPVGYGCPPE 202
Query 206 GPTQQLLNQLGRHGNRPAHIHYFVSADGHRKLTTQINVAGDPYTYDDFAYATREGLVVDA 265
G TQ LLN LGRHGNRP+H+HYFVSA G+RKLTTQ N+ GD Y +DDFA+ATR+GL+ A
Sbjct 203 GTTQALLNLLGRHGNRPSHVHYFVSAPGYRKLTTQFNIEGDKYLWDDFAFATRDGLIATA 262
Query 266 VEHTDPEAIKANDVEGPFAEMVFDLKLTRLVDGVDNQ 302
++ TD IK ++ F + F+ +L + D V Q
Sbjct 263 LDVTDLAKIKQYNLNKAFKHIKFNFQLVQDADQVPLQ 299
>O33950 Catechol 1,2-dioxygenase 2 [Acinetobacter lwoffii]
Length=275
Score = 313 bits (801), Expect = 9e-111
Identities = 149/256 (58%), Positives = 190/256 (74%), Gaps = 3/256 (1%)
Query 7 NTQDVQDFLRVASGLEQEGGNPRVKQIIHRVLSDLYKAIEDLNITSDEYWAGVAYLNQLG 66
N Q + L+ + GNPR KQI++R++ DL+ IEDL++ DE+W + YL G
Sbjct 2 NKQAIDALLQKINDSAINEGNPRTKQIVNRIVRDLFYTIEDLDVQPDEFWTALNYLGDAG 61
Query 67 ANQEAGLLSPGLGFDHYLDMRMDAEDAALGIENATPRTIEGPLYVAGAPESVGYARMDDG 126
+ E GLL+ GLGF+H+LD+RMD +A G+E TPRTIEGPLYVAGAP S G+AR+DDG
Sbjct 62 RSGELGLLAAGLGFEHFLDLRMDEAEAKAGVEGGTPRTIEGPLYVAGAPVSDGHARLDDG 121
Query 127 SDPNGHTLILHGTIFDADGKPLPNAKVEIWHANTKGFYSHFDPTGEQQAFNMRRSIITDE 186
+DP G TL++ G +F DGKPL NA VE+WHAN G YS+FD + Q AFN+RRSI TD
Sbjct 122 TDP-GQTLVMRGRVFGEDGKPLANALVEVWHANHLGNYSYFDKS--QPAFNLRRSIRTDA 178
Query 187 NGQYRVRTILPAGYGCPPEGPTQQLLNQLGRHGNRPAHIHYFVSADGHRKLTTQINVAGD 246
G+Y R+++P GY PP+G TQ LL+QLGRHG+RPAHIH+FVSA G RKLTTQIN+ GD
Sbjct 179 EGKYSFRSVVPVGYSVPPQGQTQLLLDQLGRHGHRPAHIHFFVSAPGFRKLTTQINIDGD 238
Query 247 PYTYDDFAYATREGLV 262
PY +DDFA+ATR+GLV
Sbjct 239 PYLWDDFAFATRDGLV 254
>A1IIX3 Hydroxyquinol 1,2-dioxygenase [Rhizobium sp. (strain MTP-10005)]
Length=295
Score = 142 bits (358), Expect = 6e-44
Identities = 84/243 (35%), Positives = 127/243 (52%), Gaps = 13/243 (5%)
Query 27 NPRVKQIIHRVLSDLYKAIEDLNITSDEYWAGVAYLNQLGA-----NQEAGLLSPGLGFD 81
+PR+K+I+ V L++A++++ T +E+ + +L ++G QE L S LG
Sbjct 30 DPRLKEIMAVVTRKLHEAVKEIEPTEEEWMKAIHFLTEVGQICNEWRQEWILFSDILGVS 89
Query 82 HYLDMRMDAEDAALGIENATPRTIEGPLYVAGAPESVGYARMDDGSDPNGHTLILHGTIF 141
+D + + A+ T+ GP +VA APE A + D G +++ G I
Sbjct 90 MLVDAINHRKPSG-----ASESTVLGPFHVADAPEMPMGANIC--LDGKGEDMLVTGRIL 142
Query 142 DADGKPLPNAKVEIWHANTKGFYSHFDPTGEQQAFNMRRSIITDENGQYRVRTILPAGYG 201
D DG P+ A++++W AN +GFY G Q FN+R +T E+G+Y R P Y
Sbjct 143 DTDGVPVAGARIDVWQANDEGFYD-VQQKGIQPDFNLRGVFVTGEDGRYWFRAAKPKYYP 201
Query 202 CPPEGPTQQLLNQLGRHGNRPAHIHYFVSADGHRKLTTQINVAGDPYTYDDFAYATREGL 261
P +GP QLL +GRH RPAH+HY VSA+G L T I DPY D + +E L
Sbjct 202 IPDDGPVGQLLRAMGRHPYRPAHLHYIVSAEGFTTLVTHIFDPDDPYIRSDAVFGVKESL 261
Query 262 VVD 264
+ D
Sbjct 262 LAD 264
Score = 24.3 bits (51), Expect = 0.027
Identities = 9/23 (39%), Positives = 13/23 (57%), Gaps = 0/23 (0%)
Query 261 LVVDAVEHTDPEAIKANDVEGPF 283
++VDA+ H P + V GPF
Sbjct 90 MLVDAINHRKPSGASESTVLGPF 112
Score = 19.6 bits (39), Expect = 0.80
Identities = 14/51 (27%), Positives = 22/51 (43%), Gaps = 1/51 (2%)
Query 227 YFVSADGHRKLTTQINVAGDPYTYDDFAYATRE-GLVVDAVEHTDPEAIKA 276
YFV + ++ DP + A TR+ V +E T+ E +KA
Sbjct 11 YFVEERSAETVIARMRDCDDPRLKEIMAVVTRKLHEAVKEIEPTEEEWMKA 61
>P86029 Catechol 1,2-dioxygenase [Candida albicans (strain SC5314
/ ATCC MYA-2876)]
Length=303
Score = 131 bits (329), Expect = 1e-39
Identities = 77/255 (30%), Positives = 132/255 (52%), Gaps = 14/255 (5%)
Query 28 PRVKQIIHRVLSDLYKAIEDLNITSDEYWAGVAYLNQLGA-----NQEAGLLSPGLGFDH 82
PR K++I ++ ++ + ++T++++ GV ++N++G E L+ +G +
Sbjct 19 PRAKKLIASLVQHVHDFARENHLTTEDWLWGVDFINRIGQMSDSRRNEGILVCDIIGLET 78
Query 83 YLDMRMDAEDAALGIENATPRTIEGPLYVAGAPESVGYARMDDGSDPNGHTLILHGTIFD 142
+D + + + N T I GP Y+ +P + + P + G + D
Sbjct 79 LVDALTNESEQS----NHTSSAILGPFYLPDSPVYPNGGSIVQKAIPTDVKCFVRGKVTD 134
Query 143 ADGKPLPNAKVEIWHANTKGFYSH-FDPTGEQQAFNMRRSIITDENGQYRVRTILPAGYG 201
+GKPL A++E+W N+ GFYS D G + FN+R + ITD+ G Y + P Y
Sbjct 135 TEGKPLGGAQLEVWQCNSAGFYSQQADHDGPE--FNLRGTFITDDEGNYSFECLRPTSYP 192
Query 202 CPPEGPTQQLLNQLGRHGNRPAHIHYFVSADGHRKLTTQINVAGDPYTYDDFAYATREGL 261
P +GP LL + RH NRP+HIH+ VS G+ L TQI A PYT +D YA ++ +
Sbjct 193 IPYDGPAGDLLKIMDRHPNRPSHIHWRVSHPGYHTLITQIYDAECPYTNNDSVYAVKDDI 252
Query 262 VV--DAVEHTDPEAI 274
+V + V++ D + +
Sbjct 253 IVHFEKVDNKDKDLV 267
>P11451 Chlorocatechol 1,2-dioxygenase [Pseudomonas putida]
Length=260
Score = 124 bits (310), Expect = 3e-37
Identities = 78/234 (33%), Positives = 113/234 (48%), Gaps = 11/234 (5%)
Query 29 RVKQIIHRVLSDLYKAIEDLNITSDEYWAGVAYLNQLGANQEAGLLSPGLGFDHYLDMRM 88
RV ++ ++ + K + D +T EY AGV YL ++ +E LL D +L+ +
Sbjct 4 RVAEVAGAIVEAVRKILLDKRVTEAEYRAGVDYLTEVAQTRETALL-----LDVFLNSTI 58
Query 89 DAEDAALGIENATPRTIEGPLYVAGAPESVGYARMDDGSDPNGHTLILHGTIFDADGKPL 148
E A + P I+GP ++ GAP G + D D LI+ GT+ G+ L
Sbjct 59 -IEGKAQRSRTSAP-AIQGPYFLEGAPVVEGVLKTYDTDDHK--PLIIRGTVRSDTGELL 114
Query 149 PNAKVEIWHANTKGFYSHFDPTGEQQAFNMRRSIITDENGQYRVRTILPAGYGCPPEGPT 208
A +++WH+ G YS + R ++TD G YRVRT +P Y P EGPT
Sbjct 115 AGAVIDVWHSTPDGLYSGIHDNIPVDYY--RGKLVTDSQGNYRVRTTMPVPYQIPYEGPT 172
Query 209 QQLLNQLGRHGNRPAHIHYFVSADGHRKLTTQINVAGDPYTYDDFAYATREGLV 262
+LL LG H RPAH+H+ V DG LTTQ G + DD + L+
Sbjct 173 GRLLGHLGSHTWRPAHVHFKVRKDGFEPLTTQYYFEGGKWVDDDCCHGVTPDLI 226
>P95607 Catechol 1,2-dioxygenase [Rhodococcus opacus]
Length=270
Score = 122 bits (305), Expect = 2e-36
Identities = 80/239 (33%), Positives = 113/239 (47%), Gaps = 14/239 (6%)
Query 29 RVKQIIHRVLSDLYKAIEDLNITSDEYWAGVAYLNQLGANQEAGLLSPGLGFDHYLDMRM 88
R+ I L L I +T EY +L +G E L +LD+ +
Sbjct 23 RLAAIAKDALGALNDVILKHGVTYPEYRVFKQWLIDVGEGGEWPL---------FLDVFI 73
Query 89 D--AEDAALGIENATPRTIEGPLYVAGAPESVGYARMDDGSDPNGHT-LILHGTIFDADG 145
+ E+ T +IEGP Y+ +PE + + T L+ G + D DG
Sbjct 74 EHSVEEVLARSRKGTMGSIEGPYYIENSPELPSKCTLPMREEDEKITPLVFSGQVTDLDG 133
Query 146 KPLPNAKVEIWHANTKGFYSHFDPTGEQQAFNMRRSIITDENGQYRVRTILPAGYGCPPE 205
L AKVE+WHA+ G+YS F P +N+R +II DE G+Y + TI PA Y P +
Sbjct 134 NGLAGAKVELWHADNDGYYSQFAP--HLPEWNLRGTIIADEEGRYEITTIQPAPYQIPTD 191
Query 206 GPTQQLLNQLGRHGNRPAHIHYFVSADGHRKLTTQINVAGDPYTYDDFAYATREGLVVD 264
GPT Q + H RPAH+H VSA G +TTQ+ G + D A AT+ L++D
Sbjct 192 GPTGQFIEAQNGHPWRPAHLHLIVSAPGKESVTTQLYFKGGEWIDSDVASATKPELILD 250
>P27098 Chlorocatechol 1,2-dioxygenase [Pseudomonas sp. (strain
P51)]
Length=251
Score = 120 bits (301), Expect = 5e-36
Identities = 70/237 (30%), Positives = 112/237 (47%), Gaps = 11/237 (5%)
Query 27 NPRVKQIIHRVLSDLYKAIEDLNITSDEYWAGVAYLNQLGANQEAGLLSPGLGFDHYLDM 86
N RVKQ+ ++ + K + + +T +E+ AGV Y+ +L +E +L +D+
Sbjct 2 NERVKQVASALVDAIQKTLTEQRVTEEEWRAGVGYMMKLAEAKEVAVLLDAFFNHTIVDL 61
Query 87 RMDAEDAALGIENATPRTIEGPLYVAGAPESVGYARMDDGSDPNGHTLILHGTIFDADGK 146
+ A + ++GP ++ GAP G + + D + H L++ G + DG
Sbjct 62 KAQAT-------RGSRPAMQGPYFLEGAPVVAGALKTYE--DDSHHPLVIRGAVRTDDGA 112
Query 147 PLPNAKVEIWHANTKGFYSHFDPTGEQQAFNMRRSIITDENGQYRVRTILPAGYGCPPEG 206
P A +++WH+ G YS + R ++ D G+Y VRT +PA Y P +G
Sbjct 113 PAAGAVIDVWHSTPDGKYSGIHDQIPTDMY--RGKVVADAQGKYAVRTTMPAPYQIPNKG 170
Query 207 PTQQLLNQLGRHGNRPAHIHYFVSADGHRKLTTQINVAGDPYTYDDFAYATREGLVV 263
PT LL +G H RPAH+H+ V DG LTTQ G + D LV+
Sbjct 171 PTGVLLEMMGSHTWRPAHVHFKVRKDGFAPLTTQYYFEGGDWVDSDCCKGVAPDLVM 227
Lambda K H a alpha
0.319 0.139 0.419 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 665010
Database: 1a693f7583086474516d251cfcd823e6.SwissProt.fasta
Posted date: May 30, 2024 11:48 PM
Number of letters in database: 2,883
Number of sequences in database: 10
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40