Menu

#133 Regex Include Only

closed
nobody
1
2021-06-08
2017-08-24
John Fraser
No

I am finding that it would be easier to have a regex option to include files, rather than just exclude. My list of file suffixes that are useful to index on our network drive is about 1/100th the list of those to exclude.

Cheers

Current regex to exclude list of suffixes:

(?i).*?\.(A\b|A1 1|A11B|A13S|accdb|ACK|ACK\_code\_tester\_test\_1|ACK\_code\_tester\_test\_2|ACK\_code\_tester\_test\_3|ACK\_code\_tester\_test\_4|ACK\_code\_tester\_test\_5|ACK\_code\_tester\_test6|ACK\_for\_test1|ACK\_for\_test2|ACK\_for\_test3|ACK\_for\_test4|ACK\_for\_test5|ACK\_for\_test6|ACK\_for\_test7|ACK\_for\_test8|actions|adf|adfc\_diagram|AdvancedMessageFormat$Function|AdvancedMessageFormat$ObjectFormatter|AdvancedMessageFormat$PartFactory|ai\b|air|aj\b|api\_description|apk|app|applet|Archive|as\b|AS1|asd|ase|ashx|asmx|asp|aspx|aspx\_files|asscache|axd|backup|bak|bas|base|bcdeploy|bdy|bin|bk\b|btr|buf|build|builders|bundle|BUP|C\b|c4e2d83a441042fa9dfa7f7da5e4ad20|c6149d416a614737befaca1f159bd073|c75bf085219d42ab9ff28207aebb5b9d|cache|CAT|cdi|cdi\_javac|cer|cert|cgi|chk|Citizenship type AP|CLA|class|class\_diagram|classpath|cob|col|com|compiled|completed|component|con|connectivity|container|content|CONTENT\_SOURCE|ContentHandlerFactory|core|coreep|cp\b|cpad|cpd|cpgz|cpi|cpo|cptd|cpth|cptl|cptx|cptx\_lock|cpx|crt|crxml|cs\b|csproj|csr|css\_old|css~|css032\_0|css1|ctl|ctrl|D311CORE|dat|data|datasource|db\b|dbf|dbg0|dbx|dcx|ddl|dep|dependencies|deploy|DIS|disabled|disco|dll|dmd|dmp|do\_not\_use|donotuse|DotSettings|dqy|DS\_Store|dSYM|dtd|dwt|dylib|E\b|e71a9a167378463a954bea6e12ad6281|e97beaba2b6c4160a3b112a61240baa8|ear|eclipseproduct|ed5|editors|eex|ElementMapping|emf|eml|empty|emz|ENFR4210|engine|env|eot|eps|erb|ERP|err|EventExceptionManager$ExceptionFactory|evt0|ExceptionHandler|exclude|exe|exsd|fa7030f1247647baba1e5faef4c190d6|facet|fd822575dc26436cb4f4830651026e15|feature|files\_0|fileTableLock|fm3|fmb|fmb\_fris\_22\_feb|fml|fmx|fnc|fnd|fo\b|FOEventHandler|form|fp5|fpbf|framework|ftl|fxml|fxp|gem|gemspec|gen|gif|gif\_temp|gitignore|gradle|graffle|grant|groovy|grt|gsp|gstencil|gz\b|h\b|haml|handlers|hdr|header|hist|history|hmap|htaccess|htc|HTP|I20100511-0800|icm|icns|ics|ide|idea|idl|ids|idx|IFDocumentHandler|IFO|ImageConverter|ImageHandler|ImageLoaderFactory|ImagePreloader|img|iml|import|imx|ind|indb|indd|index|index2|index3|indexes|inf|info|ini|instance|intro|ipa|ipr|ipz|iqy|ISchemaTypeProvider$Factory|isf|iws|IXA|IXB|IXC|IXE|IXF|j2ee|jar|jar\_old|JAV|java|java\_copy|java\_new|java2|jaxrpc|jbf|jdb|jfr|jks|jmx|jnlp|jnlp1|jrxml|js\b|js~|jsb|jsdtscope|jsf\_diagram|jsff|jsl|jsp|jspa\_files|jspf|jsps|jsptagregistry|jspx|jspx\_files|jsx|jtp|jwg|jws|K\b|keep|keg|key|keystore|l4t|launch|launches|launching|lazy|lck|ldif|less|library|license|LinkFileList|lis|little|lnk|local|local\_16|local\_targets|location|lock|LOG|log00001|log00076|log00077|log00078|log00079|log00080|log00081|log00082|log00083|log00084|log00085|log00086|log00087|log00088|log00089|log00090|log00091|log00092|log00093|log00094|log00095|log00096|log00097|log00098|log00099|log00100|log00101|log00102|log00103|log00104|log00105|log00106|log00107|log00108|log00109|log00132|log00133|log00134|log00135|log00136|log00137|log00138|LogFactory|LPK|lproj|LST|lsx|lvix|lxrh|lxrh1100|m\b|M20110210-1200|m4a|m4v|mailcap|mainClass|manager|manifest|mark|markdown|markers|mbox|md\b|mdb|mdi|merge|MF\b|mflow|mht|mime|mmb|mml|mmx|mno|mobi|mobileprovision|MOD|mode1v3|module|modulecore|MOI|MRF|msg|msi|mso|msu|muse|mwp|mxp|mylyn|n\b|name|net|Net For BAs|Net Screen Layouts|Net source files|new|nib|nm5|nvram|o\b|o~$|o~?|odb|ods|odt|odttf|olb|old|old1|one|onetoc2|oo3|openvms|options|ora|org|org\_trinidad|orig|original|osgi|otf|other|out|out00001|out00002|out00003|out00004|out00005|out00006|out00007|out00008|out00009|out00010|out00011|out00012|out00013|out00014|out00015|out00016|out00017|out00018|out00019|out00020|out00021|out00022|out00023|out00024|out00025|out00026|out00027|out00028|out00029|out00030|out00031|oxd\_faces|oxd\_java|oxd\_struts|oxd\_struts\_9051|P\b|p7b|pack|par|pass1|pass2|pbxbtree|pbxindex|pbxproj|pbxstrings|pbxsymbols|pbxuser|pc\b|pch|pcx|pcz|pdb|pdm|pdn|pem|pez|pfb|pfm|PGI|php|pict|pkb|pkg|pks|pl\b|plb|pld|plist|pll|pll\_old|pls|plugin|pluginInfo|plugins|plx|plz|policy|prc|prn|profile|project|projects|properties|props|providers|proxy|ps\b|psd|psp|pspimage|pvw|py\b|QRP|qry|qxd|R201102081640|r262\_v20110127|r36\_v20100702|r361\_v20100903-897HFa-FX0z-z-ntoaavz0JPX628|r362\_v20101104-9SAxFMKFkSAqi8axkv1ZjegmiBLY|r362\_v20101111-7B77FKv99HL0GvIAF99I8H14B7E|r362\_v20101111-7e7jEVFDr6xg-LSlC\_z0w1D|r362\_v20101117-0800-7z8XFW6FLFlmjJcvz03jyeFBLS\_F|r362\_v20110210-9gF78Gs1FrIGnHDHWkEcopoN8AmxeZflGDGKQi|R36x\_v20101222|R36x\_v20110210-4-09oB5865H6L02434|r7198|r8865|rake|raml|ran|rar|rb\b|rdf|reallyold|refactoring|refactorings|reg|rels|RenderEngine|Renderer|rep|repository|REQ|res|resources|resx|rgb|rgba|richfaces|RLT|RLT\_code\_tester\_test\_1|RLT\_code\_tester\_test\_2|RLT\_code\_tester\_test\_3|RLT\_code\_tester\_test\_4|RLT\_code\_tester\_test\_5|RLT\_code\_tester\_test6|RLT\_for\_test1|RLT\_for\_test2|RLT\_for\_test3|RLT\_for\_test4|RLT\_for\_test5|RLT\_for\_test6|RLT\_for\_test7|RLT\_for\_test8|root|Ross|rpt|rptdesign|rptlibrary|RSA|rsrc|rss|running|runtime|safetable|sampledb|sass|sass-cache|scf|schema|schemas|screenshot|scss|scssx|sdat|search|security|sequence|settings|SF\b|sfap0|sfk|sh\b|sha1|shb|showNOT|shs|simpleconfigurator|skeleton|sln|smbdeleteAAAf0000000062fb|smi|snap|snippets|snp|so\b|spf|Spp|sql|sqs|src|srl|ssa|ssad|sso|sssa|sssad|stamp|state|ste|stm|strings|studio|sva|svcinfo|svcmap|svg|svnExe|swf|syn|t\b|tab|table|tags|tagsupport|tar|target|task|template|texteditor|tform|thmx|tif|tiff|timestamps|tkp|tld|tmp|tmpl|tools|tpignore|translated|translator|trc|tree|trg|trg\_acai\_br\_iud|trg\_pdo\_br\_iud|trg\_scapr\_br\_iu|trg\_sfmtr\_br\_iu|trg\_sfotr\_br\_iu|trm|ts\b|tscproj|TST|TTF|txt104867643|txt104892225|txt104892229|txt104893892|txt104893898|txt22652|txt23819|txt23874|txt23884|txt25282|txt25419|TXT25441|txt25505|txt26932|txt28339|types|U1\b|U10|U2\b|U3\b|U4\b|U5\b|U6\b|U7\b|U8\b|U9\b|uef|ui\b|uml\_pck|universal|update|url|uwc|v20100518-1145|v20100520|v20100525|v20100715\_r361|v20100826-2143|v20100915|v20100916-0100-e3x-7F7e1FC7sReRSnX-DReRRMHHM|v201011031800-7B7DFEsF7RZHOaLfS0KpNJ|v201011031800-7C78FPlEdhO\_oXWe4k\_elbMgT1Lq|v20110131-0100-e3x-777728s73543D6D135332KAv|v20110131-0100-e3x-7D774BgJ9DI9YFV4CICE2KAv|v20110131-0100-e3x-7D7F-AkF7B77V7c37B775eMp|v20110131-0100-e3x-7G7J5BgJ9EC9QKN49E997JSm|v20110131-0100-e3x-7Z7f7AFBBoPbVQ7iNFebXJDypa|v20110131-0200-e3x-4327w31221242012110f4y|v201102101200|v201102160550-7C7OFXcF7RZHQKIkMxMsW\_|v201102160550-7H7AFUWDxumQJOi9ghcTb5YgkwEZ|v201102161624-50FXNAkF7B77RBgFFBF|v3\_8\_2\_v20100427-1100|v4\_8\_1\_v20100427-1100|validation|vb\b|vbs|vcf|vch|vdx|version|view|vm\b|vmd|vmdk|vmsd|vmx|vmxf|VOB|vsd|vsdx|vss|vst|vw\b|vws|W\b|W version|war|wbk|wdf|we\b|web|webp|webservice|wft|wk3|wlLibs|wls\_security|wlsjsps|woff|woff2|workbench|workspace|wpl|wrp|wsdd|wsdl|wse|wwv\_setting[1]|xaml|xap|xcent|xcf|xcfg|xcodeproj|xconf|xdo|xhtml|xib|xjb|xlf|xlk|xlsb|xlsx alias|xlw|xmi|xml|xml&clicktrax=False|xml\_10120|XMLEncoder|XMLHandler|xpdl|xpi|xps|xsd|xsl|xslfo|xslt|Y\b|yaml|yardopts|yml|zdl|zip|zzz)$|.*Status.*Report.*|.*Components.*\.xl.*|.*Timesheet.*|~\.*|package\.html|overview\.html|license\.txt|version\.txt|oracle\-faces\-version\.txt|.*compile.*

Discussion

  • Nam-Quang Tran

    Nam-Quang Tran - 2017-08-25

    Hi,

    request noted. However, as stated on the FAQ, the project is currently not actively being developed, so only bugfixes will be added, not features like this one.

    Best regards
    q:-) <= Quang

     
  • Zhengmian Hu

    Zhengmian Hu - 2018-05-07

    @John Fraser Actually you can use regex itself to implement NOT logic. For example, if you want filter all file except test_file1.txt, you can simply write ^(?!test_file1\.txt$).*$.

     
  • Nam-Quang Tran

    Nam-Quang Tran - 2021-06-08
    • status: open --> closed
     
  • Nam-Quang Tran

    Nam-Quang Tran - 2021-06-08

    The include rule has been added in DocFetcher Pro.

     

Log in to post a comment.

MongoDB Logo MongoDB