The Python Oracle

.gitignore style fnmatch()

Become part of the top 3% of the developers by applying to Toptal https://topt.al/25cXVn

--

Music by Eric Matyas
https://www.soundimage.org
Track title: Romantic Lands Beckon

--

Chapters
00:00 Question
00:47 Accepted answer (Score 9)
01:22 Answer 2 (Score 22)
01:53 Thank you

--

Full question
https://stackoverflow.com/questions/1004...

Question links:
[http://docs.python.org/library/fnmatch.h...]: http://docs.python.org/library/fnmatch.h...
[http://docs.python.org/py3k/library/glob...]: http://docs.python.org/py3k/library/glob...
https://github.com/miohtama/Krusovice/bl...
http://linux.die.net/man/5/gitignore

Answer 1 links:
[pathspec]: https://pypi.python.org/pypi/pathspec
[documentation]: http://python-path-specification.readthe...
[code]: https://github.com/cpburnz/python-path-s...

--

Content licensed under CC BY-SA
https://meta.stackexchange.com/help/lice...

--

Tags
#python #regex #filenames #gitignore #fnmatch

#avk47



ANSWER 1

Score 26


There's a library called pathspec which implements the full .gitignore specification, including things like **/*.py; the documentation describes how to handle Git pattern matching (you can also see code).

>>> import pathspec
>>> spec_src = '**/*.pyc'
>>> spec = pathspec.PathSpec.from_lines(pathspec.patterns.GitWildMatchPattern, spec_src.splitlines())
>>> set(spec.match_files({"test.py", "test.pyc", "deeper/file.pyc", "even/deeper/file.pyc"}))
set(['test.pyc', 'even/deeper/file.pyc', 'deeper/file.pyc'])
>>> set(spec.match_tree("pathspec/"))
set(['__init__.pyc', 'gitignore.pyc', 'util.pyc', 'pattern.pyc', 'tests/__init__.pyc', 'tests/test_gitignore.pyc', 'compat.pyc', 'pathspec.pyc'])



ACCEPTED ANSWER

Score 9


If you want to use mixed UNIX wildcard patterns as listed in your .gitignore example, why not just take each pattern and use fnmatch.translate with re.search?

import fnmatch
import re

s = '/path/eggs/foo/bar'
pattern = "eggs/*"

re.search(fnmatch.translate(pattern), s)
# <_sre.SRE_Match object at 0x10049e988>

translate turns the wildcard pattern into a re pattern

Hidden UNIX files:

s = '/path/to/hidden/.file'
isHiddenFile = re.search(fnmatch.translate('.*'), s)
if not isHiddenFile:
    # do something with it