Mike Gerber
cf130fe3a8
🚧 GitHub Actions: Disable matrix fail-fast
1 year ago
Mike Gerber
915a647949
🚧 GitHub Actions: Disable matrix fail-fast
1 year ago
Mike Gerber
6b05290640
🚧 GitHub Actions: Test on multiple Python versions
1 year ago
Mike Gerber
f909806f9e
🚧 GitHub Actions: Test report
1 year ago
Mike Gerber
f61b81c98e
🚧 GitHub Actions: Test report
1 year ago
Mike Gerber
44d4742570
🚧 GitHub Actions: Test report
1 year ago
Mike Gerber
f4dff64066
🚧 GitHub Actions: Test report
1 year ago
Mike Gerber
043dbb1749
🚧 GitHub Actions: Test report
1 year ago
Mike Gerber
207fcd4054
🚧 GitHub Actions: Test report
1 year ago
Mike Gerber
f7c280e59c
🚧 GitHub Actions: Try shell for loop to install from all requirements*.txt
1 year ago
Mike Gerber
d42e59846c
🚧 GitHub Actions: Try shell for loop to install from all requirements*.txt
1 year ago
Mike Gerber
ab1359c7e6
🚧 GitHub Actions: Rework test, run in src/
1 year ago
Mike Gerber
77bcecd2d0
🚧 GitHub Actions: Allow running test manually
1 year ago
Mike Gerber
c1b8d983e6
🚧 GitHub Actions: Rename test workflow, also run on schedule
1 year ago
Mike Gerber
674d833b15
🚧 GitHub Actions: Add test worklow
1 year ago
Mike Gerber
caef84cebb
🚧 GitHub Actions: Add release workflow
1 year ago
Mike Gerber
1dad18909c
🧹 Make dinglehopper.* exports explicit
1 year ago
Mike Gerber
bc6754d0cb
⚙ ruff: Ignore F811 (no redefinitions) for now, as ruff considers the multimethods redefinitions
1 year ago
Mike Gerber
e4431797e6
🎨 Reformat comments + strings manually (not auto-fixed by Black)
1 year ago
Mike Gerber
704e7cca1c
⬆ Use f-strings
1 year ago
Mike Gerber
bea56117ae
🎨 Reformat using Black
1 year ago
Mike Gerber
d50d624554
🎨 Sort imports (auto-fixed by ruff)
1 year ago
Mike Gerber
5b20fb24a1
⚙ Add pre-commit
1 year ago
Mike Gerber
32bd1896e0
🛠 Replace flake8 + pylint with ruff
...
As ruff is a lot faster than the other options, use this for code style checks etc. This
change also removes setup.cfg (See also: gh-85).
1 year ago
Mike Gerber
e8e58e76c4
⚙ Move mypy settings to pyproject.toml
1 year ago
Mike Gerber
84a05170ba
⚙ pytest.ini → pyproject.toml
1 year ago
Mike Gerber
69325facf2
🐛 Detect encoding (incl BOM) when reading files
...
As @imlabormitlea-code reported in gh-79, dinglehopper did not handle text files with
BOM well. Fix this by using chardet to detect an encoding, which also detects the BOM
and use the proper encoding to read the files, not including the BOM in the resulting
extracted text.
Fixes gh-80.
1 year ago
Mike Gerber
325e5af5f5
🐛 Move source into src/ to fix install
...
Installing was broken since moving to pyproject.toml, which we didn't notice because of
leftover files in build/. Fix this by using the convention of having the source files
in src/ and adjusting pyproject.toml accordingly.
Fixes gh-86. 🤞
1 year ago
Mike Gerber
db7c051b22
⚙ Migrate to pyproject.toml
1 year ago
Mike Gerber
fc81233a0e
🚧 CircleCI: Run black
1 year ago
Mike Gerber
cb0134d2db
🚧 CircleCI: Run black
1 year ago
Mike Gerber
55d534b981
🚧 CircleCI: Run black
1 year ago
Mike Gerber
2632cb09b8
🚧 CircleCI: Run black
1 year ago
Mike Gerber
35be58cb94
Merge pull request #83 from INL/feat/batch-processing
...
Add batch processing and report summaries
2 years ago
Mike Gerber
6d3a8cecd2
Merge pull request #82 from CircleCI-config-suggestions-bot/StoreTestResults
...
Update .circleci/config.yml to use store_test_results
2 years ago
Ruud de Jong
207804e6a6
Add batch processing and report summaries
2 years ago
CircleCI Config Suggestions Bot
89814cbe4b
Upload test results to CircleCI
2 years ago
neingeist
dd9303b429
🧹 .gitignore .python-version (for pyenv)
2 years ago
Mike Gerber
f1fc3f1880
🧹 Remove qurator. namespace prefix
2 years ago
Mike Gerber
f668963a2e
🐛 Fix installing by calling find_namespace_packages in setup.py
...
Turns out just removing __init__.py is not enough for native namespace
packages. We also need to (explicitly) call setuptools.find_namespace_packages()
for setup.py to find the package...
https://packaging.python.org/en/latest/guides/packaging-namespace-packages/#native-namespace-packages
Fixes gh-77.
2 years ago
Mike Gerber
c4ab7c9a7c
🕸Do not use deprecated ID, pageId options
...
See gh-75.
2 years ago
Mike Gerber
b4ac24ac9d
🔧 Remove explicit namespace_packages
...
Fixes gh-76.
2 years ago
Mike Gerber
2a090c9b5a
✔ CircleCI: Explicitly install binary opencv-python-headless (dep of OCR-D?) to avoid compilation
2 years ago
Mike Gerber
833efa37da
🐛 Remove deprecated declare_namespace call
...
Remove depecreated declare_namespace call and use implicit namespace (PEP-0420).
Fixes gh-76.
2 years ago
Gerber, Mike
0fd4ea1973
✔ Add @cneud's former 40 GB problem files to the test suite
2 years ago
Gerber, Mike
0f0819512e
🎨 Reformat using Black
2 years ago
Gerber, Mike
2268f32a78
✔ CircleCI: Test on Python 3.11
2 years ago
Gerber, Mike
a18b25b163
🐛 Update tests for ExtractedText
...
In PR gh-72, @maxbachmann introduced a new argument for ExtractedText(). Update the
corresponding tests.
2 years ago
Max Bachmann
f48e305347
use uniseg again
2 years ago
Max Bachmann
d2bbc8a6c7
update rapidfuzz version
2 years ago