Commit Graph

320 Commits (44d4742570c1511e7ce304e2d1e4fad4e077111b)
 

Author SHA1 Message Date
Mike Gerber 44d4742570 🚧 GitHub Actions: Test report 11 months ago
Mike Gerber f4dff64066 🚧 GitHub Actions: Test report 11 months ago
Mike Gerber 043dbb1749 🚧 GitHub Actions: Test report 11 months ago
Mike Gerber 207fcd4054 🚧 GitHub Actions: Test report 11 months ago
Mike Gerber f7c280e59c 🚧 GitHub Actions: Try shell for loop to install from all requirements*.txt 11 months ago
Mike Gerber d42e59846c 🚧 GitHub Actions: Try shell for loop to install from all requirements*.txt 11 months ago
Mike Gerber ab1359c7e6 🚧 GitHub Actions: Rework test, run in src/ 11 months ago
Mike Gerber 77bcecd2d0 🚧 GitHub Actions: Allow running test manually 11 months ago
Mike Gerber c1b8d983e6 🚧 GitHub Actions: Rename test workflow, also run on schedule 11 months ago
Mike Gerber 674d833b15 🚧 GitHub Actions: Add test worklow 11 months ago
Mike Gerber caef84cebb 🚧 GitHub Actions: Add release workflow 11 months ago
Mike Gerber 1dad18909c 🧹 Make dinglehopper.* exports explicit 11 months ago
Mike Gerber bc6754d0cb ⚙ ruff: Ignore F811 (no redefinitions) for now, as ruff considers the multimethods redefinitions 11 months ago
Mike Gerber e4431797e6 🎨 Reformat comments + strings manually (not auto-fixed by Black) 11 months ago
Mike Gerber 704e7cca1c ⬆ Use f-strings 11 months ago
Mike Gerber bea56117ae 🎨 Reformat using Black 11 months ago
Mike Gerber d50d624554 🎨 Sort imports (auto-fixed by ruff) 11 months ago
Mike Gerber 5b20fb24a1 ⚙ Add pre-commit 11 months ago
Mike Gerber 32bd1896e0 🛠 Replace flake8 + pylint with ruff
As ruff is a lot faster than the other options, use this for code style checks etc. This
change also removes setup.cfg (See also: gh-85).
11 months ago
Mike Gerber e8e58e76c4 ⚙ Move mypy settings to pyproject.toml 11 months ago
Mike Gerber 84a05170ba ⚙ pytest.ini → pyproject.toml 11 months ago
Mike Gerber 69325facf2 🐛 Detect encoding (incl BOM) when reading files
As @imlabormitlea-code reported in gh-79, dinglehopper did not handle text files with
BOM well. Fix this by using chardet to detect an encoding, which also detects the BOM
and use the proper encoding to read the files, not including the BOM in the resulting
extracted text.

Fixes gh-80.
11 months ago
Mike Gerber 325e5af5f5 🐛 Move source into src/ to fix install
Installing was broken since moving to pyproject.toml, which we didn't notice because of
leftover files in build/. Fix this by using the convention of having the source files
in src/ and adjusting pyproject.toml accordingly.

Fixes gh-86. 🤞
11 months ago
Mike Gerber db7c051b22 ⚙ Migrate to pyproject.toml 11 months ago
Mike Gerber fc81233a0e 🚧 CircleCI: Run black 11 months ago
Mike Gerber cb0134d2db 🚧 CircleCI: Run black 11 months ago
Mike Gerber 55d534b981 🚧 CircleCI: Run black 11 months ago
Mike Gerber 2632cb09b8 🚧 CircleCI: Run black 11 months ago
Mike Gerber 35be58cb94
Merge pull request #83 from INL/feat/batch-processing
Add batch processing and report summaries
1 year ago
Mike Gerber 6d3a8cecd2
Merge pull request #82 from CircleCI-config-suggestions-bot/StoreTestResults
Update .circleci/config.yml to use store_test_results
1 year ago
Ruud de Jong 207804e6a6 Add batch processing and report summaries 1 year ago
CircleCI Config Suggestions Bot 89814cbe4b Upload test results to CircleCI 1 year ago
neingeist dd9303b429 🧹 .gitignore .python-version (for pyenv) 1 year ago
Mike Gerber f1fc3f1880 🧹 Remove qurator. namespace prefix 1 year ago
Mike Gerber f668963a2e 🐛 Fix installing by calling find_namespace_packages in setup.py
Turns out just removing __init__.py is not enough for native namespace
packages. We also need to (explicitly) call setuptools.find_namespace_packages()
for setup.py to find the package...

https://packaging.python.org/en/latest/guides/packaging-namespace-packages/#native-namespace-packages

Fixes gh-77.
1 year ago
Mike Gerber c4ab7c9a7c 🕸Do not use deprecated ID, pageId options
See gh-75.
1 year ago
Mike Gerber b4ac24ac9d 🔧 Remove explicit namespace_packages
Fixes gh-76.
1 year ago
Mike Gerber 2a090c9b5a ✔ CircleCI: Explicitly install binary opencv-python-headless (dep of OCR-D?) to avoid compilation 1 year ago
Mike Gerber 833efa37da 🐛 Remove deprecated declare_namespace call
Remove depecreated declare_namespace call and use implicit namespace (PEP-0420).

Fixes gh-76.
1 year ago
Gerber, Mike 0fd4ea1973 ✔ Add @cneud's former 40 GB problem files to the test suite 1 year ago
Gerber, Mike 0f0819512e 🎨 Reformat using Black 1 year ago
Gerber, Mike 2268f32a78 ✔ CircleCI: Test on Python 3.11 1 year ago
Gerber, Mike dcc10c5389 ✔️ Skip test_lines_similar() for now
test_lines_similar() fails with rapidfuzz 2.5 and is flawed anyway:

The test was based on our own implementation that used __eq__ and not __hash__ as
rapidfuzz does. Need to review this in the future.
2 years ago
Gerber, Mike 555f586775 📝 Note that old terminals might not render the Unicode characters correctly 2 years ago
Gerber, Mike c4e85da5ab 🐛 Update editops() and seq_align() due to RapidFuzz API changes 2 years ago
Gerber, Mike 15dfbac3a7 Revert "Revert "Merge pull request #67 from maxbachmann/rapidfuzz""
This reverts commit 76bd50f1db.
2 years ago
Gerber, Mike ede9402a6c Revert "💩 Stick with rapidfuzz < 2.1.0 for now"
This reverts commit 0e153db9ca.
2 years ago
Gerber, Mike 0e153db9ca 💩 Stick with rapidfuzz < 2.1.0 for now 2 years ago
Gerber, Mike 76bd50f1db Revert "Merge pull request #67 from maxbachmann/rapidfuzz"
This reverts commit 85f751aacc, reversing
changes made to 1febea8c92.
2 years ago
Mike Gerber 85f751aacc
Merge pull request #67 from maxbachmann/rapidfuzz
replace usage of deprecated rapidfuzz APIs
2 years ago