eynollah

mirror of https://github.com/qurator-spk/eynollah.git synced 2026-02-21 00:41:56 +01:00

Author	SHA1	Message	Date
Robert Sachunsky	27f43c175f	Merge branch 'main' into ro-fixes and resolve conflicts… major conflicts resolved manually: - branches for non-`light` segmentation already removed in main - Keras/TF setup and no TF1 sessions, esp. in new ModelZoo - changes to binarizer and its CLI (`mode`, `overwrite`, `run_single()`) - writer: `build...` w/ kwargs instead of positional - training for segmentation/binarization/enhancement tasks: * drop unused `generate_data_from_folder()` * simplify `preprocess_imgs()`: turn `preprocess_img()`, `get_patches()` and `get_patches_num_scale_new()` into generators, only writing result files in the caller (top-level loop) instead of passing output directories and file counter - training for new OCR task: * `train`: put keys into additional `config_params` where they belong, resp. (conditioned under existing keys), and w/ better documentation * `train`: add new keys as kwargs to `run()` to make usable * `utils`: instead of custom data loader `data_gen_ocr()`, re-use existing `preprocess_imgs()` (for cfg capture and top-level loop), but extended w/ new kwargs and calling new `preprocess_img_ocr()`; the latter as single-image generator (also much simplified) * `train`: use tf.data loader pipeline from that generator w/ standard mechanisms for batching, shuffling, prefetching etc. * `utils` and `train`: instead of `vectorize_label`, use `Dataset.padded_batch` * add TensorBoard callback and re-use our checkpoint callback * also use standard Keras top-level loop for training still problematic (substantially unresolved): - `Patches` now only w/ fixed implicit size (ignoring training config params) - `PatchEncoder` now only w/ fixed implicit num patches and projection dim (ignoring training config params)	2026-02-07 14:05:56 +01:00
Robert Sachunsky	0d3a8eacba	improve/update docs/train.md	2026-02-05 17:12:48 +01:00
Robert Sachunsky	6a81db934e	improve docs/train.md	2026-01-29 03:01:57 +01:00
cneud	e5254dc6c5	integrate training docs	2025-10-20 22:39:54 +02:00
kba	f60e0543ab	training: update docs	2025-10-01 19:16:58 +02:00
kba	733af1e9a7	📝 update train/README.md, align with docs/train.md	2025-10-01 17:43:32 +02:00
kba	56c4b7af88	📝 align pre-merge docs/train.md with former upstream train.md syntactically	2025-09-29 14:59:41 +02:00
kba	3123add815	📝 update README	2025-09-26 15:07:32 +02:00
cneud	e9fa691308	add model and training documentation	2025-03-27 22:41:10 +01:00

9 commits