[CORE-8933] Setup for translation scheduling port #25126

oleiman · 2025-02-21T00:07:35Z

Datalake API changes setting us up for porting the translation path to the new scheduler infrastructure. As written, should be functionally equivalent to current world, but some APIs have changes shape significantly.

Pulled form #25077

Backports Required

Release Notes

none

Signed-off-by: Oren Leiman <[email protected]>

This method loops through the column writers to check if any of them are flush worthy, computes the memory usage in the same loop. Useful for a latter commit that avoids this loop again and needs stats right after append. Signed-off-by: Oren Leiman <[email protected]>

The interface implementations keep track of the current memory used by the writer and related reservations. Signed-off-by: Oren Leiman <[email protected]>

Adds the following - flush() - flushes all the buffered bytes to the output stream - methods to fetch buffered/flushed bytes Signed-off-by: Oren Leiman <[email protected]>

Signed-off-by: Oren Leiman <[email protected]>

.. instead of lazy_abort_source. To be used later, they are both connected anyway. Signed-off-by: Oren Leiman <[email protected]>

oleiman · 2025-02-21T00:54:44Z

/ci-repeat 1

vbotbuildovich · 2025-02-21T04:45:41Z

CI test results

test results on build#62085

test_id	test_kind	job_url	test_status	passed
rptest.tests.compaction_recovery_test.CompactionRecoveryTest.test_index_recovery	ducktape	https://buildkite.com/redpanda/redpanda/builds/62085#01952647-e283-4c18-906c-3c6f633be00a	FLAKY	1/3
rptest.tests.compaction_recovery_test.CompactionRecoveryUpgradeTest.test_index_recovery_after_upgrade	ducktape	https://buildkite.com/redpanda/redpanda/builds/62085#01952647-e280-4288-a8da-1db982070470	FLAKY	1/2
rptest.tests.consumer_group_recovery_test.ConsumerOffsetsRecoveryTest.test_consumer_offsets_partition_recovery	ducktape	https://buildkite.com/redpanda/redpanda/builds/62085#01952652-52c1-4643-b33a-6af7c6e0b332	FLAKY	1/2
rptest.tests.datalake.mount_unmount_test.MountUnmountIcebergTest.test_simple_unmount.cloud_storage_type=CloudStorageType.S3	ducktape	https://buildkite.com/redpanda/redpanda/builds/62085#01952647-e282-4087-b4b4-d9a1880002a6	FLAKY	1/2
rptest.tests.e2e_shadow_indexing_test.ShadowIndexingWhileBusyTest.test_create_or_delete_topics_while_busy.short_retention=True.cloud_storage_type=CloudStorageType.ABS	ducktape	https://buildkite.com/redpanda/redpanda/builds/62085#01952647-e283-4c18-906c-3c6f633be00a	FLAKY	1/2
rptest.transactions.consumer_offsets_test.VerifyConsumerOffsetsThruUpgrades.test_consumer_group_offsets.versions_to_upgrade=1	ducktape	https://buildkite.com/redpanda/redpanda/builds/62085#01952652-52c2-4cc3-8e9e-0a766971b2f6	FLAKY	1/2

test results on build#62166

test_id	test_kind	job_url	test_status	passed
rptest.tests.compaction_recovery_test.CompactionRecoveryUpgradeTest.test_index_recovery_after_upgrade	ducktape	https://buildkite.com/redpanda/redpanda/builds/62166#0195352a-f192-4d4e-a367-861d49d7f92e	FLAKY	1/2
rptest.tests.data_migrations_api_test.DataMigrationsApiTest.test_migrated_topic_data_integrity.transfer_leadership=True.params=.cancellation.None.use_alias.False	ducktape	https://buildkite.com/redpanda/redpanda/builds/62166#0195353f-12fa-4368-bee5-352243506ae8	FLAKY	1/2

oleiman · 2025-02-21T05:06:17Z

CI Failure:

ducktape-build-release looks spurious? I don't see any test failures

andrwng

Nice! Thanks for pulling this out

andrwng · 2025-02-21T06:44:14Z

src/v/datalake/translation/deps.h

@@ -0,0 +1,24 @@
+/*


Curious, what's the intuition for things that go in here?

Not 100% on details to give a one sentence description, but the bulk of it is here: 82bcd17

andrwng · 2025-02-21T21:12:50Z

src/v/datalake/translation/partition_translator.cc

@@ -303,11 +303,6 @@ partition_translator::do_translation_for_range(
    const auto& ntp = _partition->ntp();
    auto remote_path_prefix = remote_path{
      fmt::format("{}/{}/{}", iceberg_data_path_prefix, ntp.path(), _term)};
-    lazy_abort_source las{[this] {
-        return can_continue() ? std::nullopt


Just curious, this used to account for term changes. Is that still covered with _as?

I think the answer is that it'll be up to the scheduler/executor to abort during term changes via this abort source

yeah i believe that's right

src/v/datalake/record_multiplexer.h

src/v/datalake/tests/translation_task_test.cc

rockwotj

Looks great, just one comment which could be a followup

rockwotj · 2025-02-22T18:17:08Z

src/v/datalake/serde_parquet_writer.h

+    size_t buffered_bytes() const final;
+
+    size_t flushed_bytes() const final;


Should we just amend the interface of the parquet writer to be the same? So we don't have to duplicate the state in both layers?

yeah makes sense to me. will do in a follow up so I can boy scout it a little bit.

Currently multiplexer is a one shot class with pattern as follows mux = create_mux(); co_await reader.consume(mux...) With the new changes, we want multiplexer to multiplex across scheduling iterations and release resouces inbetween. This commit makes changes to the API support this port. The new pattern would look something like this.. mux = create_mux(); mux.multiplex(reader1...) mux.flush_writers(); // optional mux.multiplex(reader2..) mux.flush_writers(); // optional ... ... result = co_await std::move(mux).finish(); The ability to temporarily flush all the intermediate state and multiplex across multiple readers enables porting to the new scheduler API. Signed-off-by: Oren Leiman <[email protected]>

Make the task long running to support batching of data across multiple iterations of scheduler Signed-off-by: Oren Leiman <[email protected]>

Signed-off-by: Oren Leiman <[email protected]>

oleiman self-assigned this Feb 21, 2025

github-actions bot added area/build area/redpanda labels Feb 21, 2025

bharathv added 5 commits February 20, 2025 16:07

utils/null_output_stream: move definitions into source file

8f4489c

Signed-off-by: Oren Leiman <[email protected]>

dl/writer: introduce writer_mem_tracker interface

7ded21e

The interface implementations keep track of the current memory used by the writer and related reservations. Signed-off-by: Oren Leiman <[email protected]>

dl/parquet_writer: enhance writer interface

961d4e3

Adds the following - flush() - flushes all the buffered bytes to the output stream - methods to fetch buffered/flushed bytes Signed-off-by: Oren Leiman <[email protected]>

dl/writer: add a temporary noop_mem_tracker

8796c9a

Signed-off-by: Oren Leiman <[email protected]>

oleiman force-pushed the dlib/core-8933/translator-scheduling-port-foundation branch from b929834 to 2098d37 Compare February 21, 2025 00:11

oleiman marked this pull request as ready for review February 21, 2025 00:26

bharathv added 2 commits February 20, 2025 16:51

dl/writer: wireup writer_mem_tracker with local writer

41bb390

Signed-off-by: Oren Leiman <[email protected]>

dl/translation/task: pass an abort_source

c49c123

.. instead of lazy_abort_source. To be used later, they are both connected anyway. Signed-off-by: Oren Leiman <[email protected]>

oleiman force-pushed the dlib/core-8933/translator-scheduling-port-foundation branch from 2098d37 to f2b3275 Compare February 21, 2025 00:51

oleiman requested review from andrwng, nvartolomei, rockwotj and bharathv February 21, 2025 04:02

andrwng previously approved these changes Feb 21, 2025

View reviewed changes

rockwotj reviewed Feb 22, 2025

View reviewed changes

rockwotj previously approved these changes Feb 22, 2025

View reviewed changes

bharathv added 3 commits February 22, 2025 19:46

dl/partition_translator/task: make translation_task long running

a2330f4

Make the task long running to support batching of data across multiple iterations of scheduler Signed-off-by: Oren Leiman <[email protected]>

dl/translation/tests: coverage for long running translation/mux

ffdcfe0

Signed-off-by: Oren Leiman <[email protected]>

oleiman dismissed stale reviews from rockwotj and andrwng via ffdcfe0 February 23, 2025 22:14

oleiman force-pushed the dlib/core-8933/translator-scheduling-port-foundation branch from f2b3275 to ffdcfe0 Compare February 23, 2025 22:14

oleiman requested a review from andrwng February 24, 2025 05:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CORE-8933] Setup for translation scheduling port #25126

[CORE-8933] Setup for translation scheduling port #25126

oleiman commented Feb 21, 2025

oleiman commented Feb 21, 2025

vbotbuildovich commented Feb 21, 2025 •

edited

Loading

oleiman commented Feb 21, 2025

andrwng left a comment

andrwng Feb 21, 2025

oleiman Feb 23, 2025

andrwng Feb 21, 2025

andrwng Feb 21, 2025

oleiman Feb 23, 2025

rockwotj left a comment

rockwotj Feb 22, 2025

oleiman Feb 23, 2025

		size_t buffered_bytes() const final;

		size_t flushed_bytes() const final;

[CORE-8933] Setup for translation scheduling port #25126

Are you sure you want to change the base?

[CORE-8933] Setup for translation scheduling port #25126

Conversation

oleiman commented Feb 21, 2025

Backports Required

Release Notes

oleiman commented Feb 21, 2025

vbotbuildovich commented Feb 21, 2025 • edited Loading

CI test results

oleiman commented Feb 21, 2025

andrwng left a comment

Choose a reason for hiding this comment

andrwng Feb 21, 2025

Choose a reason for hiding this comment

oleiman Feb 23, 2025

Choose a reason for hiding this comment

andrwng Feb 21, 2025

Choose a reason for hiding this comment

andrwng Feb 21, 2025

Choose a reason for hiding this comment

oleiman Feb 23, 2025

Choose a reason for hiding this comment

rockwotj left a comment

Choose a reason for hiding this comment

rockwotj Feb 22, 2025

Choose a reason for hiding this comment

oleiman Feb 23, 2025

Choose a reason for hiding this comment

vbotbuildovich commented Feb 21, 2025 •

edited

Loading