Implement custom memory management for internal and output allocations #94

jhalakpatel · 2024-08-13T22:28:50Z

No description provided.

jhalakpatel · 2024-08-13T22:32:25Z

mlir-tensorrt/tensorrt/lib/Target/TranslateToTensorRT.cpp

+  std::unique_ptr<nvinfer1::IGpuAllocator> gpuAllocator(
+      new TensorRTLinearAllocator(kCURR_ALLOC_SIZE));
+  if (opts.useCustomGpuAllocator) {
+    builder->setGpuAllocator(gpuAllocator.get());


@christopherbate Here are the builder-side changes to allow the GPU allocator to be set up. We probably do not need this and will only be required for runtime allocations.

jhalakpatel · 2024-08-15T22:56:55Z

mlir-tensorrt/executor/include/mlir-executor/Runtime/API/API.h

 private:
  RuntimeSessionOptions options;
  ExecutableView executable;

  std::unique_ptr<PinnedMemoryAllocator> pinnedMemoryAllocator;
  std::unique_ptr<AllocTracker> allocTracker;
  std::unique_ptr<ResourceTracker> resourceTracker;
-
+  GpuAllocator* gpuAllocator;


It is unclear how to manage the lifetime. Should I use unique_ptr instead of the raw pointer?

jhalakpatel · 2024-08-15T22:58:29Z

mlir-tensorrt/executor/include/mlir-executor/Runtime/Backend/Lua/LuaRuntime.h

@@ -36,7 +36,7 @@ namespace mlirtrt::runtime {
 /// `main` function. It is assumed that `main` takes no arguments and returns an
 /// integer result (which is returned if the execution is successful).
 /// TODO: this should take a handle to a function for streaming output/errors.
-StatusOr<int64_t> runExecutorLuaScript(std::string_view luaScript);
+StatusOr<int64_t> runExecutorLuaScript(std::string_view luaScript, GpuAllocator* allocator);


Is it OK to add pass allocator here?

jhalakpatel · 2024-08-15T23:02:37Z

mlir-tensorrt/executor/lib/Runtime/Backend/Lua/Modules/TensorRT/TensorRTModule.cpp

+//! Class to allocate memory for outputs with data-dependent shapes. The sizes
+//! of those are unknown so pre-allocation is not possible.
+//!
+class OutputAllocator : public nvinfer1::IOutputAllocator {


I might add this in a separate PR.

jhalakpatel · 2024-08-19T21:46:55Z

mlir-tensorrt/executor/include/mlir-executor/Support/Allocators.h

 class GpuAllocator {
 public:
  GpuAllocator() = default;
  virtual ~GpuAllocator() = default;

-  virtual StatusOr<void *> reallocate(void *baseAddr, uint64_t alignment,


Temporarily remove it to simplify testing.

jhalakpatel · 2024-08-20T20:30:28Z

ninja -C build/mlir-tensorrt check-mlir-executor

ninja: Entering directory `build/mlir-tensorrt'
[0/2] Re-checking globbed directories...
[0/2] Running the mlir-executor regression tests

Testing Time: 1.60s

Total Discovered Tests: 26
  Unsupported:  1 (3.85%)
  Passed     : 25 (96.15%)

ninja -C build/mlir-tensorrt check-mlir-tensorrt-dialect

ninja: Entering directory `build/mlir-tensorrt'
[0/2] Re-checking globbed directories...
[0/2] Running the mlir-tensorrt-dialect regression tests

Testing Time: 0.10s

Total Discovered Tests: 16
  Unsupported:  1 (6.25%)
  Passed     : 15 (93.75%)

ninja -C build/mlir-tensorrt check-mlir-tensorrt

ninja: Entering directory `build/mlir-tensorrt'
[0/2] Re-checking globbed directories...
[2/4] Running the mlir-tensorrt regression tests
Parallelism Groups: {'non-collective': 5, 'collective': 1, 'models': 1}
llvm-lit: /workspaces/TensorRT-Incubator/mlir-tensorrt/llvm-project/llvm/utils/lit/lit/discovery.py:250: warning: test suite 'MLIR-TensorRT-Unit' contained no tests

Testing Time: 10.97s

Total Discovered Tests: 193
  Unsupported: 99 (51.30%)
  Passed     : 94 (48.70%)

1 warning(s) in tests

jhalakpatel commented Aug 13, 2024

View reviewed changes

jhalakpatel force-pushed the jhalakp-igpu-allocator branch 5 times, most recently from 842ad29 to dec20fe Compare August 15, 2024 22:54

jhalakpatel commented Aug 15, 2024

View reviewed changes

jhalakpatel force-pushed the jhalakp-igpu-allocator branch 3 times, most recently from 1ddb4b0 to 7b9a0f7 Compare August 16, 2024 22:10

jhalakpatel commented Aug 19, 2024

View reviewed changes

jhalakpatel force-pushed the jhalakp-igpu-allocator branch 3 times, most recently from 7b48086 to 0e094ec Compare August 20, 2024 20:28

jhalakpatel force-pushed the jhalakp-igpu-allocator branch 5 times, most recently from 06102b6 to 94cbb5a Compare August 22, 2024 20:46

jhalakpatel added 6 commits August 23, 2024 16:33

Add IGpuAllocator to MLIR-TensorRT

99c3080

Add a trampoline class to enable method overriding

3f3768e

Update tests to use custom allocator

d75c66a

Fix segmentaiton fault

2185f9e

Update allocate/deallocate interface

4d377a1

Add IOutputAllocator

fc7ca69

jhalakpatel force-pushed the jhalakp-igpu-allocator branch from 94cbb5a to fc7ca69 Compare August 23, 2024 23:33

jhalakpatel changed the title ~~Add IGpuAllocator to MLIR-TensorRT~~ Implement custom memory management for internal and output allocations Aug 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement custom memory management for internal and output allocations #94

Implement custom memory management for internal and output allocations #94

jhalakpatel commented Aug 13, 2024

jhalakpatel Aug 13, 2024

jhalakpatel Aug 15, 2024

jhalakpatel Aug 15, 2024

jhalakpatel Aug 15, 2024

jhalakpatel Aug 19, 2024

jhalakpatel commented Aug 20, 2024

Implement custom memory management for internal and output allocations #94

Are you sure you want to change the base?

Implement custom memory management for internal and output allocations #94

Conversation

jhalakpatel commented Aug 13, 2024

jhalakpatel Aug 13, 2024

Choose a reason for hiding this comment

jhalakpatel Aug 15, 2024

Choose a reason for hiding this comment

jhalakpatel Aug 15, 2024

Choose a reason for hiding this comment

jhalakpatel Aug 15, 2024

Choose a reason for hiding this comment

jhalakpatel Aug 19, 2024

Choose a reason for hiding this comment

jhalakpatel commented Aug 20, 2024