Add allocated CPU and GPU memory reporting #81

jhalakpatel · 2024-08-09T21:04:35Z

Add RuntimeClient python bindings to report allocated CPU and GPU memory usage.

christopherbate · 2024-08-10T01:47:03Z

It looks fine, but without knowledge of the use case it's hard to review. This is only giving you the amount of memory tracked by the RuntimeClient at any given point in time. I hope you wouldn't be calling this often (since you're adding it up on the fly). If you need specific memory stats we could build something a bit more robust

christopherbate · 2024-08-10T01:38:51Z

mlir-tensorrt/executor/include/mlir-executor/Runtime/API/API.h

@@ -773,6 +773,9 @@ class AllocTracker {
  /// Return true if the tracker's map contains `ptr`.
  bool contains(uintptr_t ptr) const;

+  /// Report total CPU and GPU memory allocated by runtime client.
+  std::pair<int64_t, int64_t> reportAllocatedMemory() const;


There actually could are more types than just these two, so I'd prefer if we separate it into a struct or array. Array could be indexed by all the potential values of PointerType.

christopherbate · 2024-08-10T01:39:19Z

mlir-tensorrt/executor/lib/Runtime/API/API.cpp

@@ -429,6 +429,24 @@ PointerInfo AllocTracker::lookupOrDefault(uintptr_t ptr) const {
  return map.at(ptr);
 }

+std::pair<int64_t, int64_t> AllocTracker::reportAllocatedMemory() const {
+    int64_t totalCpuMemory = 0;


We should use uint64_t here

christopherbate · 2024-08-10T01:43:32Z

mlir-tensorrt/python/bindings/Runtime/RuntimePyBind.cpp

+            MTRT_Status s = mtrtReportAllocatedMemory(self, &totalCpuMemory, &totalGpuMemory);
+            THROW_IF_MTRT_ERROR(s);
+            py::object namedtuple = py::module::import("collections").attr("namedtuple");
+            py::object MemoryUsage = namedtuple("MemoryUsage", "cpu_memory gpu_memory");


You'll need to update the stubs so users can see this type information in the IDE.

christopherbate · 2024-08-10T01:44:40Z

mlir-tensorrt/compiler/include/mlir-tensorrt-c/Runtime/Runtime.h

 MLIR_CAPI_EXPORTED MTRT_RuntimeClient
 mtrtMemRefGetClient(MTRT_MemRefValue memref);

+/// Retrieve the runtime client allocated cpu and gpu memory.


This isn't quite accurate. You're reporting the CPU/GPU memory that is being tracked by the RuntimeClient. It can track buffers that are externally allocated.

christopherbate · 2024-08-10T01:44:55Z

mlir-tensorrt/compiler/include/mlir-tensorrt-c/Runtime/Runtime.h

+/// Retrieve the runtime client allocated cpu and gpu memory.
+MTRT_Status mtrtReportAllocatedMemory(MTRT_RuntimeClient client,
+                                  int64_t *totalCpuMemory,
+                                  int64_t *totalGpuMemory);


Let's use uint64_t or size_t here.

jhalakpatel · 2024-08-13T18:55:00Z

mlir-tensorrt/executor/lib/Runtime/API/API.cpp

+
+    for (const auto &entry : map) {
+        const PointerInfo &info = entry.second;
+        if (info.isExternallyManaged())


@christopherbate Is this sufficient for tracking only internally managed/allocated pointers?

Add allocated CPU and GPU memory reporting

93266ba

jhalakpatel requested a review from christopherbate August 9, 2024 21:04

jhalakpatel marked this pull request as ready for review August 9, 2024 21:52

jhalakpatel added enhancement New feature or request mlir-tensorrt Pull request for the mlir-tensorrt project labels Aug 9, 2024

jhalakpatel requested a review from shelkesagar29 August 9, 2024 21:57

christopherbate reviewed Aug 10, 2024

View reviewed changes

jhalakpatel commented Aug 13, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add allocated CPU and GPU memory reporting #81

Add allocated CPU and GPU memory reporting #81

jhalakpatel commented Aug 9, 2024 •

edited

Loading

christopherbate commented Aug 10, 2024

christopherbate Aug 10, 2024

christopherbate Aug 10, 2024

christopherbate Aug 10, 2024

christopherbate Aug 10, 2024

christopherbate Aug 10, 2024

jhalakpatel Aug 13, 2024

Add allocated CPU and GPU memory reporting #81

Are you sure you want to change the base?

Add allocated CPU and GPU memory reporting #81

Conversation

jhalakpatel commented Aug 9, 2024 • edited Loading

christopherbate commented Aug 10, 2024

christopherbate Aug 10, 2024

Choose a reason for hiding this comment

christopherbate Aug 10, 2024

Choose a reason for hiding this comment

christopherbate Aug 10, 2024

Choose a reason for hiding this comment

christopherbate Aug 10, 2024

Choose a reason for hiding this comment

christopherbate Aug 10, 2024

Choose a reason for hiding this comment

jhalakpatel Aug 13, 2024

Choose a reason for hiding this comment

jhalakpatel commented Aug 9, 2024 •

edited

Loading