Load onnx model from Stream of bytes #7254

michaelgsharp · 2024-10-02T20:27:34Z

Fixes #6591 by adding an overload API to allow the ONNX model to be passed in as a Stream of bytes.

src/Microsoft.ML.OnnxTransformer/OnnxTransform.cs

tarekgh · 2024-10-02T21:06:10Z

src/Microsoft.ML.OnnxTransformer/OnnxUtils.cs

+            var tempModelFile = Path.Combine(((IHostEnvironmentInternal)env).TempFilePath, Path.GetRandomFileName());
+            using (var fileStream = File.Create(tempModelFile))
+            {
+                modelBytes.Seek(0, SeekOrigin.Begin);


Seek

Wouldn't you support streams that cannot seek?

tarekgh · 2024-10-02T21:08:30Z

src/Microsoft.ML.OnnxTransformer/OnnxUtils.cs

+                modelBytes.Seek(0, SeekOrigin.Begin);
+                modelBytes.CopyTo(fileStream);
+            }
+            return new OnnxModel(tempModelFile, gpuDeviceId, fallbackToCpu,


OnnxModel

I am wondering if we can have an OnnxModel constructor that can work directly with Streams? Or the native will not allow that? I am thinking if we can avoid writing to a temp file.

Thats something I want to do as well, but this will at least unblock people and then I can come back and look into it. I am not sure if ONNX supports that or not (though they probably do).

just a suggestion you may decide not to apply :-)

Instead of exposing CreateFromStream, would it make sense to expose a new constructor for OnnxModel which take the stream and make the implementation inside this constructor as the one provided here? This will avoid exposing extra method if we decided later to support streams in OnnxModel. This is minor point though. I am fine either way.

codecov · 2024-10-07T21:17:05Z

Codecov Report

Attention: Patch coverage is 64.22764% with 44 lines in your changes missing coverage. Please review.

Project coverage is 68.78%. Comparing base (be1e428) to head (9f5e86c).
Report is 7 commits behind head on main.

Files with missing lines	Patch %	Lines
src/Microsoft.ML.OnnxTransformer/OnnxCatalog.cs	13.79%	25 Missing ⚠️
src/Microsoft.ML.OnnxTransformer/OnnxTransform.cs	61.36%	16 Missing and 1 partial ⚠️
...osoft.ML.OnnxTransformerTest/OnnxTransformTests.cs	95.00%	0 Missing and 2 partials ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #7254      +/-   ##
==========================================
+ Coverage   68.77%   68.78%   +0.01%     
==========================================
  Files        1462     1463       +1     
  Lines      272261   272407     +146     
  Branches    28176    28183       +7     
==========================================
+ Hits       187254   187386     +132     
- Misses      77764    77778      +14     
  Partials     7243     7243

Flag	Coverage Δ
Debug	`68.78% <64.22%> (+0.01%)`	⬆️
production	`63.29% <49.39%> (+<0.01%)`	⬆️
test	`89.05% <95.00%> (+<0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files with missing lines	Coverage Δ
src/Microsoft.ML.OnnxTransformer/OnnxUtils.cs	`84.82% <100.00%> (+0.61%)`	⬆️
...osoft.ML.OnnxTransformerTest/OnnxTransformTests.cs	`95.54% <95.00%> (-0.04%)`	⬇️
src/Microsoft.ML.OnnxTransformer/OnnxTransform.cs	`87.35% <61.36%> (-2.48%)`	⬇️
src/Microsoft.ML.OnnxTransformer/OnnxCatalog.cs	`58.90% <13.79%> (-29.74%)`	⬇️

... and 9 files with indirect coverage changes

Load onnx model from Stream working

9f5e86c

dotnet-policy-service bot assigned michaelgsharp Oct 2, 2024

michaelgsharp requested review from tarekgh and LittleLittleCloud October 2, 2024 20:28

LittleLittleCloud reviewed Oct 2, 2024

View reviewed changes

src/Microsoft.ML.OnnxTransformer/OnnxTransform.cs Show resolved Hide resolved

LittleLittleCloud approved these changes Oct 2, 2024

View reviewed changes

tarekgh reviewed Oct 2, 2024

View reviewed changes

ericstj mentioned this pull request Oct 2, 2024

Microsoft.ML.Tokenizers.Tests.TiktokenTests.TestTokenizerUsingExternalVocab failing to download gpt2.tiktoken #7256

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Load onnx model from Stream of bytes #7254

Load onnx model from Stream of bytes #7254

michaelgsharp commented Oct 2, 2024 •

edited

Loading

tarekgh Oct 2, 2024

tarekgh Oct 2, 2024

michaelgsharp Oct 7, 2024

tarekgh Oct 7, 2024 •

edited

Loading

codecov bot commented Oct 7, 2024

Load onnx model from Stream of bytes #7254

Are you sure you want to change the base?

Load onnx model from Stream of bytes #7254

Conversation

michaelgsharp commented Oct 2, 2024 • edited Loading

tarekgh Oct 2, 2024

Choose a reason for hiding this comment

tarekgh Oct 2, 2024

Choose a reason for hiding this comment

michaelgsharp Oct 7, 2024

Choose a reason for hiding this comment

tarekgh Oct 7, 2024 • edited Loading

Choose a reason for hiding this comment

codecov bot commented Oct 7, 2024

Codecov Report

michaelgsharp commented Oct 2, 2024 •

edited

Loading

tarekgh Oct 7, 2024 •

edited

Loading