[QNN EP] Fix quantized model graph I/O name mismatch #26998

qti-yuduo · 2026-01-13T18:32:26Z

Preserve original ONNX graph I/O names in DLC when offload_graph_io_quantization=1

When offload_graph_io_quantization=1, Q/DQ nodes at graph boundaries are
filtered out, causing fused node I/O names to differ from original ONNX
graph I/O names. This change maps them back so DLC files use user-expected
tensor names.

Changes:

Add BuildQuantizedIoNameMap() in qnn_model.cc to trace Q/DQ nodes and
build fused-to-original name mappings
Add tensor_name_override_ to QnnTensorWrapper for overriding QNN tensor
name without changing ORT lookup name
Apply overrides in QnnModelWrapper::AddTensorWrapper() for graph I/O tensors
Add QuantizedGraphInOutNamesPreserved test

qti-yuduo added 2 commits January 13, 2026 10:31

[QNN EP] Fix quantized model graph I/O name mismatch

bac5188

Merge branch 'main' into dev/yuduo/quantized-graph-io-cl

d5fd9b2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[QNN EP] Fix quantized model graph I/O name mismatch #26998

[QNN EP] Fix quantized model graph I/O name mismatch #26998

Uh oh!

qti-yuduo commented Jan 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[QNN EP] Fix quantized model graph I/O name mismatch #26998

Are you sure you want to change the base?

[QNN EP] Fix quantized model graph I/O name mismatch #26998

Uh oh!

Conversation

qti-yuduo commented Jan 13, 2026

Changes:

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant