CoreML Execution Provider
Core ML is a machine learning framework introduced by Apple. It is designed to seamlessly take advantage of powerful hardware technology including CPU, GPU, and Neural Engine, in the most efficient way in order to maximize performance while minimizing memory and power consumption.
Contents
Requirements
The CoreML Execution Provider (EP) requires iOS devices with iOS 13 or higher, or Mac computers with macOS 10.15 or higher.
It is recommended to use Apple devices equipped with Apple Neural Engine to achieve optimal performance.
Install
Pre-built binaries of ONNX Runtime with CoreML EP for iOS are published to CocoaPods.
See here for installation instructions.
Build
For build instructions for iOS devices, please see Build for iOS.
Usage
The ONNX Runtime API details are here.
The CoreML EP can be used via the C, C++, Objective-C, C# and Java APIs.
The CoreML EP must be explicitly registered when creating the inference session. For example:
Ort::Env env = Ort::Env{ORT_LOGGING_LEVEL_ERROR, "Default"};
Ort::SessionOptions so;
uint32_t coreml_flags = 0;
Ort::ThrowOnError(OrtSessionOptionsAppendExecutionProvider_CoreML(so, coreml_flags));
Ort::Session session(env, model_path, so);
Configuration Options
There are several run time options available for the CoreML EP.
To use the CoreML EP run time options, create an unsigned integer representing the options, and set each individual option by using the bitwise OR operator.
uint32_t coreml_flags = 0;
coreml_flags |= COREML_FLAG_ONLY_ENABLE_DEVICE_WITH_ANE;
Available Options
COREML_FLAG_USE_CPU_ONLY
Limit CoreML to running on CPU only.
This may decrease the performance but will provide reference output value without precision loss, which is useful for validation.
COREML_FLAG_ENABLE_ON_SUBGRAPH
Enable CoreML EP to run on a subgraph in the body of a control flow operator (i.e. a Loop, Scan or If operator).
COREML_FLAG_ONLY_ENABLE_DEVICE_WITH_ANE
By default the CoreML EP will be enabled for all compatible Apple devices.
Setting this option will only enable CoreML EP for Apple devices with a compatible Apple Neural Engine (ANE). Note, enabling this option does not guarantee the entire model to be executed using ANE only.
For more information, see Which devices have an ANE?
Supported ops
Following ops are supported by the CoreML Execution Provider,
Operator | Note |
---|---|
ai.onnx:Add | |
ai.onnx:ArgMax | |
ai.onnx:AveragePool | Only 2D Pool is supported. |
ai.onnx:BatchNormalization | |
ai.onnx:Cast | |
ai.onnx:Clip | |
ai.onnx:Concat | |
ai.onnx:Conv | Only 1D/2D Conv is supported. Weights and bias should be constant. |
ai.onnx:DepthToSpace | Only DCR mode DepthToSpace is supported. |
ai.onnx:Div | |
ai.onnx:Flatten | |
ai.onnx:Gather | Input indices with scalar value is not supported. |
ai.onnx:Gemm | Input B should be constant. |
ai.onnx:GlobalAveragePool | Only 2D Pool is supported. |
ai.onnx:GlobalMaxPool | Only 2D Pool is supported. |
ai.onnx:LeakyRelu | |
ai.onnx:LRN | |
ai.onnx:MatMul | Input B should be constant. |
ai.onnx:MaxPool | Only 2D Pool is supported. |
ai.onnx:Mul | |
ai.onnx:Pad | Only constant mode and last two dim padding is supported. Input pads and constant_value should be constant. If provided, axes should be constant. |
ai.onnx:Pow | Only supports cases when both inputs are fp32. |
ai.onnx:PRelu | Input slope should be constant. Input slope should either have shape [C, 1, 1] or have 1 element. |
ai.onnx:Reciprocal | |
ai.onnx.ReduceSum | |
ai.onnx:Relu | |
ai.onnx:Reshape | |
ai.onnx:Resize | |
ai.onnx:Shape | Attribute start with non-default value is not supported.Attribute end is not supported. |
ai.onnx:Sigmoid | |
ai.onnx:Slice | Inputs starts , ends , axes , and steps should be constant. Empty slice is not supported. |
ai.onnx:Squeeze | |
ai.onnx:Sqrt | |
ai.onnx:Sub | |
ai.onnx:Tanh | |
ai.onnx:Transpose |