WebニューラルネットワークAPI

1. はじめに

Web Neural Network APIは、プラットフォーム固有の能力に縛られることなく、オペレーティングシステムおよび基盤となるハードウェアプラットフォームの機械学習能力を利用する、 Web向けのハードウェア非依存の抽象化レイヤーを定義する。この抽象化レイヤーは、主要な機械学習JavaScriptフレームワークの要件に対応し、さらにML分野に精通したWeb開発者がライブラリの助けを借りずにカスタムコードを書けるようにもする。

図解付きの導入については、解説を参照されたい。

2. ユースケース

2.1. アプリケーションユースケース

この節では、ニューラルネットワーク推論ハードウェアアクセラレーションのアプリケーションレベルのユースケースを例示する。これらのユースケースにおけるすべてのアプリケーションは、事前学習済みの深層ニューラルネットワーク（DNN）[models]上に構築できる。

注記: ここで説明するユースケースの一部は、その性質上、プライバシー侵害的であることに注意されたい。そのようなユースケースにAPIを使用することを計画している開発者は、そのAPIがユーザーの利益のために、ユーザーが理解し承認する目的で使用されることを保証すべきである。開発者は、 Ethical Principles for Web Machine Learning [webmachinelearning-ethics]を適用し、透明性、データ最小化、ユーザー制御などの適切なプライバシーリスク緩和策を実装すべきである。

注記: § 3 アクセシビリティの考慮事項は、これらのユースケースのアクセシビリティを改善する方法についての指針を提供する。

2.1.1. 人物検出

ユーザーがWebベースのビデオ会議アプリケーションを開いているが、一時的に部屋から離れている。アプリケーションは、カメラ入力フレーム内で人物を含む領域を検出するために、（例えば、単一のDNNを使用する[SSD] や[YOLO]などの物体検出手法を用いた）物体検出を使用して、ユーザーがPCの前にいるかどうかを監視している。

彼女が戻ってくると、アプリケーションは自動的に彼女を検出し、他のオンラインユーザーに彼女が現在アクティブであることを通知する。

2.1.2. セマンティックセグメンテーション

ユーザーは、オフィスに空いている会議室がないため、自席からWebベースのビデオ会議アプリケーションで電話会議に参加する。電話会議中、彼女は自分の部屋や背景にいる人々が見えることを望まない。他の人々および周囲のプライバシーを保護するために、アプリケーションは[DeepLabv3+]、 [MaskR-CNN] または[SegAny]などの機械学習モデルを実行して、画像を意味的にセグメントへ分割し、他の人々および背景を表すセグメントを別の画像で置き換える。

2.1.3. 骨格検出

Webベースのビデオ会議アプリケーションは、[PoseNet]などのリアルタイム人体姿勢推定を可能にする機械学習モデルを実行して、ユーザーの骨格姿勢を追跡し、彼女のジェスチャーおよびボディランゲージを認識する。彼女が手を挙げると、マイクは自動的にミュート解除され、電話会議で発言を開始できる。

2.1.4. 顔認識

会議室には複数の人がおり、彼らはWebベースのビデオ会議アプリケーションを使用してオンライン会議に参加する。アプリケーションは、（例えば[SSD]などの物体検出手法を用いた）物体検出を使用して参加者の顔を検出し、2つの顔が同一であるかどうかを検証する[FaceNet]などの機械学習モデルを実行して、各顔が前回の会議に存在していたかどうかを確認する。

2.1.5. 顔ランドマーク検出

ユーザーは、オンライン眼鏡店で自分に美しく合う新しい眼鏡を探したい。オンラインストアは、 [FAN]のような Face Alignment Networkなどの機械学習モデルを実行して、目、鼻、口などの顔ランドマークを検出する Webベースの試着シミュレーターを提供する。彼女が眼鏡を選択すると、シミュレーターは彼女の顔画像上で検出された目の位置に、選択された眼鏡を適切にレンダリングする。

2.1.6. スタイル転送

ユーザーはオンラインストアで化粧品を探しており、どの色が自分の顔に合うかを考えている。オンラインストアは化粧品のサンプル顔メイク画像を表示し、 [ContextualLoss]または[PairedCycleGAN] のような機械学習モデルを実行して、サンプルメイク画像のメイクスタイルを彼女の顔画像に転送するメイクシミュレーターを提供する。彼女はシミュレーターにより、選択したメイクが自分の顔でどのように見えるかを確認できる。

2.1.7. 超解像

Webベースのビデオ会議は相手からビデオストリームを受信しているが、ネットワーク輻輳によりビデオの解像度が低下している。知覚される映像品質の低下を防ぐために、アプリケーションは[SRGAN]などの超解像用機械学習モデルを実行し、より高解像度のビデオフレームを生成する。

2.1.8. 画像キャプション生成

アクセシビリティを向上させるために、Webベースのプレゼンテーションアプリケーションは、プレゼンテーションスライドの説明語を予測する [im2txt] などの機械学習モデルを実行して、自動画像キャプション生成を提供する。

2.1.9. テキストから画像への生成

画像は現代のWeb体験の中核をなす。プライバシーを保護する方法でテキスト入力に基づいて画像を生成する能力は、 Webアプリケーションおよびコンテンツの視覚的なパーソナライズと適応を可能にする。例えば、Web アプリケーションは、Webページ上の自然言語による説明、またはテキストプロンプト内でユーザーが提供した説明を入力として使用し、そのテキスト説明に一致する画像を生成できる。潜在拡散モデルアーキテクチャ [LDM]によって可能になるこのテキストから画像への生成ユースケースは、追加のテキストから画像への生成ユースケースの基礎を形成する。例えば、Webページ上の既存画像の一部を新たに生成されたコンテンツを用いて選択的に変更するインペインティング、またはその逆で、元画像を本来の寸法を超えて拡張し、空いた領域を生成されたコンテンツで埋めるアウトペインティングがある。

2.1.10. 機械翻訳

さまざまな国の複数の人々が、Webベースのリアルタイムテキストチャットアプリケーションを通じて会話している。アプリケーションは、すべてのテキストを別の言語に翻訳する[GNMT]または[OpenNMT]などの機械学習モデルを使用して、彼らの会話を翻訳する。

2.1.11. 感情分析

ユーザーはWebベースのリアルタイムテキストチャットアプリケーションを通じて友人と会話しており、友人の顔を見ることができないため、友人がどのように感じているかを知りたいと思っている。アプリケーションは、入力テキストから感情を推論する[DeepMoji] などの機械学習モデルを使用して友人の感情を分析し、推定された感情を表す絵文字を表示する。

2.1.12. 動画要約

Webベースのビデオ会議アプリケーションは受信したビデオストリームを録画しており、保存する録画ビデオデータを削減する必要がある。アプリケーションは、 [Video-Summarization-with-LSTM]などの動画要約用機械学習モデルを使用して、録画ビデオの短縮版を生成する。

2.1.13. ノイズ抑制

Webベースのビデオ会議アプリケーションは受信した音声ストリームを録音しているが、通常、背景ノイズは至るところに存在する。アプリケーションは、ビデオ会議における音声体験を向上させるために、赤ちゃんの泣き声や犬の吠え声のような背景の動的ノイズを抑制する目的で、 [RNNoise]などの再帰型ニューラルネットワークを用いたリアルタイムノイズ抑制を活用する。

2.1.14. 音声認識

音声認識はspeech to textとしても知られ、話された言語を認識してテキストへ変換することを可能にする。音声認識のアプリケーション例には、文字起こし、自動翻訳、マルチモーダルインタラクション、リアルタイムキャプション、およびバーチャルアシスタントが含まれる。音声認識は聴覚コンテンツのアクセシビリティを向上させ、そのようなコンテンツをプライバシーを保護する方法でテキスト形式により操作することを可能にする。一般的なユースケースの例には、リアルタイムキャプションを使用した動画視聴やオンライン会議への参加が含まれる。 [Whisper]などのモデルは、精度と堅牢性の点で人間に近づいており、そのようなユースケースのアクセシビリティを向上させる上で適している。

2.1.15. テキスト生成

大規模言語モデル（LLM）により、テキスト列における次の項目を予測する一般的能力が必要とされるタスクを実行できる、さまざまなテキスト生成ユースケースが可能になる。この種のモデルは、テキストを翻訳し、テキスト入力に基づいて質問に答え、より大きな本文を要約し、またはテキスト入力に基づいてテキスト出力を生成できる。LLMは、RNN、CNN、またはLSTMアーキテクチャに基づく古いモデルと比較してより良い性能を可能にし、この節で説明する他の多くのユースケースの性能をさらに向上させる。 LLMの例には、[t5-small]、 [m2m100_418M]、[gpt2]、および[llama-2-7b]が含まれる。

2.1.16. フェイク動画の検出

ユーザーはWeb上で“deepfake”によって生成されたリアルなフェイク動画にさらされる。フェイク動画は、話者の顔を大統領の顔に入れ替えて、ユーザーを政治的に扇動したり、ユーザーの意見を操作したりする可能性がある。[FaceForensics++]などのdeepfake検出アプリケーションは動画を分析し、ユーザーをフェイク動画または画像から保護する。彼女がWeb上でフェイク動画を視聴したとき、検出アプリケーションはリアルタイムでその不正な動画を警告する。

2.2. フレームワークユースケース

この節では、ニューラルネットワーク推論ハードウェアアクセラレーションのための専用低レベルAPIに関するフレームワークレベルのユースケースを収集する。機械学習フレームワークがWeb Neural Network API（WebNN API）の主要な利用者となり、WebNN APIを通じて公開される低レベルの詳細は一般的なWeb開発者から抽象化されることが期待される。しかし、機械学習に特定の関心と能力を持つWeb開発者が、より高レベルのMLフレームワークではなくWebNN APIと直接やり取りしたいと考えることも期待される。

2.2.1. カスタムレイヤー

Webアプリケーション開発者は、WebNN API上でDNNモデルを実行したい。しかし、 [LeakyReLU]、[ELU]などの一部の活性化関数がWebNN APIに含まれていないことに気付いた。この問題に対処するために、彼女は WebNN API上に追加の活性化関数のカスタムレイヤーを構築する。カスタムレイヤーの範囲には、活性化だけでなく畳み込み、正規化なども含まれ得ることに注意されたい。

2.2.2. ネットワーク結合

WebアプリケーションはDNNモデルを使用し、その上位の畳み込み層および下位の全結合層のモデルデータは別々のファイルに保存される。これは、全結合層のモデルデータがサーバー側でのファインチューニングにより定期的に更新されるためである。

したがって、アプリケーションは最初に両方の部分モデルファイルをダウンロードし、それらを単一のモデルに結合する。モデルが更新されると、アプリケーションはモデルのファインチューニング済み部分をダウンロードし、全結合層のみをそれで置き換える。

2.2.3. 性能適応

Webアプリケーション開発者は、モバイルデバイス上での自分のDNNモデルの性能について懸念している。彼女は、GPUアクセラレーションを備えていないモバイルデバイスでは実行が遅すぎる可能性があることを確認した。この問題に対処するために、彼女のWebアプリケーションはWebNN APIを参照してアクセラレーションが利用可能かどうかを確認し、アクセラレーションのないデバイスに対して警告を表示できるようにする。

数週間後、彼女はCPU上でも実行できる小型のDNNモデルを開発した。 CPU実行に対応するために、彼女はアプリケーションを修正し、 CPUのみのデバイスの場合にその小型モデルをロードするようにする。

2.2.4. 演算レベル実行

JavaScript MLフレームワークは、MLモデルのロード、解釈、および実行を担う。モデルの実行フェーズ中、フレームワークはモデルの演算を反復し、各演算を CPU、GPU、またはMLアクセラレーターのようなハードウェアデバイス上で実行する。デバイス間の不要なデータコピーを避けるため、フレームワークは演算を実行する同じデバイスを選択する。畳み込み2Dや行列乗算などの計算集約的な演算では、フレームワークはWebNN APIを使用して、選択されたデバイスで利用可能な ML固有のアクセラレーションによりそれを実行する。

2.2.5. リアルタイム動画処理との統合

WebRTCベースのビデオ会議のユーザー体験は、リアルタイム動画処理を使用して強化される。例えば、 § 2.1.2 セマンティックセグメンテーションモデルを使用して実装された背景ぼかしは、ユーザーのライブカメラ映像内の背景をぼかす。このユースケースの性能要件を満たすために、 WebNN APIは、メディアパイプラインを構成する他のWeb APIのプリミティブと統合し、 WebNN APIベースのリアルタイム動画ストリーム変換を可能にする。

3. アクセシビリティの考慮事項

この節は、ニューラルネットワーク推論ハードウェアアクセラレーションによって可能になる § 2.1 アプリケーションユースケースのアクセシビリティを改善する方法について、 Web作者に指針を提供する。この指針は、本仕様で概説される特定のユースケースを超えて一般化されるものであり、 Web作者は、さらなるアクセシビリティ指針として[wcag]を、倫理原則の文脈におけるデジタルアクセシビリティについては§ 6 倫理的考慮事項を参照することが推奨される。

§ 2.1.8 画像キャプション生成は、キャプションがスクリーンリーダーおよびその他の支援技術（AT）ユーザーに提示されることを保証することで改善できる。 Web作者は、生成された画像キャプションが、それぞれの画像に意味的に関連付けられることを保証することが推奨される。これは、標準のalt属性を介して、または説明が初期ページロード時に更新されるか、あるいは後からユーザー操作の結果として更新されるかに応じた他の手段によって行うことができる。

§ 2.1.11 感情分析は、ユーザーを誤ってラベル付けし、したがって誤分類する可能性があり、差別的な体験につながり得る。Web作者は、信頼度スコアを公開し、ユーザーにその機能をオフにする選択肢を与えることが推奨される。

§ 2.1.13 ノイズ抑制は、強いフィルターを用いると構音障害のあるユーザーの発話を消し去り、キャプションおよび認識を失敗させる可能性がある。Web作者は、バイパスまたは感度制御を公開し、ライブキャプションが有効な場合にノイズ抑制を固定的に組み込まないことが推奨される。

§ 2.2.5 リアルタイム動画処理との統合では、セグメンテーションにより実現される背景ぼかしが気を散らす要素を取り除くのに役立つが、読唇やライブキャプションを損なうほどの遅延を追加する可能性がある。Web作者は、ユーザー向けにキーボードおよびスクリーンリーダーで操作可能な“背景ぼかしオン/オフ”コントロールを、他のアクセシビリティ/メディア設定の近くに提示する機能を提供することが推奨される。

§ 7.2 デバイス選択により、Web作者は実行速度および消費電力に関する設定を示すことができる。実装者は、ブラウザーUIで Web作者のヒントをユーザーが上書きできるようにすることが推奨される。これにより、特に携帯型AACや視線入力の環境において、低性能またはバッテリーに敏感なデバイス上の人々が、キャプションおよびその他の重要なアクセシビリティ機能を応答性の高い状態に保てるようにする。

4. セキュリティの考慮事項

この仕様は、ニューラルネットワーク推論ハードウェアアクセラレーションのための低レベルAPIを定義する。このAPIは、ユーザーのコンピューターへの低レベルアクセスを与えるため、強力な機能[POWERFUL-FEATURES]と見なされる。強力な機能に対する認証および機密性の期待を満たし、中間者攻撃を防止するために、この仕様で定義されるすべてのインターフェイスはセキュアコンテキストでのみ利用可能である。

このAPIは、§ 7.5 権限ポリシー統合を用いて、すべてのクロスオリジンフレームでデフォルトで無効化される。これにより、埋め込みページが許可を与えるポリシーを明示的に設定しない限り、サードパーティコンテンツがこのAPIを使用することを防止する。

このAPIは、WebGPU仕様で定義されるGPUDeviceから MLContextを作成できるようにする。このコンテキストのセキュリティ特性に関する詳細については、WebGPU Security Considerationsを参照。

このAPIは、GPU、CPU、および専用MLアクセラレーターハードウェアにまたがる抽象化を提供する。GPUを使用する場合、WebGPUと同様のサービス拒否に関する考慮事項が適用される。 CPUまたは専用MLアクセラレーターを使用する場合、潜在的なリソース競合の種類は異なり、緩和策は実装および構成に依存する。実装は、サイトが不公平な量のシステムリソースを使用することを防ぐために、プラットフォームで利用可能なあらゆる仕組みを使用すべきである。これらの計算ユニットは共有リソースであり、あらゆる計算APIの使用は、高負荷状態のシステム全体の性能に影響する。

グラフが完全に構築されコンパイルされると、グラフ内の各演算への入力形状が推論され、確定される。境界チェックは、実データに対してグラフを実行する計算メソッドが呼び出されたときに行われる。この段階より前に、実データがコンパイル済みグラフに束縛されることはない。その時点までに推論済みのデータ形状に対して適切な境界チェックが行われるよう保証することは、実装の責任である。

実装者への指針として、境界外アクセスを受けやすい演算を文書化する。

実装は、定数と見なされるデータの変更に基づく制御フロー攻撃を防御しなければならない。例えば、基盤プラットフォームの最適化は、計算全体を通じて重みが変更されないと仮定する場合がある。 APIが計算中に重みを保持するバッファの内容変更を許す場合、それらの最適化の仮定は無効になり、基盤プラットフォームで未定義の動作を引き起こすことになる。このAPIは、常にバッファをコピーまたは転送することで、スクリプトからのこの種の攻撃を緩和するが、実装は、定数と仮定されるデータのプロセス分離など、追加の防御策を検討すべきである。

将来への備えとして、API設計は、互換性を壊すことなく、汎用的にエミュレート可能な特定の演算をセキュリティ、性能、またはその他の理由により非推奨にできるようにしている。これは、この仕様で定義されるより小さなプリミティブ演算の観点から定義される高レベル関数によって可能になる。これにより、高レベル関数のネイティブ実装をポリフィル実装で置き換えることが可能になる。

レンダラーを実行するプロセス間で CPUが共有されている現在の状態を考慮し、サイドチャネル攻撃の実現可能性を調査する。

攻撃者が欠陥を含む可能性のある特定の実装を標的にできないようにするために、§ 7.2 デバイス選択メカニズムはヒントにすぎず、具体的なデバイス選択は実装に委ねられる。例えば、ユーザーエージェントは、既知の脆弱性を持つデバイス上ではモデルを決して実行しないことを選択できる。さらなる緩和策として、デバイス列挙メカニズムは定義されていない。

ヒント付けは懸念を部分的に緩和する。追加の緩和策を調査する。

API設計は、コンパイル済み計算グラフの攻撃対象領域を最小化する。各種演算をホストするMLGraphBuilder インターフェイスはデータ定義APIであり、そのため何も実行せず、データを構築するだけである。その結果、攻撃の可能性は、MLContext.dispatch() メソッドを呼び出して実行する前に、データをグラフに束縛するときに限定される。これにより、実装者はMLContext.dispatch() メソッドの堅牢化に集中できる。例えば、データの境界を尊重し、境界が守られない場合に適切に失敗することを確実にすることである。

高解像度時刻を測定するための専用Web APIは、解像度の低減、ジッターの追加、濫用の検出、API呼び出しのスロットリングなどの技術を用いて、タイミング攻撃を緩和する[hr-time-3]。 WebNN実装の実際の展開では、タイミング攻撃を実用的でないものにするのに十分なジッターがもたらされる可能性が高い（例えばIPCを使用するため）が、実装者はタイミング攻撃について検討し、自らの実装に対してテストすることが推奨される。

注記: Unicodeシーケンスに関連するセキュリティリスクは、 label USVString 定義の文脈で論じられる。

4.1. 新しい演算に関するガイドライン

この節は非規範的である。

この仕様で定義される演算が安全に実装できる形で形成されることを保証するために、この節には、実装上の問題の可能性を低減するために演算がどのように定義されることが期待されるかについてのガイドラインが含まれる。これらのガイドラインは、業界のベストプラクティスに合わせて時間とともに進化することが期待される。

引数の単純さを優先する
複雑なデータ形式にパーサーを使用しない
演算を低レベルプリミティブに分解できる場合:
- 参考情報としてエミュレーション経路を追加する
- 新しい高レベル演算よりもプリミティブを優先するが、性能への影響を考慮する
演算入力および属性について一貫したスタイルに従う
プーリングやリダクションなどの演算ファミリーについて、API形状およびオプションを共有する
可能な限り、失敗ケースをテストケースとして形式化する
迷ったら含めない。ユースケースを満たすためにAPIサーフェスを可能な限り小さく保つが、小さすぎないようにする
将来の進化を妨げる可能性のある実装詳細をAPIから排除するよう努め、過剰に規定しない
早期に失敗する。Web開発者が問題を早く知るほどよい

一般に、新機能を追加する際は、Technical Architecture GroupおよびPrivacy Interest Groupによる[security-privacy-questionnaire]に文書化されているセキュリティおよびプライバシーへの影響を常に考慮すること。

5. プライバシーの考慮事項

このAPIは、機密性の高いユーザーデータをブラウザーのサンドボックス内に保持することにより、クラウドベースの推論代替手段に比べてプライバシーを向上させる。画像、音声、動画ストリーム、その他の個人情報などの入力データはユーザーのデバイスを離れることがなく、リモートサーバーへのデータ送信やサードパーティによるデータ処理に伴うリスクを排除する。

しかし、ハードウェアアクセラレーション能力と密接に相互作用する強力なローカル計算APIとして、 WebNN APIは性能最適化とプライバシー保護のバランスを取る必要がある。このAPIには、有効な機械学習推論能力を引き続き可能にしつつ、フィンガープリンティングを緩和するための複数のプライバシー保護措置が含まれる。

5.1. フィンガープリンティング

設計上、このAPIは、特定された§ 2 ユースケースに、最良の性能と結果の信頼性で対応するために必要な最小限の情報を公開することを目指す。第一に、このAPIは標準化を通じてフィンガープリンティングを緩和する。すなわち、多様なプラットフォームAPI間で一貫した動作を定義し、適合実装間で基盤ハードウェアの差異に関する情報漏えいを最小化することによってである。これは次のものにより達成される。

データ最小化の原則に沿って、ハードウェア非依存であり、基盤プラットフォームの低レベル詳細の公開を最小化する§ 7.3 演算子。
Web開発者が実行速度および消費電力の設定を示せるが、実行のために選択された実際のデバイスを公開せず、またWeb開発者が特定のデバイスを列挙または選択することも許さない§ 8.2.1 MLContextOptions API。このヒント付けメカニズムはエントロピーを増加させない。
Web開発者がサイドチャネルを用いてこの情報を推論する代わりに、明示的な照会APIを使用して特定の演算子のサポートを照会できる§ 8.3.7 opSupportLimits() API。このAPIはフィンガープリンタビリティに寄与し得るが、必要に応じてバケットを使用して、このAPIを通じて公開される区別可能な構成数を制限することにより、そのエントロピーを低減できる。
プラットフォーム間で一貫して動作する標準化されたデータ型およびテンソル演算。
異なるバックエンド実装間で一貫したエラー処理。

全体的な設計は、必要な機能を提供しながら、実装が異なるプラットフォーム間で一貫したインターフェイスを維持することを保証する。プラットフォーム固有の詳細を抽象化することにより、基盤となるアクセラレーションがCPU、GPU、または専用MLハードウェアのいずれによって提供されるかにかかわらず、APIはプライバシーを保護する予測可能な動作を提供できる。

注記: MLContextOptions は活発に開発中であり、その設計は、さらなる実装経験およびより広いWebコミュニティからの新しいユースケースに基づいて変更されることが期待される。

MLGraph.devices API拡張は、グラフが完全に構築されコンパイルされた後に、実行のために選択された実際のデバイスを公開するものとして提案されている。このAPI拡張のプライバシーへの影響は調査中である。[Issue #836]

5.2. 実行時間分析

演算のタイミング特性は、基盤ハードウェア性能に関する間接的な情報を提供し得る。これはあらゆる計算APIに固有の特徴である。特定の状況では、実行時間分析により、別の基盤プラットフォームと比較した基盤プラットフォームのニューラルネットワークハードウェアアクセラレーション能力の性能を間接的に明らかにできる場合がある。タイミング攻撃に関するさらなる議論については、 § 4 セキュリティの考慮事項も参照。

注記: このグループは、提案されている実行時間分析フィンガープリンティングベクトルおよび緩和策について、さらなる意見を歓迎している。

5.3. WebGPUとの比較

WebGPUとは異なり、このAPIはカスタムシェーダー記述を本質的にサポートしない。その結果、シェーダーキャッシュやその他の永続データに依存するタイミング攻撃を受けにくい。APIは、ブラウザーまたは基盤OSの既存のシェーダーおよびより低レベルのプリミティブの上に構築される。GPUDevice とやり取りするWeb開発者は、WebGPU コンパイルキャッシュに関する考慮事項を認識していることが期待される。

WebGPU APIは、プライバシー上の考慮事項としてマシン固有のアーティファクトを特定している。同様に、WebNN APIの計算ユニットスケジューリングは、特定の状況下でフィンガープリントを導入する可能性がある。しかし、WebGPUと同様に、そのようなフィンガープリントは各ベンダーのほとんどまたはすべてのデバイスで同一であり、懸念を緩和する。さらに、そのようなアーティファクトをさらに排除するためにソフトウェア実装を使用できる。

一般に、このAPIの実装者は、該当する場合、WebGPU Privacy Considerationsを自らの実装に適用することが期待される。

6. 倫理的考慮事項

ワーキンググループは、Web上で機械学習を使用することに関連する倫理的問題の文書化を開始しており、その規範仕様が考慮すべき緩和策を特定する助けとしている。ワーキンググループは、専用のGitHubリポジトリを通じてより広いコミュニティからの貢献を受け入れる、Ethical Principles for Web Machine Learning文書[webmachinelearning-ethics]を公開し、維持している。

7. プログラミングモデル

7.1. 概要

ニューラルネットワークの中心には、数学的演算からなる計算グラフがある。これらの演算は、コンピュータービジョン、自然言語処理、およびロボティクスにおける現代の機械学習技術の構成要素である。 WebNN APIは、ニューラルネットワークの計算グラフを構築、コンパイル、および実行するための仕様である。

MLGraph インターフェイスは、不変であるコンパイル済み計算グラフ（すなわちモデル）を表す。

MLGraphBuilder インターフェイスは、計算グラフ（そのグラフ）を構築するビルダー（ファクトリー）として機能し、その後コンパイルされてMLGraphが作成される。

WebNNにおいて、計算グラフは、データに作用し、グラフのノードである演算子から構成される。MLOperandは、計算グラフ内を流れるデータの表現であり、グラフの辺である。MLOperandには、推論のための計算グラフの入力値、推論に使用される定数（学習済み重みを含む）、推論中に計算される中間値（しばしば活性化と呼ばれる）、および推論の出力値が含まれる。演算子の入力は、1つ以上のMLOperandである。演算子の出力は、1つ以上のMLOperandである。演算子には、その動作を制御する演算子固有のパラメーターがあり、これには0個以上の活性化関数が含まれ得る。

MLGraphBuilder インターフェイスの重要な部分は、gemm() やrelu() のようなメソッドであり、これらは計算が実行されるときに入力データに対して実行する実際の演算を表す演算子を作成し、その演算子を保持する新しいMLOperandを返す。MLOperandを作成するメソッドは、任意の入力および活性化を演算子に接続する。各メソッド呼び出しは、他のいかなるMLOperandの値も変更せずに、個別の新しい値を返す。

演算子は、ラベル、すなわち例外メッセージなどの診断に含まれ得る文字列を持つ。演算子が作成されると、そのラベルは実装定義の方法で初期化され、渡されたlabelを含む場合がある。

dispatch()中にエラーを報告する仕組みの追加を検討する。 [Issue #778]

推論時には、すべてのMLOperandがテンソル（実データ）に束縛される。テンソルは本質的に多次元配列である。テンソルの表現は実装依存であるが、通常は何らかのバッファ（メモリ）に格納された配列データと、配列データを記述するメタデータ（その形状など）を含む。

計算グラフ内の演算は関数的意味論を持つ。これにより、実装は複数のテンソル間で配列データを共有できる可能性がある。例えば、reshapeやsliceなどの演算の実装は、入力テンソルと同じバッファを共有する入力テンソルのビューを返す場合がある。（reshapeの場合はデータ全体が共有される一方、sliceの場合は入力データの一部が共有される。）実装は、上記のようなビューを中間値に使用してもよい。

実行の前に、1つ以上の指定された出力を計算するために使用される計算グラフは、変換、コンパイル、および最適化される必要がある。コンパイル段階の主な目的は、演算やループの融合など、2つ以上の演算にまたがる最適化を可能にすることである。ユーザーエージェントは、グラフ変換中にこれらの最適化を行ってもよい。

MLGraphBuilder.build() メソッドは、呼び出しスレッドをブロックせずにバックグラウンドでグラフをコンパイルし、Promiseを返す。そのPromiseはMLGraphに解決される。各MLGraphBuilderは、最大1つのMLGraphを構築できる。

MLGraphの基盤実装は、MLGraphBuilderの演算子およびMLOperandに対応する、演算子およびオペランドのプラットフォーム固有表現から構成されるが、それらはスクリプトからは見えず、スクリプトにより構築されたグラフの合成または分解である場合がある。

MLGraphが構築されると、MLContext.dispatch() メソッドは、CPU実行の場合は別個のワーカースレッド内の並列タイムライン上で、またはGPUコマンドキュー内のGPUタイムライン上で、グラフの実行を非同期に行う。このメソッドは、実際の実行が別のタイムラインにオフロードされる間、呼び出しスレッドをブロックせずに直ちに返る。呼び出し側はMLNamedTensorsを使用して入力値を供給し、入力MLOperandをその値に束縛する。呼び出し側はまた、出力MLOperand用のMLNamedTensorsも供給し、成功した場合にはそこにグラフ実行の結果が格納される。その結果は、MLContext.readTensor(tensor) メソッドを使用してスクリプトへ読み戻すことができる。この種類の実行は、CPU、GPU、およびNPUデバイスをサポートする。

7.2. デバイス選択

MLContext インターフェイスは、ニューラルネットワーク実行のグローバル状態を表す。重要なコンテキスト状態の1つは、リソースを管理し、ニューラルネットワークグラフのコンパイルおよび最終的な実行を容易にする基盤実行デバイスである。 MLContextOptionsを用いるデフォルトの作成方法に加えて、MLContextは、アプリケーションで既に使用されている特定のGPUDeviceからも作成できる。

GPUコンテキストが、システムメモリ内の定数または入力をArrayBufferViewとして持つグラフを実行する状況では、入力内容はシステムメモリからGPUメモリへ自動的にアップロードされ、グラフ実行の最後にArrayBufferView 出力バッファのシステムメモリへダウンロードされる。このデータのアップロードおよびダウンロードサイクルは、 GPUの場合のように、実行デバイスがデータをシステムメモリから外へ、そして再びシステムメモリ内へコピーすることを必要とする場合にのみ発生する。デバイスがCPUデバイスである場合には発生しない。さらに、グラフ実行の結果は既知のレイアウト形式である。グラフ内の中間結果では、ネイティブなメモリアクセスパターン向けに実行が最適化される場合があるが、呼び出し側の観点から期待される動作を維持するために、グラフの最後の演算の出力は、グラフの最後で内容を既知のレイアウト形式に戻さなければならない。

MLContextが MLContextOptionsで作成される場合、ユーザーエージェントはこれらのオプションを考慮して、基盤実行デバイスを選択し作成する。

基盤プラットフォームに応じて、ユーザーエージェントはCPU、NPU、およびGPUデバイスの異なる組み合わせを選択してもよい。

この設計の履歴および根拠については、デバイス選択の解説を参照。

7.3. 演算子

この節は非規範的である。

WebNN APIは、主要な§ 2.1 アプリケーションユースケースに対応する、よく知られたCNNおよびRNN、transformer、生成モデルに必要な一連の演算子を定義する。各演算子の詳細は、この仕様の規範的な節で、演算子名のアルファベット順に定義される。これらの演算子は、APIサーフェスの機能的概要を示すため、次の非規範的な表において、その機能に基づくカテゴリに分類される。

注記: 一部の演算子は複数のカテゴリに属する。例えば、clamp() は数学関数であると同時に、活性化としても使用される。

カテゴリ別の演算子
カテゴリ	演算子
テンソル作成	`input()`, `constant()`
テンソル操作	`concat()`, `expand()`, `gather()`, `gatherElements()`, `scatterElements()`, `gatherND()`, `scatterND()`, `where()`, `pad()`, `reshape()`, `slice()`, `split()`, `transpose()`, `resample2d()`, `reverse()`, `tile()`, `triangular()`
テンソル量子化	`quantizeLinear()`, `dequantizeLinear()`
テンソルキャスト	`cast()`
数学	`add()`, `sub()`, `mul()`, `div()`, `max()`, `min()`, `clamp()`, `pow()`, `abs()`, `ceil()`, `cos()`, `erf()`, `exp()`, `floor()`, `identity()`, `log()`, `neg()`, `reciprocal()`, `sin()`, `sqrt()`, `tan()`, `tanh()`, `sign()`, `clamp()`
論理	`equal()`, `notEqual()`, `greater()`, `greaterOrEqual()`, `lesser()`, `lesserOrEqual()`, `logicalNot()`, `logicalAnd()`, `logicalOr()`, `logicalXor()`
行列乗算	`matmul()`, `gemm()`
畳み込み	`conv2d()`, `convTranspose2d()`
プーリング	`averagePool2d()`, `l2Pool2d()`, `maxPool2d()`
活性化	`clamp()`, `elu()`, `gelu()`, `hardSigmoid()`, `hardSwish()`, `leakyRelu()`, `linear()`, `prelu()`, `relu()`, `sigmoid()`, `softmax()`, `softplus()`, `softsign()`, `tanh()`
正規化	`batchNormalization()`, `instanceNormalization()`, `layerNormalization()`
リダクション	`argMin()`, `argMax()`, `reduceL1()`, `reduceL2()`, `reduceLogSum()`, `reduceLogSumExp()`, `reduceMax()`, `reduceMean()`, `reduceMin()`, `reduceProduct()`, `reduceSum()`, `reduceSumSquare()`, `cumulativeSum()`
再帰型ニューラルネットワーク	`gruCell()`, `gru()`, `lstmCell()`, `lstm()`

7.4. タスクソース

MLタスクソースは、 MLGraphの非同期コンパイルおよび実行、ならびに MLContextの作成に関連するすべてのタスクに使用されるタスクソースである。

グローバルオブジェクトglobalおよび一連の手順 stepsが与えられたとき、MLタスクをキューに入れるには、globalおよび stepsとともに、MLタスクソース上でグローバルタスクをキューに入れる。

7.5. 権限ポリシー統合

この仕様は、文字列 "webnn"により識別されるポリシー制御機能を定義する。そのデフォルト許可リストは'self'である。

8. API

8.1. navigator.mlインターフェイス

MLオブジェクトは、 Window およびWorkerGlobalScope コンテキストにおいて、それぞれNavigator およびWorkerNavigator インターフェイスを通じて利用可能であり、navigator.mlを介して公開される。

interface mixin NavigatorML {
  [SecureContext, SameObject] readonly attribute ML ml;
};
Navigator includes NavigatorML;
WorkerNavigator includes NavigatorML;

8.2. `ML` インターフェイス

enum MLPowerPreference {
  "default",
  "high-performance",
  "low-power"
};

dictionary MLContextOptions {
  MLPowerPreference powerPreference = "default";
  boolean accelerated = true;
};

[SecureContext, Exposed=(Window, Worker)]
interface ML {
  Promise<MLContext> createContext(optional MLContextOptions options = {});
  Promise<MLContext> createContext(GPUDevice gpuDevice);
};

8.2.1. `MLContextOptions`

注記: MLContextOptions は活発に開発中であり、その設計は、さらなる実装経験およびより広いWebコミュニティからの新しいユースケースに基づいて変更されることが期待される。ワーキンググループは、フォールバックデバイス、優先順位付きの複数デバイス、または特定デバイスの除外の定義を可能にする追加のAPI制御を検討している。議論中のその他の考慮事項には、エラー処理、最終的なフォールバック、および量子化演算子が含まれる。これらの設計上の考慮事項について、 Web開発者、ライブラリ作者、OSおよびハードウェアベンダー、その他の利害関係者からのフィードバックをGitHubを通じて歓迎する。フィンガープリンティングに関する考慮事項の追加の議論については、§ 5 プライバシーの考慮事項を参照。

powerPreferenceオプションはMLPowerPreferenceであり、消費電力に関連するアプリケーションの設定を示す。これは次のいずれかである。

"default": ユーザーエージェントに最も適切な動作を選択させる。
"high-performance": 消費電力よりも実行速度を優先する。
"low-power": 実行速度などの他の考慮事項よりも消費電力を優先する。

acceleratedオプションは、大規模並列アクセラレーションに関連するアプリケーションの設定を示す。このオプションの優先度はpowerPreferenceよりも低い。 true（デフォルト）に設定されている場合、基盤プラットフォームは、powerPreferenceにも依存して、 GPUやNPUなど、利用可能な大規模並列アクセラレーターの使用を試みる。 falseに設定されている場合、アプリケーションはCPU推論を優先することを示す。例えばpowerPreference が"high-performance"であり、 accelerated がfalseであるような矛盾する入力がある場合、実装は基盤プラットフォームで利用可能な最良の一致を選択する（例えば高性能CPUモード、またはaccelerated はpowerPreferenceより優先度が低いため、それを無視する）。

8.2.2. `createContext()`

引数:

options: MLContextOptions。コンテキストに対するアプリケーションの設定を提供する。
gpuDevice: GPUDevice。コンテキストで使用する特定のデバイス。

戻り値: MLContext。

realm realmおよびoptions（GPUDevice またはMLContextOptions）が与えられたとき、コンテキストを作成するには、次の手順を実行する。

contextを、realm内の新しいMLContextとする。
optionsがGPUDevice オブジェクトである場合:
1. context.[[contextType]] を"webgpu"に設定する。
2. context.[[powerPreference]] を"default"に設定する。
3. context.[[accelerated]] をtrueに設定する。
それ以外の場合:
1. context.[[contextType]] を"default"に設定する。
2. context.[[lost]] をrealm内の新しいpromiseに設定する。
3. options["powerPreference"] が存在する場合、context.[[powerPreference]] をoptions["powerPreference"]に設定する。
4. それ以外の場合、context.[[powerPreference]] を"default"に設定する。
5. options["accelerated"] が存在する場合、context.[[accelerated]] をoptions["accelerated"]に設定する。
6. それ以外の場合、context.[[accelerated]] をtrueに設定する。
ユーザーエージェントがcontext.[[contextType]]をサポートできない場合、失敗を返す。
contextを返す。

createContext(options)手順は次のとおりである。

globalを、thisの関連グローバルオブジェクトとする。
realmを、thisの関連realmとする。
globalの関連Documentが使用を許可されていない webnn機能である場合、 "SecurityError" DOMExceptionで rejectされた、 realm内の新しいpromiseを返す。
promiseをrealm内の新しい promiseとする。
次の手順を並列に実行する。
1. contextを、realmおよびoptionsが与えられてコンテキストを作成する結果とする。それが失敗を返す場合、globalとともに MLタスクをキューに入れ、 promiseを"NotSupportedError" DOMExceptionで rejectし、これらの手順を中止する。
2. globalとともにMLタスクをキューに入れ、promiseをcontextで解決する。
promiseを返す。

createContext(gpuDevice)メソッドの手順は次のとおりである。

globalを、thisの関連グローバルオブジェクトとする。
realmを、thisの関連realmとする。
globalの関連Documentが使用を許可されていない webnn機能である場合、 "SecurityError" DOMExceptionで rejectされた、 realm内の新しいpromiseを返す。
promiseをrealm内の新しい promiseとする。
次の手順を並列に実行する。
1. contextを、realmおよびgpuDeviceが与えられてコンテキストを作成する結果とする。それが失敗を返す場合、globalとともに MLタスクをキューに入れ、 promiseを"NotSupportedError" DOMExceptionで rejectし、これらの手順を中止する。
2. globalとともにMLタスクをキューに入れ、promiseをcontextで解決する。
promiseを返す。

8.3. `MLContext` インターフェイス

MLContext インターフェイスは、ニューラルネットワーク計算ワークロードおよび実行プロセスのグローバル状態を表す。各MLContextオブジェクトは、関連付けられたコンテキスト型およびMLPowerPreferenceを持つ。

typedef record<USVString, MLTensor> MLNamedTensors;

dictionary MLContextLostInfo {
  DOMString message;
};

[SecureContext, Exposed=(Window, Worker)]
interface MLContext {
  undefined dispatch(MLGraph graph, MLNamedTensors inputs, MLNamedTensors outputs);

  Promise<MLTensor> createTensor(MLTensorDescriptor descriptor);
  Promise<MLTensor> createConstantTensor(
    MLOperandDescriptor descriptor, AllowSharedBufferSource inputData);

  Promise<ArrayBuffer> readTensor(MLTensor tensor);
  Promise<undefined> readTensor(MLTensor tensor, AllowSharedBufferSource outputData);

  undefined writeTensor(MLTensor tensor, AllowSharedBufferSource inputData);

  MLOpSupportLimits opSupportLimits();

  undefined destroy();

  readonly attribute boolean accelerated;
  readonly attribute Promise<MLContextLostInfo> lost;
};

MLContext は次の内部スロットを持つ。

[[contextType]]、型はコンテキスト型。

MLContextのコンテキスト型。

[[powerPreference]]、型はMLPowerPreference。

MLContextの MLPowerPreference。

[[accelerated]]、型はboolean。

MLContextの処理型（CPUまたは大規模並列処理）。

[[lost]]、型はPromise<MLContextLostInfo>。

Promise。 MLContextの基盤実行デバイスが利用できなくなったときに解決される。

[[timeline]]

MLContextの計算ユニット上での演算の実行に関連付けられたタイムライン。これらの演算には、計算グラフ上の推論、およびMLTensorの[[data]] の変更が含まれる。

このタイムラインをより厳密に定義する。[Issue #529]

コンテキスト型は、リソースを管理し、ニューラルネットワークグラフのコンパイルおよび実行を容易にする実行コンテキストの型である。

"default": ユーザー設定オプションに従って作成されたコンテキスト。
"webgpu": WebGPUデバイスから作成されたコンテキスト。

accelerated取得手順は、this.[[accelerated]]を返すことである。

AllowSharedBufferSource bufferSourceおよびMLOperandDescriptor descriptorが与えられたとき、記述子でバッファを検証するには、次の手順を実行する。

bufferSourceのバイト長が descriptorのバイト長と等しくない場合、falseを返す。
bufferSourceの型で分岐する。
ArrayBuffer

trueを返す。

SharedArrayBuffer

trueを返す。

ArrayBufferView
1. bufferSourceがUint8Array オブジェクトである場合、trueを返す。
2. bufferSourceが、この表に従ってdescriptorのdataType と一致する場合、trueを返す。
3. falseを返す。

注記: descriptorのdataType にかかわらずUint8Arrayを使用することは、例えばWebAssembly.Memory インスタンスの一部など、ArrayBufferのスライスを表現する汎用的な方法としてサポートされる。開発者は、読みやすさと保守性のために、 WebNNコードを作成する際には、より具体的なビュー型を使用することが推奨される。

MLNamedTensors namedTensorsおよびrecord<USVString, MLOperandDescriptor> namedDescriptorsが与えられたとき、記述子でテンソルを検証するには:

namedTensorsのサイズが namedDescriptorsのサイズと等しくない場合、 falseを返す。
namedTensorsのname → tensorごとに反復する。
1. tensor.[[isConstant]] がtrueである場合、falseを返す。
2. namedDescriptors[name]が存在しない場合、 falseを返す。
3. tensor.[[descriptor]] がnamedDescriptors[name]と等しくない場合、falseを返す。
trueを返す。

8.3.1. `dispatch()`

コンパイル済みMLGraphの計算ワークロードを、 MLContextの [[timeline]]上にスケジュールする。

引数:

graph: MLGraph。実行される計算グラフ。
inputs: MLNamedTensors。計算グラフへの入力。
outputs: MLNamedTensors。計算グラフの出力。

戻り値: undefined。

注記: dispatch()自体は、グラフ実行が完了したことを示すシグナルを提供しない。代わりに、呼び出し側は出力テンソルを読み戻した結果を awaitできる。下記の§ 8.3.1.1 例を参照。

dispatch(graph, inputs, outputs) メソッドの手順は次のとおりである。

graph.[[context]] がthisでない場合、TypeErrorを throwする。
graph.[[isDestroyed]] がtrueである場合、"InvalidStateError" DOMExceptionを throwする。
allTensorsを、inputsの値にoutputsの値を拡張したものからなる MLTensorのリストとする。
allTensorsに重複する項目が含まれる場合、 TypeErrorを throwする。
allTensorsの各tensorについて反復する。
1. tensor.[[context]] がthisでない場合、TypeErrorを throwする。
2. tensor.[[isDestroyed]] がtrueである場合、TypeErrorを throwする。
inputsおよびgraph.[[inputDescriptors]]が与えられて記述子でテンソルを検証する結果がfalseを返す場合、 TypeErrorを throwする。
outputsおよびgraph.[[outputDescriptors]]が与えられて記述子でテンソルを検証する結果がfalseを返す場合、 TypeErrorを throwする。
次の手順をgraph.[[context]].[[timeline]]へエンキューする。
1. これらの手順を実行する。ただし、thisがlostであるときは中止する。
  1. inputsおよびoutputsが与えられて、 graph.[[implementation]] に計算要求を発行する。
    
    グラフ実行中にエラーを報告する仕組みを追加する。 [Issue #778]

定数オペランドがテンソルを使用して作成される場合、buildの完了後にそのテンソルを破棄することは合法である。実装は、コンパイル済みグラフがそのような破棄によって有効なままで影響を受けないことを保証することが期待される。

8.3.1.1. 例

次のコードは、MLGraphを MLTensorを使用して実行する例を示す。

const descriptor = {
  dataType: 'float32',
  shape: [2, 2]
};
const context = await navigator.ml.createContext();
const builder = new MLGraphBuilder(context);

// 1. Create a computational graph 'C = 0.2 * A + B'.
const constant = builder.constant(descriptor, new Float32Array(4).fill(0.2));
const A = builder.input('A', descriptor);
const B = builder.input('B', descriptor);
const C = builder.add(builder.mul(A, constant), B);

// 2. Compile the graph.
const graph = await builder.build({'C': C});

// 3. Create reusable input and output tensors.
const [inputTensorA, inputTensorB, outputTensorC] = await Promise.all([
  context.createTensor({dataType: A.dataType, shape: A.shape, writable: true}),
  context.createTensor({dataType: B.dataType, shape: B.shape, writable: true}),
  context.createTensor({dataType: C.dataType, shape: C.shape, readable: true})
]);

// 4. Initialize the inputs.
context.writeTensor(inputTensorA, new Float32Array(4).fill(1.0));
context.writeTensor(inputTensorB, new Float32Array(4).fill(0.8));

// 5. Execute the graph.
const inputs = {
  'A': inputTensorA,
  'B': inputTensorB
};
const outputs = {
  'C': outputTensorC
};
context.dispatch(graph, inputs, outputs);

// 6. Read back the computed result.
const result = await context.readTensor(outputTensorC);
console.log('Output value:', new Float32Array(result));  // [1, 1, 1, 1]

8.3.2. `createTensor()`

このMLContextに関連付けられた MLTensorを作成する。

引数:

descriptor: MLTensorDescriptor。

戻り値: Promise<MLTensor>。

createTensor(descriptor)メソッドの手順は次のとおりである。

globalを、thisの関連グローバルオブジェクトとする。
realmを、thisの関連realmとする。
thisがlostである場合、"InvalidStateError" DOMExceptionで rejectされた、 realm内の新しい promiseを返す。
tensorを、MLTensorを作成する結果とする。これはthisおよび descriptorが与えられる。
promiseをrealm内の新しい promiseとする。
次の手順をthis.[[timeline]]へエンキューする。
1. これらの手順を実行する。ただし、thisがlostであるときは中止する。
  1. descriptorが与えられて、tensor.[[data]] を作成し、すべてのバイトをゼロで初期化する。
  2. それが失敗した場合、globalとともにMLタスクをキューに入れ、 promiseを"UnknownError" DOMExceptionで rejectし、これらの手順を中止する。
  3. それ以外の場合、globalとともにMLタスクをキューに入れ、 promiseをtensorで解決する。
2. 中止された場合、globalとともにMLタスクをキューに入れ、 promiseを"InvalidStateError" DOMExceptionで rejectする。
promiseを返す。

8.3.3. `createConstantTensor()`

このMLContextに関連付けられた定数MLTensorを作成する。

引数:

descriptor: MLOperandDescriptor。
inputData: AllowSharedBufferSource。そのバイトがテンソルに書き込まれるバッファ。

戻り値: Promise<MLTensor>。

createConstantTensor(descriptor, inputData) メソッドの手順は次のとおりである。

globalを、thisの関連グローバルオブジェクトとする。
realmを、thisの関連realmとする。
thisがlostである場合、"InvalidStateError" DOMExceptionで rejectされた、 realm内の新しい promiseを返す。
descriptorが与えられて次元をチェックする結果がfalseを返す場合、 TypeErrorで rejectされた、 realm内の新しい promiseを返す。
inputDataおよびdescriptorが与えられて記述子でバッファを検証する結果がfalseを返す場合、 TypeErrorで rejectされた、 realm内の新しいpromiseを返す。
bytesを、inputDataが与えられてバッファソースが保持するバイトのコピーを取得する結果とする。
Assert: bytesの長さは、descriptorのバイト長と等しい。
tensorを、thisおよびdescriptorが与えられて定数MLTensorを作成する結果とする。
promiseをrealm内の新しい promiseとする。
次の手順をthis.[[timeline]]へエンキューする。
1. これらの手順を実行する。ただし、thisがlostであるときは中止する。
  1. descriptorが与えられて、tensor.[[data]] を作成する。
  2. それが失敗した場合、globalとともにMLタスクをキューに入れ、 promiseを"UnknownError" DOMExceptionで rejectし、これらの手順を中止する。
  3. bytesをtensor.[[data]]へコピーする。
  4. それが失敗した場合、globalとともにMLタスクをキューに入れ、 promiseを"UnknownError" DOMExceptionで rejectし、これらの手順を中止する。
  5. それ以外の場合、globalとともにMLタスクをキューに入れ、 promiseをtensorで解決する。
2. 中止された場合、globalとともにMLタスクをキューに入れ、 promiseを"InvalidStateError" DOMExceptionで rejectする。
promiseを返す。

8.3.4. `readTensor(tensor)`

MLTensorの [[data]]を、 MLContext.[[timeline]] からスクリプトへ読み戻す。

引数:

tensor: MLTensor。読み取るテンソル。

戻り値: Promise<ArrayBuffer>。読み取り結果を含むバッファ。

readTensor(tensor)メソッドの手順は次のとおりである。

globalを、thisの関連グローバルオブジェクトとする。
realmを、thisの関連realmとする。
tensor.[[context]] がthisでない場合、TypeErrorで rejectされた、 realm内の新しい promiseを返す。
tensor.[[isDestroyed]] がtrueである場合、TypeErrorで rejectされた、 realm内の新しい promiseを返す。
tensor.[[descriptor]].readable がfalseである場合、TypeErrorで rejectされた、 realm内の新しい promiseを返す。
promiseをrealm内の新しい promiseとする。
promiseをtensor.[[pendingPromises]]へ追加する。
次の手順をtensor.[[context]].[[timeline]]へエンキューする。
1. これらの手順を実行する。ただし、thisがlostであるときは中止する。
  1. bytesを、tensor.[[data]]のコピーを含むバイト列とする。
  2. それが失敗した場合、globalと次の手順でMLタスクをキューに入れる。
    1. tensor.[[pendingPromises]]から promiseを削除する。
    2. promiseを"UnknownError" DOMExceptionで Rejectし、これらの手順を中止する。
  3. それ以外の場合、globalと次の手順でMLタスクをキューに入れる。
    1. tensor.[[pendingPromises]]から promiseを削除する。
    2. bufferを、realm内でbytesからArrayBufferを作成する結果とする。
    3. promiseをbufferでResolveする。
2. 中止された場合、globalとともにMLタスクをキューに入れ、 promiseを"InvalidStateError" DOMExceptionで rejectする。
promiseを返す。

8.3.5. `readTensor(tensor, outputData)`

readTensor(tensor)の bring-your-own-bufferバリアント。 [[data]] を、MLTensor から、提供されたバッファへ読み戻す。

引数:

tensor: MLTensor。読み取るテンソル。
outputData: AllowSharedBufferSource。結果を読み込む先のバッファ。

戻り値: Promise<undefined>。

readTensor(tensor, outputData) メソッドの手順は次のとおりである。

globalを、thisの関連グローバルオブジェクトとする。
realmを、thisの関連realmとする。
tensor.[[context]] がthisでない場合、TypeErrorで rejectされた、 realm内の新しい promiseを返す。
tensor.[[isDestroyed]] がtrueである場合、TypeErrorで rejectされた、 realm内の新しい promiseを返す。
tensor.[[descriptor]].readable がfalseである場合、TypeErrorで rejectされた、 realm内の新しい promiseを返す。
outputDataおよびtensor.[[descriptor]] が与えられて記述子でバッファを検証する結果がfalseを返す場合、 TypeErrorで rejectされた、 realm内の新しい promiseを返す。
promiseをrealm内の新しい promiseとする。
promiseをtensor.[[pendingPromises]]へ追加する。
次の手順をtensor.[[context]].[[timeline]]へエンキューする。
1. これらの手順を実行する。ただし、thisがlostであるときは中止する。
  1. bytesを、tensor.[[data]]のコピーを含むバイト列とする。
  2. それが失敗した場合、次の手順を実行するため、globalとともにMLタスクをキューに入れる。
    1. tensor.[[pendingPromises]]から promiseを削除する。
    2. promiseを"UnknownError" DOMExceptionで Rejectし、これらの手順を中止する。
  3. それ以外の場合、次の手順を実行するため、globalとともにMLタスクをキューに入れる。
    1. tensor.[[pendingPromises]]から promiseを削除する。
    2. outputDataがdetachされている場合、 promiseをTypeErrorで rejectし、これらの手順を中止する。
      
      注記: 上記の記述子でバッファを検証するは、 outputDataがdetachされている場合に失敗するが、その手順とこの手順の間に outputDataがdetachされる可能性がある。
    3. bytesをoutputDataへ書き込む。
    4. promiseをundefinedでResolveする。
2. 中止された場合、globalとともにMLタスクをキューに入れ、 promiseを"InvalidStateError" DOMExceptionで rejectする。
promiseを返す。

8.3.6. `writeTensor()`

MLTensorの [[data]] へ、MLContextの [[timeline]]上でデータを書き込む。

引数:

tensor: MLTensor。書き込み先のテンソル。
inputData: AllowSharedBufferSource。そのバイトがテンソルに書き込まれるバッファ。

戻り値: undefined。

writeTensor(tensor, inputData) メソッドの手順は次のとおりである。

tensor.[[context]] がthisでない場合、TypeErrorを throwする。
tensor.[[isDestroyed]] がtrueである場合、TypeErrorを throwする。
tensor.[[descriptor]].writable がfalseである場合、TypeErrorを throwする。
inputDataおよびtensor.[[descriptor]] が与えられて記述子でバッファを検証する結果がfalseを返す場合、 TypeErrorを throwする。
bytesを、inputDataが与えられてバッファソースが保持するバイトのコピーを取得する結果とする。
Assert: bytesの長さは、tensor.[[descriptor]]のバイト長と等しい。
次の手順をtensor.[[context]].[[timeline]]へエンキューする。
1. これらの手順を実行する。ただし、thisがlostであるときは中止する。
  1. bytesをtensor.[[data]]へコピーする。
    
    テンソルへの書き込み中にエラーを報告する仕組みを追加する。 [Issue #778]

注記: dispatch()と同様に、 writeTensor()自体は、書き込みが完了したことを示すシグナルを提供しない。テンソルの内容を調べるために、呼び出し側はテンソルを読み戻した結果をawaitできる。

8.3.7. `opSupportLimits()`

opSupportLimits()は、実装間で演算子レベルにおいて異なるサポートレベルを公開する。WebNN APIの利用者は、各ターゲットプラットフォームに配備する最適なモデルアーキテクチャを決定するために、opSupportLimits()を使用して機能サポートレベルを調べることが推奨される。

注記: opSupportLimits() APIは、ブラウザーフィンガープリンティングのために追加のエントロピーを提供することを意図していない。現在の実装では、この機能サポート情報はOSおよびブラウザーバージョンだけから推論できる。将来の実装の多様性がそれを正当化する場合、このAPIにより、将来の実装は、エントロピーを低減するためWebGPUと同様に能力をバケット化するなど、新しいプライバシー緩和策を追加できる。

フィンガープリンティングに関する考慮事項の追加の議論については、§ 5 プライバシーの考慮事項を参照。

8.3.7.1. `MLOpSupportLimits` 辞書

MLOpSupportLimitsは次のトップレベルメンバーを持つ。これらに加えて、各演算子には、そのビルダーメソッドで定義される対応するメンバーがある。

dictionary MLOpSupportLimits {
  MLInputOperandLayout preferredInputLayout;
  [EnforceRange] unsigned long long maxTensorByteLength;
  MLTensorLimits input;
  MLTensorLimits constant;
  MLTensorLimits output;
};

preferredInputLayout, 型はMLInputOperandLayout: conv2d()など、レイアウト依存演算子に対する推奨入力レイアウト。
maxTensorByteLength, 型はunsigned long long: テンソルのサポートされる最大長（バイト単位）。
input, 型はMLTensorLimits: MLGraphに対する入力MLOperandのサポート制限。
constant, 型はMLTensorLimits: MLGraphに対する定数MLOperandのサポート制限。
output, 型はMLTensorLimits: MLGraphに対する出力MLOperandのサポート制限。

8.3.7.2. `MLRankRange` 辞書

dictionary MLRankRange {
  unsigned long min;
  unsigned long max;
};

min, 型はunsigned long: サポートされる最小ランク。
max, 型はunsigned long: サポートされる最大ランク。

8.3.7.3. `MLTensorLimits` 辞書

typedef sequence<MLOperandDataType> MLDataTypeList;

dictionary MLTensorLimits {
  MLDataTypeList dataTypes;
  MLRankRange rankRange;
};

dataTypes, 型はMLDataTypeList: サポートされるデータ型。
rankRange, 型はMLRankRange: サポートされる最小および最大ランク。

8.3.7.4. `MLBinarySupportLimits` 辞書

dictionary MLBinarySupportLimits {
  MLTensorLimits a;
  MLTensorLimits b;
  MLTensorLimits output;
};

a, 型はMLTensorLimits: aオペランド用のMLTensorLimits。
b, 型はMLTensorLimits: bオペランド用のMLTensorLimits。
output, 型はMLTensorLimits: 出力オペランド用のMLTensorLimits。

8.3.7.5. `MLSingleInputSupportLimits` 辞書

dictionary MLSingleInputSupportLimits {
  MLTensorLimits input;
  MLTensorLimits output;
};

input, 型はMLTensorLimits: 入力オペランド用のMLTensorLimits。
output, 型はMLTensorLimits: 出力オペランド用のMLTensorLimits。

8.3.8. `destroy()`

destroy() メソッドは、コンテキストに関連付けられたすべてのリソースを解放するために呼び出すことができる。未完了の計算要求、およびMLTensorの作成/読み取り/書き込み要求は失敗する。

destroy()メソッドの手順は次のとおりである。

thisがlostである場合、これらの手順を中止する。
実装定義のメッセージで、thisをloseする手順を実行する。

注記: destroy()が呼び出されたことを示すメッセージは、開発者がコンテキスト喪失の原因を区別するのに役立つ。

8.3.9. エラー

ユーザーエージェントが、MLContextが要求を満たすために利用できなくなったと判断した場合、そのためのコンテキスト喪失手順を実行しなければならない。

MLContext contextに対するコンテキスト喪失手順は次のとおりである。

globalを、contextの関連グローバルオブジェクトとする。
次の手順を実行するため、globalとともにMLタスクをキューに入れる。
1. Lose contextする。メッセージは実装定義とする。

MLContext contextを、DOMString messageでloseするには:

infoを新しいMLContextLostInfoとする。
info.message をmessageに設定する。
context.[[lost]] をinfoでResolveする。
graph.[[context]] がthisと等しい各MLGraph graphについて:
1. graphをthisとして、 graphに対するdestroy() メソッド手順を実行する。
tensor.[[context]] がthisと等しい各MLTensor tensorについて:
1. tensorをthisとして、 tensorに対するdestroy() メソッド手順を実行する。

message, 型はDOMString: 発生したエラーに関する情報を提供する実装定義のメッセージ。

lost取得手順は、thisの[[lost]] Promiseを返すことである。

MLContextは、その[[lost]] Promise がsettledしている場合、lostである。

8.4. `MLGraph` インターフェイス

MLGraph インターフェイスは、コンパイル済み計算グラフを表す。コンパイル済みグラフは、一度構築されると不変であり、その後変更できない。

[SecureContext, Exposed=(Window, Worker)]
interface MLGraph {
  undefined destroy();
};

MLGraphは次の内部スロットを持つ。

[[context]]、型はMLContext: このMLGraphに関連付けられた MLContext型のコンテキスト。
[[inputDescriptors]]、型は record<USVString, MLOperandDescriptor>: このMLGraphのすべての入力MLOperandについて、入力MLOperandの名前をそのMLOperandDescriptorに対応付ける。
[[outputDescriptors]]、型は record<USVString, MLOperandDescriptor>: このMLGraphのすべての出力MLOperandについて、出力MLOperandの名前をそのMLOperandDescriptorに対応付ける。
[[implementation]]: ユーザーエージェントにより提供される基盤実装。
[[isDestroyed]]、型はboolean: MLGraph.destroy() メソッド手順が実行されたかどうか。いったん破棄されると、MLGraph はもはや使用できない。

8.4.1. `destroy()`

destroy() メソッドは、グラフに関連付けられたすべてのリソースを解放するために呼び出すことができる。

destroy()メソッドの手順は次のとおりである。

this.[[isDestroyed]] がtrueである場合、これらの手順を中止する。
this.[[isDestroyed]] をtrueに設定する。
このグラフが所有するリソースを解放可能としてマークするために、this.[[context]].[[timeline]] 上にタスクをキューに入れる。

注記: このグラフを使用してこれ以上ワークロードをキューに入れることはできないため、実装は、このグラフを使用して以前に送信されたすべてのワークロードが完了した後、このグラフに関連付けられた追加のリソース割り当てを解放できる。

8.5. `MLOperandDescriptor` 辞書

MLOperandDescriptorは、オペランドの形状（次元）およびデータ型を記述する。これはMLGraphの入力および定数を記述するために使用され、各MLOperandは内部MLOperandDescriptorを持つ。

enum MLInputOperandLayout {
  "nchw",
  "nhwc"
};

enum MLOperandDataType {
  "float32",
  "float16",
  "int32",
  "uint32",
  "int64",
  "uint64",
  "int8",
  "uint8"
};

dictionary MLOperandDescriptor {
  required MLOperandDataType dataType;
  required sequence<[EnforceRange] unsigned long> shape;
};

dataType, 型はMLOperandDataType: オペランドのデータ型。
shape, 型はsequence<[EnforceRange] unsigned long>: オペランドの次元のリスト。スカラーオペランドの場合は空である。

MLOperandDescriptor Aは、A.dataTypeが B.dataTypeと等しく、かつA.shapeが B.shapeと等しい場合、MLOperandDescriptor Bと等しい。

MLOperandDataType dataTypeおよびリストshapeが与えられたとき、MLOperandDescriptorを作成するには、次の手順を実行する。

descriptorを新しいMLOperandDescriptorとする。
descriptor.dataType をdataTypeに設定する。
descriptor.shape をshapeのクローンに設定する。
descriptorを返す。

MLOperandDescriptor descのバイト長は、次の手順により返される値である。

elementLengthを1とする。
desc.shapeの各dimensionについて反復する。
1. elementLengthをelementLength * dimensionに設定する。
elementSizeを、この表に従ってdesc.dataTypeと一致するArrayBufferView 型の1つの要素サイズとする。
elementLength * elementSizeを返す。

MLOperandDescriptor descの要素数は、次の手順により返される値である。

elementCountを1とする。
desc.shapeの各dimensionについて反復する。
1. elementCountをelementCount * dimensionに設定する。
elementCountを返す。

妥当な次元は、 0より大きく、longの範囲内にある整数である。実装はより小さな上限を課してもよい。

妥当なテンソル数は、0より大きく8192以下の整数である。実装はより小さな上限を課してもよい。

サイズ0の次元はサポートされるべきか？ [Issue #391]

MLOperandDescriptor descriptorが与えられたとき、次元をチェックするには、次の手順を実行する。

descriptor.shapeのいずれかの項目が妥当な次元でない場合、falseを返す。
descriptor.shapeのサイズが実装でサポートするには大きすぎる場合、 falseを返す。

オペランド次元の最大数は定義されていないが、ネイティブML APIは通常、サポートされる最大サイズを持つ。[Issue #456]
descriptorの要素数が妥当な次元でない場合、falseを返す。
descriptorのバイト長が実装でサポートされていない場合、 falseを返す。
trueを返す。

8.6. `MLOperand` インターフェイス

MLOperandは、演算の一部を完全に合成された演算へ合成した結果として構築される、中間グラフを表す。

例えば、MLOperandは、演算に供給される定数、または複数の定数を1つの演算へ結合した結果を表すことができる。 § 7 プログラミングモデルも参照。

[SecureContext, Exposed=(Window, Worker)]
interface MLOperand {
  readonly attribute MLOperandDataType dataType;
  readonly attribute FrozenArray<unsigned long> shape;
};

dictionary MLOperatorOptions {
  USVString label = "";
};

typedef (bigint or unrestricted double) MLNumber;

MLOperand は次の内部スロットを持つ。

[[builder]]、型はMLGraphBuilder: MLOperandの関連ビルダーオブジェクト。
[[descriptor]]、型はMLOperandDescriptor: MLOperandの記述子。
[[name]]、型は文字列: MLOperandの名前（入力オペランドの場合のみ）。
[[operator]]、型は演算子: MLOperandに対応する演算子への参照。
[[constantTensor]]、型は MLTensor: MLOperandのテンソル（定数オペランドの場合のみ）。

An MLOperandの dataTypeは、その[[descriptor]].dataTypeである。

An MLOperandの shapeは、その[[descriptor]].shapeである。

An MLOperandの rankは、そのshapeのsizeである。

dataType getter stepsは、 thisのdataTypeを返す。

shape getter stepsは、 thisのshapeを返す。

[[builder]] オブジェクトは、MLGraphBuilder() コンストラクターによってMLContext オブジェクトに束縛されるため、MLOperandも、常に同じMLContext オブジェクトに束縛される。

ある操作がMLOperandDataTypeの一部のみをサポートする場合、位置引数とオプションの両方を含む、その操作の各入力オペランドの許可されるデータ型は、MLOperandDataTypeの明示的なリスト、または、そのオペランドのdataTypeが別の入力オペランドのdataTypeと同じでなければならないという制約、または任意のMLOperandDataTypeを許可する任意として与えられる。

実装は、オペランドについて指定されたものより少ないデータ型をサポートしてもよい（MAY）が、少なくとも指定された必須データ型をサポートしなければならない（MUST）。サポート状況は、opSupportLimits() メソッドをMLContext上で使用し、 Chromium プロトタイプにおける ONNX Runtime、LiteRT、および CoreML バックエンド全体で、操作に対応するメンバーのdataTypes 値を調べることで問い合わせることができる。

必須データ型の集合は、開発者がこれらのデータ型のみを使用するようにモデルを設計することで相互運用可能なコンテンツを作成できるよう、幅広いプラットフォームにわたる実装経験に基づいて決定されている。この仕様の Web Platform Tests は、この機能検出メカニズムを使用して、すべての許可されるデータ型について正しい動作を検証するが、必須データ型のみのサポートでも合格できる。

ある操作が、特定のrankを持つ入力オペランドを必要とする場合、位置引数とオプションの両方を含む、その操作の各入力オペランドの許可される rankは、明示的な rank（例: 1）、または任意の次元数を許可するN、または別のオペランドと同じとして与えられる。入力オペランドの形状が別の入力オペランドに対して単方向にブロードキャスト可能でなければならない、または別の入力オペランドと双方向にブロードキャスト可能でなければならない場合など、より具体的な制約は一般的である。このような場合、許可される rankは範囲として列挙され、具体的な検証は操作内のステップとして与えられる。

実装は、指定されたものよりも制限された下限および/または上限を、オペランドのrankに課してもよい（MAY）が、少なくとも指定された必須 rankをサポートしなければならない（MUST）。サポート状況は、opSupportLimits() メソッドをMLContext上で使用し、 Chromium プロトタイプにおける ONNX Runtime、LiteRT、および CoreML バックエンド全体で、操作に対応するメンバーのrankRange.min およびrankRange.max 値を調べることで問い合わせることができる。

必須 rankの集合は、開発者がこれらの rank のみを持つ入力オペランドで構成されるようにモデルを設計することで相互運用可能なコンテンツを作成できるよう、幅広いプラットフォームにわたる実装経験に基づいて決定されている。

MLOperatorOptions は、次のメンバーを持つ。

label, of type USVString, defaulting to "": MLGraphBuilder メソッドを使用してoperatorが作成され、MLOperandが作成されるときに任意で提供される。実装は、この値を使用して、operatorのlabelを初期化してもよい。

Note: label は自然言語の文字列を意図したものではない。これは、変数名やエラーコードに類似した、言語に依存しない識別子であり、 "mul#1234"のようなものである。

Note: 実装には、開発者が提供するlabel を使用して、グラフ構築中の同期的なエラーと、非同期のbuild() メソッド中に発生するエラーの両方を含め、エラーメッセージを強化し、デバッグしやすさを向上させることが推奨される。

開発者がlabel を介して提供した label を、デバッグツール、ログ、またはエラーメッセージで表示する場合、実装は、悪意ある Unicode シーケンスの注入などのセキュリティリスクを防ぐために、出力をサニタイズするべきである。たとえば、双方向テキストスプーフィング [UTR36]、ソースコードスプーフィング [UTS55]およびその他の懸念がある。たとえば、実装は制御文字（例: U+202A から U+202E、U+2066 から U+2069）をエスケープまたはフィルターするか、安全なレンダリングメカニズムを使用して潜在的なスプーフィングを無効化するべきである。

8.6.1. `MLOperand`の作成

MLOperand オブジェクトは、MLGraphBuilderのメソッドにより、内部的に次のアルゴリズムを使用して作成される。

MLGraphBuilder builderおよびMLOperandDescriptor descが与えられたとき、MLOperandを作成するには、次の手順を実行する。

realmを、builderの関連realmとする。
operandをrealm内の新しいMLOperand とする。
operand.[[builder]] をbuilderに設定する。
operand.[[descriptor]] をdescに設定する。
operandを返す。

MLOperand operandが与えられたとき、MLOperandをコピーするには、次の手順を実行する。

builderをoperand.[[builder]]とする。
realmを、builderの関連realmとする。
resultをrealm内の新しいMLOperand とする。
result.[[builder]] をbuilderに設定する。
result.[[descriptor]] をoperand.[[descriptor]]に設定する。
operand.[[name]] が存在する場合、result.[[name]] をoperand.[[name]]に設定する。
resultを返す。

MLGraphBuilder builderおよびMLOperand operandが与えられたとき、オペランドを検証するには、operand.[[builder]] がbuilderであればtrueを返し、そうでなければfalseを返す。

8.6.1.1. `MLNumber`

MLNumberは、64ビット整数型（"uint64" および"int64"）と 32ビット浮動小数点（"float32"）の両方を含む、任意のMLOperandDataTypeであり得る MLOperandに対する数値オプションの型を指定するときに使用される。実装は、対応するMLOperandDataTypeに従って値を処理する。例えば、clamp(input, options)が、 dataTypeが"uint32"である MLOperandで呼び出された場合、 MLNumber パラメーターは明示的にunsigned longへキャストされる。

オプションをdoubleとして指定すると、 2⁵³を超える値を渡すときに精度が失われ、long longを指定すると2⁶³を超える値が許可されなくなる。

bigint と数値型のunionのサポートは[WEBIDL]で新しく、実装サポートも限定的である。プロトタイプ実装には、このアプローチについてフィードバックを提供することが推奨される。[whatwg/webidl Issue #1388]

8.7. `MLTensorDescriptor` 辞書

MLTensorDescriptorは、 MLTensorの特性および能力を記述する。

dictionary MLTensorDescriptor : MLOperandDescriptor {
  boolean readable = false;
  boolean writable = false;
};

readable, 型はboolean、デフォルトはfalse: テンソルの内容をreadTensor(tensor) またはreadTensor(tensor, outputData)を通じて読み取れるかどうか。
writable, 型はboolean、デフォルトはfalse: テンソルの内容をwriteTensor()を通じて書き込めるかどうか。

8.8. `MLTensor` インターフェイス

MLTensor インターフェイスは、MLGraphへの入力または出力として使用できるテンソルを表す。MLTensorを裏付けるメモリは、それを作成するために使用されたMLContextおよび MLTensorDescriptorの要件に従って、実装定義の方法で割り当てられるべきである。MLTensorの [[data]] に関わる演算は、関連するMLContextの [[timeline]] 上で発生する。

MLTensorがどのように割り当てられるかについての実装定義の要件には、メモリが特定のバイトアラインメントで割り当てられる、または特定のメモリプール内で割り当てられるといった制約が含まれ得る。

[SecureContext, Exposed=(Window, Worker)]
interface MLTensor {
  readonly attribute MLOperandDataType dataType;
  readonly attribute FrozenArray<unsigned long> shape;
  readonly attribute boolean readable;
  readonly attribute boolean writable;
  readonly attribute boolean constant;

  undefined destroy();
};

MLTensorは次の内部スロットを持つ。

[[context]]、型はMLContext: MLTensorの関連コンテキスト。
[[descriptor]]、型はMLTensorDescriptor: MLTensorの記述子。
[[pendingPromises]]、型は Promiseの集合: 進行中でまだ解決されていないMLContext.readTensor(tensor) メソッド呼び出しに対応するPromise。MLTensor が破棄されると、すべての保留中のpromiseはrejectされる。
[[isDestroyed]]、型はboolean: MLTensor.destroy() 手順が実行されたかどうか。いったん破棄されると、MLTensor はもはや使用できない。
[[data]]、型は実装定義の型: MLTensorを裏付けるバイト。このデータは[[context]].[[timeline]] からのみアクセスまたは変更できる。
[[isConstant]]、型はboolean: MLTensorが定数MLTensorを作成するにより作成されたかどうか。

MLTensorの dataTypeは、その[[descriptor]]の dataTypeである。

MLTensorの shapeは、その[[descriptor]]の shapeである。

dataTypeの取得手順は、 thisのdataTypeを返すことである。

shapeの取得手順は、 thisのshapeを返すことである。

readableの取得手順は、 this.[[descriptor]].readableを返すことである。

writableの取得手順は、 this.[[descriptor]].writableを返すことである。

constantの取得手順は、 thisの [[isConstant]]を返すことである。

8.8.1. `MLTensor`の作成

MLTensorは、関連付けられたMLContextにより作成される。

MLContext contextおよびMLTensorDescriptor descriptorが与えられたとき、MLTensorを作成するには、次の手順を実行する。

realmを、contextの関連realmとする。
tensorをrealm内の新しいMLTensor とする。
tensor.[[context]] をcontextに設定する。
tensor.[[descriptor]] をdescriptorに設定する。
tensor.[[isDestroyed]] をfalseに設定する。
tensor.[[isConstant]] をfalseに設定する。
tensorを返す。

8.8.2. `destroy()`

MLTensorに関連付けられたリソースを解放する。このメソッドは冪等である。

戻り値: undefined。

destroy()メソッドの手順は次のとおりである。

this.[[isDestroyed]] をtrueに設定する。
各promiseについて、this.[[pendingPromises]]内で:
1. this.[[pendingPromises]]から promiseを削除する。
2. promiseを"InvalidStateError" DOMExceptionで Rejectする。
次の手順をthis.[[context]].[[timeline]]へエンキューする。
1. this.[[data]]を解放する。

注記: このテンソルを使用してこれ以上演算をキューに入れることはできないため、実装は、このテンソルを使用して以前に送信されたすべての演算が完了した後、このテンソルに関連付けられた追加のリソース割り当てを解放できる。

8.8.3. 定数`MLTensor`の作成

定数MLTensorは、関連付けられたMLContextにより作成される。

MLContext context、MLOperandDescriptor inputDescriptorが与えられたとき、定数MLTensorを作成するには、次の手順を実行する。

realmを、contextの関連realmとする。
tensorをrealm内の新しいMLTensor とする。
tensor.[[context]] をcontextに設定する。
tensorDescriptorを新しいMLTensorDescriptorとする。
tensorDescriptor.readable をfalseに設定する。
tensorDescriptor.writable をfalseに設定する。
tensorDescriptor.dataType をinputDescriptor.dataTypeに設定する。
tensorDescriptor.shape をinputDescriptor.shapeに設定する。
tensor.[[descriptor]] をtensorDescriptorに設定する。
tensor.[[isDestroyed]] をfalseに設定する。
tensor.[[isConstant]] をtrueに設定する。
tensorを返す。

8.9. `MLGraphBuilder` インターフェイス

MLGraphBuilder インターフェイスは、§ 2 ユースケースで特定される、計算グラフへ合成できる一連の演算を定義する。また、グラフ構築セッションの中間状態も表す。

typedef record<USVString, MLOperand> MLNamedOperands;

[SecureContext, Exposed=(Window, Worker)]
interface MLGraphBuilder {
  // Construct the graph builder from the context.
  constructor(MLContext context);

  // Create an operand for a graph input.
  MLOperand input(USVString name, MLOperandDescriptor descriptor);

  // Create an operand for a graph constant.
  MLOperand constant(MLOperandDescriptor descriptor,
                     AllowSharedBufferSource buffer);

  // Create a scalar operand from the specified number of the specified type.
  MLOperand constant(MLOperandDataType dataType, MLNumber value);

  // Create an operand from a specified constant tensor.
  MLOperand constant(MLTensor tensor);

  // Compile the graph up to the specified output operands asynchronously.
  Promise<MLGraph> build(MLNamedOperands outputs);
};

MLGraphBuilder.build() メソッドは、グラフビルダー状態を、指定された出力オペランドまで、作成元のMLContextの型に従ってコンパイル済みグラフへコンパイルする。MLContextの [[contextType]] が"default"に設定されている場合、コンパイル済みグラフは MLGraphが返される直前に初期化される。このグラフ初期化段階は、後続のグラフ実行の最適な性能にとって重要である。通常これは"weight preprocessing"として知られるプロセスを伴い、グラフへのすべての定数入力が前処理され、後続のグラフ実行呼び出しのためにオペレーティングシステムレベルでキャッシュされる。初期化入力は通常、グラフ構築時にconstant() メソッドを通じて定数オペランドとして指定された定数重みデータである。

MLGraphBuilder は次の内部スロットを持つ。

[[context]]、型はMLContext: このMLGraphBuilderに関連付けられた MLContext型のコンテキスト。
[[hasBuilt]]、型はboolean: MLGraphBuilder.build() が呼び出されたかどうか。いったん構築されると、MLGraphBuilder はもはや演算子を作成したり、MLGraphをコンパイルしたりできない。

MLGraphBuilderは、その[[hasBuilt]] がfalseであり、かつその[[context]] がlostでない場合、 buildできる。

8.9.1. `MLGraphBuilder` コンストラクター

引数:

context: MLContext。 MLGraphBuilderに関連付けるコンテキスト。

new MLGraphBuilder(context) コンストラクターの手順は次のとおりである。

thisの関連グローバルオブジェクトの関連Documentが webnn機能の使用を許可されていない場合、"SecurityError" DOMExceptionを throwする。
contextがlostである場合、"InvalidStateError" DOMExceptionを throwする。
this.[[context]] をcontextに設定する。
this.[[hasBuilt]] をfalseに設定する。

8.9.2. 入力オペランド

入力として使用できる、記述子に基づく名前付きMLOperandを作成する。

引数:

name: 入力の文字列名。
descriptor: MLOperandDescriptor オブジェクト。

戻り値: MLOperand。

input(name, descriptor) メソッドの手順は次のとおりである。

thisがbuildできない場合、"InvalidStateError" DOMExceptionを throwする。
nameが空である場合、TypeErrorを throwする。
thisのグラフの入力内のいずれかの MLOperandが、 nameと等しい[[name]]を持つ場合、TypeErrorを throwする。
descriptorが与えられて次元をチェックする結果がfalseを返す場合、 TypeErrorを throwする。
グラフ接続を作成する:
1. operandを、thisおよび descriptorが与えられてMLOperandを作成する結果とする。
2. operand.[[name]] をnameに設定する。
3. operandをthisのグラフの入力へ追加する。
operandを返す。

MLGraphBuilder APIは、入力オペランドなしでMLGraphを作成することを許可する。基盤プラットフォームがそれをサポートしていない場合、実装はスタブ入力を追加するか、定数をグラフへの入力として渡すことができる。

8.9.3. 定数オペランド

MLGraphBuilder メソッドで使用できる定数MLOperandを作成する。

8.9.3.1. `constant(descriptor, buffer)`

初期化データを含む、指定されたデータ型および形状の定数MLOperandを作成する。

引数:

descriptor: MLOperandDescriptor。出力テンソルの記述子。
buffer: AllowSharedBufferSource。初期化データを含むバッファ。

戻り値: MLOperand。定数出力テンソル。

constant(descriptor, buffer) メソッドの手順は次のとおりである。

thisがbuildできない場合、"InvalidStateError" DOMExceptionを throwする。
descriptorが与えられて次元をチェックする結果がfalseを返す場合、 TypeErrorを throwする。
bufferおよびdescriptorが与えられて記述子でバッファを検証する結果がfalseを返す場合、 TypeErrorを throwする。
グラフ接続を作成する:
1. operandを、thisおよび descriptorが与えられてMLOperandを作成する結果とする。
2. bytesを、bufferが与えられてバッファソースが保持するバイトのコピーを取得する結果とする。
3. operandを、bytesを値としてthisのグラフの定数へ追加する。
operandを返す。

8.9.3.2. `constant(tensor)`

初期化済みデータを含む、指定されたデータ型および形状の定数MLOperandを作成する。

引数:

tensor: MLTensor。初期化済みデータを含む定数テンソル。

戻り値: MLOperand。定数出力テンソル。

constant(tensor)メソッドの手順は次のとおりである。

tensor.[[context]] がthis.[[context]]でない場合、 TypeErrorを throwする。
tensor.[[isDestroyed]] がtrueである場合、TypeErrorを throwする。
tensor.[[isConstant]] がfalseである場合、TypeErrorを throwする。
thisがbuildできない場合、"InvalidStateError" DOMExceptionを throwする。
グラフ接続を作成する:
1. operandを、thisおよび tensor.[[descriptor]]が与えられてMLOperandを作成する結果とする。
2. operand.[[constantTensor]] をtensorに設定する。
3. operandを、tensorを値としてthisのグラフの定数へ追加する。
operandを返す。

8.9.3.3. `constant(dataType, value)`

指定された値およびデータ型のスカラー定数MLOperandを作成する。

指定された値が指定された出力データ型の範囲を超える場合、例えば浮動小数点値が"int8" データ型に割り当てられる場合などには、データの切り捨てが発生する。

引数:

dataType: MLOperandDataType。
value: MLNumber。定数の値。

戻り値: MLOperand。定数出力。

constant(dataType, value) メソッドの手順は次のとおりである。

thisがbuildできない場合、"InvalidStateError" DOMExceptionを throwする。
valueを、valueをdataTypeへキャストする結果に設定する。
descriptorを、dataTypeおよび« »が与えられてMLOperandDescriptorを作成する結果とする。
グラフ接続を作成する:
1. operandを、thisおよび descriptorが与えられてMLOperandを作成する結果とする。
2. operandを、valueを値としてthisのグラフの定数へ追加する。
operandを返す。

8.9.4. buildメソッド

与えられた出力オペランドまでの合成済みグラフを、非同期に計算グラフへ構築する。

引数:

outputs: MLNamedOperands。グラフの出力となるMLOperandを特定する。

戻り値: Promise<MLGraph>。

build(outputs)メソッドの手順は次のとおりである。

realmを、thisの関連realmとする。
thisがbuildできない場合、"InvalidStateError" DOMExceptionで rejectされた、 realm内の新しいpromiseを返す。
outputsが空である場合、TypeErrorで rejectされた、 realm内の新しい promiseを返す。
outputsの各name → operandについて反復する。
1. nameが空である場合、TypeErrorで rejectされた、 realm内の新しい promiseを返す。
2. thisおよびoperandが与えられてオペランドを検証する結果がfalseを返す場合、TypeErrorで rejectされた、realm内の新しいpromiseを返す。
3. operandがthisのグラフの入力または定数内にある場合、TypeErrorで rejectされた、realm内の新しいpromiseを返す。
4. operand.[[constantTensor]] が存在し、かつoperand.[[constantTensor]].[[isDestroyed]] がtrueである場合、TypeErrorで rejectされた、 realm内の新しい promiseを返す。
operandsを新しい空の集合とする。
operatorsを新しい空の集合とする。
inputsを新しい空の集合とする。
queueを、outputsの値を含む新しいキューとする。
queueが空でない間:
1. queueからoperandをデキューする。
2. operandをoperandsへ追加する。
3. operand.[[operator]] をoperatorsへ追加する。
4. operandがthisのグラフの入力内にある場合、operandを inputsへ追加する。
5. operand.[[operator]]の入力の各inputについて反復する。
  1. inputをqueueへエンキューする。
globalを、thisの関連グローバルオブジェクトとする。
graphをrealm内の新しいMLGraph とする。
graph.[[context]] をthis.[[context]]に設定する。
graph.[[isDestroyed]] をfalseに設定する。
inputs内の各operandについて反復する。
1. graph.[[inputDescriptors]][operand.[[name]]] をoperand.[[descriptor]]に設定する。
outputsの各name → operandについて反復する。
1. graph.[[outputDescriptors]][name] をoperand.[[descriptor]]に設定する。
this.[[hasBuilt]] をtrueに設定する。
promiseをrealm内の新しい promiseとする。
次の手順をgraph.[[context]].[[timeline]]へエンキューする。
1. これらの手順を実行する。ただし、graph.[[context]]が lostであるときは中止する。
  1. graphImplを、thisのグラフを、operands、 operators、inputs、およびoutputsの値、さらに graph.[[context]].[[powerPreference]] およびgraph.[[context]].[[accelerated]] とともに、基盤プラットフォームにより解釈できる実装定義の形式へ変換した結果とする。
  2. 前の手順が失敗した場合、globalとともにMLタスクをキューに入れ、promiseを"OperationError" DOMExceptionで rejectし、これらの手順を中止する。
  3. graph.[[implementation]] をgraphImplに設定する。
  4. globalとともにMLタスクをキューに入れ、 promiseをgraphでresolveする。
2. 中止された場合、globalとともにMLタスクをキューに入れ、 promiseを"InvalidStateError" DOMExceptionで rejectする。
promiseを返す。

注記: 入力オペランドまたは定数オペランドをグラフの出力として指定すると、これは通常APIの不正な使用であるため、エラーになる。呼び出し側はidentity() 演算子を導入することでこれを回避できる。

8.9.5. argMin/argMax演算

軸に沿ったすべての入力値の最小値または最大値のインデックス位置を返す。同値の場合、返り値の同一性は実装依存である。

dictionary MLArgMinMaxOptions : MLOperatorOptions {
  boolean keepDimensions = false;
  MLOperandDataType outputDataType = "int32";
};

partial interface MLGraphBuilder {
  MLOperand argMin(MLOperand input, [EnforceRange] unsigned long axis,
                   optional MLArgMinMaxOptions options = {});
  MLOperand argMax(MLOperand input, [EnforceRange] unsigned long axis,
                   optional MLArgMinMaxOptions options = {});
};

partial dictionary MLOpSupportLimits {
  MLSingleInputSupportLimits argMin;
  MLSingleInputSupportLimits argMax;
};

MLArgMinMaxOptions は次のメンバーを持つ。

keepDimensions, 型は boolean、デフォルトはfalse: trueの場合、縮約された次元をサイズ1で保持する。
outputDataType, 型は MLOperandDataType、デフォルトは "int32": MLOperandDataType。出力データ型。

引数:

input: MLOperand。入力N次元テンソル。
axis: 縮約する次元。値は、Nが入力テンソルのrankであるとき、 [0, N-1]の範囲内でなければならない。
options: 任意の MLArgMinMaxOptions。演算の任意パラメーター。

戻り値: MLOperand。 keepDimensions がtrueの場合はinputの rankと等しいrank、keepDimensions がfalseの場合はinputの rank - 1である出力N次元テンソル。値は、Nがaxisにより指定される入力次元のサイズであるとき、 [0, N-1]の範囲内のoutputDataType 型でなければならない。

`argMin()`/`argMax()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	任意	`"float32"`, `"float16"`, `"int32"`	1 から N	1 から 5
出力	`"int32"`, `"int64"`	`"int32"`	N	0 から 5

MLOpSupportLimits はargMin() およびargMax()について次のメンバーを持つ。

argMin, 型はMLSingleInputSupportLimits: argMin()演算子のサポート制限。
argMax, 型はMLSingleInputSupportLimits: argMax()演算子のサポート制限。

文字列 op、MLOperand input、unsigned long axis、およびMLArgMinMaxOptions optionsが与えられたとき、argMin/argMax演算を作成するには、次の手順を実行する。

Assert: opは"argMin", "argMax"のいずれかである。
thisがbuildできない場合、"InvalidStateError" DOMExceptionを throwする。
thisおよびinputとともにオペランドを検証する結果がfalseを返す場合、TypeErrorを throwする。
axisがinputのrank以上である場合、TypeErrorを throwする。
options.outputDataType が（この表に従う）出力テンソルの許可されるデータ型でない場合、TypeErrorを throwする。
inputのshape[axis]が options.outputDataTypeの最大値より大きい場合、TypeErrorを throwする。
outputShapeを、inputのshape、« axis »、およびoptions.keepDimensions が与えられて縮約出力サイズを計算する結果とする。それが失敗を返す場合、TypeErrorを throwする。
descを、options.outputDataType およびoutputShapeが与えられてMLOperandDescriptorを作成する結果とする。
グラフ接続を作成する:
1. operatorを、optionsが与えられた、op演算用の演算子とする。
2. outputを、thisおよび descが与えられてMLOperandを作成する結果とする。
3. output.[[operator]] をoperatorに設定する。
4. operatorのinputをinputに設定する。
5. operatorのoutputをoutputに設定する。
outputを返す。

次のargMin/argMaxアルゴリズムがサポートされる。

argMin(input, axis, options) メソッドの手順は次のとおりである。

outputを、"argMin"、input、axis、および optionsが与えられてargMin/argMax演算を作成する結果とする。
1. それがエラーをthrowする場合、そのエラーを再throwする。
outputを返す。

argMax(input, axis, options) メソッドの手順は次のとおりである。

outputを、"argMax"、input、axis、および optionsが与えられてargMin/argMax演算を作成する結果とする。
1. それがエラーをthrowする場合、そのエラーを再throwする。
outputを返す。

8.9.6. batchNormalization

[Batch-Normalization]を用いて入力テンソルの値を正規化する。入力特徴量ごとに、モデルの訓練中に、その特徴量の平均値および分散値がバッチ次元内のすべてのサンプルにわたって計算される。これらの平均値および分散値は、その後、モデル推論時にこの演算へ与えられる。

dictionary MLBatchNormalizationOptions : MLOperatorOptions {
  MLOperand scale;
  MLOperand bias;
  [EnforceRange] unsigned long axis = 1;
  double epsilon = 1e-5;
};

partial interface MLGraphBuilder {
  MLOperand batchNormalization(MLOperand input, MLOperand mean, MLOperand variance,
                               optional MLBatchNormalizationOptions options = {});
};

dictionary MLBatchNormalizationSupportLimits {
  MLTensorLimits input;
  MLTensorLimits mean;
  MLTensorLimits variance;
  MLTensorLimits scale;
  MLTensorLimits bias;
  MLTensorLimits output;
};

partial dictionary MLOpSupportLimits {
  MLBatchNormalizationSupportLimits batchNormalization;
};

MLBatchNormalizationOptions は次のメンバーを持つ。

scale, 型はMLOperand: スケーリング値の1次元テンソルであり、そのサイズは、axisにより示される入力次元のサイズに等しい。
bias, 型はMLOperand: バイアス値の1次元テンソルであり、そのサイズは、axisにより示される入力次元のサイズに等しい。
axis, 型はunsigned long、デフォルトは1: 平均値および分散値が対応する、入力形状の特徴量数次元へのインデックス。その値は、Nが入力テンソルのrankであるとき、[0, N-1]の範囲内でなければならない。デフォルト値は1であり、"nchw" データレイアウトにおけるチャンネル（"c"）次元に対応する。
epsilon, 型はdouble、デフォルトは1e-5: ゼロ除算による計算エラーを防ぐための小さい値。

引数:

input: MLOperand。入力N次元テンソル。
mean: MLOperand。バッチ全体にわたる入力特徴量の平均値の1次元テンソルを指定する。そのサイズは、axisにより示される入力次元のサイズに等しい。
variance: MLOperand。バッチ全体にわたる入力特徴量の分散値の1次元テンソルであり、そのサイズは、axisにより示される入力次元のサイズに等しい。
options: 任意のMLBatchNormalizationOptions。演算の任意パラメーターを指定する。

戻り値: MLOperand。 inputと同じ形状を持つ、バッチ正規化済みN次元テンソル。

`batchNormalization()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	`"float32"`, `"float16"`	`"float32"`, `"float16"`	1 から N	3 から 5
`mean`	同じ `input`	`"float32"`, `"float16"`	1	1
`variance`	同じ `input`	`"float32"`, `"float16"`	1	1
`scale`	同じ `input`	`"float32"`, `"float16"`	1	1
`bias`	同じ `input`	`"float32"`, `"float16"`	1	1
出力	同じ `input`	`"float32"`, `"float16"`	同じ `input`	3 から 5

MLBatchNormalizationSupportLimits は次のメンバーを持つ。

input, 型はMLTensorLimits: 入力オペランド用のMLTensorLimits。
mean, 型はMLTensorLimits: meanオペランド用のMLTensorLimits。
variance, 型はMLTensorLimits: varianceオペランド用のMLTensorLimits。
scale, 型はMLTensorLimits: scaleオペランド用のMLTensorLimits。
bias, 型はMLTensorLimits: biasオペランド用のMLTensorLimits。
output, 型はMLTensorLimits: 出力オペランド用のMLTensorLimits。

MLOpSupportLimits はbatchNormalization()について次のメンバーを持つ。

batchNormalization, 型はMLBatchNormalizationSupportLimits: batchNormalization()演算子のサポート制限。

batchNormalization(input, mean, variance, options) メソッドの手順は次のとおりである:

this が構築できない場合、"InvalidStateError" DOMExceptionを投げる。
this と input、mean、 variance、options.scale （それが存在する場合）、および options.bias （それが存在する場合）のいずれかを用いたオペランドの検証が false を返す場合、TypeErrorを投げる。
inputのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
options.axis が、0 から inputのrank までの範囲（排他的）に含まれない場合、TypeErrorを投げる。
meanのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
meanのshapeが、« inputのshape[options.axis] » に等しくない場合、TypeErrorを投げる。
varianceのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
varianceのshapeが、« inputのshape[options.axis] » に等しくない場合、TypeErrorを投げる。
options.epsilon を、options.epsilon を inputのdataTypeへキャストした結果に設定する。
options.scale が存在する場合:
1. そのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
2. そのshapeが、« inputのshape[options.axis] » に等しくない場合、TypeErrorを投げる。
options.bias が存在する場合:
1. そのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
2. そのshapeが、« inputのshape[options.axis] » に等しくない場合、TypeErrorを投げる。
グラフ接続を作成する:
1. operatorを、input、mean、varianceおよび optionsが与えられた、"batchNormalization" 操作のoperatorとする。
2. outputを、this と input.[[descriptor]] が与えられてMLOperand を作成する結果とする。
3. output.[[operator]] を operator に設定する。
4. operatorのinputsを、input、mean、および varianceに設定する。
5. options.scale が存在する場合、それを operatorのinputsに追加する。
6. options.bias が存在する場合、それを operatorのinputsに追加する。
7. operatorのoutputを output に設定する。
outputを返す。

入力テンソルが"nchw" レイアウトの4次元である場合のこの演算の動作は、次のように他の演算の使用から汎用的にエミュレートできる。ただし、ユーザーエージェントは通常、より効率的な実装を持つ。基盤プラットフォームがある演算を直接サポートしない場合、この分解は実装を導くテンプレートとして使用できる。

function batchNormalization(builder, input, mean, variance, options) {
  const shape = [1, input.shape[options.axis], 1, 1];
  return builder.add(
    builder.mul(
      builder.reshape(options.scale, shape),
      builder.div(
        builder.sub(input, builder.reshape(mean, shape)),
        builder.sqrt(builder.add(
          builder.reshape(variance, shape),
          builder.constant(input.dataType, options.epsilon))))),
    builder.reshape(options.bias, shape));
}

8.9.7. cast

入力テンソル内の各要素を対象データ型へキャストする。

partial interface MLGraphBuilder {
  MLOperand cast(MLOperand input,
                 MLOperandDataType dataType,
                 optional MLOperatorOptions options = {});
};

partial dictionary MLOpSupportLimits {
  MLSingleInputSupportLimits cast;
};

引数:

input: MLOperand。入力N次元テンソル。
dataType: MLOperandDataType。対象データ型。
options: MLOperatorOptions。演算の任意パラメーターを指定する。

戻り値: MLOperand。 inputと同じ形状を持ち、各要素が対象データ型へキャストされたN次元テンソル。

`cast()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	任意	`"float32"`, `"float16"`, `"int32"`, `"int8"`, `"uint8"`	N	0 から 5
出力	任意	`"float32"`, `"float16"`, `"int32"`, `"int8"`, `"uint8"`	同じ `input`	0 から 5

MLOpSupportLimits は、cast()について次のメンバーを持つ:

cast, 型は MLSingleInputSupportLimits: cast()演算子のサポート制限。

MLOperandDataType間のキャストは、次の表に従って、一部の場合には規定され、その他の場合には実装定義である:

`cast()` 操作の振る舞い。これは、`input`の dataType（行）と、対象の`dataType` （列）が与えられた場合のものである。
対象型入力型	`"float32"`, `"float16"`	`"int32"`, `"uint32"`, `"int64"`, `"uint64"`, `"int8"`, `"uint8"`
`"float32"`, `"float16"`	範囲内であれば、最も近い表現可能な値。範囲外であれば、+/-Infinity。	範囲内であれば、切り捨てられる。範囲外であれば、実装定義。
`"int32"`, `"uint32"`, `"int64"`, `"uint64"`, `"int8"`, `"uint8"`	範囲内であれば、最も近い表現可能な値。範囲外であれば、+/-Infinity。	範囲内であれば、同じ値。範囲外であれば、符号付き型については二の補数を仮定して、下位 N ビットを対象型として再解釈する。

NOTE: たとえば、"int8" から "uint8" へ -1 をキャストすると、255 になると規定されている。しかし、"float32" から "uint8" へ -1 をキャストすることは実装定義である。

cast(input, dataType, options) メソッドの手順は次のとおりである:

this が構築できない場合、"InvalidStateError" DOMExceptionを投げる。
this と input を用いたオペランドの検証が false を返す場合、TypeErrorを投げる。
dataTypeが、（この表に従って）出力テンソルの許可されるデータ型でない場合、TypeErrorを投げる。
グラフ接続を作成する:
1. operatorを、dataType と options が与えられた、 "cast" 操作のoperatorとする。
2. outputを、input が与えられてMLOperand をコピーする結果とする。
3. output.[[operator]] を operator に設定する。
4. operatorのinputを input に設定する。
5. operatorのoutputを output に設定する。
outputを返す。

8.9.8. clamp

最小値および最大値により指定される範囲内に、入力テンソルを要素ごとにクランプする。

dictionary MLClampOptions : MLOperatorOptions {
  MLNumber minValue;
  MLNumber maxValue;
};

partial interface MLGraphBuilder {
  MLOperand clamp(MLOperand input, optional MLClampOptions options = {});
};

partial dictionary MLOpSupportLimits {
  MLSingleInputSupportLimits clamp;
};

MLClampOptions は次のメンバーを持つ。

minValue, 型はMLNumber: 範囲の最小値。指定されていない場合、範囲の下限でクランプは行われない。
maxValue, 型はMLNumber: 範囲の最大値。指定されていない場合、範囲の上限でクランプは行われない。

引数:

input: MLOperand。入力テンソル。
options: 任意の MLClampOptions。演算の任意パラメーター。

戻り値:

MLOperand。 inputと同じ形状の出力テンソル。

`clamp()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	任意	`"float32"`, `"float16"`	N	0 から 5
output	同じ `input`	`"float32"`, `"float16"`	同じ `input`	0 から 5

MLOpSupportLimits はclamp()について次のメンバーを持つ。

clamp, 型はMLSingleInputSupportLimits: clamp()演算子のサポート制限。

clamp(input, options)メソッドの手順は次のとおりである。

thisがbuildできない場合、"InvalidStateError" DOMExceptionを throwする。
オペランドを検証することをthisおよびinputとともに行った結果がfalseを返す場合、TypeErrorを throwする。
minValueを、与えられていればoptions.minValue、そうでなければInfinityとする。
options.minValue を、minValueをinputのdataTypeへキャストする結果に設定する。
maxValueを、与えられていればoptions.maxValue、そうでなければ-Infinityとする。
options.maxValue を、maxValueをinputのdataTypeへキャストする結果に設定する。
options.minValue がoptions.maxValueより大きい場合、 TypeErrorをthrowする。
グラフ接続を作成する:
1. outputを、inputが与えられてMLOperandをコピーする結果とする。
2. operatorを、optionsが与えられた、"clamp"演算用の演算子とする。
3. output.[[operator]] をoperatorに設定する。
4. operatorのinputをinputに設定する。
5. operatorのoutputをoutputに設定する。
outputを返す。

この演算の動作は、次のように他の演算の使用から汎用的にエミュレートできる。ただし、ユーザーエージェントは通常、より効率的な実装を持つ。基盤プラットフォームがある演算を直接サポートしない場合、この分解は実装を導くテンプレートとして使用できる。

function clamp(builder, input, options) {
  if (options.minValue === undefined) {
    if (options.maxValue === undefined) {
      return input;
    } else {
      return builder.min(
        input, builder.constant(input.dataType, options.maxValue));
    }
  } else {
    if (options.maxValue === undefined) {
      return builder.max(
        input, builder.constant(input.dataType, options.minValue));
    } else {
      return builder.min(
        builder.max(input, builder.constant(input.dataType, options.minValue)),
        builder.constant(input.dataType, options.maxValue));
    }
  }
}

8.9.9. concat

指定された軸に沿って入力テンソルを連結する。

partial interface MLGraphBuilder {
  MLOperand concat(sequence<MLOperand> inputs,
                   [EnforceRange] unsigned long axis,
                   optional MLOperatorOptions options = {});
};

dictionary MLConcatSupportLimits {
  MLTensorLimits inputs;
  MLTensorLimits output;
};

partial dictionary MLOpSupportLimits {
  MLConcatSupportLimits concat;
};

引数:

inputs: sequence<MLOperand>。すべての入力テンソルは、連結する次元のサイズを除き、同じ形状でなければならない。
axis: unsigned long スカラー。入力を連結する軸。その値は、Nが入力テンソルのrankであるとき、[0, N-1]の範囲内でなければならない。
options: MLOperatorOptions。演算の任意パラメーターを指定する。

戻り値: MLOperand。すべての入力をaxisに沿って連結したテンソル。出力テンソルは、すべての入力が連結された次元を除き同じ形状を持つ。その次元のサイズは、同じ次元におけるすべての入力サイズの合計として計算される。

`concat()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`inputs`の items	任意	`"float32"`, `"float16"`, `"int32"`	1 から N	1 から 5
出力	同じ `inputs`の items	`"float32"`, `"float16"`, `"int32"`	同じ `inputs`の items	1 から 5

MLConcatSupportLimits は次のメンバーを持つ。

inputs, 型はMLTensorLimits: すべての入力オペランド用のMLTensorLimits。
output, 型はMLTensorLimits: 出力オペランド用のMLTensorLimits。

MLOpSupportLimits はconcat()について次のメンバーを持つ。

concat, 型はMLConcatSupportLimits: concat()演算子のサポート制限。

concat(inputs, axis, options) メソッドの手順は次のとおりである。

thisがbuildできない場合、"InvalidStateError" DOMExceptionを throwする。
オペランドを検証することをthisおよびinputs内の任意の項目とともに行った結果がfalseを返す場合、 TypeErrorを throwする。
inputsのサイズが妥当なテンソル数でない場合、TypeErrorを throwする。
firstをinputs[0]とする。
axisがfirstのrank以上である場合、TypeErrorを throwする。
descを、firstのdataTypeおよび firstのshapeが与えられてMLOperandDescriptorを作成する結果とする。
desc.shape[axis] をfirstのshape[axis]に設定する。
各indexについて、1以上 inputsのサイズ未満の範囲内で:
1. inputをinputs[index]とする。
2. inputのdataTypeがfirstのdataTypeと等しくない場合、TypeErrorを throwする。
3. inputのrankがfirstのrankと等しくない場合、 TypeErrorを throwする。
4. inputのrank未満の0以上の範囲内の各dimについて反復する。
  
  axisにより与えられる次元のものを除き、オペランドの対応する各次元の形状および型が同じでない場合、失敗する。
  1. dimがaxisと等しくなく、かつinputのshape[dim]が firstのshape[dim]と等しくない場合、TypeErrorを throwする。
  2. dimがaxisと等しい場合:
    1. sizeを、desc.shape[axis] とinputのshape[dim]の合計とする。
    2. sizeが妥当な次元でない場合、TypeErrorを throwする。
    3. desc.shape[axis] をsizeに設定する。
グラフ接続を作成する:
1. outputを、thisおよび descが与えられてMLOperandを作成する結果とする。
2. operatorを、inputs、axis、およびoptionsが与えられた "concat"演算用の演算子とする。
3. output.[[operator]] をoperatorに設定する。
4. operatorの入力をinputsに設定する。
5. operatorのoutputをoutputに設定する。
outputを返す。

8.9.10. conv2d

4次元の入力テンソルおよびフィルターテンソルが与えられたとき、2次元畳み込みを計算する

enum MLConv2dFilterOperandLayout {
  "oihw",
  "hwio",
  "ohwi",
  "ihwo"
};

dictionary MLConv2dOptions : MLOperatorOptions {
  sequence<[EnforceRange] unsigned long> padding;
  sequence<[EnforceRange] unsigned long> strides;
  sequence<[EnforceRange] unsigned long> dilations;
  [EnforceRange] unsigned long groups = 1;
  MLInputOperandLayout inputLayout = "nchw";
  MLConv2dFilterOperandLayout filterLayout = "oihw";
  MLOperand bias;
};

partial interface MLGraphBuilder {
  MLOperand conv2d(MLOperand input,
                   MLOperand filter,
                   optional MLConv2dOptions options = {});
};

dictionary MLConv2dSupportLimits {
  MLTensorLimits input;
  MLTensorLimits filter;
  MLTensorLimits bias;
  MLTensorLimits output;
};

partial dictionary MLOpSupportLimits {
  MLConv2dSupportLimits conv2d;
};

MLConv2dOptions は次のメンバーを持つ。

padding, 型はsequence<[EnforceRange] unsigned long>

長さ4のリスト: [beginningHeight, endingHeight, beginningWidth, endingWidth]。畳み込み入力の各空間次元の先頭および末尾に追加される行および列を指定する。デフォルト値は[0, 0, 0, 0]である。

strides, 型はsequence<[EnforceRange] unsigned long>

長さ2のリスト: [strideHeight, strideWidth]。畳み込み入力の各空間次元に対するスライディングウィンドウのストライドを指定する。デフォルト値は[1, 1]である。

dilations, 型はsequence<[EnforceRange] unsigned long>

長さ2のリスト: [dilationHeight, dilationWidth]。畳み込みフィルター（カーネル）に適用される各空間次元の膨張係数を指定する。デフォルト値は[1, 1]である。

groups, 型はunsigned long、デフォルトは1

入力チャンネルおよび出力チャンネルが分割されるグループ数。

inputLayout, 型はMLInputOperandLayout、デフォルトは "nchw"

入力テンソルおよび出力テンソルのレイアウト形式を次のように指定する。

"nchw"
- 入力テンソル: [batches, inputChannels, height, width]
- 出力テンソル: [batches, outputChannels, height, width]
"nhwc":
- 入力テンソル: [batches, height, width, inputChannels]
- 出力テンソル: [batches, height, width, outputChannels]

filterLayout, 型はMLConv2dFilterOperandLayout、デフォルトは "oihw"

フィルターテンソルのレイアウト形式を次のように指定する。

"oihw": [outputChannels, inputChannels/groups, height, width]
"hwio": [height, width, inputChannels/groups, outputChannels]
"ohwi": [outputChannels, height, width, inputChannels/groups]
"ihwo": [inputChannels/groups, height, width, outputChannels]

bias, 型はMLOperand

畳み込み結果に値が加算される、[outputChannels]の形状を持つ追加の1次元テンソル。

引数:

input: MLOperand。入力4次元テンソル。論理形状は、inputLayoutの値に従って解釈される。
filter: MLOperand。フィルター4次元テンソル。論理形状は、filterLayout およびgroupsの値に従って解釈される。
options: MLConv2dOptions。演算の任意パラメーター。

戻り値: MLOperand。畳み込み結果を含む出力4次元テンソル。出力形状は inputLayoutに従って解釈される。より具体的には、"nchw" 入力レイアウトの場合、出力テンソルの空間次元または末尾2次元のサイズは次のように計算できる。

outputSize = 1 + (inputSize - (filterSize - 1) * dilation - 1 + beginningPadding + endingPadding) / stride

`conv2d()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	`"float32"`, `"float16"`	`"float32"`, `"float16"`	4	4
`filter`	同じ `input`	`"float32"`, `"float16"`	4	4
`bias`	同じ `input`	`"float32"`, `"float16"`	1	1
出力	同じ `input`	`"float32"`, `"float16"`	4	4

MLConv2dSupportLimits は次のメンバーを持つ。

input, 型はMLTensorLimits: 入力オペランド用のMLTensorLimits。
filter, 型はMLTensorLimits: filterオペランド用のMLTensorLimits。
bias, 型はMLTensorLimits: biasオペランド用のMLTensorLimits。
output, 型はMLTensorLimits: 出力オペランド用のMLTensorLimits。

MLOpSupportLimits はconv2d()について次のメンバーを持つ。

conv2d, 型はMLConv2dSupportLimits: conv2d()演算子のサポート制限。

depthwise conv2d演算は、MobileNetのようなモデルで使用されるグループ化畳み込みの一種であり、 groups = inputChannels = outputChannelsで、フィルターテンソルの形状は、"oihw" レイアウトでは[options.groups, 1, height, width]、"hwio" レイアウトでは[height, width, 1, options.groups]、"ohwi" レイアウトでは[options.groups, height, width, 1]、"ihwo" レイアウトでは[1, height, width, options.groups]である。

unsigned整数inputSize、filterSize、beginningPadding、 endingPadding、strideおよびdilationが与えられたとき、 conv出力サイズを計算するには、次の手順を実行する。これらは数値を返す。

effectiveFilterSizeを ( filterSize - 1 ) * dilation + 1 とする。
outputSizeを ( inputSize - effectiveFilterSize + beginningPadding + endingPadding ) / stride + 1 とする。
outputSizeを返す。

unsigned整数inputHeight、inputWidth、filterHeightおよび filterWidth、4個のunsigned整数のリストpadding、2個のunsigned整数のリスト strides、および2個のunsigned整数のリストdilationsが与えられたとき、 conv2d出力サイズを計算するには、次の手順を実行する。これらは2個の数値のリストを返す。

outputHeightを、inputHeight、filterHeight、padding[0]、 padding[1]、strides[0]およびdilations[0]が与えられて conv出力サイズを計算する結果とする。
outputWidthを、inputWidth、filterWidth、padding[2]、 padding[3]、strides[1]およびdilations[1]が与えられて conv出力サイズを計算する結果とする。
« outputHeight, outputWidth »を返す。

conv2d(input, filter, options) メソッドの手順は次のとおりである:

this が構築できない場合、"InvalidStateError" DOMExceptionを投げる。
this と input、filter、および options.bias （それが存在する場合）のいずれかを用いたオペランドの検証が false を返す場合、TypeErrorを投げる。
inputのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
inputのrankが、その許可される rankでない場合、TypeErrorを投げる。
filterのrankが、その許可される rankでない場合、TypeErrorを投げる。
filterのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
options.padding が存在しない場合、それをリスト « 0, 0, 0, 0 » に設定する。
そうでなく、options.paddingのサイズが 4 でない場合、TypeErrorを投げる。
options.strides が存在しない場合、それをリスト « 1, 1 » に設定する。
そうでなく、options.stridesのサイズが 2 でない場合、TypeErrorを投げる。
options.strides 内のいずれかの項目が 0 に等しい場合、TypeErrorを投げる。
options.dilations が存在しない場合、それをリスト « 1, 1 » に設定する。
そうでなく、options.dilationsのサイズが 2 でない場合、TypeErrorを投げる。
options.dilations 内のいずれかの項目が 0 に等しい場合、TypeErrorを投げる。
options.groups が 0 の場合、TypeErrorを投げる。
出力 shape を計算する:
1. inputShapeを、inputのshapeとする。
2. options.inputLayoutに基づいて分岐する:
  
  "nchw"
  
  « batches, inputChannels, inputHeight, inputWidth » を inputShape とする。
  
  "nhwc"
  
  « batches, inputHeight, inputWidth, inputChannels » を inputShape とする。
3. filterShapeを、filterのshapeとする。
4. options.filterLayoutに基づいて分岐する:
  
  "hwio"
  
  « filterHeight, filterWidth, filterInputChannels, outputChannels » を filterShape とする。
  
  "ohwi"
  
  « outputChannels, filterHeight, filterWidth, filterInputChannels » を filterShape とする。
  
  "ihwo"
  
  « filterInputChannels, filterHeight, filterWidth, outputChannels » を filterShape とする。
  
  "oihw"
  
  « outputChannels, filterInputChannels, filterHeight, filterWidth » を filterShape とする。
5. inputChannels % options.groups が 0 でない場合、TypeErrorを投げる。
6. そうでなく、inputChannels / options.groups が filterInputChannels に等しくない場合、TypeErrorを投げる。
7. outputChannels % options.groups が 0 でない場合、TypeErrorを投げる。
8. options.bias が存在する場合:
  1. そのshapeが « outputChannels » に等しくない場合、TypeErrorを投げる。
  2. そのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
9. « outputHeight, outputWidth » を、 inputHeight、inputWidth、 filterHeight、filterWidth、options.padding、 options.strides、および options.dilations が与えられてconv2d 出力サイズを計算する結果とする。
10. outputHeightを floor( outputHeight ) に設定する。
11. outputWidthを floor( outputWidth ) に設定する。
12. outputHeightまたは outputWidth のいずれかが有効な次元でない場合、TypeErrorを投げる。
13. options.inputLayoutに基づいて分岐する:
  
  "nchw"
  
  outputShapeを « batches, outputChannels, outputHeight, outputWidth » とする。
  
  "nhwc"
  
  outputShapeを « batches, outputHeight, outputWidth, outputChannels » とする。
14. descを、inputのdataTypeと outputShape が与えられてMLOperandDescriptor を作成する結果とする。
グラフ接続を作成する:
1. outputを、this と desc が与えられてMLOperand を作成する結果とする。
2. operatorを、options と filter が与えられた、 "conv2d" 操作のoperatorとする。
3. output.[[operator]] を operator に設定する。
4. operatorのinputsを input と filter に設定する。
5. options.bias が存在する場合、それを operatorのinputsに追加する。
6. operatorのoutputを output に設定する。
outputを返す。

8.9.11. convTranspose2d

4次元の入力テンソルおよびフィルターテンソルが与えられたとき、2次元転置畳み込みを計算する

enum MLConvTranspose2dFilterOperandLayout {
  "iohw",
  "hwoi",
  "ohwi"
};

dictionary MLConvTranspose2dOptions : MLOperatorOptions {
  sequence<[EnforceRange] unsigned long> padding;
  sequence<[EnforceRange] unsigned long> strides;
  sequence<[EnforceRange] unsigned long> dilations;
  sequence<[EnforceRange] unsigned long> outputPadding;
  sequence<[EnforceRange] unsigned long> outputSizes;
  [EnforceRange] unsigned long groups = 1;
  MLInputOperandLayout inputLayout = "nchw";
  MLConvTranspose2dFilterOperandLayout filterLayout = "iohw";
  MLOperand bias;
};

partial interface MLGraphBuilder {
  MLOperand convTranspose2d(MLOperand input, MLOperand filter,
                            optional MLConvTranspose2dOptions options = {});
};

partial dictionary MLOpSupportLimits {
  MLConv2dSupportLimits convTranspose2d;
};

MLConvTranspose2dOptions は次のメンバーを持つ。

padding, 型はsequence<[EnforceRange] unsigned long>

strides, 型はsequence<[EnforceRange] unsigned long>

dilations, 型はsequence<[EnforceRange] unsigned long>

outputPadding, 型は sequence<[EnforceRange] unsigned long>

長さ2のリスト。出力テンソルの各空間次元に適用されるパディング値を指定する。strides の値が1より大きい場合、転置畳み込みに対する出力テンソル形状の曖昧さを解消するために、明示的なパディング値が必要である。

これらの値は、必要なときに出力形状の曖昧さを解消するためにのみ使用されることに注意されたい。必ずしも何らかのパディング値が出力テンソルへ書き込まれることを引き起こすわけではない。

デフォルト値は[0, 0]である。

outputSizes, 型はsequence<[EnforceRange] unsigned long>

長さ2のリスト。出力テンソルの末尾2次元のサイズを指定する。出力サイズが明示的に指定された場合、outputPadding 内の出力パディング値は無視される。

指定されない場合、出力サイズは自動的に計算される。

groups, 型はunsigned long、デフォルトは1

入力チャンネルおよび出力チャンネルが分割されるグループ数。

inputLayout, 型はMLInputOperandLayout、デフォルトは "nchw"

入力テンソルおよび出力テンソルのレイアウト形式を次のように指定する。

"nchw"
- 入力テンソル: [batches, inputChannels, height, width]
- 出力テンソル: [batches, outputChannels, height, width]
"nhwc":
- 入力テンソル: [batches, height, width, inputChannels]
- 出力テンソル: [batches, height, width, outputChannels]

filterLayout, 型はMLConvTranspose2dFilterOperandLayout、デフォルトは"iohw"

フィルターテンソルのレイアウト形式を次のように指定する。

"iohw": [inputChannels, outputChannels/groups, height, width]
"hwoi": [height, width, outputChannels/groups, inputChannels]
"ohwi": [outputChannels/groups, height, width, inputChannels]

bias, 型はMLOperand

畳み込み結果に値が加算される、[outputChannels]の形状を持つ追加の1次元テンソル。

引数:

input: MLOperand。入力4次元テンソル。論理形状は、inputLayoutの値に従って解釈される。
filter: MLOperand。フィルター4次元テンソル。論理形状は、filterLayout およびgroupsの値に従って解釈される。
options: 任意のMLConvTranspose2dOptions。

戻り値: MLOperand。転置畳み込み結果を含む出力4次元テンソル。出力形状は inputLayoutに従って解釈される。より具体的には、outputSizes が明示的に指定されない限り、次のように出力テンソルの空間次元値を計算するためにoutputPadding が必要である。

outputSize = (inputSize - 1) * stride + (filterSize - 1) * dilation + 1 - beginningPadding - endingPadding + outputPadding

`convTranspose2d()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	`"float32"`, `"float16"`	`"float32"`, `"float16"`	4	4
`filter`	同じ `input`	`"float32"`, `"float16"`	4	4
`bias`	同じ `input`	`"float32"`, `"float16"`	1	1
output	同じ `input`	`"float32"`, `"float16"`	4	4

MLOpSupportLimits はconvTranspose2d()について次のメンバーを持つ。

convTranspose2d, 型は MLConv2dSupportLimits: convTranspose2d()演算子のサポート制限。

unsigned整数inputSize、filterSize、beginningPadding、 endingPadding、stride、およびdilationが与えられたとき、 convtranspose出力サイズを計算するには、次の手順を実行する。これらは数値を返す。

effectiveFilterSizeを ( filterSize - 1 ) * dilation + 1 とする。
outputSizeを ( inputSize - 1 ) * stride + effectiveFilterSize - beginningPadding - endingPadding とする。
outputSizeを返す。

convTranspose2d(input, filter, options) メソッドの手順は次のとおりである:

this が構築できない場合、"InvalidStateError" DOMExceptionを投げる。
this と input、filter、および options.bias （それが存在する場合）のいずれかを用いたオペランドの検証が false を返す場合、TypeErrorを投げる。
inputのrankが、その許可される rankでない場合、TypeErrorを投げる。
inputのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
filterのrankが、その許可される rankでない場合、TypeErrorを投げる。
filterのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
options.padding が存在しない場合、それをリスト « 0, 0, 0, 0 » に設定する。
そうでなく、options.paddingのサイズが 4 でない場合、TypeErrorを投げる。
options.strides が存在しない場合、それをリスト « 1, 1 » に設定する。
そうでなく、options.stridesのサイズが 2 でない場合、TypeErrorを投げる。
options.strides 内のいずれかの項目が 0 に等しい場合、TypeErrorを投げる。
options.dilations が存在しない場合、それをリスト « 1, 1 » に設定する。
そうでなく、options.dilationsのサイズが 2 でない場合、TypeErrorを投げる。
options.dilations 内のいずれかの項目が 0 に等しい場合、TypeErrorを投げる。
options.outputPadding が存在しない場合、それをリスト « 0, 0 » に設定する。
そうでなく、options.outputPaddingのサイズが 2 でない場合、TypeErrorを投げる。
options.outputSizes が存在する場合:
1. そのサイズが 2 でない場合、TypeErrorを投げる。
そうでない場合:
1. options.outputPadding[0] が options.strides[0] 以上である、または options.outputPadding[1] が options.strides[1] 以上である場合、 TypeErrorを投げる。
options.groups が 0 の場合、TypeErrorを投げる。
出力 shape を計算する:
1. inputShapeを、inputのshapeとする。
2. options.inputLayoutに基づいて分岐する:
  
  "nchw"
  
  « batches, inputChannels, inputHeight, inputWidth » を inputShape とする。
  
  "nhwc"
  
  « batches, inputHeight, inputWidth, inputChannels » を inputShape とする。
3. filterShapeを、filterのshapeとする。
4. options.filterLayoutに基づいて分岐する:
  
  "iohw"
  
  « filterInputChannels, filterOutputChannels, filterHeight, filterWidth » を filterShape とする。
  
  "hwoi"
  
  « filterHeight, filterWidth, filterOutputChannels, filterInputChannels » を filterShape とする。
  
  "ohwi"
  
  « filterOutputChannels, filterHeight, filterWidth, filterInputChannels » を filterShape とする。
5. inputChannelsが filterInputChannels に等しくない場合、TypeErrorを投げる。
6. outputChannelsを filterOutputChannels * options.groups とする。
7. outputChannelsが有効な次元でない場合、TypeErrorを投げる。
8. options.bias が存在する場合:
  1. そのshapeが « outputChannels » に等しくない場合、TypeErrorを投げる。
  2. そのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
9. calculatedOutputHeightを、inputHeight、filterHeight、 padding[0]、padding[1]、strides[0] および dilations[0] が与えられてconvtranspose 出力サイズを計算する結果とする。
10. calculatedOutputWidthを、inputWidth、filterWidth、 padding[2]、padding[3]、strides[1] および dilations[1] が与えられてconvtranspose 出力サイズを計算する結果とする。
11. options.outputSizes が存在する場合:
  1. « outputHeight, outputWidth » を options.outputSizesとする。
  2. outputHeightが calculatedOutputHeight より小さい、または outputHeightが calculatedOutputHeight + strides[0] 以上である場合、TypeErrorを投げる。
  3. outputWidthが calculatedOutputWidth より小さい、または outputWidthが calculatedOutputWidth + strides[1] 以上である場合、TypeErrorを投げる。
12. そうでない場合:
  1. outputHeightを calculatedOutputHeight + options.outputPadding[0] とする。
  2. outputWidthを calculatedOutputWidth + options.outputPadding[1] とする。
13. outputHeightまたは outputWidth のいずれかが有効な次元でない場合、TypeErrorを投げる。
14. options.inputLayoutに基づいて分岐する:
  
  "nchw"
  
  outputShapeを « batches, outputChannels, floor( outputHeight ), floor( outputWidth ) » とする。
  
  "nhwc"
  
  outputShapeを « batches, floor( outputHeight ), floor( outputWidth ), outputChannels » とする。
15. descを、inputのdataTypeと outputShape が与えられてMLOperandDescriptor を作成する結果とする。
グラフ接続を作成する:
1. outputを、this と desc が与えられてMLOperand を作成する結果とする。
2. operatorを、options と filter が与えられた、 "convTranspose2d" 操作のoperatorとする。
3. output.[[operator]] を operator に設定する。
4. operatorのinputsを input と filter に設定する。
5. options.bias が存在する場合、それを operatorのinputsに追加する。
6. operatorのoutputを output に設定する。
outputを返す。

8.9.12. cumulativeSum

与えられた軸に沿って、一連の値の累積和を、現在の値を含めるか除外するかのいずれかで計算する。

dictionary MLCumulativeSumOptions : MLOperatorOptions {
  boolean exclusive = false;
  boolean reversed = false;
};

partial interface MLGraphBuilder {
  MLOperand cumulativeSum(MLOperand input,
                          unsigned long axis,
                          optional MLCumulativeSumOptions options = {});
};

partial dictionary MLOpSupportLimits {
  MLSingleInputSupportLimits cumulativeSum;
};

`cumulativeSum()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	`"float32"`, `"float16"`, `"int32"`, `"uint32"`, `"int64"`, `"uint64"`	`"float32"`, `"float16"`, `"int32"`	1 から N	1 から 5
出力	同じ `input`	`"float32"`, `"float16"`, `"int32"`	同じ `input`	1 から 5

MLCumulativeSumOptions は次のメンバーを持つ。

exclusive, 型はboolean、デフォルトはfalse: 出力に現在の値を含めるか除外するか、すなわち包含的prefix sumか排他的prefix sumかを示す [Prefix-sum]。入力[1,2,3,4]が与えられた場合、包含的な合計は [1,3,6,10]の出力を生成する一方、排他的な合計は[0,1,3,6]を生成する。デフォルトは包含的である。
reversed, 型はboolean、デフォルトはfalse: 有効な軸に沿った合計方向を逆にし、代わりに高い座標から低い座標へ開始するかどうかを示す。入力[1,2,3,4]が与えられた場合、包含的な前方向の合計は[1,3,6,10]の出力を生成する一方、包含的な後方向の合計は[10,9,7,4]を生成する。デフォルトは前方向である。

引数:

input: MLOperand。入力テンソル。
axis: unsigned long スカラー。合計が行われる軸。その値は、Nがinputの rankであるとき、[0, N-1]の範囲内でなければならない。
options: MLCumulativeSumOptions。演算の任意パラメーターを指定する。

戻り値:

MLOperand。 inputと同じ形状の出力テンソル。

MLOpSupportLimits はcumulativeSum()について次のメンバーを持つ。

cumulativeSum, 型は MLSingleInputSupportLimits: cumulativeSum()演算子のサポート制限。

cumulativeSum(input, axis, options) メソッドの手順は次のとおりである。

thisがbuildできない場合、"InvalidStateError" DOMExceptionを throwする。
オペランドを検証することをthisおよびinputとともに行った結果がfalseを返す場合、TypeErrorを throwする。
inputのdataTypeが（この表に従う）その許可されるデータ型のいずれでもない場合、TypeErrorを throwする。
axisがinputのrank以上である場合、TypeErrorをthrowする。
グラフ接続を作成する:
1. outputを、inputが与えられてMLOperandをコピーする結果とする。
2. operatorを、"cumulativeSum"演算およびoptions用の演算子とする。
3. output.[[operator]] をoperatorに設定する。
4. operatorのinputをinputに設定する。
5. operatorのoutputをoutputに設定する。
outputを返す。

8.9.13. 要素ごとの二項演算

2つの入力テンソルの要素ごとの二項加算、減算、乗算、除算、累乗、最大値、および最小値を計算する。

演算は、[numpy-broadcasting-rule]に従ってブロードキャストされる。入力テンソルは双方向ブロードキャスト可能でなければならない。出力テンソルのrankは、入力テンソルの最大rankである。出力テンソルの各次元について、そのサイズは、入力テンソルのその次元に沿った最大サイズである。

partial interface MLGraphBuilder {
  MLOperand add(MLOperand a, MLOperand b, optional MLOperatorOptions options = {});
  MLOperand sub(MLOperand a, MLOperand b, optional MLOperatorOptions options = {});
  MLOperand mul(MLOperand a, MLOperand b, optional MLOperatorOptions options = {});
  MLOperand div(MLOperand a, MLOperand b, optional MLOperatorOptions options = {});
  MLOperand max(MLOperand a, MLOperand b, optional MLOperatorOptions options = {});
  MLOperand min(MLOperand a, MLOperand b, optional MLOperatorOptions options = {});
  MLOperand pow(MLOperand a, MLOperand b, optional MLOperatorOptions options = {});
};

partial dictionary MLOpSupportLimits {
  MLBinarySupportLimits add;
  MLBinarySupportLimits sub;
  MLBinarySupportLimits mul;
  MLBinarySupportLimits div;
  MLBinarySupportLimits max;
  MLBinarySupportLimits min;
  MLBinarySupportLimits pow;
};

引数:

a: MLOperand。第1入力テンソル。
b: MLOperand。第2入力テンソル。
options: MLOperatorOptions。演算の任意パラメーターを指定する。

戻り値: MLOperand。 2つの入力テンソルの要素ごとの二項演算の結果を含む出力テンソル。

演算の種類:

add: 2つの入力テンソルの値を要素ごとに加算する。
sub: 第2入力テンソルの値を、第1入力テンソルの値から要素ごとに減算する。
mul: 2つの入力テンソルの値を要素ごとに乗算する。
div: 第1入力テンソルの値を、第2テンソルの値で要素ごとに除算する。整数型はゼロ方向へ切り捨てられる。
max: 2つの入力テンソルの大きい方の値を要素ごとに選択する。
min: 2つの入力テンソルの小さい方の値を要素ごとに選択する。
pow: 第1入力テンソルの値を第2入力テンソルの値でべき乗した値を、要素ごとに計算する。

要素ごとの二項演算のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`a`	任意	`"float32"`, `"float16"`, `"int32"`	N	0 から 5
`b`	同じ `a`	`"float32"`, `"float16"`, `"int32"`	N	0 から 5
出力	同じ `a`	`"float32"`, `"float16"`, `"int32"`	N	0 から 5

MLOpSupportLimits は要素ごとの二項演算について次のメンバーを持つ。

add, 型はMLBinarySupportLimits: add()演算子のサポート制限。
sub, 型はMLBinarySupportLimits: sub()演算子のサポート制限。
mul, 型はMLBinarySupportLimits: mul()演算子のサポート制限。
div, 型はMLBinarySupportLimits: div()演算子のサポート制限。
max, 型はMLBinarySupportLimits: max()演算子のサポート制限。
min, 型はMLBinarySupportLimits: min()演算子のサポート制限。
pow, 型はMLBinarySupportLimits: pow()演算子のサポート制限。

文字列op、MLOperand a、MLOperand b、およびMLOperatorOptions optionsが与えられたとき、要素ごとの二項演算を作成するには、次の手順を実行する。

Assert: opは"add"、"sub"、"mul"、"div"、"max"、 "min"、"pow"のいずれかである。
thisがbuildできない場合、"InvalidStateError" DOMExceptionを throwする。
オペランドを検証することをthisおよびaとbのいずれかとともに行った結果がfalseを返す場合、 TypeErrorをthrowする。
aのdataTypeがbのdataTypeと等しくない場合、TypeErrorをthrowする。
outputShapeを、aのshapeと bのshapeを双方向ブロードキャストする結果とする。
1. それが失敗を返す場合、TypeErrorをthrowする。
descriptorを、aのdataTypeおよび outputShapeが与えられてMLOperandDescriptorを作成する結果とする。
グラフ接続を作成する:
1. outputを、thisおよび descriptorが与えられてMLOperandを作成する結果とする。
2. operatorを、a、b、およびoptionsが与えられた op演算用の演算子とする。
3. output.[[operator]] をoperatorに設定する。
4. operatorの入力をaおよびbに設定する。
5. operatorのoutputをoutputに設定する。
outputを返す。

要素ごとの二項演算アルゴリズムは、次のように要素ごとの二項演算を作成する手順を呼び出す。

add(a, b, options) メソッドの手順は次のとおりである。

outputを、"add"、a、b、およびoptionsが与えられて要素ごとの二項演算を作成する結果とする。
1. それがエラーをthrowする場合、そのエラーを再throwする。
outputを返す。

sub(a, b, options) メソッドの手順は次のとおりである。

outputを、"sub"、a、b、およびoptionsが与えられて要素ごとの二項演算を作成する結果とする。
1. それがエラーをthrowする場合、そのエラーを再throwする。
outputを返す。

mul(a, b, options) メソッドの手順は次のとおりである。

outputを、"mul"、a、b、およびoptionsが与えられて要素ごとの二項演算を作成する結果とする。
1. それがエラーをthrowする場合、そのエラーを再throwする。
outputを返す。

div(a, b, options) メソッドの手順は次のとおりである。

outputを、"div"、a、b、およびoptionsが与えられて要素ごとの二項演算を作成する結果とする。
1. それがエラーをthrowする場合、そのエラーを再throwする。
outputを返す。

max(a, b, options) メソッドの手順は次のとおりである。

outputを、"max"、a、b、およびoptionsが与えられて要素ごとの二項演算を作成する結果とする。
1. それがエラーをthrowする場合、そのエラーを再throwする。
outputを返す。

min(a, b, options) メソッドの手順は次のとおりである。

outputを、"min"、a、b、およびoptionsが与えられて要素ごとの二項演算を作成する結果とする。
1. それがエラーをthrowする場合、そのエラーを再throwする。
outputを返す。

pow(a, b, options) メソッドの手順は次のとおりである。

outputを、"pow"、a、b、およびoptionsが与えられて要素ごとの二項演算を作成する結果とする。
1. それがエラーをthrowする場合、そのエラーを再throwする。
outputを返す。

8.9.14. 要素ごとの論理演算

入力テンソルを要素ごとに比較し、その比較について値0（false）または1（true）の"uint8" テンソルを返す。単一オペランド演算については、その演算の論理結果を返す。

複数オペランド演算については、演算は[numpy-broadcasting-rule]に従ってブロードキャストされる。入力テンソルは双方向ブロードキャスト可能でなければならない。出力テンソルのrankは、入力テンソルの最大rankである。出力テンソルの各次元について、そのサイズは、入力テンソルのその次元に沿った最大サイズである。

partial interface MLGraphBuilder {
  MLOperand equal(MLOperand a,
                  MLOperand b,
                  optional MLOperatorOptions options = {});
  MLOperand notEqual(MLOperand a,
                     MLOperand b,
                     optional MLOperatorOptions options = {});
  MLOperand greater(MLOperand a,
                    MLOperand b,
                    optional MLOperatorOptions options = {});
  MLOperand greaterOrEqual(MLOperand a,
                           MLOperand b,
                           optional MLOperatorOptions options = {});
  MLOperand lesser(MLOperand a,
                   MLOperand b,
                   optional MLOperatorOptions options = {});
  MLOperand lesserOrEqual(MLOperand a,
                          MLOperand b,
                          optional MLOperatorOptions options = {});
  MLOperand logicalNot(MLOperand a, optional MLOperatorOptions options = {});
  MLOperand logicalAnd(MLOperand a,
                       MLOperand b,
                       optional MLOperatorOptions options = {});
  MLOperand logicalOr(MLOperand a,
                      MLOperand b,
                      optional MLOperatorOptions options = {});
  MLOperand logicalXor(MLOperand a,
                       MLOperand b,
                       optional MLOperatorOptions options = {});
  MLOperand isNaN(MLOperand a, optional MLOperatorOptions options = {});
  MLOperand isInfinite(MLOperand a, optional MLOperatorOptions options = {});
};

dictionary MLLogicalNotSupportLimits {
  MLTensorLimits a;
  MLTensorLimits output;
};

partial dictionary MLOpSupportLimits {
  MLBinarySupportLimits equal;
  MLBinarySupportLimits notEqual;
  MLBinarySupportLimits greater;
  MLBinarySupportLimits greaterOrEqual;
  MLBinarySupportLimits lesser;
  MLBinarySupportLimits lesserOrEqual;
  MLLogicalNotSupportLimits logicalNot;
  MLBinarySupportLimits logicalAnd;
  MLBinarySupportLimits logicalOr;
  MLBinarySupportLimits logicalXor;
  MLLogicalNotSupportLimits isNaN;
  MLLogicalNotSupportLimits isInfinite;
};

引数:

a: MLOperand。第1入力テンソル。
b: MLOperand。指定された場合の第2入力テンソル。
options: MLOperatorOptions。演算の任意パラメーターを指定する。

戻り値: MLOperand。 2つの入力テンソルの要素ごとの比較結果を含む出力テンソル。

`equal()`/`notEqual()`/`greater()`/`greaterOrEqual()`/`lesser()`/`lesserOrEqual()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`a`	任意	`"float32"`, `"float16"`, `"int32"`	N	0 から 5
`b`	同じ `a`	`"float32"`, `"float16"`, `"int32"`	N	0 から 5
出力	`"uint8"`	`"uint8"`	N	0 から 5

`logicalNot()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`a`	`"uint8"`	`"uint8"`	N	0 から 5
出力	`"uint8"`	`"uint8"`	N	0 から 5

`logicalAnd()`/`logicalOr()`/`logicalXor()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`a`	`"uint8"`	`"uint8"`	N	0 から 5
`b`	同じ `a`	`"uint8"`	N	0 から 5
出力	`"uint8"`	`"uint8"`	N	0 から 5

`isNaN()`/`isInfinite()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`a`	任意	`"float32"`, `"float16"`	N	0 から 5
出力	`"uint8"`	`"uint8"`	N	0 から 5

MLLogicalNotSupportLimits は次のメンバーを持つ:

a, 型は MLTensorLimits: a オペランド用のMLTensorLimits。
output, 型は MLTensorLimits: 出力オペランド用のMLTensorLimits。

MLOpSupportLimits は、要素ごとの論理演算について次のメンバーを持つ:

equal, 型は MLBinarySupportLimits: equal()演算子のサポート制限。
notEqual, 型は MLBinarySupportLimits: notEqual()演算子のサポート制限。
greater, 型は MLBinarySupportLimits: greater()演算子のサポート制限。
greaterOrEqual, 型は MLBinarySupportLimits: greaterOrEqual()演算子のサポート制限。
lesser, 型は MLBinarySupportLimits: lesser()演算子のサポート制限。
lesserOrEqual, 型は MLBinarySupportLimits: lesserOrEqual()演算子のサポート制限。
logicalNot, 型は MLLogicalNotSupportLimits: logicalNot()演算子のサポート制限。
logicalAnd, 型は MLBinarySupportLimits: logicalAnd()演算子のサポート制限。
logicalOr, 型は MLBinarySupportLimits: logicalOr()演算子のサポート制限。
logicalXor, 型は MLBinarySupportLimits: logicalXor()演算子のサポート制限。
isNaN, 型は MLLogicalNotSupportLimits: isNaN()演算子のサポート制限。
isInfinite, 型は MLLogicalNotSupportLimits: isInfinite()演算子のサポート制限。

演算の種類:

equal: 2 つの入力テンソルの値が等しいかどうかを、要素ごとに比較する。
notEqual: 2 つの入力テンソルの値が等しくないかどうかを、要素ごとに比較する。
greater: 1 つ目の入力テンソルの値がより大きいかどうかを、要素ごとに比較する。
greaterOrEqual: 1 つ目の入力テンソルの値が以上であるかどうかを、要素ごとに比較する。
lesser: 1 つ目の入力テンソルの値がより小さいかどうかを、要素ごとに比較する。
lesserOrEqual: 1 つ目の入力テンソルの値が以下であるかどうかを、要素ごとに比較する。
logicalNot: 入力テンソルの値を、要素ごとに 0 または 1 の値へ反転する。具体的には、入力値が 0 でない場合は 0 に反転する。逆に、入力値が 0 の場合は 1 に反転する。
logicalAnd: 2 つの入力テンソルの論理 and を要素ごとに計算し、 0 でない任意の値を true として扱い、0 または 1 の要素を返す。
logicalOr: 2 つの入力テンソルの論理 or を要素ごとに計算し、 0 でない任意の値を true として扱い、0 または 1 の要素を返す。
logicalXor: 2 つの入力テンソルの論理 xor を要素ごとに計算し、 0 でない任意の値を true として扱い、0 または 1 の要素を返す。
isNaN: 入力テンソルの値が無効な数値表現（NaN）であるかどうかを、要素ごとに確認し、 NaN の場合は 1、それ以外の場合は 0 を返す。
isInfinite: 入力テンソルの値が無限大であるかどうかを、要素ごとに確認し、正または負の無限大の場合は 1、それ以外の場合は 0 を返す。

greaterOrEqual() および lesserOrEqual() 演算は、それぞれ logicalNot()、 lesser()、および greater() 演算を用いて実装できるが（言い換えると builder.greaterOrEqual(a, b) は builder.logicalNot(builder.lesser(a, b)) である）、NaN の場合を扱うため、また二重比較を避けるという性能上の理由から、これらは明示的に定義されている。

文字列 op、MLOperand a、任意の MLOperand b、および MLOperatorOptions options が与えられたとき、要素ごとの論理演算を作成するには、次の手順を実行する:

Assert: op は "equal", "notEqual", "greater", "greaterOrEqual", "lesser", "lesserOrEqual", "logicalNot", "logicalAnd", "logicalOr", "logicalXor", "isNaN", "isInfinite" のいずれかである。
this が構築できない場合、"InvalidStateError" DOMExceptionを投げる。
this と a を用いたオペランドの検証が false を返す場合、TypeErrorを投げる。
op が "logicalNot", "logicalAnd", "logicalOr", "logicalXor" のいずれかである場合:
1. aのdataTypeが "uint8" でない場合、TypeErrorを投げる。
op が "isNaN", "isInfinite" のいずれかである場合:
1. aのdataTypeが « "float32", "float16" » のいずれでもない場合、TypeErrorを投げる。
b が渡されている場合:
1. this と b を用いたオペランドの検証が false を返す場合、TypeErrorを投げる。
2. aのdataTypeが bのdataTypeに等しくない場合、TypeErrorを投げる。
3. outputShapeを、aのshapeと bのshapeを双方向にブロードキャストした結果とする。それが failure を返す場合、TypeErrorを投げる。
そうでない場合:
1. outputShapeを、aのshapeの複製とする。
descriptorを、"uint8" と outputShape が与えられてMLOperandDescriptor を作成する結果とする。
グラフ接続を作成する:
1. outputを、this と descriptor が与えられてMLOperand を作成する結果とする。
2. operatorを、a、（b が渡されている場合）b、および options が与えられた、op 操作のoperatorとする。
3. output.[[operator]] を operator に設定する。
4. operatorのinputsを a および（b が渡されている場合）b に設定する。
5. operatorのoutputを output に設定する。
outputを返す。

要素ごとの論理演算アルゴリズムは、次のように要素ごとの論理演算を作成する手順を呼び出す。

equal(a, b, options) メソッドの手順は次のとおりである。

outputを、"equal"、a、b、およびoptionsが与えられて要素ごとの論理演算を作成する結果とする。
1. それがエラーをthrowする場合、そのエラーを再throwする。
outputを返す。

notEqual(a, b, options) メソッドの手順は次のとおりである。

outputを、"notEqual"、a、b、およびoptionsが与えられて要素ごとの論理演算を作成する結果とする。
1. それがエラーをthrowする場合、そのエラーを再throwする。
outputを返す。

greater(a, b, options) メソッドの手順は次のとおりである。

outputを、"greater"、a、b、およびoptionsが与えられて要素ごとの論理演算を作成する結果とする。
1. それがエラーをthrowする場合、そのエラーを再throwする。
outputを返す。

greaterOrEqual(a, b, options) メソッドの手順は次のとおりである。

outputを、"greaterOrEqual"、a、b、および optionsが与えられて要素ごとの論理演算を作成する結果とする。
1. それがエラーをthrowする場合、そのエラーを再throwする。
outputを返す。

lesser(a, b, options) メソッドの手順は次のとおりである。

outputを、"lesser"、a、b、およびoptionsが与えられて要素ごとの論理演算を作成する結果とする。
1. それがエラーをthrowする場合、そのエラーを再throwする。
outputを返す。

lesserOrEqual(a, b, options) メソッドの手順は次のとおりである。

outputを、"lesserOrEqual"、a、b、およびoptionsが与えられて要素ごとの論理演算を作成する結果とする。
1. それがエラーをthrowする場合、そのエラーを再throwする。
outputを返す。

logicalNot(a, options) メソッドの手順は次のとおりである。

outputを、"logicalNot"、a、およびoptionsが与えられて要素ごとの論理演算を作成する結果とする。
1. それがエラーをthrowする場合、そのエラーを再throwする。
outputを返す。

logicalAnd(a, b, options) メソッドの手順は次のとおりである。

outputを、"logicalAnd"、a、b、およびoptionsが与えられて要素ごとの論理演算を作成する結果とする。
1. それがエラーをthrowする場合、そのエラーを再throwする。
outputを返す。

logicalOr(a, b, options) メソッドの手順は次のとおりである。

outputを、"logicalOr"、a、b、およびoptionsが与えられて要素ごとの論理演算を作成する結果とする。
1. それがエラーをthrowする場合、そのエラーを再throwする。
outputを返す。

logicalXor(a, b, options) メソッドの手順は次のとおりである。

outputを、"logicalXor"、a、b、およびoptionsが与えられて要素ごとの論理演算を作成する結果とする。
1. それがエラーをthrowする場合、そのエラーを再throwする。
outputを返す。

isNaN(a, options)メソッドの手順は次のとおりである。

outputを、"isNaN"、a、およびoptionsが与えられて要素ごとの論理演算を作成する結果とする。
1. それがエラーをthrowする場合、そのエラーを再throwする。
outputを返す。

isInfinite(a, options) メソッドの手順は次のとおりである。

outputを、"isInfinite"、a、およびoptionsが与えられて要素ごとの論理演算を作成する結果とする。
1. それがエラーをthrowする場合、そのエラーを再throwする。
outputを返す。

8.9.15. 要素ごとの単項演算

入力テンソルに対して要素ごとの単項演算を計算する。

partial interface MLGraphBuilder {
  MLOperand abs(MLOperand input, optional MLOperatorOptions options = {});
  MLOperand ceil(MLOperand input, optional MLOperatorOptions options = {});
  MLOperand cos(MLOperand input, optional MLOperatorOptions options = {});
  MLOperand erf(MLOperand input, optional MLOperatorOptions options = {});
  MLOperand exp(MLOperand input, optional MLOperatorOptions options = {});
  MLOperand floor(MLOperand input, optional MLOperatorOptions options = {});
  MLOperand identity(MLOperand input, optional MLOperatorOptions options = {});
  MLOperand log(MLOperand input, optional MLOperatorOptions options = {});
  MLOperand neg(MLOperand input, optional MLOperatorOptions options = {});
  MLOperand reciprocal(MLOperand input, optional MLOperatorOptions options = {});
  MLOperand roundEven(MLOperand input, optional MLOperatorOptions options = {});
  MLOperand sin(MLOperand input, optional MLOperatorOptions options = {});
  MLOperand sign(MLOperand input, optional MLOperatorOptions options = {});
  MLOperand sqrt(MLOperand input, optional MLOperatorOptions options = {});
  MLOperand tan(MLOperand input, optional MLOperatorOptions options = {});
};

partial dictionary MLOpSupportLimits {
  MLSingleInputSupportLimits abs;
  MLSingleInputSupportLimits ceil;
  MLSingleInputSupportLimits cos;
  MLSingleInputSupportLimits erf;
  MLSingleInputSupportLimits exp;
  MLSingleInputSupportLimits floor;
  MLSingleInputSupportLimits identity;
  MLSingleInputSupportLimits log;
  MLSingleInputSupportLimits neg;
  MLSingleInputSupportLimits reciprocal;
  MLSingleInputSupportLimits roundEven;
  MLSingleInputSupportLimits sin;
  MLSingleInputSupportLimits sign;
  MLSingleInputSupportLimits sqrt;
  MLSingleInputSupportLimits tan;
};

引数:

input: MLOperand。入力テンソル。
options: MLOperatorOptions。演算の任意パラメーターを指定する。

戻り値: MLOperand。入力テンソルの要素ごとの単項演算の結果を含む出力テンソル。出力テンソルの形状は、入力テンソルの形状と同じである。

`abs()`/`neg()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	`"float32"`, `"float16"`, `"int64"`, `"int32"`, `"int8"`	`"float32"`, `"float16"`, `"int32"`	N	0 から 5
出力	同じ `input`	`"float32"`, `"float16"`, `"int32"`	同じ `input`	0 から 5

`ceil()`/`cos()`/`erf()`/`exp()`/`floor()`/`log()`/`reciprocal()`/`roundEven()`/`sin()`/`sqrt()`/`tan()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	`"float32"`, `"float16"`	`"float32"`, `"float16"`	N	0 から 5
出力	同じ `input`	`"float32"`, `"float16"`	同じ `input`	0 から 5

`identity()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	任意	`"float32"`, `"float16"`, `"int32"`	N	0 から 5
出力	同じ `input`	`"float32"`, `"float16"`, `"int32"`	同じ `input`	0 から 5

`sign()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	`"float32"`, `"float16"`, `"int64"`, `"int32"`, `"int8"`	`"float32"`, `"float16"`, `"int32"`	N	0 から 5
出力	同じ `input`	`"float32"`, `"float16"`, `"int32"`	同じ `input`	0 から 5

MLOpSupportLimits は、要素ごとの単項演算について次のメンバーを持つ:

abs, 型は MLSingleInputSupportLimits: abs()演算子のサポート制限。
ceil, 型は MLSingleInputSupportLimits: ceil()演算子のサポート制限。
cos, 型は MLSingleInputSupportLimits: cos()演算子のサポート制限。
erf, 型は MLSingleInputSupportLimits: erf()演算子のサポート制限。
exp, 型は MLSingleInputSupportLimits: exp()演算子のサポート制限。
floor, 型は MLSingleInputSupportLimits: floor()演算子のサポート制限。
identity, 型は MLSingleInputSupportLimits: identity()演算子のサポート制限。
log, 型は MLSingleInputSupportLimits: log()演算子のサポート制限。
neg, 型は MLSingleInputSupportLimits: neg()演算子のサポート制限。
reciprocal, 型は MLSingleInputSupportLimits: reciprocal()演算子のサポート制限。
roundEven, 型は MLSingleInputSupportLimits: roundEven()演算子のサポート制限。
sin, 型は MLSingleInputSupportLimits: sin()演算子のサポート制限。
sign, 型は MLSingleInputSupportLimits: sign()演算子のサポート制限。
sqrt, 型は MLSingleInputSupportLimits: sqrt()演算子のサポート制限。
tan, 型は MLSingleInputSupportLimits: tan()演算子のサポート制限。

演算の種類:

abs: 入力テンソルの絶対値を、要素ごとに計算する。
ceil: 入力テンソルの天井値を、要素ごとに計算する。
cos: 入力テンソルの余弦を、要素ごとに計算する。
erf: 入力テンソルの誤差関数 [Error-Function] を、要素ごとに計算する。
exp: 入力テンソルの指数関数を、要素ごとに計算する。
floor: 入力テンソルの床値を、要素ごとに計算する。
identity: 入力テンソルの値を出力テンソルに、要素ごとにコピーする。
log: 入力テンソルの自然対数を、要素ごとに計算する。
neg: 入力テンソルの数値的な負の値を、要素ごとに計算する。
reciprocal: 入力テンソルの逆数を、要素ごとに計算する。
roundEven: 入力テンソルを、半端値について最も近い偶数値へ、要素ごとに丸める（例: [0.1, 0.9, 1.1, 1.9, -3.5, -2.5, -1.5, 1.5, 2.5, 3.5] は [0.0, 1.0, 1.0, 2.0, -4.0, -2.0, -2.0, 2.0, 2.0, 4.0] を生成する）。
sin: 入力テンソルの正弦を、要素ごとに計算する。
sign: 入力テンソルの符号（-1, 0, 1）を、要素ごとに計算し、> 0 の場合は 1、 < 0 の場合は -1、それ以外の場合は 0 を返す。
sqrt: 入力テンソルの平方根を、要素ごとに計算する。
tan: 入力テンソルの正接を、要素ごとに計算する。

文字列 op、MLOperand input、任意のリスト allowedDataTypes、および options が与えられたとき、要素ごとの単項演算を作成するには、次の手順を実行する:

Assert: op は "abs", "ceil", "cos", "erf", "exp", "floor", "identity", "log", "neg", "reciprocal", "roundEven", "sin", "sign", "sqrt", "tan" のいずれかである。
this が構築できない場合、"InvalidStateError" DOMExceptionを投げる。
this と input を用いたオペランドの検証が false を返す場合、TypeErrorを投げる。
allowedDataTypes が与えられており、それが inputのdataTypeを含まない場合、TypeErrorを投げる。
グラフ接続を作成する:
1. outputを、input が与えられてMLOperand をコピーする結果とする。
2. operatorを、options が与えられた op 操作のoperatorとする。
3. output.[[operator]] を operator に設定する。
4. operatorのinputを input に設定する。
5. operatorのoutputを output に設定する。
outputを返す。

要素ごとの単項演算アルゴリズムは、要素ごとの単項演算を作成する手順を次のように呼び出す。

abs(input, options)メソッドの手順は次のとおりである:

outputを、"abs"、input、« "float32", "float16", "int64", "int32", "int8" »、および options が与えられて要素ごとの単項演算を作成する結果とする。
1. それがエラーを投げる場合、そのエラーを再度投げる。
outputを返す。

ceil(input, options)メソッドの手順は次のとおりである:

outputを、"ceil"、input、« "float32", "float16" »、および options が与えられて要素ごとの単項演算を作成する結果とする。
1. それがエラーを投げる場合、そのエラーを再度投げる。
outputを返す。

cos(input, options)メソッドの手順は次のとおりである:

outputを、"cos"、input、« "float32", "float16" »、および options が与えられて要素ごとの単項演算を作成する結果とする。
1. それがエラーを投げる場合、そのエラーを再度投げる。
outputを返す。

erf(input, options)メソッドの手順は次のとおりである:

outputを、"erf"、input、« "float32", "float16" »、および options が与えられて要素ごとの単項演算を作成する結果とする。
1. それがエラーを投げる場合、そのエラーを再度投げる。
outputを返す。

exp(input, options)メソッドの手順は次のとおりである:

outputを、"exp"、input、« "float32", "float16" »、および options が与えられて要素ごとの単項演算を作成する結果とする。
1. それがエラーを投げる場合、そのエラーを再度投げる。
outputを返す。

floor(input, options)メソッドの手順は次のとおりである:

outputを、"floor"、input、« "float32", "float16" »、および options が与えられて要素ごとの単項演算を作成する結果とする。
1. それがエラーを投げる場合、そのエラーを再度投げる。
outputを返す。

identity(input, options) メソッドの手順は次のとおりである:

outputを、"identity"、input、および options が与えられて要素ごとの単項演算を作成する結果とする。
1. それがエラーを投げる場合、そのエラーを再度投げる。
outputを返す。

log(input, options)メソッドの手順は次のとおりである:

outputを、"log"、input、« "float32", "float16" »、および options が与えられて要素ごとの単項演算を作成する結果とする。
1. それがエラーを投げる場合、そのエラーを再度投げる。
outputを返す。

neg(input, options)メソッドの手順は次のとおりである:

outputを、"neg"、input、« "float32", "float16", "int64", "int32", "int8" »、および options が与えられて要素ごとの単項演算を作成する結果とする。
1. それがエラーを投げる場合、そのエラーを再度投げる。
outputを返す。

reciprocal(input, options) メソッドの手順は次のとおりである:

outputを、"reciprocal"、input、« "float32", "float16" »、および options が与えられて要素ごとの単項演算を作成する結果とする。
1. それがエラーを投げる場合、そのエラーを再度投げる。
outputを返す。

roundEven(input, options) メソッドの手順は次のとおりである:

outputを、"roundEven"、input、« "float32", "float16" »、および options が与えられて要素ごとの単項演算を作成する結果とする。
1. それがエラーを投げる場合、そのエラーを再度投げる。
outputを返す。

sin(input, options)メソッドの手順は次のとおりである:

outputを、"sin"、input、« "float32", "float16" »、および options が与えられて要素ごとの単項演算を作成する結果とする。
1. それがエラーを投げる場合、そのエラーを再度投げる。
outputを返す。

sign(input, options)メソッドの手順は次のとおりである:

outputを、"sign"、input、« "float32", "float16", "int64", "int32", "int8" »、および options が与えられて要素ごとの単項演算を作成する結果とする。
1. それがエラーを投げる場合、そのエラーを再度投げる。
outputを返す。

sqrt(input, options)メソッドの手順は次のとおりである:

outputを、"sqrt"、input、« "float32", "float16" »、および options が与えられて要素ごとの単項演算を作成する結果とする。
1. それがエラーを投げる場合、そのエラーを再度投げる。
outputを返す。

tan(input, options)メソッドの手順は次のとおりである:

outputを、"tan"、input、« "float32", "float16" »、および options が与えられて要素ごとの単項演算を作成する結果とする。
1. それがエラーを投げる場合、そのエラーを再度投げる。
outputを返す。

sign() 操作の振る舞いは、ユーザーエージェントが通常はより効率的な実装を持つものの、次のように他の操作の利用から汎用的にエミュレートできる。基盤プラットフォームが操作を直接サポートしない場合、この分解は実装を導くテンプレートとして使用できる。

function sign(builder, input, options) {
  const zero = builder.constant(input.dataType, 0);
  const positiveOne = builder.constant(input.dataType, 1);
  const negativeOne = builder.constant(input.dataType, -1);

  return builder.where(
    builder.greater(input, zero),
    positiveOne,
    builder.where(builder.lesser(input, zero), negativeOne, zero));
}

8.9.16. dequantizeLinear

使用 scale 和 zero-point 偏置将整数张量反量化为浮点张量，其中 output = (input - zeroPoint) * scale。scale 和 zeroPoint 张量可以小于 input 张量，因为它们是按块可广播的。

partial interface MLGraphBuilder {
  MLOperand dequantizeLinear(MLOperand input,
                             MLOperand scale,
                             MLOperand zeroPoint,
                             optional MLOperatorOptions options = {});
};

dictionary MLQuantizeDequantizeLinearSupportLimits {
  MLTensorLimits input;
  MLTensorLimits scale;
  MLTensorLimits zeroPoint;
  MLTensorLimits output;
};

partial dictionary MLOpSupportLimits {
  MLQuantizeDequantizeLinearSupportLimits dequantizeLinear;
};

参数：

input：一个 MLOperand。输入张量。
scale：一个 MLOperand。在按 zero point 调整后，要与每个 input 值相乘的 scale 张量。它必须与输入按块可广播。值必须为正且非零，否则行为是实现定义的（例如正确结果、错误结果，或编译失败）。
zeroPoint：一个 MLOperand。要从每个 input 值中减去的 zero point 张量。它与 scale 具有相同的 shape。
options：一个 MLOperatorOptions。指定该操作的可选参数。

返回：一个 MLOperand。包含反量化值的输出张量。

`dequantizeLinear()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	`"uint8"`, `"int8"`, `"uint32"`, `"int32"`	`"uint8"`, `"int8"`	N	0 から 5
`scale`	`"float32"`, `"float16"`	`"float32"`, `"float16"`	同じ `input`	0 から 5
`zeroPoint`	同じ `input`	`"uint8"`, `"int8"`, `"int32"`	同じ `input`	0 から 5
出力	同じ `scale`	`"float32"`, `"float16"`	同じ `input`	0 から 5

MLQuantizeDequantizeLinearSupportLimits 具有以下成员：

input，类型为 MLTensorLimits: 用于 input 操作数的 MLTensorLimits。
scale，类型为 MLTensorLimits: 用于 scale 操作数的 MLTensorLimits。
zeroPoint，类型为 MLTensorLimits: 用于 zeroPoint 操作数的 MLTensorLimits。
output，类型为 MLTensorLimits: 用于 output 操作数的 MLTensorLimits。

MLOpSupportLimits 具有以下用于 dequantizeLinear() 的成员：

dequantizeLinear，类型为 MLQuantizeDequantizeLinearSupportLimits: 运算符 dequantizeLinear() 的支持限制。

dequantizeLinear(input, scale, zeroPoint, options) メソッドの手順は次のとおりである:

this.[[hasBuilt]] が true の場合、"InvalidStateError" DOMExceptionを投げる。
this と input、scale、および zeroPoint のいずれかを用いたオペランドの検証が false を返す場合、TypeErrorを投げる。
inputのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
scaleのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
zeroPointのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
zeroPointのdataTypeが inputのdataTypeに等しくない場合、TypeErrorを投げる。
scaleのrankまたは zeroPointのrankが inputのrankに等しくない場合、TypeErrorを投げる。
scaleのshapeが zeroPointのshapeに等しくない場合、TypeErrorを投げる。
scaleのshapeと inputのshapeをブロック単位でブロードキャストして false を返す場合、TypeErrorを投げる。
zeroPointのshapeと inputのshapeをブロック単位でブロードキャストして false を返す場合、TypeErrorを投げる。
outputDescriptorを、scaleのdataTypeと inputのshapeが与えられてMLOperandDescriptor を作成する結果とする。
グラフ接続を作成する:
1. outputを、this と outputDescriptor が与えられてMLOperand を作成する結果とする。
2. operatorを、input、scale、zeroPoint、および options が与えられた、"dequantizeLinear" 操作のoperatorとする。
3. output.[[operator]] を operator に設定する。
4. operatorのinputを input に設定する。
5. operatorのoutputを output に設定する。
outputを返す。

该操作的行为可以按如下方式由其他操作的使用进行通用仿真，尽管用户代理通常具有更高效的实现。在底层平台不直接支持某个操作的情况下，这种分解可用作指导实现的模板。

function dequantizeLinear(builder, input, scale, zeroPoint, options) {
  // output = (input - zeroPoint) * scale
  const floatInput = builder.cast(input, scale.dataType);
  const floatZeroPoint = builder.cast(zeroPoint, scale.dataType);
  const upsampledScale = blockwiseExpand(builder, scale, input.shape);
  const upsampledZeroPoint =
    blockwiseExpand(builder, floatZeroPoint, input.shape);
  return builder.mul(
    builder.sub(floatInput, upsampledZeroPoint), upsampledScale);
}

function blockwiseExpand(builder, input, outputShape) {
  // Given the original input and a desired output shape, this expands each axis
  // by repeating the block the number of times per that axis. Though, backend
  // implementations might have much more efficient upsampling operators that
  // can accept multiple dimensions to upsample all dimensions at once by
  // integer multiples (like tile) using nearest neighbor resampling:
  // output = resample(scale, {sizes: input.shape})

  let output = input;

  for (let axis = 0; axis < input.shape.length; ++axis) {
    const oldShape = output.shape;
    const oldDimensionLength = oldShape[axis];
    const newDimensionLength = outputShape[axis];

    if (newDimensionLength != oldDimensionLength) {
      // Since tile/expand can only accept repetitions of entire dimension
      // slices (not repeating individual elements along an axis), temporarily
      // reshape the tensor to enable them to broadcast the elements up to the
      // full block size, utilizing an inserted dimension of size 1.
      const elementRepeatCount = newDimensionLength / oldDimensionLength;
      const flattenedShape = getFlattenedShapeAroundAxis(oldShape, axis);
      const unexpandedShape =
        [flattenedShape[0], flattenedShape[1], 1, flattenedShape[2]];
      const expandedShape = [
        flattenedShape[0],
        flattenedShape[1],
        elementRepeatCount,
        flattenedShape[2]
      ];
      const reshapedInput = builder.reshape(output, unexpandedShape);
      output = builder.expand(reshapedInput, expandedShape);

      let newShape = [...oldShape];
      newShape[axis] = newDimensionLength;
      output = builder.reshape(output, newShape);
    }
  }

  return output;
}

// Compute the flattened shape before and after the given axis, yielding a
// 3-element list: e.g.
// - inputShape = [2,3,4,5,6] with axis = 2 yields shape [6,4,30].
// - inputShape = [4] with axis = 0 yields shape [1,4,1].
function getFlattenedShapeAroundAxis(inputShape, axis) {
  axis = Math.max(Math.min(axis, inputShape.length - 1), 0);
  const shapeBefore = inputShape.slice(0, axis);
  const shapeAfter = inputShape.slice(axis + 1, inputShape.length);
  const countBefore = shapeBefore.reduce((a, b) => a * b, 1);
  const countAfter = shapeAfter.reduce((a, b) => a * b, 1);
  return [countBefore, inputShape[axis], countAfter];
}

8.9.17. quantizeLinear

使用 scale 和 zero-point 偏置将浮点张量量化为整数张量（例如，对于 "uint8"， output = clamp(roundEven(input / scale) + zeroPoint, 0, 255)）。scale 和 zeroPoint 张量可以小于 input 张量，因为它们会被按块广播。

partial interface MLGraphBuilder {
  MLOperand quantizeLinear(MLOperand input,
                           MLOperand scale,
                           MLOperand zeroPoint,
                           optional MLOperatorOptions options = {});
};

partial dictionary MLOpSupportLimits {
  MLQuantizeDequantizeLinearSupportLimits quantizeLinear;
};

参数：

input：一个 MLOperand。输入张量。
scale：一个 MLOperand。在按 zero point 调整前，要用来除每个 input 值的 scale 张量。它必须与输入按块可广播。值必须为正且非零，否则行为依赖于实现（例如正确结果、错误结果，或编译失败）。
zeroPoint：一个 MLOperand。要加到每个重新缩放后的 input 值上的 zero point 张量。它与 scale 具有相同的 shape。
options：一个 MLOperatorOptions。指定该操作的可选参数。

返回：一个 MLOperand。包含量化值的输出张量。

`quantizeLinear()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	`"float32"`, `"float16"`	`"float32"`, `"float16"`	N	0 から 5
`scale`	同じ `input`	`"float32"`, `"float16"`	同じ `input`	0 から 5
`zeroPoint`	`"uint8"`, `"int8"`, `"uint32"`, `"int32"`	`"uint8"`, `"int8"`	同じ `input`	0 から 5
output	同じ `zeroPoint`	`"uint8"`, `"int8"`	同じ `input`	0 から 5

MLOpSupportLimits は quantizeLinear() について次のメンバーを持つ:

quantizeLinear, 型は MLQuantizeDequantizeLinearSupportLimits: quantizeLinear()演算子のサポート制限。

quantizeLinear(input, scale, zeroPoint, options) メソッドの手順は次のとおりである:

this.[[hasBuilt]] が true の場合、"InvalidStateError" DOMExceptionを投げる。
this と input、scale、および zeroPoint のいずれかを用いたオペランドの検証が false を返す場合、TypeErrorを投げる。
inputのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
scaleのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
scaleのdataTypeが inputのdataTypeに等しくない場合、TypeErrorを投げる。
zeroPointのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
scaleのrankまたは zeroPointのrankが inputのrankに等しくない場合、TypeErrorを投げる。
scaleのshapeが zeroPointのshapeに等しくない場合、TypeErrorを投げる。
scaleのshapeと inputのshapeをブロック単位でブロードキャストして false を返す場合、TypeErrorを投げる。
zeroPointのshapeと inputのshapeをブロック単位でブロードキャストして false を返す場合、TypeErrorを投げる。
outputDescriptorを、zeroPointのdataTypeと inputのshapeが与えられてMLOperandDescriptor を作成する結果とする。
グラフ接続を作成する:
1. outputを、this と outputDescriptor が与えられてMLOperand を作成する結果とする。
2. operatorを、input、scale、zeroPoint、および options が与えられた、"quantizeLinear" 操作のoperatorとする。
3. output.[[operator]] を operator に設定する。
4. operatorのinputを input に設定する。
5. operatorのoutputを output に設定する。
outputを返す。

function quantizeLinear(builder, input, scale, zeroPoint, options) {
  // output = clamp(roundEven(input / scale) + zeroPoint, 0, 255)
  // Note blockwiseExpand is defined in dequantizeLinear.

  const floatZeroPoint = builder.cast(zeroPoint, scale.dataType);
  const upsampledScale = blockwiseExpand(builder, scale, input.shape);
  const upsampledZeroPoint =
    blockwiseExpand(builder, floatZeroPoint, input.shape);
  const quantizedInput = builder.roundEven(builder.div(input, upsampledScale));
  const zeroPointAdjustedInput =
    builder.add(quantizedInput, upsampledZeroPoint);
  const clampedInput =
    builder.clamp(zeroPointAdjustedInput, {'minValue': 0, 'maxValue': 255});
  return builder.cast(clampedInput, zeroPoint.dataType);
}

8.9.18. elu

对输入张量逐元素计算指数线性单元函数（ELU）。计算遵循表达式 max(0, x) + alpha * (exp(min(0, x)) - 1)。

dictionary MLEluOptions : MLOperatorOptions {
  double alpha = 1;
};

partial interface MLGraphBuilder {
  MLOperand elu(MLOperand input, optional MLEluOptions options = {});
};

partial dictionary MLOpSupportLimits {
  MLSingleInputSupportLimits elu;
};

MLEluOptions 具有以下成员：

alpha，类型为 double，默认为 1: 标量乘数。

参数：

input：一个 MLOperand。输入张量。
options：一个可选的 MLEluOptions。该操作的可选参数。

返回：

一个 MLOperand。与 input 形状相同的输出张量。

`elu()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	`"float32"`, `"float16"`	`"float32"`, `"float16"`	N	0 から 5
output	同じ `input`	`"float32"`, `"float16"`	同じ `input`	0 から 5

MLOpSupportLimits は elu() について次のメンバーを持つ:

elu, 型は MLSingleInputSupportLimits: elu()演算子のサポート制限。

elu(input, options) メソッドの手順は次のとおりである:

this が構築できない場合、"InvalidStateError" DOMExceptionを投げる。
this と input を用いたオペランドの検証が false を返す場合、TypeErrorを投げる。
inputのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
options.alpha を、options.alpha を inputのdataTypeへキャストする結果に設定する。
グラフ接続を作成する:
1. outputを、input が与えられてMLOperand をコピーする結果とする。
2. operatorを、options が与えられた "elu" 操作のoperatorとする。
3. output.[[operator]] を operator に設定する。
4. operatorのinputを input に設定する。
5. operatorのoutputを output に設定する。
outputを返す。

function elu(builder, input, options) {
  return builder.add(
    builder.max(builder.constant(input.dataType, 0), input),
    builder.mul(
      builder.constant(input.dataType, options.alpha),
      builder.sub(
        builder.exp(builder.min(builder.constant(input.dataType, 0), input)),
        builder.constant(input.dataType, 1))));
}

8.9.19. expand

新しい形状に従って、入力テンソルのサイズ1の任意の次元をより大きなサイズへ拡張する。この拡張は [numpy-broadcasting-rule]と一貫している。入力テンソルは新しい形状へ単方向ブロードキャスト可能でなければならない。各次元はサイズ1であるか、新しい形状に従って対応する出力次元のサイズと一致しなければならない。

partial interface MLGraphBuilder {
  MLOperand expand(MLOperand input,
                   sequence<[EnforceRange] unsigned long> newShape,
                   optional MLOperatorOptions options = {});
};

partial dictionary MLOpSupportLimits {
  MLSingleInputSupportLimits expand;
};

引数:

input: MLOperand。入力テンソル。
newShape: sequence<unsigned long>。入力テンソルが拡張される新しい形状。
options: MLOperatorOptions。演算の任意パラメーターを指定する。

戻り値: MLOperand。サイズ形状が拡張されたテンソル。

`expand()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	任意	`"float32"`, `"float16"`, `"int32"`	N	0 から 5
output	同じ `input`	`"float32"`, `"float16"`, `"int32"`	N	0 から 5

MLOpSupportLimits はexpand()について次のメンバーを持つ。

expand, 型はMLSingleInputSupportLimits: expand()演算子のサポート制限。

expand(input, newShape, options) メソッドの手順は次のとおりである。

thisがbuildできない場合、"InvalidStateError" DOMExceptionをthrowする。
オペランドを検証することをthisおよびinputとともに行った結果がfalseを返す場合、TypeErrorをthrowする。
outputShapeを、inputのshapeおよびnewShapeを単方向ブロードキャストする結果とする。
1. それがfailureを返す場合、TypeErrorをthrowする。
outputShapeのsizeが（この表に従う）出力テンソルの許可されるランクでない場合、TypeErrorをthrowする。
outputDescriptorを、inputのdataTypeおよび outputShapeが与えられてMLOperandDescriptorを作成する結果とする。
グラフ接続を作成する:
1. outputを、thisおよび outputDescriptorが与えられてMLOperandを作成する結果とする。
2. operatorを、input、newShape、およびoptionsが与えられた"expand"演算用の演算子とする。
3. output.[[operator]] をoperatorに設定する。
4. operatorのinputをinputに設定する。
5. operatorのoutputをoutputに設定する。
outputを返す。

8.9.20. gather

indicesに従って、軸に沿って入力テンソルの値を収集する。

dictionary MLGatherOptions : MLOperatorOptions {
  [EnforceRange] unsigned long axis = 0;
};

partial interface MLGraphBuilder {
  MLOperand gather(MLOperand input,
                   MLOperand indices,
                   optional MLGatherOptions options = {});
};

dictionary MLGatherSupportLimits {
  MLTensorLimits input;
  MLTensorLimits indices;
  MLTensorLimits output;
};

partial dictionary MLOpSupportLimits {
  MLGatherSupportLimits gather;
};

MLGatherOptions は次のメンバーを持つ。

axis, 型はunsigned long、デフォルトは0: 収集される値を取得する軸。その値は、入力テンソルのrankをNとすると、[0, N-1]の範囲内でなければならない。

引数:

input: MLOperand。値が収集される入力 N-D テンソル。
indices: MLOperand。収集する入力値のインデックス N-D テンソル。値は "int32"、 "uint32"、または "int64" 型でなければならず、axis によってインデックス付けされる入力次元のサイズを N とすると、-N（含む）から N（含まない）までの範囲内でなければならない。負のインデックスは、その次元の末尾からインデックス付けすることを意味する。
options: 任意の MLGatherOptions。この演算の任意パラメーター。

戻り値: MLOperand。 rankがinputのrank + indicesのrank - 1に等しい出力N-Dテンソル。

indices パラメーターは、グラフが構築される時点では入力が実行まで不明であるため、gather() に対して許可範囲にクランプできない。指定されたクランプ動作が基盤となるプラットフォームによって提供されない場合、実装はコンパイル済みグラフ内にclamp() を導入できる。同様に、基盤となるプラットフォームが負のインデックスをサポートしない場合、実装はコンパイル済みグラフ内に、次元の末尾からの負のインデックスを正のインデックスへ変換する演算を導入できる。

`gather()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	任意	`"float32"`, `"float16"`, `"int32"`, `"int8"`, `"uint8"`	1 から N	1 から 5
`indices`	`"int32"`, `"uint32"`, `"int64"`	`"int32"`	N	0 から 5
output	同じ `input`	`"float32"`, `"float16"`, `"int32"`, `"int8"`, `"uint8"`	1 から N	1 から 5

MLGatherSupportLimits は次のメンバーを持つ。

input, 型はMLTensorLimits: inputオペランド用のMLTensorLimits。
indices, 型はMLTensorLimits: indicesオペランド用のMLTensorLimits。
output, 型はMLTensorLimits: outputオペランド用のMLTensorLimits。

MLOpSupportLimits はgather()について次のメンバーを持つ。

gather, 型はMLGatherSupportLimits: gather()演算子のサポート制限。

gather(input, indices, options) メソッドの手順は次のとおりである。

thisがbuildできない場合、"InvalidStateError" DOMExceptionをthrowする。
オペランドを検証することをthisおよびinput、indicesのいずれかとともに行った結果がfalseを返す場合、TypeErrorをthrowする。
indicesのdataTypeが（この表に従う）その許可されるデータ型のいずれでもない場合、TypeErrorをthrowする。
inputShapeをinputのshapeとし、inputRankをinputのrankとする。
indicesShapeをindicesのshapeとする。
axisをoptions.axisとする。
axisがinputRank以上である場合、TypeErrorをthrowする。
dimCountをゼロとする。
outputRankをゼロとする。
outputShapeを空のリストとする。
inputShapeの各sizeについて実行する:
1. dimCountがaxisと等しい場合、breakする。
2. outputShape[dimCount]をsizeに設定する。
3. dimCountを1増やす。
outputRankをdimCountに設定する。
dimCountをゼロとする。
indicesShapeの各sizeについて実行する:
1. outputShape[outputRank + dimCount]を sizeに設定する。
2. dimCountを1増やす。
outputRankをoutputRank + dimCountに設定する。
dimCountをゼロとする。
inputShapeの各sizeについて実行する:
1. dimCountがaxis以下である場合、continueする。
2. outputShape[outputRank + dimCount - axis - 1]をsizeに設定する。
3. dimCountを1増やす。
descを、inputのdataTypeおよび outputShapeが与えられてMLOperandDescriptorを作成する結果とする。
グラフ接続を作成する:
1. outputを、descが与えられてMLOperandを作成する結果とする。
2. operatorを、input、indices、およびoptionsが与えられた"gather"演算用の演算子とする。
3. output.[[operator]] をoperatorに設定する。
4. operatorのinputsをinputおよびindicesに設定する。
5. operatorのoutputをoutputに設定する。
outputを返す。

異なるスライス方式でgatherがどのように動作するかの例。

// input of shape [4,3]:
//   [[ 0,  1,  2],
//    [10, 11, 12],
//    [20, 21, 22],
//    [30, 31, 32]]
const input = builder.constant(
  {dataType: 'float32', shape: [4, 3]},
  new Float32Array([0, 1, 2, 10, 11, 12, 20, 21, 22, 30, 31, 32]));

// axis = 0 (default)
// indices of shape [2]:
//   [3,1]
// output of shape [2,3]:
//   [[30, 31, 32],
//    [10, 11, 12]]

const indices1 =
  builder.constant({dataType: 'uint32', shape: [2]}, new Uint32Array([3, 1]));

const output1 = builder.gather(input, indices1);

// axis = 1
// indices of shape [3]:
//   [2,1,1]
// output of shape [4,3]:
//   [[ 2,  1,  1],
//    [12, 11, 11],
//    [22, 21, 21],
//    [32, 31, 31]]

const indices2 = builder.constant(
  {dataType: 'uint32', shape: [3]}, new Uint32Array([2, 1, 1]));

const output2 = builder.gather(input, indices2, {axis: 1});

// axis = 1
// indices of shape [2,2]:
//   [[0, 1],
//    [1, 2]]
// output of shape [4,2,2]:
//   [[[ 0,  1], [ 1,  2]],
//    [[10, 11], [11, 12]],
//    [[20, 21], [21, 22]],
//    [[30, 31], [31, 32]]]

const indices3 = builder.constant(
  {dataType: 'uint32', shape: [2, 2]}, new Uint32Array([0, 1, 1, 2]));

const output3 = builder.gather(input, indices3, {axis: 1});

8.9.21. gatherElements

indicesに従って、軸に沿って入力テンソルの値を収集する。

partial interface MLGraphBuilder {
  MLOperand gatherElements(MLOperand input,
                           MLOperand indices,
                           optional MLGatherOptions options = {});
};

partial dictionary MLOpSupportLimits {
  MLGatherSupportLimits gatherElements;
};

引数:

input: MLOperand。値が収集される入力 N-D テンソル。
indices: MLOperand。収集する入力値のインデックス N-D テンソル。値は "int32"、 "uint32"、または "int64" 型でなければならず、options.axis によってインデックス付けされる入力次元のサイズを N とすると、 -N（含む）から N（含まない）までの範囲内でなければならない。負のインデックスは、その次元の末尾からインデックス付けすることを意味する。
options: 任意の MLGatherOptions。この演算の任意パラメーター。

戻り値: MLOperand。 inputの rankに等しいrankを持つ出力 N-D テンソル。

`gatherElements()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	任意	`"float32"`, `"float16"`, `"int32"`	1 から N	1 から 5
`indices`	`"int32"`, `"uint32"`, `"int64"`	`"int32"`	同じ `input`	1 から 5
output	同じ `input`	`"float32"`, `"float16"`, `"int32"`	同じ `input`	1 から 5

MLOpSupportLimits はgatherElements()について次のメンバーを持つ。

gatherElements, 型は MLGatherSupportLimits: gatherElements()演算子のサポート制限。

indices パラメーターは、グラフが構築される時点では入力が実行まで不明であるため、gatherElements() に対して許可範囲にクランプできない。指定されたクランプ動作が基盤となるプラットフォームによって提供されない場合、実装はコンパイル済みグラフ内にclamp() を導入できる。同様に、基盤となるプラットフォームが負のインデックスをサポートしない場合、実装はコンパイル済みグラフ内に、次元の末尾からの負のインデックスを正のインデックスへ変換する演算を導入できる。

gatherElements(input, indices, options) メソッドの手順は次のとおりである:

this が構築できない場合、"InvalidStateError" DOMExceptionを投げる。
this と input および indices のいずれかを用いたオペランドの検証が false を返す場合、TypeErrorを投げる。
indicesのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
input または indices のいずれかのrankが、その許可される rankでない場合、TypeErrorを投げる。
axisを options.axis とする。
axis が inputのrank以上である場合、TypeErrorを投げる。
indicesShapeExpectedを、inputのshapeのコピーとする。
indicesShapeExpected[axis] を indicesのshape[axis] に設定する。
indicesのshapeが indicesShapeExpectedに等しくない場合、 TypeErrorを投げる。
グラフ接続を作成する:
1. outputを、input が与えられてMLOperand をコピーする結果とする。
2. operatorを、input、indices、および options が与えられた "gatherElements" 操作のoperatorとする。
3. output.[[operator]] を operator に設定する。
4. operatorのinputsを input および indices に設定する。
5. operatorのoutputを output に設定する。
outputを返す。

異なるスライス方式でgatherElementsがどのように動作するかの例。

// input of shape [4,3]:
//   [[ 0,  1,  2],
//    [10, 11, 12],
//    [20, 21, 22],
//    [30, 31, 32]]
// indices of shape [2,3]:
//   [[3, 1, 1],
//    [2, 0, 3]]
// axis = 0 (default)
// output of shape [2,3]:
//   [[30, 11, 12],
//    [20,  1, 32]]

const input1 = builder.constant(
  {dataType: 'float32', shape: [4, 3]},
  new Float32Array([0, 1, 2, 10, 11, 12, 20, 21, 22, 30, 31, 32]));

const indices1 = builder.constant(
  {dataType: 'uint32', shape: [2, 3]}, new Uint32Array([3, 1, 1, 2, 0, 3]));

const output1 = builder.gatherElements(input1, indices1);

// input of shape [4,3]:
//   [[ 0,  1,  2],
//    [10, 11, 12],
//    [20, 21, 22],
//    [30, 31, 32]]
// indices of shape [4,1]:
//   [[2],
//    [1],
//    [0],
//    [2]],
// axis = 1
// output of shape [4,1]:
//   [[ 2],
//    [11],
//    [20],
//    [32]]

const indices2 = builder.constant(
  {dataType: 'uint32', shape: [4, 1]}, new Uint32Array([2, 1, 0, 2]));

const output2 = builder.gatherElements(input1, indices2, {axis: 1});

// input of shape [4,2,2]:
//   [[[  0,   1],
//     [ 10,  11]],
//    [[100, 101],
//     [110, 111]],
//    [[200, 201],
//     [210, 211]],
//    [[300, 301],
//     [310, 311]],]
// indices of shape [1,2,2]:
//   [[[0, 2],
//     [1, 3]]],
// axis = 0
// output of shape [1,2,2]:
//   [[[  0, 201],
//     [110, 311]]]

const inputData3 = new Float32Array(
  [0, 1, 10, 11, 100, 101, 110, 111, 200, 201, 210, 211, 300, 301, 310, 311]);

const input3 =
  builder.constant({dataType: 'float32', shape: [4, 2, 2]}, inputData3);

const indices3 = builder.constant(
  {dataType: 'uint32', shape: [1, 2, 2]}, new Uint32Array([0, 2, 1, 3]));

const output3 = builder.gatherElements(input3, indices3, {axis: 0});

8.9.22. gatherND

indicesに従って、入力テンソルのスライスを収集する。

partial interface MLGraphBuilder {
  MLOperand gatherND(MLOperand input,
                     MLOperand indices,
                     optional MLOperatorOptions options = {});
};

partial dictionary MLOpSupportLimits {
  MLGatherSupportLimits gatherND;
};

引数:

input: MLOperand。値が収集される入力 N-D テンソル。
indices: MLOperand。 indices 配列は入力テンソルへの完全な座標を含み、最右の次元が座標ごとの次元数を保持する。したがって shape [10,1] の indices テンソルは 10 個の単一軸インデックスを保持し、 shape [4,3] は 3D 座標の 4 個のインデックスを保持する。値は "int32"、 "uint32"、または "int64" 型でなければならず、それぞれ、対応する入力次元のサイズを N とすると、 -N（含む）から N（含まない）までの範囲内でなければならない。負のインデックスは、対応する次元の末尾からインデックス付けすることを意味する。
options: 任意の MLOperatorOptions。この演算の任意パラメーター。

戻り値: MLOperand。 inputの rank + indicesの rank - indicesの shape[-1] - 1 に等しい rankを持つ出力 N-D テンソル。

`gatherND()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	任意	`"float32"`, `"float16"`, `"int32"`, `"int8"`, `"uint8"`	1 から N	1 から 5
`indices`	`"int32"`, `"uint32"`, `"int64"`	`"int32"`	1 から N	1 から 5
出力	同じ `input`	`"float32"`, `"float16"`, `"int32"`, `"int8"`, `"uint8"`	N	0 から 5

MLOpSupportLimits はgatherND()について次のメンバーを持つ。

gatherND, 型はMLGatherSupportLimits: gatherND()演算子のサポート制限。

indices パラメーターは、グラフが構築される時点では入力が実行まで不明であるため、gatherND() に対して許可範囲にクランプできない。指定されたクランプ動作が基盤となるプラットフォームによって提供されない場合、実装はコンパイル済みグラフ内にclamp() を導入できる。同様に、基盤となるプラットフォームが負のインデックスをサポートしない場合、実装はコンパイル済みグラフ内に、次元の末尾からの負のインデックスを正のインデックスへ変換する演算を導入できる。

gatherND(input, indices, options) メソッドの手順は次のとおりである:

this が構築できない場合、"InvalidStateError" DOMExceptionを投げる。
this と input および indices のいずれかを用いたオペランドの検証が false を返す場合、TypeErrorを投げる。
indicesのdataTypeが、（この表に従って）許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
input または indices のいずれかのrankが、その許可される rankでない場合、TypeErrorを投げる。
inputShapeを inputのshapeとし、inputRankを inputのrankとする。
indicesShapeを indicesのshapeとし、indicesRank を indicesのrankとする。
input または indices のいずれかのrankが、その許可される rankでない場合、TypeErrorを投げる。
indexableSizeを indicesRank - 1 とする。
coordinateSizeを indicesShape[indexableSize] とする。
coordinateSize が inputRank より大きい場合、TypeErrorを投げる。
outputShapeを空のリストとする。
0 から indexableSize までの範囲（終端を含まない）の各 index について反復する:
1. indicesShape[index] を outputShape に付加する。
coordinateSize から inputRank までの範囲（終端を含まない）の各 index について反復する:
1. inputShape[index] を outputShape に付加する。
outputDescを、inputのdataTypeと outputShape が与えられてMLOperandDescriptor を作成する結果とする。
グラフ接続を作成する:
1. outputを、outputDesc が与えられてMLOperand を作成する結果とする。
2. operatorを、input、indices、および options が与えられた、 "gatherND" 操作のoperatorとする。
3. output.[[operator]] を operator に設定する。
4. operatorのinputsを input および indices に設定する。
5. operatorのoutputを output に設定する。
outputを返す。

異なるスライス方式でgatherNDがどのように動作するかの例。

// input of shape [2,2]:
//   [[0, 1],
//    [2, 3]]
// indices of shape [3,2]:
//   [[0, 0],
//    [1, 1],
//    [1, 0]]
// output of shape [3]:
//   [0, 3, 2]

const input1 = builder.constant(
  {dataType: 'float32', shape: [2, 2]}, new Float32Array([0, 1, 2, 3]));

const indices1 = builder.constant(
  {dataType: 'uint32', shape: [3, 2]}, new Uint32Array([0, 0, 1, 1, 1, 0]));

const output1 = builder.gatherND(input1, indices1);

// input of shape [2,2]:
//   [[0, 1],
//    [2, 3]]
// indices of shape [2,1]:
//   [[1],
//    [0]]
// output of shape [2,2]:
//   [[2, 3]    <= row [2, 3] from input coordinates [1, *]
//    [0, 1]]   <= row [0, 1] from input coordinates [0, *]

const indices2 = builder.constant(
  {dataType: 'uint32', shape: [2, 1]}, new Uint32Array([1, 0]));

const output2 = builder.gatherND(input1, indices2);

// input of shape [2,2,2]:
//   [[[0, 1],
//     [2, 3]],
//    [[4, 5],
//     [6, 7]]]
// indices of shape [2,2]:
//   [[0, 1],
//    [1, 0]]
// output of shape [2,2]:
//   [[2, 3],   <= row [2, 3] from input coordinates [0, 1, *]
//    [4, 5]]   <= row [4, 5] from input coordinates [1, 0, *]

const input2 = builder.constant(
  {dataType: 'float32', shape: [2, 2, 2]},
  new Float32Array([0, 1, 2, 3, 4, 5, 6, 7]));

const indices3 = builder.constant(
  {dataType: 'uint32', shape: [2, 2]}, new Uint32Array([0, 1, 1, 0]));

const output3 = builder.gatherND(input2, indices3);

// input of shape [2,2,2]:
//   [[[0, 1],
//     [2, 3]],
//    [[4, 5],
//     [6, 7]]]
// indices of shape [3,1]:
//   [[1],
//    [0],
//    [1]]
// output of shape [3,2,2]:
//   [[[4, 5],   <= block [[4, 5], [6, 7]] from input coordinates [1, *, *]
//     [6, 7]],
//    [[0, 1],   <= block [[0, 1], [2, 3]] from input coordinates [0, *, *]
//     [2, 3]],
//    [[4, 5],   <= block [[4, 5], [6, 7]] from input coordinates [1, *, *]
//     [6, 7]]]

const indices4 = builder.constant(
  {dataType: 'uint32', shape: [3, 1]}, new Uint32Array([1, 0, 1]));

const output4 = builder.gatherND(input2, indices4);

// input of shape [2,2,2]:
//   [[[0, 1],
//     [2, 3]],
//    [[4, 5],
//     [6, 7]]]
// indices of shape [5,3]:
//   [[0,0,1],
//    [0,1,0],
//    [1,0,0],
//    [1,1,0],
//    [1,1,1]]
// output of shape [5]:
//   [1,2,4,6,7]

const indices5 = builder.constant(
  {dataType: 'uint32', shape: [5, 3]},
  new Uint32Array([0, 0, 1, 0, 1, 0, 1, 0, 0, 1, 1, 0, 1, 1, 1]));

const output5 = builder.gatherND(input2, indices5);

8.9.23. gelu

入力テンソルのガウス誤差線形ユニット関数（GELU）を計算する。計算は式 0.5 * x * (1 + erf(x / sqrt(2)))に従う。

partial interface MLGraphBuilder {
  MLOperand gelu(MLOperand input, optional MLOperatorOptions options = {});
};

partial dictionary MLOpSupportLimits {
  MLSingleInputSupportLimits gelu;
};

引数:

input: MLOperand。入力テンソル。
options: MLOperatorOptions。演算の任意パラメーターを指定する。

戻り値:

MLOperand。 inputと同じ形状の出力テンソル。

`gelu()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	`"float32"`, `"float16"`	`"float32"`, `"float16"`	N	0 から 5
出力	同じ `input`	`"float32"`, `"float16"`	同じ `input`	0 から 5

MLOpSupportLimits は gelu() について次のメンバーを持つ:

gelu, 型は MLSingleInputSupportLimits: gelu()演算子のサポート制限。

gelu(input, options) メソッドの手順は次のとおりである:

this が構築できない場合、"InvalidStateError" DOMExceptionを投げる。
this と input を用いたオペランドの検証が false を返す場合、TypeErrorを投げる。
inputのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
グラフ接続を作成する:
1. outputを、input が与えられてMLOperand をコピーする結果とする。
2. operatorを、options が与えられた "gelu" 操作のoperatorとする。
3. output.[[operator]] を operator に設定する。
4. operatorのinputを input に設定する。
5. operatorのoutputを output に設定する。
outputを返す。

この演算の振る舞いは、ユーザーエージェントは通常より効率的な実装を持つものの、次のように他の演算の使用から一般的にエミュレートできる。基盤となるプラットフォームが演算を直接サポートしない場合、この分解は実装を導くためのテンプレートとして使用できる。

function gelu(builder, input) {
  return builder.mul(
    builder.mul(input, builder.constant(input.dataType, 0.5)),
    builder.add(
      builder.constant(input.dataType, 1),
      builder.erf(builder.div(
        input, builder.sqrt(builder.constant(input.dataType, 2))))));
}

8.9.24. gemm

Basic Linear Algebra Subprogramsの一般行列乗算を計算する。計算は式 alpha * A * B + beta * Cに従う。ここで、Aは形状[M, K]または [K, M]を持つ2-Dテンソル、Bは形状[K, N]または[N, K]を持つ 2-Dテンソルであり、Cは形状[M, N]へ単方向ブロードキャスト可能である。Aおよび Bは、計算の前に任意で転置できる。

dictionary MLGemmOptions : MLOperatorOptions {
  MLOperand c;
  double alpha = 1.0;
  double beta = 1.0;
  boolean aTranspose = false;
  boolean bTranspose = false;
};

partial interface MLGraphBuilder {
  MLOperand gemm(MLOperand a, MLOperand b, optional MLGemmOptions options = {});
};

dictionary MLGemmSupportLimits {
  MLTensorLimits a;
  MLTensorLimits b;
  MLTensorLimits c;
  MLTensorLimits output;
};

partial dictionary MLOpSupportLimits {
  MLGemmSupportLimits gemm;
};

MLGemmOptions は次のメンバーを持つ:

c, 型はMLOperand: 第3入力テンソル。これはスカラー、または形状[M, N]へ単方向ブロードキャスト可能な形状である。指定されない場合、計算はc がスカラー0.0であるかのように行われる。
alpha, 型はdouble、デフォルトは1.0: 第1入力用の乗数。
beta, 型はdouble、デフォルトは1.0: 第3入力c用の乗数。
aTranspose, 型はboolean、デフォルトはfalse: 出力を計算する前に第1入力を転置するかを示す。
bTranspose, 型はboolean、デフォルトはfalse: 出力を計算する前に第2入力を転置するかを示す。

引数:

a: MLOperand。 aTranspose がfalseの場合は形状[M, K]、aTranspose がtrueの場合は[K, M]の、第1入力2-Dテンソル。
b: MLOperand。 bTranspose がfalseの場合は形状[K, N]、bTranspose がtrueの場合は[N, K]の、第2入力2-Dテンソル。
options: 任意の MLGemmOptions。演算の任意パラメーター。

戻り値: MLOperand。すべての入力の計算された積を含む、形状[M, N]の出力2-Dテンソル。

`gemm()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`a`	`"float32"`, `"float16"`	`"float32"`, `"float16"`	2	2
`b`	同じ `a`	`"float32"`, `"float16"`	2	2
`c`	同じ `a`	`"float32"`, `"float16"`	0 から 2	0 から 2
出力	同じ `a`	`"float32"`, `"float16"`	2	2

MLGemmSupportLimits は次のメンバーを持つ:

a, 型は MLTensorLimits: a オペランド用のMLTensorLimits。
b, 型は MLTensorLimits: b オペランド用のMLTensorLimits。
c, 型は MLTensorLimits: c オペランド用のMLTensorLimits。
output, 型は MLTensorLimits: 出力オペランド用のMLTensorLimits。

MLOpSupportLimits は gemm() について次のメンバーを持つ:

gemm, 型は MLGemmSupportLimits: gemm()演算子のサポート制限。

gemm(a, b, options) メソッドの手順は次のとおりである:

this が構築できない場合、"InvalidStateError" DOMExceptionを投げる。
this と a および b のいずれかを用いたオペランドの検証が false を返す場合、 TypeErrorを投げる。
a または b のいずれかのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
a または b のいずれかのrankが、その許可される rankでない場合、TypeErrorを投げる。
options.alpha を、options.alpha を aのdataTypeへキャストする結果に設定する。
options.beta を、options.beta を aのdataTypeへキャストする結果に設定する。
shapeAを、 aのshapeの複製とする。
shapeBを、 bのshapeの複製とする。
options.aTranspose が true の場合、shapeA 内の項目の順序を反転する。
options.bTranspose が true の場合、shapeB 内の項目の順序を反転する。
shapeA[1] が shapeB[0] に等しくない場合、TypeErrorを投げる。
options.c が存在する場合:
1. それが shape « shapeA[0], shapeB[1] » に単方向にブロードキャスト可能でない場合、TypeErrorを投げる。
2. そのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
descを、aのdataTypeと « shapeA[0], shapeB[1] » が与えられてMLOperandDescriptor を作成する結果とする。
グラフ接続を作成する:
1. outputを、this と desc が与えられてMLOperand を作成する結果とする。
2. operatorを、options が与えられた "gemm" 操作のoperatorとする。
3. output.[[operator]] を operator に設定する。
4. operatorのinputsを a および b に設定する。
5. options.c が存在する場合、それを operatorのinputsに追加する。
6. operatorのoutputを output に設定する。
outputを返す。

function gemm(builder, a, b, options) {
  if (options.aTranspose)
    a = builder.transpose(a);

  if (options.bTranspose)
    b = builder.transpose(b);

  let ab = builder.matmul(
    builder.mul(builder.constant(a.dataType, options.alpha), a), b);
  return (
    options.c ?
      builder.add(
        ab,
        builder.mul(builder.constant(a.dataType, options.beta), options.c)) :
      ab);
}

8.9.25. gru

Gated Recurrent Unit [GRU] recurrent networkは、update、reset、およびnew gateを使用して、ネットワークの時間的シーケンスにわたって出力へ繰り込まれる出力状態を計算する。

enum MLGruWeightLayout {
  "zrn",  // update-reset-new gate ordering
  "rzn"   // reset-update-new gate ordering
};

enum MLRecurrentNetworkActivation {
  "relu",
  "sigmoid",
  "tanh"
};

enum MLRecurrentNetworkDirection {
  "forward",
  "backward",
  "both"
};

dictionary MLGruOptions : MLOperatorOptions {
  MLOperand bias;
  MLOperand recurrentBias;
  MLOperand initialHiddenState;
  boolean resetAfter = true;
  boolean returnSequence = false;
  MLRecurrentNetworkDirection direction = "forward";
  MLGruWeightLayout layout = "zrn";
  sequence<MLRecurrentNetworkActivation> activations;
};

partial interface MLGraphBuilder {
  sequence<MLOperand> gru(MLOperand input,
                          MLOperand weight,
                          MLOperand recurrentWeight,
                          [EnforceRange] unsigned long steps,
                          [EnforceRange] unsigned long hiddenSize,
                          optional MLGruOptions options = {});
};

dictionary MLGruSupportLimits {
  MLTensorLimits input;
  MLTensorLimits weight;
  MLTensorLimits recurrentWeight;
  MLTensorLimits bias;
  MLTensorLimits recurrentBias;
  MLTensorLimits initialHiddenState;
  MLTensorLimits output0;
  MLTensorLimits output1;
};

partial dictionary MLOpSupportLimits {
  MLGruSupportLimits gru;
};

MLGruOptions は次のメンバーを持つ:

bias, 型はMLOperand: 形状[numDirections, 3 * hiddenSize]の2-D入力バイアステンソル。テンソル形状の第2次元におけるバイアスベクトルの順序は、layoutに従って指定される。
recurrentBias, 型はMLOperand: 形状[numDirections, 3 * hiddenSize]の2-D再帰バイアステンソル。テンソル形状の第2次元におけるバイアスベクトルの順序は、layoutに従って指定される。
initialHiddenState, 型はMLOperand: 形状[numDirections, batchSize, hiddenSize]の3-D初期隠れ状態テンソル。指定されない場合、実装はゼロで埋められたテンソルを使用しなければならない。
resetAfter, 型はboolean、デフォルトはtrue: 行列乗算の後または前にreset gateを適用するかを示す。
returnSequence, 型は boolean、デフォルトはfalse: 最後のtime stepの出力に加えて、各time stepからのすべての出力を含むシーケンス全体も返すかを示す。
direction, 型はMLRecurrentNetworkDirection、デフォルトは "forward": 入力シーケンスの処理方向。"both"に設定された場合、weightおよびbiasテンソル形状の第1次元のサイズは2でなければならず、入力は両方向で処理される。
layout, 型はMLGruWeightLayout、デフォルトは "zrn": GRUの内部gate、具体的にはupdate (z)、reset (r)、および new (n) gateに対するweightおよびbiasベクトルの順序。weightおよびbiasテンソル形状の第2次元で示される。
activations, 型は sequence<MLRecurrentNetworkActivation>: 活性化関数のペアを指定する。第1の関数はupdateおよびreset gateに使用され、第2の関数はnew gateに使用される。指定されない場合、それぞれ"sigmoid" および"tanh" 関数がデフォルトとなる。

引数:

input: MLOperand。形状[steps, batchSize, inputSize]の入力3-Dテンソル。
weight: MLOperand。形状[numDirections, 3 * hiddenSize, inputSize]の3-D入力weightテンソル。テンソル形状の第2次元におけるweightベクトルの順序は、layoutに従って指定される。
recurrentWeight: MLOperand。形状[numDirections, 3 * hiddenSize, hiddenSize]の3-D再帰weightテンソル。テンソル形状の第2次元におけるweightベクトルの順序は、layoutに従って指定される。
steps: unsigned long スカラー。recurrent network内のtime step数。この値は0より大きくなければならない。
hiddenSize: unsigned long スカラー。cell出力テンソル形状の第3次元の値。隠れ状態における特徴量の数を示す。
options: 任意のMLGruOptions。演算の任意パラメーター。

戻り値: sequence<MLOperand>。第1要素は、形状[numDirections, batchSize, hiddenSize]の3-Dテンソルであり、ネットワークの最後のtime stepからのcell出力である。さらに、returnSequence がtrueに設定されている場合、第2要素は時間的シーケンス内の各time stepからのすべてのcell出力を含む形状[steps, numDirections, batchSize, hiddenSize]の4-D出力テンソルである。

`gru()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	`"float32"`, `"float16"`	`"float32"`, `"float16"`	3	3
`weight`	同じ `input`	`"float32"`, `"float16"`	3	3
`recurrentWeight`	同じ `input`	`"float32"`, `"float16"`	3	3
`bias`	同じ `input`	`"float32"`, `"float16"`	2	2
`recurrentBias`	同じ `input`	`"float32"`, `"float16"`	2	2
`initialHiddenState`	同じ `input`	`"float32"`, `"float16"`	3	3
outputs[0]	同じ `input`	`"float32"`, `"float16"`	3	3
outputs[1]（`returnSequence` が true の場合）	同じ `input`	`"float32"`, `"float16"`	4	4

MLGruSupportLimits は次のメンバーを持つ:

input, 型はMLTensorLimits: inputオペランド用のMLTensorLimits。
weight, 型はMLTensorLimits: weightオペランド用のMLTensorLimits。
recurrentWeight, 型はMLTensorLimits: recurrentWeightオペランド用のMLTensorLimits。
bias, 型はMLTensorLimits: biasオペランド用のMLTensorLimits。
recurrentBias, 型はMLTensorLimits: recurrentBiasオペランド用のMLTensorLimits。
initialHiddenState, 型はMLTensorLimits: initialHiddenStateオペランド用のMLTensorLimits。
output0, 型はMLTensorLimits: すべての出力オペランド[0]用のMLTensorLimits。
output1, 型はMLTensorLimits: すべての出力オペランド[1]用のMLTensorLimits。

MLOpSupportLimits はgru()について次のメンバーを持つ:

gru, 型はMLGruSupportLimits: gru()演算子のサポート制限。

gru(input, weight, recurrentWeight, steps, hiddenSize, options) メソッドの手順は次のとおりである:

this が構築できない場合、"InvalidStateError" DOMExceptionを投げる。
this と input、weight、 recurrentWeight、options.bias （それが存在する場合）、options.recurrentBias （それが存在する場合）、および options.initialHiddenState （それが存在する場合）のいずれかを用いたオペランドの検証が false を返す場合、TypeErrorを投げる。
input、weight または recurrentWeight のいずれかのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
input、weight または recurrentWeight のいずれかのrankが、その許可される rankでない場合、 TypeErrorを投げる。
inputのshape[0] が steps に等しくない場合、TypeErrorを投げる。
batchSizeを inputのshape[1] とする。
inputSizeを inputのshape[2] とする。
options.direction が "both" の場合は numDirections を 2 とし、それ以外の場合は 1 とする。
weightのshapeが « numDirections, 3 * hiddenSize, inputSize » に等しくない場合、TypeErrorを投げる。
recurrentWeightのshapeが « numDirections, 3 * hiddenSize, hiddenSize » に等しくない場合、TypeErrorを投げる。
hiddenSize * 6 が有効な次元でない場合、TypeErrorを投げる。

なぜ hiddenSize * 6 なのか?
一部の基盤プラットフォームは、bias と recurrentBias を連結した単一の bias テンソルを扱う。したがって、3 * hiddenSize + 3 * hiddenSize も有効な次元である必要がある。
options.bias が存在する場合:
1. そのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
2. そのshapeが « numDirections, 3 * hiddenSize » に等しくない場合、TypeErrorを投げる。
options.recurrentBias が存在する場合:
1. そのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
2. そのshapeが « numDirections, 3 * hiddenSize » に等しくない場合、TypeErrorを投げる。
options.initialHiddenState が存在する場合:
1. そのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
2. そのshapeが « numDirections, batchSize, hiddenSize » に等しくない場合、TypeErrorを投げる。
options.activations が存在する場合:
1. そのsizeが 2 でない場合、TypeErrorを投げる。
2. activationsを options.activationsの複製とする。
そうでない場合:
1. activationsを « "sigmoid", "tanh" » とする。
出力 shape を計算する:
1. desc0を、inputのdataTypeと « numDirections, batchSize, hiddenSize » が与えられてMLOperandDescriptor を作成する結果とする。
2. options.returnSequence が true の場合:
  1. desc1を、inputのdataType と « steps, numDirections, batchSize, hiddenSize » が与えられてMLOperandDescriptor を作成する結果とする。
グラフ接続を作成する:
1. operatorを、weight、recurrentWeight、steps、hiddenSize および options が与えられた "gru" 操作のoperatorとする。
2. output0を、this と desc0 が与えられてMLOperand を作成する結果とする。
3. options.returnSequence が true の場合:
  1. output1を、this と desc1 が与えられてMLOperand を作成する結果とする。
  2. outputをリスト « output0, output1 » とする。
  3. output0.[[operator]] および output1.[[operator]] を operator に設定する。
4. そうでない場合:
  1. outputをリスト « output0 » とする。
  2. output0.[[operator]] を operator に設定する。
5. operatorのinputsを input、weight、および recurrentWeight に設定する。
6. options.bias が存在する場合、それを operatorのinputsに追加する。
7. options.recurrentBias が存在する場合、それを operatorのinputsに追加する。
8. options.initialHiddenState が存在する場合、それを operatorのinputsに追加する。
9. operatorのactivation functionsを activationsの複製に設定する。
10. operatorのoutputを output に設定する。
outputを返す。

squeeze()ヘルパーを使用して、この演算の振る舞いは、ユーザーエージェントは通常より効率的な実装を持つものの、次のように他の演算の使用から一般的にエミュレートできる。基盤となるプラットフォームが演算を直接サポートしない場合、この分解は実装を導くためのテンプレートとして使用できる。

function gru(
  builder, input, weight, recurrentWeight, steps, hiddenSize, options) {
  const batchSize = input.shape[1];
  const inputSize = input.shape[2];
  const direction = options.direction || 'forward';
  const numDirections = (direction == 'both' ? 2 : 1);
  let hiddenState = options.initialHiddenState;

  if (!hiddenState) {
    const desc = {
      dataType: 'float32',
      shape: [numDirections, batchSize, hiddenSize]
    };
    const totalSize = numDirections * batchSize * hiddenSize;
    hiddenState = builder.constant(desc, new Float32Array(totalSize).fill(0));
  }

  let currentWeight = [];
  let currentRecurrentWeight = [];
  let currentBias = [];
  let currentRecurrentBias = [];
  let forwardSequence = null;
  let backwardSequence = null;
  let outputHidden = null;

  for (let dir = 0; dir < numDirections; ++dir) {
    currentWeight.push(squeeze(
      builder,
      builder.slice(weight, [dir, 0, 0], [1, 3 * hiddenSize, inputSize])));
    currentRecurrentWeight.push(squeeze(
      builder,
      builder.slice(
        recurrentWeight, [dir, 0, 0], [1, 3 * hiddenSize, hiddenSize])));
    currentBias.push(
      options.bias ?
        (squeeze(
          builder,
          builder.slice(options.bias, [dir, 0], [1, 3 * hiddenSize]))) :
        null);
    currentRecurrentBias.push(
      options.recurrentBias ?
        (squeeze(
          builder,
          builder.slice(
            options.recurrentBias, [dir, 0], [1, 3 * hiddenSize]))) :
        null);
    let currentHidden = squeeze(
      builder,
      builder.slice(hiddenState, [dir, 0, 0], [1, batchSize, hiddenSize]), [0]);

    for (let step = 0; step < steps; ++step) {
      const slice =
        (dir == 1 || direction == 'backward' ? steps - step - 1 : step);
      const currentInput = squeeze(
        builder,
        builder.slice(input, [slice, 0, 0], [1, batchSize, inputSize]), [0]);

      currentHidden = builder.gruCell(
        currentInput,
        currentWeight[dir],
        currentRecurrentWeight[dir],
        currentHidden,
        hiddenSize,
        {
          bias: currentBias[dir],
          recurrentBias: currentRecurrentBias[dir],
          resetAfter: options.resetAfter,
          layout: options.layout,
          activations: options.activations
        });

      if (options.returnSequence) {
        // Expand currentHidden of 2D([batchSize, hiddenSize])
        // to 4D([steps, numDirections, batchSize, hiddenSize])
        const expandedHiddenAs4D =
          builder.reshape(currentHidden, [1, 1, batchSize, hiddenSize]);

        if (direction == 'forward' || (dir == 0 && direction == 'both')) {
          forwardSequence = forwardSequence ?
            builder.concat([forwardSequence, expandedHiddenAs4D], 0) :
            expandedHiddenAs4D;
        } else if (
          direction == 'backward' || (dir == 1 && direction == 'both')) {
          backwardSequence = backwardSequence ?
            builder.concat([expandedHiddenAs4D, backwardSequence], 0) :
            expandedHiddenAs4D;
        }
      }
    }

    // Expand currentHidden of 2D([batchSize, hiddenSize])
    // to 3D([numDirections, batchSize, hiddenSize])
    const expandedHiddenAs3D =
      builder.reshape(currentHidden, [1, batchSize, hiddenSize]);
    outputHidden = outputHidden ?
      builder.concat([outputHidden, expandedHiddenAs3D], 0) :
      expandedHiddenAs3D;
  }

  if (options.returnSequence) {
    let outputSequence = null;

    if (direction == 'forward') {
      outputSequence = forwardSequence;
    } else if (direction == 'backward') {
      outputSequence = backwardSequence;
    } else if (direction == 'both') {
      // Concat along axis 1 (numDirections dimension)
      outputSequence = builder.concat([forwardSequence, backwardSequence], 1);
    }

    return [outputHidden, outputSequence];
  } else {
    return [outputHidden];
  }
}

8.9.26. gruCell

Gated Recurrent Unit [GRU] recurrent networkの単一time stepであり、update gateおよびreset gateを使用して、recurrent networkの時間的シーケンスにわたって出力へ繰り込まれる隠れ状態を計算する。

dictionary MLGruCellOptions : MLOperatorOptions {
  MLOperand bias;
  MLOperand recurrentBias;
  boolean resetAfter = true;
  MLGruWeightLayout layout = "zrn";
  sequence<MLRecurrentNetworkActivation> activations;
};

partial interface MLGraphBuilder {
  MLOperand gruCell(MLOperand input,
                    MLOperand weight,
                    MLOperand recurrentWeight,
                    MLOperand hiddenState,
                    [EnforceRange] unsigned long hiddenSize,
                    optional MLGruCellOptions options = {});
};

dictionary MLGruCellSupportLimits {
  MLTensorLimits input;
  MLTensorLimits weight;
  MLTensorLimits recurrentWeight;
  MLTensorLimits hiddenState;
  MLTensorLimits bias;
  MLTensorLimits recurrentBias;
  MLTensorLimits output;
};

partial dictionary MLOpSupportLimits {
  MLGruCellSupportLimits gruCell;
};

MLGruCellOptions は次のメンバーを持つ:

bias, 型はMLOperand: 形状[3 * hiddenSize]の1-D入力バイアステンソル。テンソル形状の第2次元におけるバイアスベクトルの順序は、layoutに従って指定される。
recurrentBias, 型は MLOperand: 形状[3 * hiddenSize]の1-D再帰バイアステンソル。テンソル形状の第2次元におけるバイアスベクトルの順序は、layoutに従って指定される。
resetAfter, 型はboolean、デフォルトはtrue: 行列乗算の後または前にreset gateを適用するかを示す。
layout, 型はMLGruWeightLayout、デフォルトは "zrn": GRUの内部gate、具体的にはupdate (z)、reset (r)、およびnew (n) gateに対するweightおよびbiasベクトルの順序。weightおよびbiasテンソル形状の第2次元で示される。
activations, 型は sequence<MLRecurrentNetworkActivation>: 活性化関数のペアを指定する。第1の関数はupdateおよびreset gateに使用され、第2の関数はnew gateに使用される。指定されない場合、それぞれ"sigmoid" および"tanh" 関数がデフォルトとなる。

引数:

input: MLOperand。形状[batchSize, inputSize]の入力2-Dテンソル。
weight: MLOperand。形状[3 * hiddenSize, inputSize]の2-D入力weightテンソル。テンソル形状の第1次元における weightベクトルの順序は、layoutに従って指定される。
recurrentWeight: MLOperand。形状[3 * hiddenSize, hiddenSize]の2-D再帰weightテンソル。テンソル形状の第1次元における weightベクトルの順序は、layoutに従って指定される。
hiddenState: MLOperand。形状[batchSize, hiddenSize]の入力2-D隠れ状態テンソル。
hiddenSize: unsigned long スカラー。出力テンソル形状の第2次元の値。隠れ状態における特徴量の数を示す。
options: 任意のMLGruCellOptions。演算の任意パラメーター。

戻り値: MLOperand。形状[batchSize, hiddenSize]の2-Dテンソルであり、recurrent networkの単一time stepの cell出力隠れ状態。

`gruCell()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	`"float32"`, `"float16"`	`"float32"`, `"float16"`	2	2
`weight`	同じ `input`	`"float32"`, `"float16"`	2	2
`recurrentWeight`	同じ `input`	`"float32"`, `"float16"`	2	2
`hiddenState`	同じ `input`	`"float32"`, `"float16"`	2	2
`bias`	同じ `input`	`"float32"`, `"float16"`	1	1
`recurrentBias`	同じ `input`	`"float32"`, `"float16"`	1	1
output	同じ `input`	`"float32"`, `"float16"`	2	2

MLGruCellSupportLimits は次のメンバーを持つ;

input, 型はMLTensorLimits: inputオペランド用のMLTensorLimits。
weight, 型はMLTensorLimits: weightオペランド用のMLTensorLimits。
recurrentWeight, 型は MLTensorLimits: recurrentWeightオペランド用のMLTensorLimits。
hiddenState, 型はMLTensorLimits: hiddenStateオペランド用のMLTensorLimits。
bias, 型はMLTensorLimits: biasオペランド用のMLTensorLimits。
recurrentBias, 型はMLTensorLimits: recurrentBiasオペランド用のMLTensorLimits。
output, 型はMLTensorLimits: outputオペランド用のMLTensorLimits。

MLOpSupportLimits はgruCell()について次のメンバーを持つ:

gruCell, 型はMLGruCellSupportLimits: gruCell()演算子のサポート制限。

gruCell(input, weight, recurrentWeight, hiddenState, hiddenSize, options) メソッドの手順は次のとおりである:

this が構築できない場合、"InvalidStateError" DOMExceptionを投げる。
this と input、weight、 recurrentWeight、hiddenState、options.bias （それが存在する場合）、および options.recurrentBias （それが存在する場合）のいずれかを用いたオペランドの検証が false を返す場合、TypeErrorを投げる。
input、 weight、recurrentWeight、または hiddenState のいずれかのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
input、weight、recurrentWeight または hiddenState のいずれかのrank が、（この表に従って）その許可される rankでない場合、TypeErrorを投げる。
batchSizeを inputのshape[0] とする。
inputSizeを inputのshape[1] とする。
weightのshapeが « 3 * hiddenSize, inputSize » に等しくない場合、TypeErrorを投げる。
recurrentWeightのshapeが « 3 * hiddenSize, hiddenSize » に等しくない場合、TypeErrorを投げる。
hiddenStateのshapeが « batchSize, hiddenSize » に等しくない場合、TypeErrorを投げる。
hiddenSize * 6 が有効な次元でない場合、TypeErrorを投げる。

なぜ hiddenSize * 6 なのか?
一部の基盤プラットフォームは、bias と recurrentBias を連結した単一の bias テンソルを扱う。したがって、3 * hiddenSize + 3 * hiddenSize も有効な次元である必要がある。
options.bias が存在する場合:
1. そのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
2. そのshapeが « 3 * hiddenSize » に等しくない場合、TypeErrorを投げる。
options.recurrentBias が存在する場合:
1. そのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
2. そのshapeが « 3 * hiddenSize » に等しくない場合、TypeErrorを投げる。
options.activations が存在する場合:
1. そのsizeが 2 でない場合、TypeErrorを投げる。
2. activationsを options.activationsの複製とする。
そうでない場合:
1. activationsを « "sigmoid", "tanh" » とする。
descを、inputのdataTypeと « batchSize, hiddenSize » が与えられてMLOperandDescriptor を作成する結果とする。
グラフ接続を作成する:
1. outputを、this と desc が与えられてMLOperand を作成する結果とする。
2. operatorを、weight、recurrentWeight、hiddenState、 hiddenSize および options が与えられた "gruCell" 操作のoperatorとする。
3. output.[[operator]] を operator に設定する。
4. operatorのinputsを input、weight、 recurrentWeight、および hiddenState に設定する。
5. options.bias が存在する場合、それを operatorのinputsに追加する。
6. options.recurrentBias が存在する場合、それを operatorのinputsに追加する。
7. operatorのactivation functionsを activationsの複製に設定する。
8. operatorのoutputを output に設定する。
outputを返す。

weight layoutがデフォルトの"zrn" layoutであり、update/reset gateおよびnew gateの活性化関数がそれぞれsigmoid() およびtanh() である場合のこの演算の振る舞いは、ユーザーエージェントは通常より効率的な実装を持つものの、次のように他の演算の使用から一般的にエミュレートできる。基盤となるプラットフォームが演算を直接サポートしない場合、この分解は実装を導くためのテンプレートとして使用できる。

function gruCell(
  builder, input, weight, recurrentWeight, hiddenState, hiddenSize, options) {
  const one = builder.constant(input.dataType, 1);
  const zero = builder.constant(input.dataType, 0);

  const inputSize = input.shape[1];

  // update gate (z)
  let z = builder.sigmoid(builder.add(
    builder.add(
      (options.bias ? builder.slice(options.bias, [0], [hiddenSize]) : zero),
      (options.recurrentBias ?
         builder.slice(options.recurrentBias, [0], [hiddenSize]) :
         zero)),
    builder.add(
      builder.matmul(
        input,
        builder.transpose(
          builder.slice(weight, [0, 0], [hiddenSize, inputSize]))),
      builder.matmul(
        hiddenState,
        builder.transpose(
          builder.slice(recurrentWeight, [0, 0], [hiddenSize, hiddenSize]))))));

  // reset gate (r)
  let r = builder.sigmoid(builder.add(
    builder.add(
      (options.bias ? builder.slice(options.bias, [hiddenSize], [hiddenSize]) :
                      zero),
      (options.recurrentBias ?
         builder.slice(options.recurrentBias, [hiddenSize], [hiddenSize]) :
         zero)),
    builder.add(
      builder.matmul(
        input,
        builder.transpose(
          builder.slice(weight, [hiddenSize, 0], [hiddenSize, inputSize]))),
      builder.matmul(
        hiddenState,
        builder.transpose(builder.slice(
          recurrentWeight, [hiddenSize, 0], [hiddenSize, hiddenSize]))))));

  // new gate (n)
  let n;
  if (options.resetAfter) {
    n = builder.tanh(builder.add(
      (options.bias ?
         builder.slice(options.bias, [2 * hiddenSize], [hiddenSize]) :
         zero),
      builder.add(
        builder.matmul(
          input,
          builder.transpose(builder.slice(
            weight, [2 * hiddenSize, 0], [hiddenSize, inputSize]))),
        builder.mul(
          r,
          builder.add(
            (options.recurrentBias ?
               builder.slice(
                 options.recurrentBias, [2 * hiddenSize], [hiddenSize]) :
               zero),
            builder.matmul(
              hiddenState,
              builder.transpose(builder.slice(
                recurrentWeight,
                [2 * hiddenSize, 0],
                [hiddenSize, hiddenSize]))))))));
  } else {
    n = builder.tanh(builder.add(
      builder.add(
        (options.bias ?
           builder.slice(options.bias, [2 * hiddenSize], [hiddenSize]) :
           zero),
        (options.recurrentBias ?
           builder.slice(
             options.recurrentBias, [2 * hiddenSize], [hiddenSize]) :
           zero)),
      builder.add(
        builder.matmul(
          input,
          builder.transpose(builder.slice(
            weight, [2 * hiddenSize, 0], [hiddenSize, inputSize]))),
        builder.matmul(
          builder.mul(r, hiddenState),
          builder.transpose(builder.slice(
            recurrentWeight,
            [2 * hiddenSize, 0],
            [hiddenSize, hiddenSize]))))));
  }

  // compute the new hidden state
  return builder.add(
    builder.mul(z, hiddenState), builder.mul(n, builder.sub(one, z)));
}

8.9.27. hardSigmoid

入力テンソルに対して非平滑なhard sigmoid関数を計算する。これは、より高速な計算のためにsigmoid関数の代わりに使用される。

dictionary MLHardSigmoidOptions : MLOperatorOptions {
  double alpha = 0.2;
  double beta = 0.5;
};

partial interface MLGraphBuilder {
  MLOperand hardSigmoid(MLOperand input, optional MLHardSigmoidOptions options = {});
};

partial dictionary MLOpSupportLimits {
  MLSingleInputSupportLimits hardSigmoid;
};

MLHardSigmoidOptions は次のメンバーを持つ:

alpha, 型はdouble、デフォルトは0.2: スカラー乗数。
beta, 型はdouble、デフォルトは0.5: スカラー加算値。

引数:

input: MLOperand。入力テンソル。
options: 任意の MLHardSigmoidOptions。演算の任意パラメーター。

戻り値:

MLOperand。 inputと同じ形状の出力テンソル。

`hardSigmoid()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	`"float32"`, `"float16"`	`"float32"`, `"float16"`	N	0 から 5
output	同じ `input`	`"float32"`, `"float16"`	同じ `input`	0 から 5

MLOpSupportLimits は hardSigmoid() について次のメンバーを持つ:

hardSigmoid, 型は MLSingleInputSupportLimits: hardSigmoid()演算子のサポート制限。

hardSigmoid(input, options) メソッドの手順は次のとおりである:

this が構築できない場合、"InvalidStateError" DOMExceptionを投げる。
this と input を用いたオペランドの検証が false を返す場合、TypeErrorを投げる。
inputのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
options.alpha を、options.alpha を inputのdataTypeへキャストする結果に設定する。
options.beta を、options.beta を inputのdataTypeへキャストする結果に設定する。
グラフ接続を作成する:
1. outputを、input が与えられてMLOperand をコピーする結果とする。
2. operatorを、options が与えられた "hardSigmoid" 操作のoperatorとする。
3. output.[[operator]] を operator に設定する。
4. operatorのinputを input に設定する。
5. operatorのoutputを output に設定する。
outputを返す。

function hardSigmoid(builder, input, options) {
  return builder.max(
    builder.min(
      builder.add(
        builder.mul(builder.constant(input.dataType, options.alpha), input),
        builder.constant(input.dataType, options.beta)),
      builder.constant(input.dataType, 1)),
    builder.constant(input.dataType, 0));
}

8.9.28. hardSwish

入力テンソルに対して要素ごとに、[MobileNetV3]で導入された非線形関数y = x * max(0, min(6, (x + 3))) / 6を計算する。

partial interface MLGraphBuilder {
  MLOperand hardSwish(MLOperand input, optional MLOperatorOptions options = {});
};

partial dictionary MLOpSupportLimits {
  MLSingleInputSupportLimits hardSwish;
};

引数:

input: MLOperand。入力テンソル。
options: MLOperatorOptions。演算の任意パラメーターを指定する。

戻り値:

MLOperand。 inputと同じ形状の出力テンソル。

`hardSwish()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	`"float32"`, `"float16"`	`"float32"`, `"float16"`	N	0 から 5
output	同じ `input`	`"float32"`, `"float16"`	同じ `input`	0 から 5

MLOpSupportLimits は hardSwish() について次のメンバーを持つ:

hardSwish, 型は MLSingleInputSupportLimits: hardSwish()演算子のサポート制限。

hardSwish(input, options) メソッドの手順は次のとおりである:

this が構築できない場合、"InvalidStateError" DOMExceptionを投げる。
this と input を用いたオペランドの検証が false を返す場合、TypeErrorを投げる。
inputのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
グラフ接続を作成する:
1. outputを、input が与えられてMLOperand をコピーする結果とする。
2. operatorを、options が与えられた "hardSwish" 操作のoperatorとする。
3. output.[[operator]] を operator に設定する。
4. operatorのinputを input に設定する。
5. operatorのoutputを output に設定する。
outputを返す。

function hardSwish(builder, input, options) {
  return builder.div(
    builder.mul(
      input,
      builder.max(
        builder.constant(input.dataType, 0),
        builder.min(
          builder.constant(input.dataType, 6),
          builder.add(input, builder.constant(input.dataType, 3))))),
    builder.constant(input.dataType, 6));
}

8.9.29. instanceNormalization

[Instance-Normalization]を使用して入力を正規化する。batchNormalization()では、正規化に使用される平均値および分散値はモデルの学習中にbatch次元内のすべてのサンプルにわたって計算されるが、 instance normalizationで使用される平均値および分散値は、batch内の各個別サンプルの各入力特徴量に対してその場で計算される。

dictionary MLInstanceNormalizationOptions : MLOperatorOptions {
  MLOperand scale;
  MLOperand bias;
  double epsilon = 1e-5;
  MLInputOperandLayout layout = "nchw";
};

partial interface MLGraphBuilder {
  MLOperand instanceNormalization(
    MLOperand input,
    optional MLInstanceNormalizationOptions options = {});
};

dictionary MLNormalizationSupportLimits {
  MLTensorLimits input;
  MLTensorLimits scale;
  MLTensorLimits bias;
  MLTensorLimits output;
};

partial dictionary MLOpSupportLimits {
  MLNormalizationSupportLimits instanceNormalization;
};

MLInstanceNormalizationOptions は次のメンバーを持つ:

scale, 型はMLOperand: スケーリング値の1-Dテンソル。そのsizeは channel数、すなわち入力のfeature次元のサイズに等しい。たとえば、input テンソルのlayoutが"nchw" である場合、sizeはinputの shape[1]に等しい。
bias, 型はMLOperand: バイアス値の1-Dテンソル。そのsizeは入力のfeature次元のサイズに等しい。たとえば、input テンソルのlayoutが"nchw" である場合、sizeはinputの shape[1]に等しい。
epsilon, 型はdouble、デフォルトは1e-5: ゼロ除算による計算エラーを防ぐための小さい値。
layout, 型はMLInputOperandLayout、デフォルトは "nchw": 入力のlayout形式。

引数:

input: MLOperand。入力4-Dテンソル。
options: 任意のMLInstanceNormalizationOptions。演算の任意パラメーター。

戻り値: MLOperand。 inputと同じ形状の、instance-normalizedされた4-Dテンソル。

`instanceNormalization()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	`"float32"`, `"float16"`	`"float32"`, `"float16"`	4	4
`scale`	同じ `input`	`"float32"`, `"float16"`	1	1
`bias`	同じ `input`	`"float32"`, `"float16"`	1	1
output	同じ `input`	`"float32"`, `"float16"`	4	4

MLNormalizationSupportLimits は次のメンバーを持つ:

input, 型は MLTensorLimits: input オペランド用のMLTensorLimits。
scale, 型は MLTensorLimits: scale オペランド用のMLTensorLimits。
bias, 型は MLTensorLimits: bias オペランド用のMLTensorLimits。
output, 型は MLTensorLimits: output オペランド用のMLTensorLimits。

MLOpSupportLimits は instanceNormalization() について次のメンバーを持つ:

instanceNormalization, 型は MLNormalizationSupportLimits: instanceNormalization()演算子のサポート制限。

instanceNormalization(input, options) メソッドの手順は次のとおりである:

this が構築できない場合、"InvalidStateError" DOMExceptionを投げる。
this と input、options.scale （それが存在する場合）、および options.bias （それが存在する場合）のいずれかを用いたオペランドの検証が false を返す場合、TypeErrorを投げる。
inputのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
inputのrankが、その許可される rankでない場合、TypeErrorを投げる。
options.epsilon を、options.epsilon を inputのdataTypeへキャストする結果に設定する。
options.layout が "nchw" の場合は axis を 1 とし、それ以外の場合は 3 とする。
options.scale が存在する場合:
1. そのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
2. そのshapeが « inputのshape[axis] » に等しくない場合、TypeErrorを投げる。
options.bias が存在する場合:
1. そのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
2. そのshapeが « inputのshape[axis] » に等しくない場合、TypeErrorを投げる。
グラフ接続を作成する:
1. outputを、input が与えられてMLOperand をコピーする結果とする。
2. operatorを、options が与えられた "instanceNormalization" 操作のoperatorとする。
3. output.[[operator]] を operator に設定する。
4. operatorのinputを input に設定する。
5. options.scale が存在する場合、それを operatorのinputsに追加する。
6. options.bias が存在する場合、それを operatorのinputsに追加する。
7. operatorのoutputを output に設定する。
outputを返す。

入力テンソルが"nchw" layoutの4-Dである場合のこの演算の振る舞いは、ユーザーエージェントは通常より効率的な実装を持つものの、次のように他の演算の使用から一般的にエミュレートできる。基盤となるプラットフォームが演算を直接サポートしない場合、この分解は実装を導くためのテンプレートとして使用できる。

function instanceNormalization(builder, input, options) {
  // The reduction of the mean and variance values happens over the spatial
  // dimensions of the input e.g. axis 2 and 3 of the input tensor.
  const reduceOptions = {axes: [2, 3], keepDimensions: true};
  const mean = builder.reduceMean(input, reduceOptions);
  const variance = builder.reduceMean(
    builder.pow(builder.sub(input, mean), builder.constant(input.dataType, 2)),
    reduceOptions);

  // The scale and bias values are applied per input feature
  // e.g. axis 1 of the input tensor.
  const shape = [1, input.shape[1], 1, 1];
  return builder.add(
    builder.mul(
      builder.reshape(options.scale, shape),
      builder.div(
        builder.sub(input, mean),
        builder.sqrt(builder.add(variance, options.epsilon)))),
    builder.reshape(options.bias, shape));
}

8.9.30. layerNormalization

[Layer-Normalization]を使用して入力を正規化する。batchNormalization()では、平均値および分散値は、モデルの学習中にbatch次元内のすべてのサンプルにわたって計算され、instanceNormalization()では、平均値および分散値はbatch内の各個別サンプルの各入力特徴量に対してその場で計算されるのに対し、 layer normalizationの平均値および分散値は、batch内の各個別サンプルのすべての入力特徴量にわたってその場で計算される。

dictionary MLLayerNormalizationOptions : MLOperatorOptions {
  MLOperand scale;
  MLOperand bias;
  sequence<[EnforceRange] unsigned long> axes;
  double epsilon = 1e-5;
};

partial interface MLGraphBuilder {
  MLOperand layerNormalization(MLOperand input,
                               optional MLLayerNormalizationOptions options = {});
};

partial dictionary MLOpSupportLimits {
  MLNormalizationSupportLimits layerNormalization;
};

MLLayerNormalizationOptions は次のメンバーを持つ:

scale, 型はMLOperand: スケーリング値のN-Dテンソル。そのshapeはaxes メンバーによって決定され、axes 内の各値は、スケーリング値を持つ入力テンソルの次元を示す。たとえば、axes の値が[1,2,3]の場合、このテンソルのshapeは、入力次元1、2および3の対応するサイズのリストである。このメンバーが存在しない場合、スケーリング値は1とみなされる。
bias, 型はMLOperand: バイアス値のN-Dテンソル。そのshapeはaxes メンバーによって決定され、axes 内の各値は、バイアス値を持つ入力テンソルの次元を示す。たとえば、axes の値が[1,2,3]の場合、このテンソルのshapeは、入力次元1、2および3の対応するサイズのリストである。このメンバーが存在しない場合、バイアス値は0とみなされる。
axes, 型はsequence<[EnforceRange] unsigned long>: reduceする入力次元へのインデックス。このメンバーが存在しない場合、最初を除くすべての次元が与えられたものとして扱われる（例: 4-D入力テンソルでは、axes = [1,2,3]）。すなわち、平均値および分散値のreductionは、各独立したbatchについてすべての入力特徴量にわたって計算される。空の場合、どの次元もreduceされない。
epsilon, 型はdouble、デフォルトは1e-5: ゼロ除算による計算エラーを防ぐための小さい値。

引数:

input: MLOperand。入力N-Dテンソル。
options: 任意のMLLayerNormalizationOptions。演算の任意パラメーター。

戻り値: MLOperand。 inputと同じ形状の、layer-normalizedされたN-Dテンソル。

`layerNormalization()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	`"float32"`, `"float16"`	`"float32"`, `"float16"`	N	0 から 5
`scale`	同じ `input`	`"float32"`, `"float16"`	N	0 から 5
`bias`	同じ `input`	`"float32"`, `"float16"`	N	0 から 5
output	同じ `input`	`"float32"`, `"float16"`	同じ `input`	0 から 5

MLOpSupportLimits は layerNormalization() について次のメンバーを持つ:

layerNormalization, 型は MLNormalizationSupportLimits: layerNormalization()演算子のサポート制限。

layerNormalization(input, options) メソッドの手順は次のとおりである:

this が構築できない場合、"InvalidStateError" DOMExceptionを投げる。
this と input、options.scale （それが存在する場合）、および options.bias （それが存在する場合）のいずれかを用いたオペランドの検証が false を返す場合、TypeErrorを投げる。
inputのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
options.axes が存在しない場合、options.axes を新しいリストに設定する。すなわち、inputのrankが 1 より大きい場合は 1 から inputのrankまでの範囲（終端を含まない）、それ以外の場合は空のリストである。
そうでなく、options.axes が重複値を含む場合、またはその項目のいずれかが 0 から inputのrankまでの範囲（終端を含まない）に含まれない場合、TypeErrorを投げる。
options.epsilon を、options.epsilon を inputのdataTypeへキャストする結果に設定する。
options.scale が存在する場合:
1. そのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
2. そのrankが options.axesの sizeに等しくない場合、TypeErrorを投げる。
options.bias が存在する場合:
1. そのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
2. そのrankが options.axesの sizeに等しくない場合、TypeErrorを投げる。
0 から options.axesの size までの範囲（終端を含まない）の各 index について反復する:
1. axisを options.axes[index] とする。
2. axis が inputのrank以上である場合、TypeErrorを投げる。
3. sizeを inputのshape[axis] とする。
4. options.scale が存在する場合:
  1. そのshape[index] が size に等しくない場合、TypeErrorを投げる。
5. options.bias が存在する場合:
  1. そのshape[index] が size に等しくない場合、TypeErrorを投げる。
グラフ接続を作成する:
1. outputを、input が与えられてMLOperand をコピーする結果とする。
2. operatorを、options が与えられた "layerNormalization" 操作のoperatorとする。
3. output.[[operator]] を operator に設定する。
4. operatorのinputを input に設定する。
5. options.scale が存在する場合、それを operatorのinputsに追加する。
6. options.bias が存在する場合、それを operatorのinputsに追加する。
7. operatorのoutputを output に設定する。
outputを返す。

axesパラメーターが[1,2,3]に設定されている場合のこの演算の振る舞いは、ユーザーエージェントは通常より効率的な実装を持つものの、次のように他の演算の使用から一般的にエミュレートできる。基盤となるプラットフォームが演算を直接サポートしない場合、この分解は実装を導くためのテンプレートとして使用できる。

function layerNormalization(builder, input, options) {
  // The reduction of the mean and variance values happens over the spatial
  // dimensions across all the input features (i.e. all channels) of the input
  // tensor.
  const reduceOptions = {axes: [1, 2, 3], keepDimensions: true};
  const mean = builder.reduceMean(input, reduceOptions);
  const variance = builder.reduceMean(
    builder.pow(builder.sub(input, mean), builder.constant(input.dataType, 2)),
    reduceOptions);

  // The scale and bias tensors are of the shape of the input
  // specified by the values in the axes parameter (i.e. [1,2,3]).
  return builder.add(
    builder.mul(
      options.scale,
      builder.div(
        builder.sub(input, mean),
        builder.sqrt(builder.add(variance, options.epsilon)))),
    options.bias);
}

8.9.31. leakyRelu

入力テンソルに対して要素ごとに rectified linear functionの leaky版を計算する。計算は式 max(0, x) + alpha * min(0, x)に従う。

dictionary MLLeakyReluOptions : MLOperatorOptions {
  double alpha = 0.01;
};

partial interface MLGraphBuilder {
  MLOperand leakyRelu(MLOperand input, optional MLLeakyReluOptions options = {});
};

partial dictionary MLOpSupportLimits {
  MLSingleInputSupportLimits leakyRelu;
};

MLLeakyReluOptions は次のメンバーを持つ:

alpha, 型はdouble、デフォルトは0.01: スカラー乗数。

引数:

input: MLOperand。入力テンソル。
options: 任意の MLLeakyReluOptions。演算の任意パラメーター。

戻り値:

MLOperand。 inputと同じ形状の出力テンソル。

`leakyRelu()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	`"float32"`, `"float16"`	`"float32"`, `"float16"`	N	0 から 5
output	同じ `input`	`"float32"`, `"float16"`	同じ `input`	0 から 5

MLOpSupportLimits は leakyRelu() について次のメンバーを持つ:

leakyRelu, 型は MLSingleInputSupportLimits: leakyRelu()演算子のサポート制限。

leakyRelu(input, options) メソッドの手順は次のとおりである:

this が構築できない場合、"InvalidStateError" DOMExceptionを投げる。
this と input を用いたオペランドの検証が false を返す場合、TypeErrorを投げる。
inputのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
options.alpha を、options.alpha を inputのdataTypeへキャストする結果に設定する。
グラフ接続を作成する:
1. outputを、input が与えられてMLOperand をコピーする結果とする。
2. operatorを、options が与えられた "leakyRelu" 操作のoperatorとする。
3. output.[[operator]] を operator に設定する。
4. operatorのinputを input に設定する。
5. operatorのoutputを output に設定する。
outputを返す。

function leakyRelu(builder, input, options) {
  return builder.add(
    builder.max(builder.constant(input.dataType, 0), input),
    builder.mul(
      builder.constant(input.dataType, options.alpha),
      builder.min(builder.constant(input.dataType, 0), input)));
}

8.9.32. linear

入力テンソルに対して線形関数y = alpha * x + betaを計算する。

dictionary MLLinearOptions : MLOperatorOptions {
  double alpha = 1;
  double beta = 0;
};

partial interface MLGraphBuilder {
  MLOperand linear(MLOperand input, optional MLLinearOptions options = {});
};

partial dictionary MLOpSupportLimits {
  MLSingleInputSupportLimits linear;
};

MLLinearOptions は次のメンバーを持つ:

alpha, 型はdouble、デフォルトは1: スカラー乗数。
beta, 型はdouble、デフォルトは0: スカラー加算値。

引数:

input: MLOperand。入力テンソル。
options: 任意の MLLinearOptions。演算の任意パラメーター。

戻り値:

MLOperand。 inputと同じ形状の出力テンソル。

`linear()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	`"float32"`, `"float16"`	`"float32"`, `"float16"`	N	0 から 5
output	同じ `input`	`"float32"`, `"float16"`	同じ `input`	0 から 5

MLOpSupportLimits は linear() について次のメンバーを持つ:

linear, 型は MLSingleInputSupportLimits: linear()演算子のサポート制限。

linear(input, options) メソッドの手順は次のとおりである:

this が構築できない場合、"InvalidStateError" DOMExceptionを投げる。
this と input を用いたオペランドの検証が false を返す場合、TypeErrorを投げる。
inputのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
options.alpha を、options.alpha を inputのdataTypeへキャストする結果に設定する。
options.beta を、options.beta を inputのdataTypeへキャストする結果に設定する。
グラフ接続を作成する:
1. outputを、input が与えられてMLOperand をコピーする結果とする。
2. operatorを、options が与えられた "linear" 操作のoperatorとする。
3. output.[[operator]] を operator に設定する。
4. operatorのinputを input に設定する。
5. operatorのoutputを output に設定する。
outputを返す。

function linear(builder, input, options) {
  return builder.add(
    builder.mul(input, builder.constant(input.dataType, options.alpha)),
    builder.constant(input.dataType, options.beta));
}

8.9.33. lstm

Long Short-Term Memory [LSTM] recurrent networkは、input、output、forget、およびcell gateを使用して、networkの時間的シーケンスにわたって出力へ繰り込まれる出力状態を計算する。

enum MLLstmWeightLayout {
  "iofg", // input-output-forget-cell gate ordering
  "ifgo"  // input-forget-cell-output gate ordering
};

dictionary MLLstmOptions : MLOperatorOptions {
  MLOperand bias;
  MLOperand recurrentBias;
  MLOperand peepholeWeight;
  MLOperand initialHiddenState;
  MLOperand initialCellState;
  boolean returnSequence = false;
  MLRecurrentNetworkDirection direction = "forward";
  MLLstmWeightLayout layout = "iofg";
  sequence<MLRecurrentNetworkActivation> activations;
};

partial interface MLGraphBuilder {
  sequence<MLOperand> lstm(MLOperand input,
                           MLOperand weight,
                           MLOperand recurrentWeight,
                           [EnforceRange] unsigned long steps,
                           [EnforceRange] unsigned long hiddenSize,
                           optional MLLstmOptions options = {});
};

dictionary MLLstmSupportLimits {
  MLTensorLimits input;
  MLTensorLimits weight;
  MLTensorLimits recurrentWeight;
  MLTensorLimits bias;
  MLTensorLimits recurrentBias;
  MLTensorLimits peepholeWeight;
  MLTensorLimits initialHiddenState;
  MLTensorLimits initialCellState;
  MLTensorLimits output0;
  MLTensorLimits output1;
  MLTensorLimits output2;
};

partial dictionary MLOpSupportLimits {
  MLLstmSupportLimits lstm;
};

MLLstmOptions は次のメンバーを持つ:

bias, 型はMLOperand: 形状[numDirections, 4 * hiddenSize]の2-D入力バイアステンソル。テンソル形状の第2次元におけるバイアスベクトルの順序は、layoutに従って指定される。
recurrentBias, 型はMLOperand: 形状[numDirections, 4 * hiddenSize]の2-D再帰バイアステンソル。テンソル形状の第1次元におけるバイアスベクトルの順序は、layoutに従って指定される。
peepholeWeight, 型は MLOperand: 形状[numDirections, 3 * hiddenSize]のpeephole用2-D weightテンソル。weightベクトルのパック順序は、それぞれinput (i)、output (o)、および forget (f) gate用である。
initialHiddenState, 型はMLOperand: 形状[numDirections, batchSize, hiddenSize]の3-D初期隠れ状態テンソル。指定されない場合、実装はゼロで満たされたテンソルを使用しなければならない。
initialCellState, 型は MLOperand: 形状[numDirections, batchSize, hiddenSize]の3-D初期隠れ状態テンソル。指定されない場合、実装はゼロで満たされたテンソルを使用しなければならない。
returnSequence, 型は boolean、デフォルトはfalse: 最後のtime stepの出力に加えて、各time stepからのすべての出力を含むシーケンス全体も返すかを示す。
direction, 型はMLRecurrentNetworkDirection、デフォルトは "forward": 入力シーケンスの処理方向。"both"に設定される場合、 weightおよびbiasテンソル形状の第1次元のサイズは2でなければならず、入力は両方向に処理される。
layout, 型はMLLstmWeightLayout、デフォルトは "iofg": LSTMの内部gate、具体的にはinput (i)、output (o)、forget (f)、およびcell (g) gateに対するweightおよびbiasベクトルの順序。weightおよびbiasテンソル形状の第1次元で示される。
activations, 型は sequence<MLRecurrentNetworkActivation>: 3つの活性化関数のリスト。1つ目はinput (i)、 forget (f)、およびoutput (o) gateに使用され、2つ目は cell (g) gateに使用され、最後は出力cell stateを出力gateの結果と結合して出力隠れ状態を形成する前にフィルタリングするために使用される。指定されない場合、それぞれ"sigmoid"、 "tanh"、および"tanh" 関数のシーケンスがデフォルトとなる。

引数:

input: MLOperand。形状[steps, batchSize, inputSize]の入力3-Dテンソル。
weight: MLOperand。形状[numDirections, 4 * hiddenSize, inputSize]の3-D入力weightテンソル。テンソル形状の第2次元におけるweightベクトルの順序は、layoutに従って指定される。
recurrentWeight: MLOperand。形状[numDirections, 4 * hiddenSize, hiddenSize]の3-D再帰weightテンソル。テンソル形状の第2次元におけるweightベクトルの順序は、layoutに従って指定される。
steps: unsigned long スカラー。recurrent network内のtime step数。この値は0より大きくなければならない。
hiddenSize: unsigned long スカラー。cell出力テンソル形状の第3次元の値。隠れ状態における特徴量の数を示す。
options: 任意のMLLstmOptions。演算の任意パラメーター。

戻り値: sequence<MLOperand>。第1要素は形状[numDirections, batchSize, hiddenSize]の3-Dテンソルであり、networkの最後のtime stepからの出力隠れ状態である。第2要素は形状[numDirections, batchSize, hiddenSize]の3-Dテンソルであり、 networkの最後のtime stepからの出力cell stateである。さらに、returnSequence がtrueに設定されている場合、第3要素は形状[steps, numDirections, batchSize, hiddenSize]の4-D出力テンソルであり、時間的シーケンス内の各time stepからのすべての出力を含む。

`lstm()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	`"float32"`, `"float16"`	`"float32"`, `"float16"`	3	3
`weight`	同じ `input`	`"float32"`, `"float16"`	3	3
`recurrentWeight`	同じ `input`	`"float32"`, `"float16"`	3	3
`bias`	同じ `input`	`"float32"`, `"float16"`	2	2
`recurrentBias`	同じ `input`	`"float32"`, `"float16"`	2	2
`peepholeWeight`	同じ `input`	`"float32"`, `"float16"`	2	2
`initialHiddenState`	同じ `input`	`"float32"`, `"float16"`	3	3
`initialCellState`	同じ `input`	`"float32"`, `"float16"`	3	3
outputs[0]	同じ `input`	`"float32"`, `"float16"`	3	3
outputs[1]	同じ `input`	`"float32"`, `"float16"`	3	3
outputs[2]（`returnSequence` が true の場合）	同じ `input`	`"float32"`, `"float16"`	4	4

MLLstmSupportLimits は次のメンバーを持つ:

input, 型は MLTensorLimits: input オペランド用のMLTensorLimits。
weight, 型は MLTensorLimits: weight オペランド用のMLTensorLimits。
recurrentWeight, 型は MLTensorLimits: recurrentWeight オペランド用のMLTensorLimits。
bias, 型は MLTensorLimits: bias オペランド用のMLTensorLimits。
recurrentBias, 型は MLTensorLimits: recurrentBias オペランド用のMLTensorLimits。
peepholeWeight, 型は MLTensorLimits: peepholeWeight オペランド用のMLTensorLimits。
initialHiddenState, 型は MLTensorLimits: initialHiddenState オペランド用のMLTensorLimits。
initialCellState, 型は MLTensorLimits: initialCellState オペランド用のMLTensorLimits。
output0, 型は MLTensorLimits: すべての output オペランド[0]用のMLTensorLimits。
output1, 型は MLTensorLimits: すべての output オペランド[1]用のMLTensorLimits。
output2, 型は MLTensorLimits: すべての output オペランド[2]用のMLTensorLimits。

MLOpSupportLimits は lstm() について次のメンバーを持つ:

lstm, 型は MLLstmSupportLimits: lstm()演算子のサポート制限。

lstm(input, weight, recurrentWeight, steps, hiddenSize, options) メソッドの手順は次のとおりである:

this が構築できない場合、"InvalidStateError" DOMExceptionを投げる。
this と input、weight、 recurrentWeight、options.bias （それが存在する場合）、options.recurrentBias （それが存在する場合）、options.peepholeWeight （それが存在する場合）、options.initialHiddenState （それが存在する場合）、および options.initialCellState （それが存在する場合）のいずれかを用いたオペランドの検証が false を返す場合、TypeErrorを投げる。
options.direction が "both" の場合は numDirections を 2 とし、それ以外の場合は 1 とする。
input、weight または recurrentWeight のいずれかのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
input、weight または recurrentWeight のいずれかのrankが、その許可される rankでない場合、 TypeErrorを投げる。
steps が 0 の場合、TypeErrorを投げる。
inputのshape[0] が steps に等しくない場合、TypeErrorを投げる。
batchSizeを inputのshape[1] とする。
inputSizeを inputのshape[2] とする。
weightのshapeが « numDirections, 4 * hiddenSize, inputSize » に等しくない場合、TypeErrorを投げる。
recurrentWeightのshapeが « numDirections, 4 * hiddenSize, hiddenSize » に等しくない場合、TypeErrorを投げる。
hiddenSize * 8 が有効な次元でない場合、TypeErrorを投げる。

なぜ hiddenSize * 8 なのか?
一部の基盤プラットフォームは、bias と recurrentBias を連結した単一の bias テンソルを扱う。したがって、4 * hiddenSize + 4 * hiddenSize も有効な次元である必要がある。
options.bias が存在する場合:
1. そのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
2. そのshapeが « numDirections, 4 * hiddenSize » に等しくない場合、TypeErrorを投げる。
options.recurrentBias が存在する場合:
1. そのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
2. そのshapeが « numDirections, 4 * hiddenSize » に等しくない場合、TypeErrorを投げる。
options.peepholeWeight が存在する場合:
1. そのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
2. そのshapeが « numDirections, 3 * hiddenSize » に等しくない場合、TypeErrorを投げる。
options.initialHiddenState が存在する場合:
1. そのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
2. そのshapeが « numDirections, batchSize, hiddenSize » に等しくない場合、TypeErrorを投げる。
options.initialCellState が存在する場合:
1. そのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
2. そのshapeが « numDirections, batchSize, hiddenSize » に等しくない場合、TypeErrorを投げる。
options.activations が存在する場合:
1. そのsizeが 3 でない場合、TypeErrorを投げる。
2. activationsを options.activationsの複製とする。
そうでない場合:
1. activationsを « "sigmoid", "tanh", "tanh" » とする。
出力 shape を計算する:
1. descを、inputのdataTypeと « numDirections, batchSize, hiddenSize » が与えられてMLOperandDescriptor を作成する結果とする。
2. options.returnSequence が true の場合:
  1. desc2を、inputのdataType と « steps, numDirections, batchSize, hiddenSize » が与えられてMLOperandDescriptor を作成する結果とする。
グラフ接続を作成する:
1. operatorを、weight、recurrentWeight、steps、hiddenSize および options が与えられた "lstm" 操作のoperatorとする。
2. output0を、this と desc が与えられてMLOperand を作成する結果とする。
3. output1を、this と desc が与えられてMLOperand を作成する結果とする。
4. options.returnSequence が true の場合:
  1. output2を、this と desc2 が与えられてMLOperand を作成する結果とする。
  2. outputをリスト « output0, output1, output2 » とする。
  3. output0.[[operator]]、 output1.[[operator]] および output2.[[operator]] を operator に設定する。
5. そうでない場合:
  1. outputをリスト « output0, output1 » とする。
  2. output0.[[operator]] および output1.[[operator]] を operator に設定する。
6. operatorのinputsを input、weight、および recurrentWeight に設定する。
7. options.bias が存在する場合、それを operatorのinputsに追加する。
8. options.recurrentBias が存在する場合、それを operatorのinputsに追加する。
9. options.peepholeWeight が存在する場合、それを operatorのinputsに追加する。
10. options.initialHiddenState が存在する場合、それを operatorのinputsに追加する。
11. options.initialCellState が存在する場合、それを operatorのinputsに追加する。
12. operatorのactivation functionsを activationsの複製に設定する。
13. operatorのoutputを output に設定する。
outputを返す。

squeeze()ヘルパーを使用すると、この演算の振る舞いは、ユーザーエージェントは通常より効率的な実装を持つものの、次のように他の演算の使用から一般的にエミュレートできる。基盤となるプラットフォームが演算を直接サポートしない場合、この分解は実装を導くためのテンプレートとして使用できる。

function lstm(
  builder, input, weight, recurrentWeight, steps, hiddenSize, options) {
  const batchSize = input.shape[1];
  const inputSize = input.shape[2];
  const direction = options.direction || 'forward';
  const numDirections = (direction == 'both' ? 2 : 1);
  let hiddenState = options.initialHiddenState;
  let cellState = options.initialCellState;

  if (!hiddenState) {
    const desc = {
      dataType: 'float32',
      shape: [numDirections, batchSize, hiddenSize]
    };
    const totalSize = numDirections * batchSize * hiddenSize;
    hiddenState = builder.constant(desc, new Float32Array(totalSize).fill(0));
  }

  if (!cellState) {
    const desc = {
      dataType: 'float32',
      shape: [numDirections, batchSize, hiddenSize]
    };
    const totalSize = numDirections * batchSize * hiddenSize;
    cellState = builder.constant(desc, new Float32Array(totalSize).fill(0));
  }

  let currentWeight = [];
  let currentRecurrentWeight = [];
  let currentBias = [];
  let currentRecurrentBias = [];
  let currentPeepholeWeight = [];
  let forwardSequence = null;
  let backwardSequence = null;
  let outputHidden = null;
  let outputCell = null;

  for (let dir = 0; dir < numDirections; ++dir) {
    currentWeight.push(squeeze(
      builder,
      builder.slice(weight, [dir, 0, 0], [1, 4 * hiddenSize, inputSize])));
    currentRecurrentWeight.push(squeeze(
      builder,
      builder.slice(
        recurrentWeight, [dir, 0, 0], [1, 4 * hiddenSize, hiddenSize])));
    currentBias.push(
      options.bias ?
        (squeeze(
          builder,
          builder.slice(options.bias, [dir, 0], [1, 4 * hiddenSize]))) :
        null);
    currentRecurrentBias.push(
      options.recurrentBias ?
        (squeeze(
          builder,
          builder.slice(
            options.recurrentBias, [dir, 0], [1, 4 * hiddenSize]))) :
        null);
    currentPeepholeWeight.push(
      options.peepholeWeight ?
        (squeeze(
          builder,
          builder.slice(
            options.peepholeWeight, [dir, 0], [1, 3 * hiddenSize]))) :
        null);

    let currentHidden = squeeze(
      builder,
      builder.slice(hiddenState, [dir, 0, 0], [1, batchSize, hiddenSize]), [0]);
    let currentCell = squeeze(
      builder,
      builder.slice(cellState, [dir, 0, 0], [1, batchSize, hiddenSize]), [0]);

    for (let step = 0; step < steps; ++step) {
      const slice =
        (dir == 1 || direction == 'backward' ? steps - step - 1 : step);
      const currentInput = squeeze(
        builder,
        builder.slice(input, [slice, 0, 0], [1, batchSize, inputSize]), [0]);

      [currentHidden, currentCell] = builder.lstmCell(
        currentInput,
        currentWeight[dir],
        currentRecurrentWeight[dir],
        currentHidden,
        currentCell,
        hiddenSize,
        {
          bias: currentBias[dir],
          recurrentBias: currentRecurrentBias[dir],
          peepholeWeight: currentPeepholeWeight[dir],
          layout: options.layout,
          activations: options.activations
        });

      if (options.returnSequence) {
        // Expand currentHidden of 2D([batchSize, hiddenSize])
        // to 4D([steps, numDirections, batchSize, hiddenSize])
        const expandedHiddenAs4D =
          builder.reshape(currentHidden, [1, 1, batchSize, hiddenSize]);

        if (direction == 'forward' || (dir == 0 && direction == 'both')) {
          forwardSequence = forwardSequence ?
            builder.concat([forwardSequence, expandedHiddenAs4D], 0) :
            expandedHiddenAs4D;
        } else if (
          direction == 'backward' || (dir == 1 && direction == 'both')) {
          backwardSequence = backwardSequence ?
            builder.concat([expandedHiddenAs4D, backwardSequence], 0) :
            expandedHiddenAs4D;
        }
      }
    }

    // Expand currentHidden of 2D([batchSize, hiddenSize])
    // to 3D([numDirections, batchSize, hiddenSize])
    const expandedHiddenAs3D =
      builder.reshape(currentHidden, [1, batchSize, hiddenSize]);
    outputHidden = outputHidden ?
      builder.concat([outputHidden, expandedHiddenAs3D], 0) :
      expandedHiddenAs3D;

    // Expand currentCell of 2D([batchSize, hiddenSize])
    // to 3D([numDirections, batchSize, hiddenSize])
    const expandedCellAs3D =
      builder.reshape(currentCell, [1, batchSize, hiddenSize]);
    outputCell = outputCell ?
      builder.concat([outputCell, expandedCellAs3D], 0) :
      expandedCellAs3D;
  }

  if (options.returnSequence) {
    let outputSequence = null;

    if (direction == 'forward') {
      outputSequence = forwardSequence;
    } else if (direction == 'backward') {
      outputSequence = backwardSequence;
    } else if (direction == 'both') {
      // Concat along axis 1 (numDirections dimension)
      outputSequence = builder.concat([forwardSequence, backwardSequence], 1);
    }

    return [outputHidden, outputCell, outputSequence];
  } else {
    return [outputHidden, outputCell];
  }
}

8.9.34. lstmCell

Long Short-Term Memory [LSTM] recurrent networkの単一time step。cell state、input、output、およびforget gateを使用して、networkの時間的シーケンスにわたって出力へ繰り込まれる次のtime stepのcell stateとhidden stateを計算する。

dictionary MLLstmCellOptions : MLOperatorOptions {
  MLOperand bias;
  MLOperand recurrentBias;
  MLOperand peepholeWeight;
  MLLstmWeightLayout layout = "iofg";
  sequence<MLRecurrentNetworkActivation> activations;
};

partial interface MLGraphBuilder {
  sequence<MLOperand> lstmCell(MLOperand input,
                               MLOperand weight,
                               MLOperand recurrentWeight,
                               MLOperand hiddenState,
                               MLOperand cellState,
                               [EnforceRange] unsigned long hiddenSize,
                               optional MLLstmCellOptions options = {});
};

dictionary MLLstmCellSupportLimits {
  MLTensorLimits input;
  MLTensorLimits weight;
  MLTensorLimits recurrentWeight;
  MLTensorLimits hiddenState;
  MLTensorLimits cellState;
  MLTensorLimits bias;
  MLTensorLimits recurrentBias;
  MLTensorLimits peepholeWeight;
  MLTensorLimits output0;
  MLTensorLimits output1;
};

partial dictionary MLOpSupportLimits {
  MLLstmCellSupportLimits lstmCell;
};

MLLstmCellOptions は次のメンバーを持つ:

bias, 型はMLOperand: 形状[4 * hiddenSize]の1-D入力バイアステンソル。テンソル形状の第1次元におけるバイアスベクトルの順序は、 layoutに従って指定される。
recurrentBias, 型は MLOperand: 形状[4 * hiddenSize]の1-D再帰バイアステンソル。テンソル形状の第1次元におけるバイアスベクトルの順序は、 layoutに従って指定される。
peepholeWeight, 型は MLOperand: 形状[3 * hiddenSize]のpeephole用1-D weightテンソル。weightベクトルのパック順序は、それぞれinput (i)、output (o)、および forget (f) gate用である。
layout, 型はMLLstmWeightLayout、デフォルトは "iofg": LSTMの内部gate、具体的にはinput (i)、output (o)、forget (f)、およびcell (g) gateに対するweightおよびbiasベクトルの順序。weightおよびbiasテンソル形状の第1次元で示される。
activations, 型は sequence<MLRecurrentNetworkActivation>: 3つの活性化関数のリスト。1つ目はinput (i)、 forget (f)、およびoutput (o) gateに使用され、2つ目は cell (g) gateに使用され、最後は出力cell stateを出力gateの結果と結合して出力hidden stateを形成する前にフィルタリングするために使用される。指定されない場合、それぞれ"sigmoid"、 "tanh"、および"tanh" 関数のシーケンスがデフォルトとなる。

引数:

input: MLOperand。形状[batchSize, inputSize]の入力2-Dテンソル。
weight: MLOperand。形状[4 * hiddenSize, inputSize]の2-D入力weightテンソル。テンソル形状の第1次元におけるweightベクトルの順序は、layoutに従って指定される。
recurrentWeight: MLOperand。形状[4 * hiddenSize, hiddenSize]の2-D再帰weightテンソル。テンソル形状の第1次元における weightベクトルの順序は、layoutに従って指定される。
hiddenState: MLOperand。形状[batchSize, hiddenSize]の入力hidden state 2-Dテンソル。
cellState: MLOperand。形状[batchSize, hiddenSize]の入力cell state 2-Dテンソル。
hiddenSize: unsigned long スカラー。出力テンソル形状の第2次元の値。hidden stateにおける特徴量の数を示す。
options: 任意のMLLstmCellOptions。演算の任意パラメーター。

戻り値: sequence<MLOperand>。第1要素はrecurrent networkの現在のtime stepの出力hidden stateである。次の要素は出力cell stateである。両方の要素は形状[batchSize, hiddenSize]の2-Dテンソルである。

`lstmCell()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	`"float32"`, `"float16"`	`"float32"`, `"float16"`	2	2
`weight`	同じ `input`	`"float32"`, `"float16"`	2	2
`recurrentWeight`	同じ `input`	`"float32"`, `"float16"`	2	2
`hiddenState`	同じ `input`	`"float32"`, `"float16"`	2	2
`cellState`	同じ `input`	`"float32"`, `"float16"`	2	2
`bias`	同じ `input`	`"float32"`, `"float16"`	1	1
`recurrentBias`	同じ `input`	`"float32"`, `"float16"`	1	1
`peepholeWeight`	同じ `input`	`"float32"`, `"float16"`	1	1
outputs[0]	同じ `input`	`"float32"`, `"float16"`	2	2
outputs[1]	同じ `input`	`"float32"`, `"float16"`	2	2

MLLstmCellSupportLimits は次のメンバーを持つ:

input, 型はMLTensorLimits: inputオペランド用のMLTensorLimits。
weight, 型はMLTensorLimits: weightオペランド用のMLTensorLimits。
recurrentWeight, 型は MLTensorLimits: recurrentWeightオペランド用のMLTensorLimits。
hiddenState, 型はMLTensorLimits: hiddenStateオペランド用のMLTensorLimits。
cellState, 型はMLTensorLimits: cellStateオペランド用のMLTensorLimits。
bias, 型はMLTensorLimits: biasオペランド用のMLTensorLimits。
recurrentBias, 型はMLTensorLimits: recurrentBiasオペランド用のMLTensorLimits。
peepholeWeight, 型はMLTensorLimits: peepholeWeightオペランド用のMLTensorLimits。
output0, 型はMLTensorLimits: すべての出力オペランド[0]用のMLTensorLimits。
output1, 型はMLTensorLimits: すべての出力オペランド[1]用のMLTensorLimits。

MLOpSupportLimits はlstmCell()について次のメンバーを持つ:

lstmCell, 型はMLLstmCellSupportLimits: lstmCell()演算子のサポート制限。

lstmCell(input, weight, recurrentWeight, hiddenState, cellState, hiddenSize, options) メソッドの手順は次のとおりである:

this が構築できない場合、"InvalidStateError" DOMExceptionを投げる。
this と input、weight、 recurrentWeight、hiddenState、cellState、 options.bias （それが存在する場合）、options.recurrentBias （それが存在する場合）、および options.peepholeWeight （それが存在する場合）のいずれかを用いたオペランドの検証が false を返す場合、TypeErrorを投げる。
input、 weight、recurrentWeight、hiddenState または cellState のいずれかのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
input、weight、recurrentWeight、hiddenState または cellState のいずれかのrankが、その許可される rankでない場合、TypeErrorを投げる。
batchSizeを inputのshape[0] とする。
inputSizeを inputのshape[1] とする。
weightのshapeが « 4 * hiddenSize, inputSize » に等しくない場合、TypeErrorを投げる。
recurrentWeightのshapeが « 4 * hiddenSize, hiddenSize » に等しくない場合、TypeErrorを投げる。
hiddenStateのshapeが « batchSize, hiddenSize » に等しくない場合、TypeErrorを投げる。
cellStateのshapeが « batchSize, hiddenSize » に等しくない場合、TypeErrorを投げる。
hiddenSize * 8 が有効な次元でない場合、TypeErrorを投げる。

なぜ hiddenSize * 8 なのか?
一部の基盤プラットフォームは、bias と recurrentBias を連結した単一の bias テンソルを扱う。したがって、4 * hiddenSize + 4 * hiddenSize も有効な次元である必要がある。
options.bias が存在する場合:
1. そのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
2. そのshapeが « 4 * hiddenSize » に等しくない場合、TypeErrorを投げる。
options.recurrentBias が存在する場合:
1. そのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
2. そのshapeが « 4 * hiddenSize » に等しくない場合、TypeErrorを投げる。
options.peepholeWeight が存在する場合:
1. そのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
2. そのshapeが « 3 * hiddenSize » に等しくない場合、TypeErrorを投げる。
options.activations が存在する場合:
1. そのsizeが 3 でない場合、TypeErrorを投げる。
2. activationsを options.activationsの複製とする。
そうでない場合:
1. activationsを « "sigmoid", "tanh", "tanh" » とする。
descを新しい MLOperandDescriptor とする。
desc.shape をリスト « batchSize, hiddenSize » に設定する。
desc.dataType を inputのdataTypeに設定する。
グラフ接続を作成する:
1. output0を、this と desc が与えられてMLOperand を作成する結果とする。
2. output1を、this と desc が与えられてMLOperand を作成する結果とする。
3. outputをリスト « output0, output1 » とする。
4. operatorを、weight、recurrentWeight、hiddenState、 cellState、hiddenSize および options が与えられた "lstmCell" 操作のoperatorとする。
5. output0.[[operator]] および output1.[[operator]] を operator に設定する。
6. operatorのinputsを input、weight、 recurrentWeight、hiddenState、および cellState に設定する。
7. options.bias が存在する場合、それを operatorのinputsに追加する。
8. options.recurrentBias が存在する場合、それを operatorのinputsに追加する。
9. options.peepholeWeight が存在する場合、それを operatorのinputsに追加する。
10. operatorのactivation functionsを activationsの複製に設定する。
11. operatorのoutputを output に設定する。
outputを返す。

weight layoutがデフォルトの"iofg" layoutであり、input/forget/output gateの活性化関数および cell gate/output hidden state用のcell stateフィルターがそれぞれsigmoid() およびtanh() である場合のこの演算の振る舞いは、ユーザーエージェントは通常より効率的な実装を持つものの、次のように他の演算の使用から一般的にエミュレートできる。基盤となるプラットフォームが演算を直接サポートしない場合、この分解は実装を導くためのテンプレートとして使用できる。

function lstmCell(
  builder,
  input,
  weight,
  recurrentWeight,
  hiddenState,
  cellState,
  hiddenSize,
  options) {
  const zero = builder.constant(input.dataType, 0);

  const inputSize = input.shape[1];

  // input gate (i)
  let i = builder.sigmoid(builder.add(
    builder.mul(
      cellState,
      (options.peepholeWeight ?
         builder.slice(options.peepholeWeight, [0], [hiddenSize]) :
         zero)),
    builder.add(
      builder.add(
        (options.bias ? builder.slice(options.bias, [0], [hiddenSize]) : zero),
        (options.recurrentBias ?
           builder.slice(options.recurrentBias, [0], [hiddenSize]) :
           zero)),
      builder.add(
        builder.matmul(
          input,
          builder.transpose(
            builder.slice(weight, [0, 0], [hiddenSize, inputSize]))),
        builder.matmul(
          hiddenState,
          builder.transpose(builder.slice(
            recurrentWeight, [0, 0], [hiddenSize, hiddenSize])))))));

  // forget gate (f)
  let f = builder.sigmoid(builder.add(
    builder.mul(
      cellState,
      (options.peepholeWeight ?
         builder.slice(options.peepholeWeight, [2 * hiddenSize], [hiddenSize]) :
         zero)),
    builder.add(
      builder.add(
        (options.bias ?
           builder.slice(options.bias, [2 * hiddenSize], [hiddenSize]) :
           zero),
        (options.recurrentBias ?
           builder.slice(
             options.recurrentBias, [2 * hiddenSize], [hiddenSize]) :
           zero)),
      builder.add(
        builder.matmul(
          input,
          builder.transpose(builder.slice(
            weight, [2 * hiddenSize, 0], [hiddenSize, inputSize]))),
        builder.matmul(
          hiddenState,
          builder.transpose(builder.slice(
            recurrentWeight,
            [2 * hiddenSize, 0],
            [hiddenSize, hiddenSize])))))));

  // cell gate (g)
  let g = builder.tanh(builder.add(
    builder.add(
      (options.bias ?
         builder.slice(options.bias, [3 * hiddenSize], [hiddenSize]) :
         zero),
      (options.recurrentBias ?
         builder.slice(options.recurrentBias, [3 * hiddenSize], [hiddenSize]) :
         zero)),
    builder.add(
      builder.matmul(
        input,
        builder.transpose(
          builder.slice(weight, [3 * hiddenSize, 0], [hiddenSize, inputSize]))),
      builder.matmul(
        hiddenState,
        builder.transpose(builder.slice(
          recurrentWeight, [3 * hiddenSize, 0], [hiddenSize, hiddenSize]))))));

  // output gate (o)
  let o = builder.sigmoid(builder.add(
    builder.mul(
      cellState,
      (options.peepholeWeight ?
         builder.slice(options.peepholeWeight, [hiddenSize], [hiddenSize]) :
         zero)),
    builder.add(
      builder.add(
        (options.bias ?
           builder.slice(options.bias, [hiddenSize], [hiddenSize]) :
           zero),
        (options.recurrentBias ?
           builder.slice(options.recurrentBias, [hiddenSize], [hiddenSize]) :
           zero)),
      builder.add(
        builder.matmul(
          input,
          builder.transpose(
            builder.slice(weight, [hiddenSize, 0], [hiddenSize, inputSize]))),
        builder.matmul(
          hiddenState,
          builder.transpose(builder.slice(
            recurrentWeight, [hiddenSize, 0], [hiddenSize, hiddenSize])))))));

  // output cell state (ct)
  let ct = builder.add(builder.mul(f, cellState), builder.mul(i, g));

  // output hidden state (ht)
  let ht = builder.mul(o, builder.tanh(ct));

  return [ht, ct];
}

8.9.35. matmul

2つの入力テンソルの行列積を計算する。

partial interface MLGraphBuilder {
  MLOperand matmul(MLOperand a, MLOperand b, optional MLOperatorOptions options = {});
};

partial dictionary MLOpSupportLimits {
  MLBinarySupportLimits matmul;
};

引数:

a: MLOperand。少なくとも2-Dである第1入力テンソル。
b: MLOperand。少なくとも2-Dである第2入力テンソル。
options: MLOperatorOptions。演算の任意パラメーターを指定する。

戻り値: MLOperand。 2つの入力テンソルの行列積を含む出力テンソル。

2つの入力テンソルの行列積を次のように計算する:

a とb の両方が2次元である場合、それらは従来の行列と同様に乗算され、出力として2次元テンソルを生成する。
a またはb のいずれかがN次元（N > 2）である場合、最後の2つのindexに対応する次元を持つ行列のstackとして扱われる。行列乗算はbroadcastされる。これは[numpy-broadcasting-rule]に従う。最後の2次元を除く a およびbの形状は、双方向ブロードキャスト可能でなければならない。出力は入力テンソルの最大rankをrankとするN次元テンソルである。出力テンソルの最後の2つを除く各次元について、そのサイズは入力テンソルのその次元に沿った最大サイズである。

`matmul()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`a`	`"float32"`, `"float16"`	`"float32"`, `"float16"`	2 から N	2 から 5
`b`	同じ `a`	`"float32"`, `"float16"`	2 から N	2 から 5
output	同じ `a`	`"float32"`, `"float16"`	2 から N	2 から 5

MLOpSupportLimits は matmul() について次のメンバーを持つ:

matmul, 型は MLBinarySupportLimits: matmul()演算子のサポート制限。

matmul(a, b, options) メソッドの手順は次のとおりである:

this が構築できない場合、"InvalidStateError" DOMExceptionを投げる。
this と a および b のいずれかを用いたオペランドの検証が false を返す場合、 TypeErrorを投げる。
a または b のいずれかのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
出力 shape を計算する:
1. shapeAを aのshapeの複製とする。
2. rankAを aのrankとする。
3. shapeBを bのshapeの複製とする。
4. rankBを bのrankとする。
5. rankA または rankB のいずれかが 2 未満の場合、TypeErrorを投げる。
6. colsAを shapeA[rankA - 1] とする。
7. rowsAを shapeA[rankA - 2] とする。
8. colsBを shapeB[rankB - 1] とする。
9. rowsBを shapeB[rankB - 2] とする。
10. colsA が rowsB に等しくない場合、TypeErrorを投げる。
11. batchShapeAを、空間次元（最後の 2 項目）が削除された shapeA の複製とする。
12. batchShapeBを、空間次元（最後の 2 項目）が削除された shapeB の複製とする。
13. outputShapeを batchShapeA と batchShapeB を双方向にブロードキャストする結果とする。それが failure を返す場合、TypeErrorを投げる。
14. « rowsA, colsB » を outputShape に追加する。
15. descを、aのdataTypeと outputShape が与えられてMLOperandDescriptor を作成する結果とする。
グラフ接続を作成する:
1. outputを、this と desc が与えられてMLOperand を作成する結果とする。
2. operatorを、options が与えられた "matmul" 操作のoperatorとする。
3. output.[[operator]] を operator に設定する。
4. operatorのinputsを a および b に設定する。
5. operatorのoutputを output に設定する。
outputを返す。

8.9.36. pad

テンソルの端に定数値またはmirror値を用いてテンソルを拡張する。

enum MLPaddingMode {
  "constant",
  "edge",
  "reflection"
};

dictionary MLPadOptions : MLOperatorOptions {
  MLPaddingMode mode = "constant";
  MLNumber value = 0;
};

partial interface MLGraphBuilder {
  MLOperand pad(MLOperand input,
                sequence<[EnforceRange] unsigned long> beginningPadding,
                sequence<[EnforceRange] unsigned long> endingPadding,
                optional MLPadOptions options = {});
};

partial dictionary MLOpSupportLimits {
  MLSingleInputSupportLimits pad;
};

MLPadOptions は次のメンバーを持つ:

mode, 型はMLPaddingMode、デフォルトは "constant": テンソルをpadするための異なる方法。
value, 型はMLNumber、デフォルトは0: mode が"constant"に設定されている場合のpadding値。

引数:

input: MLOperand。入力テンソル。
beginningPadding: sequence<unsigned long>。各入力次元の先頭に追加するpadding値の数。長さはNであり、ここでNは入力テンソルのrankである。inputの各次元dについて、beginningPadding[d] は、その次元の内容の前に追加する値の数を示す。
endingPadding: sequence<unsigned long>。各入力次元の末尾に追加するpadding値の数。長さはNであり、ここでNは入力テンソルのrankである。inputの各次元dについて、endingPadding[d] は、その次元の内容の後に追加する値の数を示す。
options: 任意のMLPadOptions。演算の任意パラメーター。

戻り値: MLOperand。 padされた出力テンソル。出力テンソルの各次元は次のように計算できる:

output size = beginning padding + input size + ending padding

`pad()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	任意	`"float32"`, `"float16"`	N	0 から 5
output	同じ `input`	`"float32"`, `"float16"`	同じ `input`	0 から 5

MLOpSupportLimits はpad()について次のメンバーを持つ:

pad, 型はMLSingleInputSupportLimits: pad()演算子のサポート制限。

pad(input, beginningPadding, endingPadding, options) メソッドの手順は次のとおりである:

thisがbuildできない場合、"InvalidStateError" DOMExceptionをthrowする。
オペランドを検証することをthisおよびinputとともに行った結果がfalseを返す場合、TypeErrorをthrowする。
beginningPaddingのsizeと endingPaddingのsizeの両方が inputのrankと等しくない場合、TypeErrorをthrowする。
descをinput.[[descriptor]]のcopyとする。
outputShapeをinputのshapeのcopyとする。
0からoutputShapeのrankまで（含まない）の範囲内の各indexについて実行する:
1. options.modeで分岐する:
  "constant"
  
  何もしない。
  
  "edge"
  
  何もしない。
  
  "reflection"
  1. beginningPadding[index]が outputShape[index]以上である場合、TypeErrorをthrowする。
  2. endingPadding[index]が outputShape[index]以上である場合、TypeErrorをthrowする。
2. beginningPadding[index]の値を outputShape[index]に加算する。
3. endingPadding[index]の値を outputShape[index]に加算する。
outputShape内のいずれかのitemが妥当な次元でない場合、TypeErrorをthrowする。
options.value を、options.value をinputのdataTypeにcastした結果に設定する。
desc.shape をoutputShapeに設定する。
グラフ接続を作成する:
1. outputを、thisおよび descが与えられてMLOperandを作成する結果とする。
2. operatorを、beginningPadding、endingPaddingおよび optionsが与えられた"padding"演算用の演算子とする。
3. output.[[operator]] をoperatorに設定する。
4. operatorのinputをinputに設定する。
5. operatorのoutputをoutputに設定する。
outputを返す。

constant、edge、およびreflection paddingの例:

// input: [[1,2,3], [4,5,6]]
const input = builder.constant(
  {dataType: 'float32', shape: [2, 3]}, new Float32Array([1, 2, 3, 4, 5, 6]));

const beginningPadding = [1, 2];
const endingPadding = [1, 2];

// "constant" padded:
//    [[0,0,0,0,0,0,0],
//     [0,0,1,2,3,0,0],
//     [0,0,4,5,6,0,0],
//     [0,0,0,0,0,0,0]]
builder.pad(input, beginningPadding, endingPadding);

// "edge" padded:
//    [[1,1,1,2,3,3,3],
//     [1,1,1,2,3,3,3],
//     [4,4,4,5,6,6,6],
//     [4,4,4,5,6,6,6]]
builder.pad(input, beginningPadding, endingPadding, {mode: 'edge'});

// "reflection" padded:
//    [[6,5,4,5,6,5,4],
//     [3,2,1,2,3,2,1],
//     [6,5,4,5,6,5,4],
//     [3,2,1,2,3,2,1]]
builder.pad(input, beginningPadding, endingPadding, {mode: 'reflection'});

8.9.37. Pooling operations

入力テンソル上を移動するwindow内のすべての要素にわたってpooling演算を計算する。

enum MLRoundingType {
  "floor",
  "ceil"
};

dictionary MLPool2dOptions : MLOperatorOptions {
  sequence<[EnforceRange] unsigned long> windowDimensions;
  sequence<[EnforceRange] unsigned long> padding;
  sequence<[EnforceRange] unsigned long> strides;
  sequence<[EnforceRange] unsigned long> dilations;
  MLInputOperandLayout layout = "nchw";
  MLRoundingType outputShapeRounding = "floor";
  sequence<[EnforceRange] unsigned long> outputSizes;
};

partial interface MLGraphBuilder {
  MLOperand averagePool2d(MLOperand input, optional MLPool2dOptions options = {});
  MLOperand l2Pool2d(MLOperand input, optional MLPool2dOptions options = {});
  MLOperand maxPool2d(MLOperand input, optional MLPool2dOptions options = {});
};

partial dictionary MLOpSupportLimits {
  MLSingleInputSupportLimits averagePool2d;
  MLSingleInputSupportLimits l2Pool2d;
  MLSingleInputSupportLimits maxPool2d;
};

MLPool2dOptions は次のメンバーを持つ:

windowDimensions, 型は sequence<[EnforceRange] unsigned long>

長さ2のリスト: [windowHeight, windowWidth]。 sliding windowの寸法を指定する。 window寸法のデフォルト値は、入力形状のheightおよびwidth寸法である。

padding, 型はsequence<[EnforceRange] unsigned long>

長さ4のリスト: [beginningHeight, endingHeight, beginningWidth, endingWidth]。 convolution入力の各spatial dimensionの先頭および末尾に追加される行および列を指定する。デフォルト値は[0,0,0,0]である。

strides, 型はsequence<[EnforceRange] unsigned long>

長さ2のリスト: [strideHeight, strideWidth]。 convolution入力の各spatial dimensionについてsliding windowのstrideを指定する。デフォルト値は[1,1]である。

dilations, 型はsequence<[EnforceRange] unsigned long>

長さ2のリスト: [dilationHeight, dilationWidth]。convolution filter（kernel）に適用される各 spatial dimensionのdilation係数を指定する。デフォルト値は[1,1]である。

layout, 型はMLInputOperandLayout、デフォルトは "nchw"

入力および出力テンソルのlayout形式を次のように指定する:

"nchw"
- input tensor: [batches, inputChannels, height, width]
- output tensor: [batches, outputChannels, height, width]
"nhwc":
- input tensor: [batches, height, width, inputChannels]
- output tensor: [batches, height, width, outputChannels]

outputShapeRounding, 型はMLRoundingType、デフォルトは "floor"

完全なwindow結果または部分的なwindow結果のどちらが望まれるかに応じて、出力形状を計算するために使用される丸め関数。

outputSizes, 型は sequence<[EnforceRange] unsigned long>

長さ2のリスト: [outputHeight, outputWidth] 出力テンソルの2つのspatial dimensionのサイズを指定する。出力サイズが明示的に指定されている場合、outputShapeRounding は無視される。指定されていない場合、出力サイズは自動的に計算される。

引数:

input: MLOperand。入力4-Dテンソル。論理形状はlayoutの値に従って解釈される。
options: 任意の MLPool2dOptions。演算の任意パラメーター。

戻り値: MLOperand。 reductionの結果を含む出力4-Dテンソル。論理形状はlayoutの値に従って解釈される。より具体的には、outputShapeRounding が"floor"の場合、出力テンソルの単一次元についてのspatial dimensionは次のように計算できる:

output size = floor(1 + (input size - filter size + beginning padding + ending padding) / stride)

または、outputShapeRounding が"ceil"の場合:

output size = ceil(1 + (input size - filter size + beginning padding + ending padding) / stride)

`averagePool2d()`/`l2Pool2d()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	`"float32"`, `"float16"`	`"float32"`, `"float16"`	4	4
output	同じ `input`	`"float32"`, `"float16"`	4	4

`maxPool2d()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	任意	`"float32"`, `"float16"`	4	4
output	同じ `input`	`"float32"`, `"float16"`	4	4

MLOpSupportLimits は pooling 操作について次のメンバーを持つ:

averagePool2d, 型は MLSingleInputSupportLimits: averagePool2d()演算子のサポート制限。
l2Pool2d, 型は MLSingleInputSupportLimits: l2Pool2d()演算子のサポート制限。
maxPool2d, 型は MLSingleInputSupportLimits: maxPool2d()演算子のサポート制限。

max pooling 操作のためのものなどの global pooling 操作は、window dimensions が input shape の空間次元（最後の 2 次元）である pooling の変種であり、次のようになる。

// 'global' max pooling
builder.maxPool2d(input);

string op、MLOperand input、MLPool2dOptions options、および任意のリスト allowedDataTypes が与えられて、pooling 操作を作成するには、次の手順を実行する:

Assert: op は "averagePool2d"、"l2Pool2d"、 "maxPool2d" のいずれかである。
this が構築できない場合、"InvalidStateError" DOMExceptionを投げる。
this と input を用いたオペランドの検証が false を返す場合、TypeErrorを投げる。
allowedDataTypes が与えられており、それが inputのdataTypeを含まない場合、TypeErrorを投げる。
inputのrankが 4 でない場合、TypeErrorを投げる。
options.layout に応じて分岐する:

"nchw"

« batches, channels, inputHeight, inputWidth » を inputのshapeとする。

"nhwc"

« batches, inputHeight, inputWidth, channels » を inputのshapeとする。
options.windowDimensions が存在しない場合、options.windowDimensions を « inputHeight, inputWidth » に設定する。
options.windowDimensionsの sizeが 2 でない場合、TypeErrorを投げる。
options.windowDimensions 内のいずれかの項目が 0 に等しい場合、TypeErrorを投げる。
options.outputSizes が存在する場合、または options.padding が存在しない場合、options.padding をリスト « 0, 0, 0, 0 » に設定する。
options.paddingの sizeが 4 でない場合、TypeErrorを投げる。
options.strides が存在しない場合、options.strides をリスト « 1, 1 » に設定する。
options.stridesの sizeが 2 でない場合、TypeErrorを投げる。
options.strides 内のいずれかの項目が 0 である場合、TypeErrorを投げる。
options.outputSizes が存在する場合:
1. そのsizeが 2 でない場合、TypeErrorを投げる。
2. その項目が、options.stridesについて同じ次元（index）の項目より小さくない場合、TypeErrorを投げる。
options.dilations が存在しない場合、options.dilations をリスト « 1, 1 » に設定する。
options.dilationsの sizeが 2 でない場合、TypeErrorを投げる。
options.dilations 内のいずれかの項目が 0 である場合、TypeErrorを投げる。
descを input.[[descriptor]]のコピーとする。
出力 shape を計算する:
1. « windowHeight, windowWidth » を options.windowDimensions とする。
2. « calculatedOutputHeight, calculatedOutputWidth » を、 inputHeight、inputWidth、 windowHeight、windowWidth、options.padding、 options.strides、および options.dilations が与えられてconv2d 出力サイズを計算する結果とする。
3. options.outputSizes が存在する場合:
  1. « outputHeight, outputWidth » を options.outputSizes とする。
  2. outputHeight が floor( calculatedOutputHeight ) に等しく、かつ outputWidth が floor( calculatedOutputWidth ) に等しい、または outputHeight が ceil( calculatedOutputHeight ) に等しく、かつ outputWidth が ceil( calculatedOutputWidth ) に等しい、のいずれでもない場合、TypeErrorを投げる。
4. そうでない場合:
  1. « outputHeight, outputWidth » を « calculatedOutputHeight, calculatedOutputWidth » とする。
  2. options.outputShapeRounding に応じて分岐する:
    "floor"
    outputWidth を floor(outputWidth) に設定する。
    
    outputHeight を floor(outputHeight) に設定する。
    "ceil"
    outputWidth を ceiling(outputWidth) に設定する。
    
    outputHeight を ceiling(outputHeight) に設定する。
5. outputHeight または outputWidth のいずれかが有効な次元でない場合、TypeErrorを投げる。
6. options.layout に応じて分岐する:
  
  "nchw"
  
  outputShapeを « batches, channels, outputHeight, outputWidth » とする。
  
  "nhwc"
  
  outputShapeを « batches, outputHeight, outputWidth, channels » とする。
7. desc.shape を outputShape に設定する。
グラフ接続を作成する:
1. outputを、this と desc が与えられてMLOperand を作成する結果とする。
2. operatorを、options が与えられた op 操作のoperatorとする。
3. output.[[operator]] を operator に設定する。
4. operatorのinputを input に設定する。
5. operatorのoutputを output に設定する。
outputを返す。

次の pooling アルゴリズムがサポートされる。

averagePool2d(input, options) メソッドの手順は次のとおりである:

outputを、"averagePool2d"、input、options、および « "float32", "float16" » が与えられてpooling 操作を作成する結果とする。
1. それが error を投げる場合、その error を再投げる。
outputを返す。

l2Pool2d(input, options) メソッドの手順は次のとおりである:

outputを、"l2Pool2d"、input、options、および « "float32", "float16" » が与えられてpooling 操作を作成する結果とする。
1. それが error を投げる場合、その error を再投げる。
outputを返す。

maxPool2d(input, options) メソッドの手順は次のとおりである:

outputを、"maxPool2d"、input および options が与えられてpooling 操作を作成する結果とする。
1. それが error を投げる場合、その error を再投げる。
outputを返す。

8.9.37.1. averagePool2d

feature mapのpatchについて平均値を計算し、それを使用してpooled feature mapを作成する。詳細は§ 8.9.37 Pooling operationsを参照。

8.9.37.2. l2Pool2d

L2 norm関数を入力feature mapの領域に適用する。L2 normは、その要素の二乗和の平方根である。詳細は§ 8.9.37 Pooling operationsを参照。

8.9.37.3. maxPool2d

feature mapのpatchについて最大値を計算し、それを使用してpooled feature mapを作成する。詳細は§ 8.9.37 Pooling operationsを参照。

8.9.38. prelu

入力テンソルに対してrectified linear functionのparametric版（Parametric ReLU）を要素単位で計算する。Parametric ReLUは leaky ReLUの一種であり、0.01のようなスカラーslopeを持つ代わりに、slope（leakage係数）をこの演算のmodel training段階で学習されるパラメーターにする。計算は式max(0, x) + slope * min(0, x)に従う。

演算はbroadcastされる。これは[numpy-broadcasting-rule]に従う。入力テンソルは双方向ブロードキャスト可能でなければならない。出力テンソルのrankは、入力テンソルの最大rankである。出力テンソルの各次元について、そのサイズは入力テンソルのその次元に沿った最大サイズである。

partial interface MLGraphBuilder {
  MLOperand prelu(MLOperand input,
                  MLOperand slope,
                  optional MLOperatorOptions options = {});
};

dictionary MLPreluSupportLimits {
  MLTensorLimits input;
  MLTensorLimits slope;
  MLTensorLimits output;
};

partial dictionary MLOpSupportLimits {
  MLPreluSupportLimits prelu;
};

引数:

input: MLOperand。入力テンソル。
slope: MLOperand。 slopeテンソル。その形状はinputの形状に対して双方向ブロードキャスト可能でなければならない。
options: MLOperatorOptions。演算の任意パラメーターを指定する。

戻り値:

MLOperand。 inputのrankと slopeの rankの最大値に等しいrankの出力N-Dテンソル。

`prelu()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	`"float32"`, `"float16"`, `"int64"`, `"int32"`, `"int8"`	`"float32"`, `"float16"`	N	0 から 5
`slope`	同じ `input`	`"float32"`, `"float16"`	N	0 から 5
output	同じ `input`	`"float32"`, `"float16"`	N	0 から 5

MLPreluSupportLimits は次のメンバーを持つ:

input, 型は MLTensorLimits: input オペランド用のMLTensorLimits。
slope, 型は MLTensorLimits: slope オペランド用のMLTensorLimits。
output, 型は MLTensorLimits: output オペランド用のMLTensorLimits。

MLOpSupportLimits は prelu() について次のメンバーを持つ:

prelu, 型は MLPreluSupportLimits: prelu()演算子のサポート制限。

prelu(input, slope, options) メソッドの手順は次のとおりである:

this が構築できない場合、"InvalidStateError" DOMExceptionを投げる。
this と input および slope のいずれかを用いたオペランドの検証が false を返す場合、TypeErrorを投げる。
input または slope のいずれかのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
outputShapeを、 slopeのshapeと inputのshapeを双方向にブロードキャストする結果とする。
1. それが failure を返す場合、TypeErrorを投げる。
descriptorを、inputのdataTypeと outputShape が与えられてMLOperandDescriptor を作成する結果とする。
グラフ接続を作成する:
1. outputを、this と descriptor が与えられてMLOperand を作成する結果とする。
2. operatorを、slope と options が与えられた "prelu" 操作のoperatorとする。
3. output.[[operator]] を operator に設定する。
4. operatorのinputsを input および slope に設定する。
5. operatorのoutputを output に設定する。
outputを返す。

function prelu(builder, input, slope) {
  return builder.add(
    builder.max(builder.constant(input.dataType, 0), input),
    builder.mul(
      slope, builder.min(builder.constant(input.dataType, 0), input)));
}

8.9.39. Reduction operations

入力テンソルをすべての次元に沿って、またはaxes 配列パラメーターで指定された軸に沿ってreduceする。指定された各軸について、そのindexを持つ次元がreduceされる。すなわち、結果のテンソルは、 keepDimensions が指定されていない限り、それを含まない。結果のテンソルの値は、reduceされた次元にわたるすべての入力値をパラメーターとして取る、指定されたreduction関数を使用して計算される。

dictionary MLReduceOptions : MLOperatorOptions {
  sequence<[EnforceRange] unsigned long> axes;
  boolean keepDimensions = false;
};

partial interface MLGraphBuilder {
  MLOperand reduceL1(MLOperand input, optional MLReduceOptions options = {});
  MLOperand reduceL2(MLOperand input, optional MLReduceOptions options = {});
  MLOperand reduceLogSum(MLOperand input, optional MLReduceOptions options = {});
  MLOperand reduceLogSumExp(MLOperand input, optional MLReduceOptions options = {});
  MLOperand reduceMax(MLOperand input, optional MLReduceOptions options = {});
  MLOperand reduceMean(MLOperand input, optional MLReduceOptions options = {});
  MLOperand reduceMin(MLOperand input, optional MLReduceOptions options = {});
  MLOperand reduceProduct(MLOperand input, optional MLReduceOptions options = {});
  MLOperand reduceSum(MLOperand input, optional MLReduceOptions options = {});
  MLOperand reduceSumSquare(MLOperand input, optional MLReduceOptions options = {});
};

partial dictionary MLOpSupportLimits {
  MLSingleInputSupportLimits reduceL1;
  MLSingleInputSupportLimits reduceL2;
  MLSingleInputSupportLimits reduceLogSum;
  MLSingleInputSupportLimits reduceLogSumExp;
  MLSingleInputSupportLimits reduceMax;
  MLSingleInputSupportLimits reduceMean;
  MLSingleInputSupportLimits reduceMin;
  MLSingleInputSupportLimits reduceProduct;
  MLSingleInputSupportLimits reduceSum;
  MLSingleInputSupportLimits reduceSumSquare;
};

MLReduceOptions は次のメンバーを持つ:

axes, 型はsequence<[EnforceRange] unsigned long>

reduceする次元。これにより、入力テンソル内のどの値がreduction関数で使用されるかも指定される。リスト内のaxesは、入力テンソルのrankをNとすると、 [0, N-1]の範囲内でなければならない。

存在しない場合、すべての次元がreduceされる。reduction関数への入力値は、入力テンソル内のすべての値である。

存在し、空でない場合、reduction関数への入力値は、入力テンソルの指定された次元についてのすべての値である。

存在し、空である場合、どの次元もreduceされず、出力テンソルの形状は入力テンソルの形状と同じになる。reduction関数は、テンソル内の各値に個別に適用される。

keepDimensions, 型は boolean、デフォルトはfalse

trueの場合、出力は入力と同じrankを持ち、reduceされたすべての次元をサイズ1に設定する。

引数:

input: MLOperand。入力テンソル。
options: 任意の MLReduceOptions。演算の任意パラメーター。

戻り値: MLOperand。 inputのrankを含む、 0からそのrankまでの範囲にあるrankの出力N-Dテンソルであり、 axes およびkeepDimensions に依存する。入力オペランドがスカラーである場合、reduction関数はそのスカラー値に適用され、出力もスカラーである。

`reduceL1()`/`reduceSum()`/`reduceSumSquare()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	`"float32"`, `"float16"`, `"int32"`, `"uint32"`, `"int64"`, `"uint64"`	`"float32"`, `"float16"`, `"int32"`	N	0 から 5
output	同じ `input`	`"float32"`, `"float16"`, `"int32"`	N	0 から 5

`reduceL2()`/`reduceLogSum()`/`reduceLogSumExp()`/`reduceMean()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	`"float32"`, `"float16"`	`"float32"`, `"float16"`	N	0 から 5
output	同じ `input`	`"float32"`, `"float16"`	N	0 から 5

`reduceMax()`/`reduceMin()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	任意	`"float32"`, `"float16"`, `"int32"`	N	0 から 5
output	同じ `input`	`"float32"`, `"float16"`, `"int32"`	N	0 から 5

`reduceProduct()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	`"float32"`, `"float16"`, `"int32"`, `"uint32"`, `"int64"`, `"uint64"`	`"float32"`, `"float16"`	N	0 から 5
output	同じ `input`	`"float32"`, `"float16"`	N	0 から 5

MLOpSupportLimits は reduction 操作について次のメンバーを持つ:

reduceL1, 型は MLSingleInputSupportLimits: reduceL1()演算子のサポート制限。
reduceL2, 型は MLSingleInputSupportLimits: reduceL2()演算子のサポート制限。
reduceLogSum, 型は MLSingleInputSupportLimits: reduceLogSum()演算子のサポート制限。
reduceLogSumExp, 型は MLSingleInputSupportLimits: reduceLogSumExp()演算子のサポート制限。
reduceMax, 型は MLSingleInputSupportLimits: reduceMax()演算子のサポート制限。
reduceMean, 型は MLSingleInputSupportLimits: reduceMean()演算子のサポート制限。
reduceMin, 型は MLSingleInputSupportLimits: reduceMin()演算子のサポート制限。
reduceProduct, 型は MLSingleInputSupportLimits: reduceProduct()演算子のサポート制限。
reduceSum, 型は MLSingleInputSupportLimits: reduceSum()演算子のサポート制限。
reduceSumSquare, 型は MLSingleInputSupportLimits: reduceSumSquare()演算子のサポート制限。

Reduction types:

L1: L1 norm、すなわち入力値の絶対値の和を計算する。
L2: L2 norm、すなわち入力値の二乗和の平方根を計算する。
LogSum: 入力値の和のlog値を計算する。
LogSumExp: 入力値の指数の和のlog値を計算する。
Max: 入力値の最大値を計算する。
Mean: 入力値の平均値を計算する。
Min: 入力値の最小値を計算する。
Product: 入力値の積を計算する。
Sum: 入力値の和を計算する。
SumSquare: 入力値の二乗和を計算する。

unsigned integersのlist inputShape、任意のunsigned integersのlist axes、およびboolean keepDimensionsが与えられて、reduction output sizesを計算するには、次の手順を実行する。それらはunsigned integersの新しいlist、またはfailureを返す。

inputRankをinputShapeのsizeとする。
axesが与えられていない場合、axesを0から inputRankまで（含まない）の範囲とする。
そうでなく、axesが重複値を含む場合、またはそのitemsのいずれかが 0からinputRankまで（含まない）の範囲内にない場合、failureを返す。
keepDimensionsがtrueの場合:
1. outputShapeをinputShapeのclone とする。
2. axesの各axisについて実行する:
  1. outputShape[axis]を1に設定する。
そうでない場合:
1. outputShapeを空のlistとする。
2. 0からinputRankまで（含まない）の範囲内の各indexについて実行する:
  1. axesがindexを含まない場合、inputShape[index] をoutputShapeにappendする。
outputShapeを返す。

string op、MLOperand input、MLReduceOptions options、および任意のlist allowedDataTypesが与えられて、reduction operationを作成するには、次の手順を実行する:

Assert: opは"reduceL1"、"reduceL2"、 "reduceLogSum"、"reduceLogSumExp"、"reduceMax"、"reduceMean"、"reduceMin"、"reduceProduct"、 "reduceSum"、"reduceSumSquare"のいずれかである。
thisがbuildできない場合、"InvalidStateError" DOMExceptionをthrowする。
オペランドを検証することをthisおよびinputとともに行った結果がfalseを返す場合、TypeErrorをthrowする。
allowedDataTypesが与えられており、それが inputのdataTypeを含まない場合、TypeErrorをthrowする。
outputShapeを、inputのshape、options.axes （それが存在する場合）、およびoptions.keepDimensionsが与えられてreduction output sizesを計算する結果とする。それがfailureを返す場合、TypeErrorをthrowする。
descを、inputのdataTypeおよび outputShapeが与えられてMLOperandDescriptorを作成する結果とする。
グラフ接続を作成する:
1. outputを、thisおよび descが与えられてMLOperandを作成する結果とする。
2. operatorを、optionsが与えられたop演算用の演算子とする。
3. output.[[operator]] をoperatorに設定する。
4. operatorのinputをinputに設定する。
5. operatorのoutputをoutputに設定する。
outputを返す。

次のreductionアルゴリズムがサポートされる。

reduceL1(input, options) メソッドの手順は次のとおりである:

outputを、"reduceL1"、input、options、および « "float32", "float16", "int32", "uint32", "int64", "uint64" » が与えられてreduction 操作を作成する結果とする。
1. それが error を投げる場合、その error を再投げる。
outputを返す。

reduceL2(input, options) メソッドの手順は次のとおりである:

outputを、"reduceL2"、input、options、および « "float32", "float16" » が与えられてreduction 操作を作成する結果とする。
1. それが error を投げる場合、その error を再投げる。
outputを返す。

reduceLogSum(input, options) メソッドの手順は次のとおりである:

outputを、"reduceLogSum"、input、options、および « "float32", "float16" » が与えられてreduction 操作を作成する結果とする。
1. それが error を投げる場合、その error を再投げる。
outputを返す。

reduceLogSumExp(input, options) メソッドの手順は次のとおりである:

outputを、"reduceLogSumExp"、input、options、および « "float32", "float16" » が与えられてreduction 操作を作成する結果とする。
1. それが error を投げる場合、その error を再投げる。
outputを返す。

reduceMax(input, options) メソッドの手順は次のとおりである:

outputを、"reduceMax"、input および options が与えられてreduction 操作を作成する結果とする。
1. それが error を投げる場合、その error を再投げる。
outputを返す。

reduceMean(input, options) メソッドの手順は次のとおりである:

outputを、"reduceMean"、input、options、および « "float32", "float16" » が与えられてreduction 操作を作成する結果とする。
1. それが error を投げる場合、その error を再投げる。
outputを返す。

reduceMin(input, options) メソッドの手順は次のとおりである:

outputを、"reduceMin"、input および options が与えられてreduction 操作を作成する結果とする。
1. それが error を投げる場合、その error を再投げる。
outputを返す。

reduceProduct(input, options) メソッドの手順は次のとおりである:

outputを、"reduceProduct"、input、options、および « "float32", "float16", "int32", "uint32", "int64", "uint64" » が与えられてreduction 操作を作成する結果とする。
1. それが error を投げる場合、その error を再投げる。
outputを返す。

reduceSum(input, options) メソッドの手順は次のとおりである:

outputを、"reduceSum"、input、options、および « "float32", "float16", "int32", "uint32", "int64", "uint64" » が与えられてreduction 操作を作成する結果とする。
1. それが error を投げる場合、その error を再投げる。
outputを返す。

reduceSumSquare(input, options) メソッドの手順は次のとおりである:

outputを、"reduceSumSquare"、input、options、および « "float32", "float16", "int32", "uint32", "int64", "uint64" » が与えられてreduction 操作を作成する結果とする。
1. それが error を投げる場合、その error を再投げる。
outputを返す。

いくつかのreduction演算の振る舞いは、ユーザーエージェントは通常より効率的な実装を持つものの、次のように他の演算の使用から一般的にエミュレートできる。基盤となるプラットフォームが演算を直接サポートしない場合、この分解は実装を導くためのテンプレートとして使用できる。

function reduceLogSum(builder, input, options) {
  return builder.log(builder.reduceSum(input, options));
}

function reduceLogSumExp(builder, input, options) {
  return builder.log(builder.reduceSum(builder.exp(input), options));
}

function reduceSumSquare(builder, input, options) {
  return builder.reduceSum(builder.pow(input, 2), options);
}

一部の基盤となるプラットフォームは、keepDimensions のようなoptionを直接サポートしない。これは基盤となるテンソルデータには影響せず、形状にのみ影響する。たとえば、入力形状が [2, 3, 4]、axisが1、かつkeepDimensions がtrueの場合、期待される出力形状は[2, 1 ,4]である。基盤となるプラットフォームがreduceされた次元を保持しない場合、それは[2, 4]の出力形状を生成する。実装は[2, 1, 4]へのno-op reshapeを導入できる。keepDimensions がfalseであっても、基盤となるプラットフォームが常にreduceされた次元を保持する場合、同様のno-op reshapeを導入できる。

8.9.40. relu

入力テンソルのrectified linear functionを計算する。

partial interface MLGraphBuilder {
  MLOperand relu(MLOperand input, optional MLOperatorOptions options = {});
};

partial dictionary MLOpSupportLimits {
  MLSingleInputSupportLimits relu;
};

引数:

input: MLOperand。入力テンソル。
options: MLOperatorOptions。演算の任意パラメーターを指定する。

戻り値:

MLOperand。 inputと同じ形状の出力テンソル。

`relu()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	`"float32"`, `"float16"`, `"int64"`, `"int32"`, `"int8"`	`"float32"`, `"float16"`	N	0 から 5
output	同じ `input`	`"float32"`, `"float16"`	同じ `input`	0 から 5

MLOpSupportLimits は relu() について次のメンバーを持つ:

relu, 型は MLSingleInputSupportLimits: relu()演算子のサポート制限。

relu(input, options) メソッドの手順は次のとおりである:

this が構築できない場合、"InvalidStateError" DOMExceptionを投げる。
this と input を用いたオペランドの検証が false を返す場合、TypeErrorを投げる。
inputのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
グラフ接続を作成する:
1. outputを、input が与えられてMLOperand をコピーする結果とする。
2. operatorを、options が与えられた "relu" 操作のoperatorとする。
3. output.[[operator]] を operator に設定する。
4. operatorのinputを input に設定する。
5. operatorのoutputを output に設定する。
outputを返す。

function relu(builder, input) {
  return builder.max(builder.constant(input.dataType, 0), input);
}

8.9.41. resample2d

軸およびスケーリング係数に従って、テンソル値をソース次元から宛先次元へresampleする。

enum MLInterpolationMode {
  "nearest-neighbor",
  "linear"
};

dictionary MLResample2dOptions : MLOperatorOptions {
  MLInterpolationMode mode = "nearest-neighbor";
  sequence<float> scales;
  sequence<[EnforceRange] unsigned long> sizes;
  sequence<[EnforceRange] unsigned long> axes;
};

partial interface MLGraphBuilder {
  MLOperand resample2d(MLOperand input, optional MLResample2dOptions options = {});
};

partial dictionary MLOpSupportLimits {
  MLSingleInputSupportLimits resample2d;
};

引数:

input: MLOperand。入力4-Dテンソル。
options: 任意の MLResample2dOptions。演算の任意パラメーター。

戻り値: MLOperand。出力4-Dテンソル。

MLResample2dOptions は次のメンバーを持つ:

mode, 型はMLInterpolationMode、デフォルトは "nearest-neighbor"

出力テンソル値を埋めるために使用される補間アルゴリズム。

どちらのアルゴリズムも、各spatial axis（axesに基づく）について計算された、次の入力から開始する。ここで、inputSizeはinput テンソルのshapeで与えられ、 outputSizeはsizes またはscalesで与えられ、 outputCoordinateは、計算対象の出力テンソル内の要素を識別する。

scale = outputSize / inputSize
unclampedCoordinate = (outputCoordinate + 0.5) / scale - 0.5
inputCoordinate = clamp(unclampedCoordinate, 0, inputSize - 1)

出力テンソル内の所与のoutputCoordinate.xおよびoutputCoordinate.yの位置について、上の式は有理数のinputCoordinate.xおよび inputCoordinate.yを与える。

nearest-neighbor

上で計算されたinputCoordinate.xおよびinputCoordinate.yは、次のように出力テンソル値を計算するため、nearest-neighbor samplingアルゴリズムへの入力として使用される:

x = ceil(inputCoordinate.x - 0.5)
y = ceil(inputCoordinate.y - 0.5)
output tensor value = input tensor value at (x, y)

linear

上で計算されたinputCoordinate.xおよびinputCoordinate.yは、次のように出力テンソル値を計算するため、bilinear samplingアルゴリズムへの入力として使用される:

x0 = floor(inputCoordinate.x)
x1 = ceil(inputCoordinate.x)
y0 = floor(inputCoordinate.y)
y1 = ceil(inputCoordinate.y)
vx0y0 = input tensor value at (x0, y0)
vx1y0 = input tensor value at (x1, y0)
vx0y1 = input tensor value at (x0, y1)
vx1y1 = input tensor value at (x1, y1)
tx = inputCoordinate.x - x0
ty = inputCoordinate.y - y0

vy0 = vx0y0 * (1 - tx) + vx1y0 * tx
vy1 = vx0y1 * (1 - tx) + vx1y1 * tx
output tensor value = vy0 * (1 - ty) + vy1 * ty

scales, 型は sequence<float>

長さ2のリスト。 axesからの各入力次元についてのスケーリング係数を指定する: [scaleForFirstAxis, scaleForSecondAxis]。デフォルト値は[1.0, 1.0]である。

sizes, 型はsequence<[EnforceRange] unsigned long>

長さ2のリスト。 axesからの各入力次元についてのターゲットサイズを指定する: [sizeForFirstAxis, sizeForSecondAxis]。sizes が指定されている場合、スケーリング係数値は入力のターゲットサイズから導出されるため、scales は無視される。

axes, 型はsequence<[EnforceRange] unsigned long>

長さ2のリスト。補間アルゴリズムが適用される入力テンソルの2つの次元を指定する。デフォルト値は[2, 3]である。

`resample2d()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	`"float32"`, `"float16"`, `"uint8"`, `"int8"`	`"float32"`, `"float16"`	4	4
output	同じ `input`	`"float32"`, `"float16"`	4	4

MLOpSupportLimits は resample2d() について次のメンバーを持つ:

resample2d, 型は MLSingleInputSupportLimits: resample2d()演算子のサポート制限。

resample2d(input, options) メソッドの手順は次のとおりである:

this が構築できない場合、"InvalidStateError" DOMExceptionを投げる。
this と input を用いたオペランドの検証が false を返す場合、TypeErrorを投げる。
inputのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
inputのrankがその許可される rankでない場合、TypeErrorを投げる。
options.scales が存在しない場合、それをリスト « 1.0, 1.0 » に設定する。
そうでない場合で、その項目のいずれかが 0 以下である、またはそのsizeが 2 でない場合、TypeErrorを投げる。
options.sizes が存在し、かつそのsizeが 2 でない、またはその項目のいずれかが 0 である場合、TypeErrorを投げる。
options.axes が存在しない場合、それをリスト « 2, 3 » に設定する。
そうでない場合で、options.axes が重複値を含む、またはその項目のいずれかが 0 から inputのrank までの範囲（終端を含まない）内にない場合、TypeErrorを投げる。
出力 shape を計算する:
1. inputDescriptorを input.[[descriptor]] とする。
2. outputShapeを inputDescriptor.shapeの複製とする。
3. 各 index について、0 から options.axesの size までの範囲（終端を含まない）で反復する:
  1. options.sizes が存在する場合、size を options.sizes[index] とする。
  2. そうでない場合、size を floor(inputのshape[options.axes[index]] * options.scales[index]) とする。
  3. size が有効な次元でない場合、TypeErrorを投げる。
  4. outputShape[options.axes[index]] を size に設定する。
4. descを、inputDescriptor.dataType と outputShape が与えられてMLOperandDescriptor を作成する結果とする。
グラフ接続を作成する:
1. outputを、this と desc が与えられてMLOperand を作成する結果とする。
2. operatorを、options が与えられた "resample2d" 操作のoperatorとする。
3. output.[[operator]] を operator に設定する。
4. operatorのinputを input に設定する。
5. operatorのoutputを output に設定する。
outputを返す。

具体的なsamplingアルゴリズムは、既存のMachine Learningフレームワークで広く使用されているものに基づく。たとえば、次の[4, 4]入力テンソル（spatial dimensionsのみを考慮）からlinear resamplingを行う場合:

[   0   1   2   3  ]
[   0   1   2   3  ]
[  12  13  14  15  ]
[  12  13  14  15  ]

[8, 8]出力テンソルについて、期待される値は次のとおりである:

[   0   0.25   0.75   1.25   1.75   2.25   2.75   3  ]
[   0   0.25   0.75   1.25   1.75   2.25   2.75   3  ]
[   0   0.25   0.75   1.25   1.75   2.25   2.75   3  ]
[   3   3.25   3.75   4.25   4.75   5.25   5.75   6  ]
[   9   9.25   9.75  10.25  10.75  11.25  11.75  12  ]
[  12  12.25  12.75  13.25  13.75  14.25  14.75  15  ]
[  12  12.25  12.75  13.25  13.75  14.25  14.75  15  ]
[  12  12.25  12.75  13.25  13.75  14.25  14.75  15  ]

これは、samplingが均等に分散され、対称で、image mirroringに対して堅牢で、corner値が揃うという便利な性質を持つ。

8.9.42. reshape

テンソルの形状を新しい形状へ変更する。Reshapeはテンソルの内容をcopyしたり変更したりしない。後続の演算のためにテンソルの論理形状を変更するだけである。

partial interface MLGraphBuilder {
  MLOperand reshape(MLOperand input,
                    sequence<[EnforceRange] unsigned long> newShape,
                    optional MLOperatorOptions options = {});
};

partial dictionary MLOpSupportLimits {
  MLSingleInputSupportLimits reshape;
};

引数:

input: MLOperand。入力テンソル。
newShape: sequence<unsigned long>。出力テンソルの形状。 newShape により含意される要素数は、入力テンソル内の要素数と同じでなければならない。
options: MLOperatorOptions。演算の任意パラメーターを指定する。

戻り値: MLOperand。出力テンソル。出力テンソルの値は入力テンソルの値と同じである。出力テンソルの形状はnewShape により指定される。

`reshape()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	任意	`"float32"`, `"float16"`, `"int32"`, `"int8"`, `"uint8"`	N	0 から 5
output	同じ `input`	`"float32"`, `"float16"`, `"int32"`, `"int8"`, `"uint8"`	N	0 から 5

MLOpSupportLimits は reshape() について次のメンバーを持つ:

reshape, 型は MLSingleInputSupportLimits: reshape()演算子のサポート制限。

reshape(input, newShape, options) メソッドの手順は次のとおりである:

this が構築できない場合、"InvalidStateError" DOMExceptionを投げる。
this と input を用いたオペランドの検証が false を返す場合、TypeErrorを投げる。
newShapeのsizeが、（この表に従って）output tensor の許可される rankでない場合、TypeErrorを投げる。
outputShapeを unsigned longの空配列とする。
newShapeのsizeが 0 の場合、outputShapeをスカラー用の空のリストに設定する。
newShape 内のいずれかの項目が有効な次元でない場合、TypeErrorを投げる。
inputElementCountを、 inputのshape内のすべての項目の積とする。空の次元は inputElementCount 1 をもたらす。
newShape 内のすべての値の積が inputElementCount に等しくない場合、TypeErrorを投げる。
descを input.[[descriptor]]のコピーとする。
desc.shape を newShape に設定する。
グラフ接続を作成する:
1. outputを、this と desc が与えられてMLOperand を作成する結果とする。
2. operatorを、options が与えられた "reshape" 操作のoperatorとする。
3. output.[[operator]] を operator に設定する。
4. operatorのinputを input に設定する。
5. operatorのoutputを output に設定する。
outputを返す。

8.9.43. reverse

指定された軸に沿ってテンソルをreverseする。

dictionary MLReverseOptions : MLOperatorOptions {
  sequence<[EnforceRange] unsigned long> axes;
};

partial interface MLGraphBuilder {
  MLOperand reverse(MLOperand input, optional MLReverseOptions options = {});
};

partial dictionary MLOpSupportLimits {
  MLSingleInputSupportLimits reverse;
};

MLReverseOptions は次のメンバーを持つ:

axes, 型はsequence<[EnforceRange] unsigned long>: reverseする入力次元へのindex。このメンバーが存在しない場合、すべての次元がreverseされるものとして扱われる。明示的に空として渡された場合、どの次元もreverseされない。

引数:

input: MLOperand。入力テンソル。
options: MLOperatorOptions。演算の任意パラメーターを指定する。

戻り値:

MLOperand。 inputと同じ形状の出力テンソル。

`reverse()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	任意	`"float32"`, `"float16"`, `"int32"`	N	0 から 5
output	同じ `input`	`"float32"`, `"float16"`, `"int32"`	同じ `input`	0 から 5

MLOpSupportLimits は reverse() について次のメンバーを持つ:

reverse, 型は MLSingleInputSupportLimits: reverse()演算子のサポート制限。

reverse(input, options) メソッドの手順は次のとおりである:

this が構築できない場合、"InvalidStateError" DOMExceptionを投げる。
this と input を用いたオペランドの検証が false を返す場合、TypeErrorを投げる。
inputのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
inputRankを inputのrankとする。
axes が与えられていない場合、axesを 0 から inputRank までの範囲（終端を含まない）とする。
そうでない場合で、axes が重複値を含む、またはその要素のいずれかが 0 から inputRank までの範囲（終端を含まない）内にない場合、failure を返す。
グラフ接続を作成する:
1. outputを、input が与えられてMLOperand をコピーする結果とする。
2. operatorを、"reverse" 操作と options のoperatorとする。
3. output.[[operator]] を operator に設定する。
4. operatorのinputを input に設定する。
5. operatorのoutputを output に設定する。
outputを返す。

8.9.44. scatterElements

indicesに従って、axisに沿って、updatesテンソルからの値を入力テンソルのcopyの上にscatterする。

dictionary MLScatterOptions : MLOperatorOptions {
  [EnforceRange] unsigned long axis = 0;
};

partial interface MLGraphBuilder {
  MLOperand scatterElements(MLOperand input,
                            MLOperand indices,
                            MLOperand updates,
                            optional MLScatterOptions options = {});
};

dictionary MLScatterSupportLimits {
  MLTensorLimits input;
  MLTensorLimits indices;
  MLTensorLimits updates;
  MLTensorLimits output;
};

partial dictionary MLOpSupportLimits {
  MLScatterSupportLimits scatterElements;
};

MLScatterOptions は次のメンバーを持つ:

axis, 型はunsigned long、デフォルトは0: scattered valuesが取得される軸。その値は、入力テンソルのrankをNとすると、 [0, N-1]の範囲内でなければならない。

引数:

input: MLOperand。出力を初期化するための input N-D テンソル。
indices: MLOperand。 scatter する input 値の indices N-D テンソル。値は "int32", "uint32", または "int64" 型でなければならず、options.axis によって index される input 次元のサイズを N とするとき、-N（含む）から N（含まない）の範囲内でなければならない。また、負の index は次元の末尾から index することを意味する。
updates: MLOperand。 input 上で置き換える新しい値で、indices と同じ shape を持つ。
options: 任意の MLScatterOptions。操作の任意パラメーター。

戻り値: MLOperand。 inputの rankに等しいrankを持つ output N-D テンソル。

`scatterElements()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	任意	`"float32"`, `"float16"`, `"int32"`	1 から N	1 から 5
`indices`	`"int32"`, `"uint32"`, `"int64"`	`"int32"`	同じ `input`	1 から 5
`updates`	同じ `input`	`"float32"`, `"float16"`, `"int32"`	同じ `input`	1 から 5
output	同じ `input`	`"float32"`, `"float16"`, `"int32"`	同じ `input`	1 から 5

MLScatterSupportLimits は次のメンバーを持つ:

input, 型は MLTensorLimits: input オペランド用のMLTensorLimits。
indices, 型は MLTensorLimits: indices オペランド用のMLTensorLimits。
updates, 型は MLTensorLimits: updates オペランド用のMLTensorLimits。
output, 型は MLTensorLimits: output オペランド用のMLTensorLimits。

MLOpSupportLimits は scatterElements() について次のメンバーを持つ:

scatterElements, 型は MLScatterSupportLimits: scatterElements()演算子のサポート制限。

indices パラメーターは、graph が構築される時点では input が実行時まで不明であるため、scatterElements() に対して許可される範囲へ clamp できない。指定された clamping 振る舞いが基盤プラットフォームによって提供されない場合、実装はコンパイル済み graph に clamp() を導入できる。同様に、基盤プラットフォームが負の indices をサポートしない場合、実装はコンパイル済み graph に操作を導入して、次元の末尾からの負の index を正の index へ変換できる。

scatterElements(input, indices, updates, options) メソッドの手順は次のとおりである:

this が構築できない場合、"InvalidStateError" DOMExceptionを投げる。
this と input、indices および updates のいずれかを用いたオペランドの検証が false を返す場合、TypeErrorを投げる。
indicesのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
updatesのdataTypeが inputのdataTypeに等しくない場合、TypeErrorを投げる。
input、indices、または updates のいずれかのrankがその許可される rankでない場合、 TypeErrorを投げる。
axisを options.axis とする。
axis が inputのrank以上である場合、TypeErrorを投げる。
indicesShapeExpectedを inputのshapeのコピーとする。
indicesShapeExpected[axis] を indicesのshape[axis] に設定する。
indicesのshapeが indicesShapeExpected に等しくない場合、TypeErrorを投げる。
updatesのshapeが indicesのshapeに等しくない場合、TypeErrorを投げる。
グラフ接続を作成する:
1. outputを、input が与えられてMLOperand をコピーする結果とする。
2. operatorを、input、indices、updates、および options が与えられた "scatterElements" 操作のoperatorとする。
3. output.[[operator]] を operator に設定する。
4. operatorのinputsを input、indices、および updates に設定する。
5. operatorのoutputを output に設定する。
outputを返す。

scatterElementsが異なるslicing schemesでどのように動作するかの例。

// input of shape [4,3]:
//   [[ 0,  1,  2],
//    [10, 11, 12],
//    [20, 21, 22],
//    [30, 31, 32]]
// indices of shape [2,3]:
//   [[3, 1, 1],
//    [2, 0, 3]]
// updates of shape [2,3]:
//   [[-1, -2, -3],
//    [-4, -5, -6]]
// axis = 0 (default)
// output of shape [4,3]:
//   [[ 0, -5,  2],
//    [10, -2, -3],
//    [-4, 21, 22],
//    [-1, 31, -6]]

const input1 = builder.constant(
  {dataType: 'float32', shape: [4, 3]},
  new Float32Array([0, 1, 2, 10, 11, 12, 20, 21, 22, 30, 31, 32]));

const indices1 = builder.constant(
  {dataType: 'uint32', shape: [2, 3]}, new Uint32Array([3, 1, 1, 2, 0, 3]));

const updates1 = builder.constant(
  {dataType: 'float32', shape: [2, 3]},
  new Uint32Array([-1, -2, -3, -4, -5, -6]));

const output1 = builder.scatterElements(input1, indices1, updates1);

// input of shape [4,3]:
//   [[ 0,  1,  2],
//    [10, 11, 12],
//    [20, 21, 22],
//    [30, 31, 32]]
// indices of shape [4,1]:
//   [[2],
//    [1],
//    [0],
//    [2]],
// updates of shape [4,1]:
//   [[-1],
//    [-2],
//    [-3],
//    [-4]],
// axis = 1
// output of shape [4,3]:
//   [[ 0,  1, -1],
//    [10, -2, 12],
//    [-3, 21, 22],
//    [30, 31, -4]]

const indices2 = builder.constant(
  {dataType: 'uint32', shape: [4, 1]}, new Uint32Array([2, 1, 0, 2]));

const updates2 = builder.constant(
  {dataType: 'float32', shape: [4, 1]}, new Uint32Array([-1, -2, -3, -4]));

const output2 = builder.scatterElements(input1, indices2, updates2, {axis: 1});

// input of shape [4,2,2]:
//   [[[  0,   1],
//     [ 10,  11]],
//    [[100, 101],
//     [110, 111]],
//    [[200, 201],
//     [210, 211]],
//    [[300, 301],
//     [310, 311]],]
// indices of shape [1,2,2]:
//   [[[0, 2],
//     [1, 3]]],
// updates of shape [1,2,2]:
//   [[[-1, -2],
//     [-3, -4]]],
// axis = 0
// output of shape [4,2,2]:
//   [[[ -1,   1],
//     [ 10,  11]],
//    [[100, 101],
//     [ -3, 111]],
//    [[200,  -2],
//     [210, 211]],
//    [[300, 301],
//     [310,  -4]],]

const inputData3 = new Float32Array(
  [0, 1, 10, 11, 100, 101, 110, 111, 200, 201, 210, 211, 300, 301, 310, 311]);

const input3 =
  builder.constant({dataType: 'float32', shape: [4, 2, 2]}, inputData3);

const indices3 = builder.constant(
  {dataType: 'uint32', shape: [1, 2, 2]}, new Uint32Array([0, 2, 1, 3]));

const updates3 = builder.constant(
  {dataType: 'float32', shape: [1, 2, 2]}, new Uint32Array([-1, -2, -3, -4]));

const output3 = builder.scatterElements(input3, indices3, updates3, {axis: 0});

8.9.45. scatterND

indicesに従って、updateテンソルからの値のslicesを入力テンソルのcopyの上にscatterする。

partial interface MLGraphBuilder {
  MLOperand scatterND(MLOperand input,
                      MLOperand indices,
                      MLOperand updates,
                      optional MLOperatorOptions options = {});
};

partial dictionary MLOpSupportLimits {
  MLScatterSupportLimits scatterND;
};

引数:

input: MLOperand。出力を初期化するための input N-D テンソル。
indices: MLOperand。 indices 配列は output テンソル内への完全な座標を含み、最右次元が座標ごとの次元数を保持する。したがって shape [10,1] の indices テンソルは 10 個の単一軸 indices を保持し、shape [4,3] は 3D 座標の 4 個の indices を保持する。値は "int32", "uint32", または "int64" 型でなければならず、それぞれ対応する output 次元のサイズを N とするとき、-N（含む）から N（含まない）の範囲内でなければならない。また、負の index は対応する次元の末尾から index することを意味する。
updates: MLOperand。 input 上で置き換える新しい値。
options: 任意の MLScatterOptions。操作の任意パラメーター。

戻り値: MLOperand。 inputの rank + indicesの rank - indicesの shape[-1] - 1 に等しいrankを持つ output N-D テンソル。

`scatterND()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	任意	`"float32"`, `"float16"`, `"int32"`	1 から N	1 から 5
`indices`	`"int32"`, `"uint32"`, `"int64"`	`"int32"`	1 から N	1 から 5
`updates`	同じ `input`	`"float32"`, `"float16"`, `"int32"`	N	0 から 5
output	同じ `input`	`"float32"`, `"float16"`, `"int32"`	1 から N	1 から 5

MLOpSupportLimits は scatterND() について次のメンバーを持つ:

scatterND, 型は MLScatterSupportLimits: scatterND()演算子のサポート制限。

indices パラメーターは、graph が構築される時点では input が実行時まで不明であるため、scatterND() に対して許可される範囲へ clamp できない。指定された clamping 振る舞いが基盤プラットフォームによって提供されない場合、実装はコンパイル済み graph に clamp() を導入できる。同様に、基盤プラットフォームが負の indices をサポートしない場合、実装はコンパイル済み graph に操作を導入して、次元の末尾からの負の index を正の index へ変換できる。

scatterND(input, indices, updates, options) メソッドの手順は次のとおりである:

this が構築できない場合、"InvalidStateError" DOMExceptionを投げる。
オペランドの検証が、this と input、indices および updates のいずれかを用いて false を返す場合、TypeErrorを投げる。
indicesのdataTypeが、（この表に従って）許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
updatesのdataTypeが inputのdataTypeに等しくない場合、TypeErrorを投げる。
input、indices、または updates のいずれかのrankがその許可される rankでない場合、 TypeErrorを投げる。
inputShapeを inputのshapeとし、inputRankを inputのrankとする。
indicesShapeを indicesのshapeとし、indicesRank を indicesのrankとする。
indexableSizeを indicesRank - 1 とする。
coordinateSizeを indicesShape[indexableSize] とする。
coordinateSize が inputRank より大きい場合、TypeErrorを投げる。
expectedUpdatesShapeを空リストとする。
各 index について、0 から indexableSize までの範囲（終端を含まない）で反復する:
1. indicesShape[index] を expectedUpdatesShape に追加する。
各 index について、coordinateSize から inputRank までの範囲（終端を含まない）で反復する:
1. inputShape[index] を expectedUpdatesShape に追加する。
updatesのshapeが expectedUpdatesShape に等しくない場合、TypeErrorを投げる。
outputShapeを inputのshapeのコピーとする。
outputDescを、inputのdataTypeと outputShape が与えられてMLOperandDescriptor を作成する結果とする。
グラフ接続を作成する:
1. outputを、outputDesc が与えられてMLOperand を作成する結果とする。
2. operatorを、input、indices、updates、および options が与えられた "scatterND" 操作のoperatorとする。
3. output.[[operator]] を operator に設定する。
4. operatorのinputsを input、indices、および updates に設定する。
5. operatorのoutputを output に設定する。
outputを返す。

scatterNDが異なるslicing schemesでどのように動作するかの例。

// input of shape [8]:
//   [0, 1, 2, 3, 4, 5, 6, 7]
// indices of shape [4, 1]:
//   [[4],
//    [3],
//    [1],
//    [7]]
// updates of shape [4]:
//   [-1, -2, -3, -4]
// output of shape [8]:
//   [0, -3, 2, -2, -1, 5, 6, -4]

const input1 = builder.constant(
  {dataType: 'float32', shape: [8]},
  new Float32Array([0, 1, 2, 3, 4, 5, 6, 7]));

const indices1 = builder.constant(
  {dataType: 'uint32', shape: [4, 1]}, new Uint32Array([4, 3, 1, 7]));

const updates1 = builder.constant(
  {dataType: 'uint32', shape: [4]}, new Uint32Array([-1, -2, -3, -4]));

const output1 = builder.scatterND(input1, indices1, updates1);

// input of shape [2,2]:
//   [[0, 1],
//    [2, 3]]
// indices of shape [2,2]:
//   [[0, 0],
//    [1, 1]]
// updates of shape [2]:
//   [-1, -2]
// output of shape [2,2]:
//   [[-1,  1],   <= -1 written to output coordinate [0, 0]
//    [ 2, -2]]   <= -2 written to output coordinate [1, 1]

const input2 = builder.constant(
  {dataType: 'float32', shape: [2, 2]}, new Float32Array([0, 1, 2, 3]));

const indices2 = builder.constant(
  {dataType: 'uint32', shape: [2, 2]}, new Uint32Array([0, 0, 1, 1]));

const updates2 =
  builder.constant({dataType: 'uint32', shape: [2]}, new Uint32Array([-1, -2]));

const output2 = builder.scatterND(input2, indices2, updates2);

// input of shape [3,2]:
//   [[0, 1],
//    [2, 3],
//    [4, 5]]
// indices of shape [2,1]:
//   [[2],
//    [0]]
// updates of shape [2,2]:
//   [[-1, -2],
//    [-3, -4]]
// output of shape [3,2]:
//   [[-3 ,-4],    <= [-3, -4] written to output coordinates [0, *]
//    [ 2,  3],
//    [-1, -2]]    <= [-1, -2] written to output coordinates [2, *]

const input3 = builder.constant(
  {dataType: 'float32', shape: [3, 2]}, new Float32Array([0, 1, 2, 3, 4, 5]));

const indices3 = builder.constant(
  {dataType: 'uint32', shape: [2, 1]}, new Uint32Array([1, 0]));

const updates3 = builder.constant(
  {dataType: 'uint32', shape: [2, 2]}, new Uint32Array([-1, -2, -3, 4]));

const output3 = builder.scatterND(input3, indices3, updates3);

// input of shape [2,2,2]:
//   [[[0, 1],
//     [2, 3]],
//    [[4, 5],
//     [6, 7]]]
// indices of shape [2,2]:
//   [[0, 1],
//    [1, 0]]
// updates of shape [2,2]:
//   [[-1, -2],
//    [-3, -4]]
// output of shape [2,2,2]:
//   [[[ 0,  1],
//     [-1, -2]],   <= [-1, -2] written to output coordinates [0, 1, *]
//    [[-3, -4],    <= [-3, -4] written to output coordinates [1, 0, *]
//     [ 6,  7]]]

const input4 = builder.constant(
  {dataType: 'float32', shape: [2, 2, 2]},
  new Float32Array([0, 1, 2, 3, 4, 5, 6, 7]));

const indices4 = builder.constant(
  {dataType: 'uint32', shape: [2, 2]}, new Uint32Array([0, 1, 1, 0]));

const updates4 = builder.constant(
  {dataType: 'uint32', shape: [2, 2]}, new Uint32Array([-1, -2, -3, 4]));

const output4 = builder.scatterND(input4, indices4, updates4);

8.9.46. sigmoid

入力テンソルのsigmoid functionを計算する。計算は式1 / (exp(-x) + 1)に従う。

partial interface MLGraphBuilder {
  MLOperand sigmoid(MLOperand input, optional MLOperatorOptions options = {});
};

partial dictionary MLOpSupportLimits {
  MLSingleInputSupportLimits sigmoid;
};

引数:

input: MLOperand。入力テンソル。
options: MLOperatorOptions。演算の任意パラメーターを指定する。

戻り値:

MLOperand。 inputと同じ形状の出力テンソル。

`sigmoid()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	`"float32"`, `"float16"`	`"float32"`, `"float16"`	N	0 から 5
output	同じ `input`	`"float32"`, `"float16"`	同じ `input`	0 から 5

MLOpSupportLimits は sigmoid() について次のメンバーを持つ:

sigmoid, 型は MLSingleInputSupportLimits: sigmoid()演算子のサポート制限。

sigmoid(input, options) メソッドの手順は次のとおりである:

this が構築できない場合、"InvalidStateError" DOMExceptionを投げる。
this と input を用いたオペランドの検証が false を返す場合、TypeErrorを投げる。
inputのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
グラフ接続を作成する:
1. outputを、input が与えられてMLOperand をコピーする結果とする。
2. operatorを、options が与えられた "sigmoid" 操作のoperatorとする。
3. output.[[operator]] を operator に設定する。
4. operatorのinputを input に設定する。
5. operatorのoutputを output に設定する。
outputを返す。

この演算の振る舞いは、ユーザーエージェントは通常より効率的な実装を持つものの、次のように他の演算の使用から一般的にエミュレートできる。基盤となるプラットフォームが演算を直接サポートしない場合、この分解は実装を導くテンプレートとして使用できる。

function sigmoid(builder, input) {
  return builder.div(
    builder.constant(input.dataType, 1),
    builder.add(
      builder.exp(builder.neg(input)), builder.constant(input.dataType, 1)));
}

8.9.47. slice

入力テンソルのsliceを生成する。

dictionary MLSliceOptions : MLOperatorOptions {
  sequence<[EnforceRange] unsigned long> strides;
};

partial interface MLGraphBuilder {
  MLOperand slice(MLOperand input,
                  sequence<[EnforceRange] unsigned long> starts,
                  sequence<[EnforceRange] unsigned long> sizes,
                  optional MLSliceOptions options = {});
};

partial dictionary MLOpSupportLimits {
  MLSingleInputSupportLimits slice;
};

MLSliceOptions は次のメンバーを持つ:

strides, 型はsequence<[EnforceRange] unsigned long>: 各axisに沿って各入力をstep overするstride。 strides配列の長さは入力テンソルのrankと等しくなければならない。デフォルトは、すべて1からなるrank長の配列である。例: 3-Dテンソルの場合は[1,1,1]。 Stridesは0より大きくなければならない。

引数:

input: MLOperand。入力テンソル。
starts: sequence<unsigned long>。各入力次元のsliceを開始するindexで、長さNを持つ。ここでNは入力テンソルのrankである。 inputの各次元dについて、 starts[d] はその次元でsliceを開始するindexを示す。開始indexはその次元で[0, input size - 1]の範囲内でなければならない。
sizes: sequence<unsigned long>。各入力次元でsliceする要素数で、長さNを持つ。ここでNは入力テンソルのrankである。 inputの各次元dについて、 sizes[d] はその次元でsliceする要素数を示す。sizeは0であってはならず、その次元で starting index + size <= input sizeという制約を満たさなければならない。
options: MLSliceOptions。演算の任意パラメーターを指定する。

戻り値: MLOperand。各次元の指定された開始indexおよび終了indexにtensor値が切り詰められた、入力テンソルと同じrankの出力テンソル。

`slice()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	任意	`"float32"`, `"float16"`, `"int32"`, `"int8"`, `"uint8"`	N	0 から 5
output	同じ `input`	`"float32"`, `"float16"`, `"int32"`, `"int8"`, `"uint8"`	同じ `input`	0 から 5

MLOpSupportLimits はslice()について次のメンバーを持つ:

slice, 型はMLSingleInputSupportLimits: slice()演算子のサポート制限。

slice(input, starts, sizes, options) メソッドの手順は次のとおりである:

thisがbuildできない場合、"InvalidStateError" DOMExceptionをthrowする。
オペランドを検証することをthisおよびinputとともに行った結果がfalseを返す場合、TypeErrorをthrowする。
sizesのitemsのいずれかが0である場合、 TypeErrorをthrowする。
startsのsizeおよびsizesのsizeの両方が inputのrankと等しくない場合、TypeErrorをthrowする。
stridesを新しいlistとする。
options.strides が存在する場合:
1. stridesをoptions.stridesに設定する。
2. stridesのsizeが inputのrankと等しくない場合、TypeErrorをthrowする。
inputShapeをinputのshapeとし、inputRankをinputのrankとする。
outputShapeを新しいlistとする。
0からinputRankまで（含まない）の範囲内の各indexについて実行する:
1. inputSizeをinputShape[index]とする。
2. inputSliceSizeをsizes[index]とする。
3. strideを、空でない場合はstrides[index]、そうでない場合は1とする:
4. inputSliceSizeが0の場合、TypeErrorをthrowする。
  
  0-size dimensionsが許可される場合、これらの手順を改訂すること。[Issue #391]
5. strideが1未満の場合、TypeErrorをthrowする。
6. starts[index]がinputSizeより大きい場合、TypeErrorをthrowする。
7. starts[index] + inputSliceSizeが inputSizeより大きい場合、TypeErrorをthrowする。
8. outputSizeRoundingExcessを、inputSliceSize % stride != 0の場合は1、そうでない場合は0とする。
9. outputSizeをfloor(inputSliceSize / stride) + outputSizeRoundingExcessとする:
10. outputSizeをoutputShapeにappendする。
outputDescを、inputのdataTypeおよび outputShapeが与えられてMLOperandDescriptorを作成する結果とする。
グラフ接続を作成する:
1. outputを、outputDescが与えられてMLOperandを作成する結果とする。
2. operatorを、starts、sizes、およびoptionsが与えられた"slice"演算用の演算子とする。
3. output.[[operator]] をoperatorに設定する。
4. operatorのinputをinputに設定する。
5. operatorのoutputをoutputに設定する。
outputを返す。

8.9.48. softmax

指定されたaxisに沿って、N-D入力テンソルのsoftmax値を計算する。

partial interface MLGraphBuilder {
  MLOperand softmax(MLOperand input,
                    [EnforceRange] unsigned long axis,
                    optional MLOperatorOptions options = {});
};

partial dictionary MLOpSupportLimits {
  MLSingleInputSupportLimits softmax;
};

引数:

input: MLOperand。入力N-Dテンソル。
axis: unsigned long スカラー。reductionが実行される次元。
options: MLOperatorOptions。演算の任意パラメーターを指定する。

戻り値:

MLOperand。 softmax結果を含む、inputと同じ形状の出力N-Dテンソル。

`softmax()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	`"float32"`, `"float16"`	`"float32"`, `"float16"`	1 から N	1 から 5
output	同じ `input`	`"float32"`, `"float16"`	同じ `input`	1 から 5

MLOpSupportLimits は softmax() について次のメンバーを持つ:

softmax, 型は MLSingleInputSupportLimits: softmax()演算子のサポート制限。

softmax(input, axis, options) メソッドの手順は次のとおりである:

this が構築できない場合、"InvalidStateError" DOMExceptionを投げる。
this と input を用いたオペランドの検証が false を返す場合、TypeErrorを投げる。
inputのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
axis が inputのrank以上である場合、TypeErrorを投げる。
グラフ接続を作成する:
1. outputを、input が与えられてMLOperand をコピーする結果とする。
2. operatorを、axis および options が与えられた "softmax" 操作のoperatorとする。
3. output.[[operator]] を operator に設定する。
4. operatorのinputを input に設定する。
5. operatorのoutputを output に設定する。
outputを返す。

function softmax(builder, input, axis) {
  // This sample deploys a well-known implementation trick [1] to compute the
  // exponentials of the distances to the max value, instead of the exponentials
  // of the input values itself, in order to increase the numerical stability of
  // the result.
  // [1]: https://cs231n.github.io/linear-classify/#softmax
  const maxX = builder.reduceMax(input, {axes: [axis], keepDimensions: true});
  const expX = builder.exp(builder.sub(input, maxX));
  return builder.div(
    expX, builder.reduceSum(expX, {axes: [axis], keepDimensions: true}));
}

8.9.49. softplus

入力テンソルのsoftplus functionを計算する。計算は式ln(1 + exp(x))に従う。

partial interface MLGraphBuilder {
  MLOperand softplus(MLOperand input, optional MLOperatorOptions options = {});
};

partial dictionary MLOpSupportLimits {
  MLSingleInputSupportLimits softplus;
};

引数:

input: MLOperand。入力テンソル。
options: MLOperatorOptions。演算の任意パラメーターを指定する。

戻り値:

MLOperand。 inputと同じ形状の出力テンソル。

`softplus()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	`"float32"`, `"float16"`	`"float32"`, `"float16"`	N	0 から 5
output	同じ `input`	`"float32"`, `"float16"`	同じ `input`	0 から 5

MLOpSupportLimits は softplus() について次のメンバーを持つ:

softplus, 型は MLSingleInputSupportLimits: softplus()演算子のサポート制限。

softplus(input, options) メソッドの手順は次のとおりである:

this が構築できない場合、"InvalidStateError" DOMExceptionを投げる。
this と input を用いたオペランドの検証が false を返す場合、TypeErrorを投げる。
inputのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
グラフ接続を作成する:
1. outputを、input が与えられてMLOperand をコピーする結果とする。
2. operatorを、"softplus" 操作と options のoperatorとする。
3. output.[[operator]] を operator に設定する。
4. operatorのinputを input に設定する。
5. operatorのoutputを output に設定する。
outputを返す。

function softplus(builder, input) {
  return builder.log(
    builder.add(builder.exp(input), builder.constant(input.dataType, 1)));
}

8.9.50. softsign

入力テンソルのsoftsign functionを計算する。計算は式x / (1 + |x|)に従う。

partial interface MLGraphBuilder {
  MLOperand softsign(MLOperand input, optional MLOperatorOptions options = {});
};

partial dictionary MLOpSupportLimits {
  MLSingleInputSupportLimits softsign;
};

function softsign(builder, input) {
  return builder.div(
    input,
    builder.add(builder.constant(input.dataType, 1), builder.abs(input)));
}

引数:

input: MLOperand。入力テンソル。
options: MLOperatorOptions。演算の任意パラメーターを指定する。

戻り値:

MLOperand。 inputと同じ形状の出力テンソル。

`softsign()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	`"float32"`, `"float16"`	`"float32"`, `"float16"`	N	0 から 5
output	同じ `input`	`"float32"`, `"float16"`	同じ `input`	0 から 5

MLOpSupportLimits は softsign() について次のメンバーを持つ:

softsign, 型は MLSingleInputSupportLimits: softsign()演算子のサポート制限。

softsign(input, options) メソッドの手順は次のとおりである:

this が構築できない場合、"InvalidStateError" DOMExceptionを投げる。
this と input を用いたオペランドの検証が false を返す場合、TypeErrorを投げる。
inputのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
グラフ接続を作成する:
1. outputを、input が与えられてMLOperand をコピーする結果とする。
2. operatorを、"softsign" 操作と options のoperatorとする。
3. output.[[operator]] を operator に設定する。
4. operatorのinputを input に設定する。
5. operatorのoutputを output に設定する。
outputを返す。

8.9.51. split

指定されたaxisに沿って、入力テンソルをいくつかのsub tensorsに分割する。

dictionary MLSplitOptions : MLOperatorOptions {
  [EnforceRange] unsigned long axis = 0;
};

partial interface MLGraphBuilder {
  sequence<MLOperand> split(
      MLOperand input,
      ([EnforceRange] unsigned long or sequence<[EnforceRange] unsigned long>) splits,
      optional MLSplitOptions options = {});
};

dictionary MLSplitSupportLimits {
  MLTensorLimits input;
  MLTensorLimits outputs;
};

partial dictionary MLOpSupportLimits {
  MLSplitSupportLimits split;
};

引数:

input: MLOperand。入力テンソル。
splits: unsigned long またはsequence<unsigned long>。 unsigned longの場合、 axisに沿った出力テンソルの数を指定する。この数は、inputの axis に沿った次元サイズを割り切らなければならない。 sequence<unsigned long>の場合、 axisに沿った各出力テンソルのサイズを指定する。 sizesの合計は、inputの axisに沿った次元サイズと等しくなければならない。
options: 任意のMLSplitOptions。演算の任意パラメーター。

戻り値: sequence<MLOperand>。分割された出力テンソル。splits がunsigned longの場合、出力のsizeはsplitsと等しい。各出力テンソルのshapeは、axisの次元サイズが、inputの axisに沿った次元サイズをsplitsで割った商に等しいことを除き、inputと同じである。 splits がsequence<unsigned long>の場合、出力のsizeはsplitsのsizeと等しい。 i番目の出力テンソルのshapeは、axisに沿った次元サイズがsplits[i]であることを除き、 inputと同じである。

MLSplitOptions は次のメンバーを持つ:

axis, 型はunsigned long、デフォルトは0: splitする次元。その値は、入力テンソルのrankをNとして、 [0, N-1]の範囲内でなければならない。

`split()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	任意	`"float32"`, `"float16"`, `"int32"`	1 から N	1 から 5
outputs	同じ `input`	`"float32"`, `"float16"`, `"int32"`	同じ `input`	1 から 5

MLSplitSupportLimits は次のメンバーを持つ:

input, 型はMLTensorLimits: inputオペランド用のMLTensorLimits。
outputs, 型はMLTensorLimits: すべてのoutputオペランド用のMLTensorLimits。

MLOpSupportLimits はsplit()について次のメンバーを持つ:

split, 型はMLSplitSupportLimits: split()演算子のサポート制限。

split(input, splits, options) メソッドの手順は次のとおりである:

thisがbuildできない場合、"InvalidStateError" DOMExceptionをthrowする。
オペランドを検証することをthisおよびinputとともに行った結果がfalseを返す場合、TypeErrorをthrowする。
axisをoptions.axisとする。
axisがinputのrank以上である場合、TypeErrorをthrowする。
splitsがunsigned longである場合:
1. splitsが有効なテンソル数でない場合、TypeErrorをthrowする。
2. inputのshape[axis] % splitsが 0でない場合、TypeErrorをthrowする。
3. そうでない場合、splitCountをsplitsとする。
splitsがsequence<unsigned long>である場合:
1. splitsのsizeが有効なテンソル数でない場合、TypeErrorをthrowする。
2. そのitemsのいずれかが0と等しい場合、TypeErrorをthrowする。
  
  0-size dimensionsが許可される場合、上記の手順を改訂すること。[Issue #391]
3. そのitemsの合計がinputのshape[axis]と等しくない場合、TypeErrorをthrowする。
4. そうでない場合、splitCountをsplitsのsizeとする。
グラフ接続を作成する:
1. operatorを、splitsおよびoptionsが与えられた "split"演算用の演算子とする。
2. outputsを新しいlistとする。
3. 0からsplitCountまで（含まない）の範囲内の各indexについて実行する:
  1. operandを、inputが与えられてMLOperandをcopyする結果とする。
  2. splitsがunsigned longである場合、 newDimensionをoperandのshape[axis] / splitsとする。
  3. そうでない場合、newDimensionをsplits[index]とする。
  4. operandのshape[axis]を newDimensionに設定する。
  5. operand.[[operator]] をoperatorに設定する。
  6. operandを outputsにappendする。
4. operatorのinputをinputに設定する。
5. operatorのoutputsをoutputsに設定する。
outputsを返す。

function split(builder, input, splits, options) {
  // This sample shows the case that the splits parameter is an array.
  const outputs = [];
  const inputShape = input.shape;
  const inputRank = inputShape.length;
  let starts = Array(inputRank).fill(0);
  let sizes = inputShape;
  let start = 0;
  for (const size of splits) {
    starts[options.axis] = start;
    sizes[options.axis] = size;
    outputs.push(builder.slice(input, starts, sizes));
    start += size;
  }
  return outputs;
}

8.9.52. tanh

入力テンソルのhyperbolic tangent functionを計算する。計算は式(exp(2 * x) - 1) / (exp(2 * x) + 1)に従う。

partial interface MLGraphBuilder {
  MLOperand tanh(MLOperand input, optional MLOperatorOptions options = {});
};

partial dictionary MLOpSupportLimits {
  MLSingleInputSupportLimits tanh;
};

引数:

input: MLOperand。入力テンソル。
options: MLOperatorOptions。演算の任意パラメーターを指定する。

戻り値:

MLOperand。 inputと同じ形状の出力テンソル。

`tanh()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	`"float32"`, `"float16"`	`"float32"`, `"float16"`	N	0 から 5
output	同じ `input`	`"float32"`, `"float16"`	同じ `input`	0 から 5

MLOpSupportLimits は tanh() について次のメンバーを持つ:

tanh, 型は MLSingleInputSupportLimits: tanh()演算子のサポート制限。

tanh(input, options) メソッドの手順は次のとおりである:

this が構築できない場合、"InvalidStateError" DOMExceptionを投げる。
this と input を用いたオペランドの検証が false を返す場合、TypeErrorを投げる。
inputのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
グラフ接続を作成する:
1. outputを、input が与えられてMLOperand をコピーする結果とする。
2. operatorを、options が与えられた "tanh" 操作のoperatorとする。
3. output.[[operator]] を operator に設定する。
4. operatorのinputを input に設定する。
5. operatorのoutputを output に設定する。
outputを返す。

function tanh(builder, input) {
  return builder.div(
    builder.sub(
      builder.exp(builder.mul(builder.constant(input.dataType, 2), input)),
      builder.constant(input.dataType, 1)),
    builder.add(
      builder.exp(builder.mul(builder.constant(input.dataType, 2), input)),
      builder.constant(input.dataType, 1)));
}

8.9.53. tile

各次元に沿って、指定された回数だけテンソルを繰り返す。

partial interface MLGraphBuilder {
  MLOperand tile(MLOperand input,
                 sequence<unsigned long> repetitions,
                 optional MLOperatorOptions options = {});
};

partial dictionary MLOpSupportLimits {
  MLSingleInputSupportLimits tile;
};

引数:

input: MLOperand。入力N-Dテンソル。
repetitions: 各次元について、その次元を何回繰り返すかのcount。sizeは inputの rankと一致しなければならず、同じサイズを保持すべきaxisには1を使用する。
options: 任意のMLOperatorOptions。演算の任意パラメーター。

戻り値: MLOperand。反転されたN-Dテンソル。

`tile()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	任意	`"float32"`, `"float16"`, `"int32"`	N	0 から 5
output	同じ `input`	`"float32"`, `"float16"`, `"int32"`	同じ `input`	0 から 5

MLOpSupportLimits はtile()について次のメンバーを持つ:

tile, 型はMLSingleInputSupportLimits: tile()演算子のサポート制限。

tile(input, repetitions, options) メソッドの手順は次のとおりである:

thisがbuildできない場合、"InvalidStateError" DOMExceptionをthrowする。
オペランドを検証することをthisおよびinputとともに行った結果がfalseを返す場合、TypeErrorをthrowする。
repetitionsのsizeが inputのrankと等しくない場合、TypeErrorをthrowする。
repetitionsの値が0を含む場合、TypeErrorをthrowする。

0-size dimensionsが許可される場合、これらの手順を改訂すること。[Issue #391]
outputShapeをinputのshapeのcopyとする。
0からoutputShapeのsizeまで（含まない）の範囲内の各indexについて実行する:
1. outputShape[index]をoutputShape[index] * repetitions[index]に設定する。
outputDescriptorを、inputのdataTypeおよび outputShapeが与えられてMLOperandDescriptorを作成する結果とする。
グラフ接続を作成する:
1. outputを、outputDescriptorが与えられてMLOperandを作成する結果とする。
2. operatorを、optionsが与えられた"tile"演算用の演算子とする。
3. output.[[operator]] をoperatorに設定する。
4. operatorのinputをinputに設定する。
5. operatorのoutputをoutputに設定する。
outputを返す。

8.9.54. transpose

permutationに従って入力テンソルの次元を置換する。

dictionary MLTransposeOptions : MLOperatorOptions {
  sequence<[EnforceRange] unsigned long> permutation;
};

partial interface MLGraphBuilder {
  MLOperand transpose(MLOperand input, optional MLTransposeOptions options = {});
};

partial dictionary MLOpSupportLimits {
  MLSingleInputSupportLimits transpose;
};

MLTransposeOptions は次のメンバーを持つ:

permutation, 型は sequence<[EnforceRange] unsigned long>: 出力shapeを置換するために使用される値。デフォルトは[N-1, ..., 0]であり、ここでNは入力テンソルのrankである。例: 3-Dテンソルでは[2,1,0]。これらのデフォルト値により、出力は入力の転置テンソルになる。指定された場合、値の数は入力テンソルのrankと同じでなければならず、その値は重複なしで 0からN-1までの範囲内でなければならない。

引数:

input: MLOperand。入力N-Dテンソル。
options: 任意のMLTransposeOptions。演算の任意パラメーター。

戻り値: MLOperand。置換または転置されたN-Dテンソル。

`transpose()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	任意	`"float32"`, `"float16"`, `"int32"`	N	0 から 5
output	同じ `input`	`"float32"`, `"float16"`, `"int32"`	同じ `input`	0 から 5

MLOpSupportLimits はtranspose()について次のメンバーを持つ:

transpose, 型はMLSingleInputSupportLimits: transpose()演算子のサポート制限。

transpose(input, options) メソッドの手順は次のとおりである:

thisがbuildできない場合、"InvalidStateError" DOMExceptionをthrowする。
オペランドを検証することをthisおよびinputとともに行った結果がfalseを返す場合、TypeErrorをthrowする。
options.permutation が存在しない場合、options.permutation をinputのshapeのすべてのindicesの逆順sequenceとする。
そうでない場合、options.permutation が存在する場合:
1. そのsizeがinputのrankと等しくない場合、TypeErrorをthrowする。
2. そのitemsが0からinputのrankまで（含まない）の範囲内にない場合、 TypeErrorをthrowする。
3. 重複値を含む場合、TypeErrorをthrowする。
グラフ接続を作成する:
1. outputを、inputが与えられてMLOperandをcopyする結果とする。
2. operatorを、optionsが与えられた"transpose"演算用の演算子とする。
3. output.[[operator]] をoperatorに設定する。
4. operatorのinputをinputに設定する。
5. operatorのoutputをoutputに設定する。
outputを返す。

8.9.55. triangular

2-Dテンソル（matrix）が与えられた場合、入力テンソルの上三角部分または下三角部分のいずれかを含む2-Dテンソルを返す。入力テンソルが2次元を超える場合、matricesのbatchとして扱われ、結果は同じshapeを持つ。

dictionary MLTriangularOptions : MLOperatorOptions {
  boolean upper = true;
  [EnforceRange] long diagonal = 0;
};

partial interface MLGraphBuilder {
  MLOperand triangular(MLOperand input, optional MLTriangularOptions options = {});
};

partial dictionary MLOpSupportLimits {
  MLSingleInputSupportLimits triangular;
};

MLTriangularOptions は次のメンバーを持つ:

upper, 型はboolean、デフォルトはtrue: 入力matrixの上部または下部のどちらを出力で保持するかを示す。Trueは上部を保持することを示す。
diagonal, 型はlong、デフォルトは0: 入力matrixの主対角線の上または下にある対角線をいくつ保持または除外するかを指定する。値0は、主対角線以外の対角線が影響を受けないことを意味する。

引数:

input: MLOperand。少なくとも2-Dである入力テンソル。
options: 任意のMLTriangularOptions。演算の任意パラメーター。

戻り値: MLOperand。入力と同じshapeである、三角matrixまたはmatricesのbatchを表す出力テンソル。

`triangular()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`input`	任意	`"float32"`, `"float16"`	2 から N	2 から 5
output	同じ `input`	`"float32"`, `"float16"`	同じ `input`	2 から 5

MLOpSupportLimits は triangular() について次のメンバーを持つ:

triangular, 型は MLSingleInputSupportLimits: triangular()演算子のサポート制限。

triangular(input, options) メソッドの手順は次のとおりである:

this が構築できない場合、"InvalidStateError" DOMExceptionを投げる。
this と input を用いたオペランドの検証が false を返す場合、TypeErrorを投げる。
inputのrankが、（この表に従って）その許可される rankのいずれでもない場合、TypeErrorを投げる。
グラフ接続を作成する:
1. outputを、input が与えられてMLOperand をコピーする結果とする。
2. operatorを、options が与えられた "triangular" 操作のoperatorとする。
3. output.[[operator]] を operator に設定する。
4. operatorのinputを input に設定する。
5. operatorのoutputを output に設定する。
outputを返す。

triangularが異なるdiagonal設定でどのように動作するかの例。

// input:
//   [[7, 1, 2],
//    [9, 4, 8],
//    [2, 6, 3]]
const input = builder.constant(
  {dataType: 'float32', shape: [3, 3]},
  new Float32Array([7, 1, 2, 9, 4, 8, 2, 6, 3]));

// upper triangular matrix:
//   [[7, 1, 2],
//    [0, 4, 8],
//    [0, 0, 3]]
const upper = builder.triangular(input);

// upper triangular matrix with one additional set of diagonals excluded:
//   [[0, 1, 2],
//    [0, 0, 8],
//    [0, 0, 0]]
const upperPositive = builder.triangular(input, {diagonal: 1});

// upper triangular matrix with one additional set of diagonals retained:
//   [[7, 1, 2],
//    [9, 4, 8],
//    [0, 6, 3]]
const upperNegative = builder.triangular(input, {diagonal: -1});

// lower triangular matrix:
//   [[7, 0, 0],
//    [9, 4, 0],
//    [2, 6, 3]]
const lower = builder.triangular(input, {upper: false});

// lower triangular matrix with one additional set of diagonals retained:
//   [[7, 1, 0],
//    [9, 4, 8],
//    [2, 6, 3]]
const lowerPositive = builder.triangular(input, {upper: false, diagonal: 1});

// lower triangular matrix with one additional set of diagonals excluded:
//   [[0, 0, 0],
//    [9, 0, 0],
//    [2, 6, 0]]
const lowerNegative = builder.triangular(input, {upper: false, diagonal: -1})

// lower triangular matrix with two batches:
//   [[[7, 0, 0],
//     [9, 4, 0],
//     [2, 6, 3]],
//    [[1, 0, 0],
//     [4, 5, 0],
//     [7, 8, 9]]]
const lowerWithBatches = builder.triangular(input, {upper: false});

8.9.56. where

condition テンソルの対応する値に応じて、trueValue またはfalseValue テンソルから値を選択する。ここで非0はtrue、0はfalseである。condition テンソルは、element-wise logical operationsのいずれかの出力であることが多い。

この演算はbroadcast される。これは[numpy-broadcasting-rule]に従う。入力テンソルはbidirectionally broadcastableでなければならない。出力テンソルのrankは、入力テンソルの rankの最大値である。出力テンソルの各次元について、そのサイズは入力テンソルのその次元に沿った最大サイズである。

partial interface MLGraphBuilder {
  MLOperand where(MLOperand condition,
                  MLOperand trueValue,
                  MLOperand falseValue,
                  optional MLOperatorOptions options = {});
};

dictionary MLWhereSupportLimits {
  MLTensorLimits condition;
  MLTensorLimits trueValue;
  MLTensorLimits falseValue;
  MLTensorLimits output;
};

partial dictionary MLOpSupportLimits {
  MLWhereSupportLimits where;
};

引数:

condition: MLOperand。 conditionテンソル。
trueValue: MLOperand。対応する要素のconditionがtrueに設定されている場合に、値が選択されるテンソル。
falseValue: MLOperand。対応する要素のconditionがfalseに設定されている場合に、値が選択されるテンソル。
options: MLOperatorOptions。演算の任意パラメーターを指定する。

戻り値: MLOperand。 trueValue またはfalseValue テンソルのいずれかからelement-wiseに選択された値を含む出力テンソル。

`where()`のテンソル制限
オペランド	許可されるデータ型	必須データ型	許可される rank	必須 rank
`condition`	`"uint8"`	`"uint8"`	N	0 から 5
`trueValue`	任意	`"float32"`, `"float16"`, `"int32"`	N	0 から 5
`falseValue`	同じ `trueValue`	`"float32"`, `"float16"`, `"int32"`	N	0 から 5
output	同じ `trueValue`	`"float32"`, `"float16"`, `"int32"`	N	0 から 5

MLWhereSupportLimits は次のメンバーを持つ:

condition, 型は MLTensorLimits: condition オペランド用のMLTensorLimits。
trueValue, 型は MLTensorLimits: trueValue オペランド用のMLTensorLimits。
falseValue, 型は MLTensorLimits: falseValue オペランド用のMLTensorLimits。
output, 型は MLTensorLimits: output オペランド用のMLTensorLimits。

MLOpSupportLimits は where() について次のメンバーを持つ:

where, 型は MLWhereSupportLimits: where()演算子のサポート制限。

where(condition, trueValue, falseValue, options) メソッドの手順は次のとおりである:

this が構築できない場合、"InvalidStateError" DOMExceptionを投げる。
this と condition、trueValue および falseValue のいずれかを用いたオペランドの検証が false を返す場合、TypeErrorを投げる。
condition、trueValue、または falseValue のいずれかのdataTypeが、（この表に従って）その許可されるデータ型のいずれでもない場合、TypeErrorを投げる。
outputShapeを、trueValueのshapeと falseValueのshapeを双方向にブロードキャストする結果とする。
1. それが failure を返す場合、TypeErrorを投げる。
outputShapeを、conditionのshapeと outputShapeを双方向にブロードキャストする結果に設定する。
1. それが failure を返す場合、TypeErrorを投げる。
descriptorを、trueValueのdataTypeと outputShape が与えられてMLOperandDescriptor を作成する結果とする。
グラフ接続を作成する:
1. outputを、this と descriptor が与えられてMLOperand を作成する結果とする。
2. operatorを、condition、trueValue、falseValue、および options が与えられた "where" 操作のoperatorとする。
3. output.[[operator]] を operator に設定する。
4. operatorのinputsを condition、 trueValue および falseValue に設定する。
5. operatorのoutputを output に設定する。
outputを返す。

function where(builder, condition, trueValue, falseValue) {
  const c = builder.clamp(condition, {'minValue': 0, 'maxValue': 1});
  builder.add(
    builder.mul(trueValue, builder.cast(c, trueValue.dataType)),
    builder.mul(
      falseValue, builder.cast(builder.logicalNot(c), falseValue.dataType)));
}

9. アルゴリズム

9.1. ブロードキャスト

ブロードキャストは、グラフ構築および計算中に、WebNNが異なるshapeを持つテンソルをどのように扱うかを説明する。これは[NumPy]の影響を強く受けており、[numpy-broadcasting-rule]に従う。おおまかに言えば、小さいテンソルに対する演算を大きいテンソルのshape全体へ「broadcast」できるようにすることで、 copyを作らずに同じデータを繰り返し適用できるようにする。

最も単純な例は、add() やmul() などのelement-wise二項演算で、スカラー定数をN次元テンソルに適用することである。スカラー定数の複数のcopyを含む、一致するN次元テンソルを確保して埋める必要はなく、これらのelement-wise演算ではスカラー定数を直接使用でき、スカラー値をN次元テンソル全体へ broadcastできる。次の考慮事項により、同じロジックは他の次元のテンソルにも適用される。

入力テンソルのshapeは互換でなければならない。あるテンソルは、最後（右端）の次元から開始して、 sizeが1であるaxisに沿ってそのテンソルを繰り返すか、新しい次元にまたがって繰り返すことで、最初のテンソルを「引き伸ばせる」場合、別のテンソルへ一方向にbroadcast可能である。例えば、[4]テンソルは、それを5回繰り返すことで[5, 4]テンソルへbroadcastできる。 [1]テンソルは、最後の次元で4回、前の次元で5回繰り返すことで[5,4]テンソルへbroadcastできる。一方向ブロードキャストは、targetテンソルshapeが明示的に与えられるexpand() のような演算で重要である。

2つのテンソルは、最後の次元から開始して、それぞれのさまざまな次元にまたがって相互に「引き伸ばせる」（繰り返せる）場合、双方向にbroadcast可能である。例えば、 [5,1]テンソルは、最初のテンソルを最後の次元で6回繰り返し、2つ目のテンソルを前の次元で5回繰り返すことで、[1,6]テンソルと双方向にbroadcastできる。演算の結果は[5,6]テンソルになる。双方向ブロードキャストはelement-wise演算に便利である。

すべての次元を整数倍でtargetテンソルのshapeへupsampleできる場合、テンソルはblockwise broadcast可能である。例えば、[4,5]テンソルは、各要素を第1次元で4回、最後の次元で2回繰り返すことで、正確な倍数（16 % 4 = 0、10 % 5 = 0）であるため、[16,10]テンソルへblockwise broadcastできる（例えば、最後の次元の値[1,2,3,4,5]は [1,1,2,2,3,3,4,4,5,5]へ繰り返される）。しかし、[4,5]テンソルは、両方の次元で0でない余り（9 % 4 = 1、3 % 5 = 3）があるため、[9,3]テンソルとは互換でない。 Blockwise broadcastingは、memoryを節約するために大きなblockで共通の値を共有するのに有用である。両方のテンソルは同じrankを持つことが期待され、出力shapeは単に、小さい方がupsampleされるtargetテンソルのshapeである。

一部の演算は、特別なsemanticsを持つブロードキャストを許可する。例えば、matmul() は入力テンソルの最後の2次元をmatrixの行および列として扱い、最初のmatrixの列数は2つ目のmatrixの行数と等しくなければならない。行列乗算は、入力テンソルを乗算するmatricesのstackとして扱い、追加の任意の次元にまたがって双方向にbroadcastされる。

shapeFromおよびshapeToのshapeを一方向にbroadcastするには、次の手順を実行する。 shapeFromおよびshapeToは、テンソルの次元を表す正の整数のlistであり、この手順は正の整数の新しいlist、またはfailureを返す。

sizeFromをshapeFromのsizeとする。
sizeToをshapeToのsizeとする。
sizeFrom > sizeToの場合、failureを返す。
paddedShapeFromをshapeFromのcloneとする。
paddedShapeFromのsizeが sizeToより小さい間、paddedShapeFromに1をprependする。
outputShapeを新しいlistとする。
0からsizeToまで（含まない）の範囲内の各indexについて実行する:
1. dimFromをpaddedShapeFrom[index]とする。
2. dimToをshapeTo[index]とする。
3. dimToがdimFromと等しくなく、かつdimFromが1と等しくない場合、 failureを返す。
4. dimToをoutputShapeにAppendする。
outputShapeを返す。

shapeFromおよびshapeToを一方向にbroadcastingすることがfailureにならない場合、 shapeFromはshapeToへ一方向にbroadcast可能である。

shapeAおよびshapeBのshapeを双方向にbroadcastするには、次の手順を実行する。 shapeAおよびshapeBは、テンソルの次元を表す正の整数のlistであり、この手順は正の整数の新しいlist、またはfailureを返す。

sizeAをshapeAのsizeとする。
sizeBをshapeBのsizeとする。
outputSizeをsizeAおよびsizeBの最大値とする。
paddedAをshapeAのcloneとする。
paddedAのsizeが outputSizeより小さい間、paddedAに1をprependする。
paddedBをshapeBのcloneとする。
paddedBのsizeが outputSizeより小さい間、paddedBに1をprependする。
outputShapeを新しいlistとする。
0からoutputSizeまで（含まない）の範囲内の各indexについて実行する:
1. dimAをpaddedA[index]とする。
2. dimBをpaddedB[index]とする。
3. dimAがdimBと等しくなく、かつdimAが1と等しくなく、かつdimBが1と等しくない場合、failureを返す。
4. dimAおよびdimBの最大値をoutputShapeにAppendする。
outputShapeを返す。

shapeAおよびshapeBを双方向にbroadcastingすることがfailureにならない場合、 shapeAはshapeBへ双方向にbroadcast可能である。

shapeFromおよびshapeToのshapeを blockwise broadcastするには、次の手順を実行する。 shapeFromおよびshapeToは、テンソルの次元を表す正の整数のlistであり、この手順はtrueまたはfalseを返す。

shapeFromのsizeがshapeToのsizeと等しくない場合、falseを返す。
0からshapeToのsizeまで（含まない）の範囲内の各indexについて実行する:
1. shapeFrom[index]がshapeTo[index]を正確に割り切らない場合、falseを返す。
trueを返す。

shapeFromおよびshapeToをblockwise broadcastingすることがtrueを返す場合、shapeFromはshapeToへ blockwise broadcast可能である。

9.2. Casting

明示的な数値castingは、MLNumber またはdouble として渡されたパラメーターを、入力または出力MLOperandの MLOperandDataType に一致するように変換する必要があるアルゴリズムで使用される。

数値xを与えられたMLOperandDataType dataTypeへcastするには、次の手順を実行する。これらは数値を返す。

dataType に応じて分岐する:

"float32"

ConvertToFloat(x, 32) を返す。

"float16"

ConvertToFloat(x, 16) を返す。

"int64"

ConvertToInt(x, 64, "signed") を返す。

"uint64"

ConvertToInt(x, 64, "unsigned") を返す。

"int32"

ConvertToInt(x, 32, "signed") を返す。

"uint32"

ConvertToInt(x, 32, "signed") を返す。

"int8"

ConvertToInt(x, 8, "signed") を返す。

"uint8"

ConvertToInt(x, 8, "unsigned") を返す。

注記: castへの入力は、無制限の範囲と精度を持つ抽象数値であり、 special valuesであるInfinity、-InfinityおよびNaNを含む。出力も抽象数値であるが、指定された型として正確に表現可能である。

ConvertToFloat(x, bitLength)の手順は次のとおりである:

xがNaNである場合、NaNを返す。
bitLengthでswitchする:
32
1. upperBoundを2¹²⁸とする。
2. lowerBoundを-2¹²⁸とする。
3. Sを、-0を除く[IEEE-754-2019] binary32浮動小数点値の集合とする。ただしspecial valuesである upperBoundおよびlowerBoundを追加する。
16
1. upperBoundを2¹⁶とする。
2. lowerBoundを-2¹⁶とする。
3. Sを、-0を除く[IEEE-754-2019] binary16浮動小数点値の集合とする。ただしspecial valuesである upperBoundおよびlowerBoundを追加する。
yを、S内でxに最も近い数値とする。2つの等しく近い値がある場合は、 significandが偶数である数値を選択する。この目的のため、2つのspecial values lowerBoundおよびupperBoundは偶数のsignificandsを持つと見なされる。
yがupperBoundである場合、+Infinityを返す。
yがlowerBoundである場合、-Infinityを返す。
yが+0であり、xが負である場合、-0を返す。
yを返す。

注記: これは[WEBIDL]の定義に基づくが、 16-bit浮動小数点値をcoverするように拡張されている。

ConvertToInt(x, bitLength, signedness) の手順は次のとおりである:

signednessが"unsigned"である場合:
1. lowerBoundを0とする。
2. upperBoundを2^bitLength - 1とする。
そうでない場合:
1. lowerBoundを-(2^{bitLength - 1})とする。
2. upperBoundを2^{bitLength - 1} - 1とする。
xが-0である場合、xを+0に設定する。
xがNaNである場合、+0を返す。
xをmin(max(x, lowerBound), upperBound)に設定する。
xを最も近い整数へroundし、2つの整数のちょうど中間にある場合は偶数の整数を選択し、 -0ではなく+0を選択する。
xを返す。

注記: これは[WEBIDL]の定義に基づくが、次の違いがある: 64-bit整数は特別に扱われず、入力xは抽象数値であり、clampingは常に実行される。

9.3. その他

list Aは、AのsizeがBの sizeと等しく、 A内の各itemがB内の同じindexにあるitemと等しい場合、 list Bと等しい。

[INFRA]に定義が利用可能になったら、これを削除すること。[whatwg/infra Issue #664]

10. 例

次のbuild graphが与えられた場合:

constant1 ---+
             +--- Add ---> intermediateOutput1 ---+
input1    ---+                                    |
                                                  +--- Mul---> output
constant2 ---+                                    |
             +--- Add ---> intermediateOutput2 ---+
input2    ---+

次のcodeはこのgraphを実装する:

// Use tensors in 4 dimensions.
const TENSOR_SHAPE = [1, 2, 2, 2];
const TENSOR_SIZE = 8;

const context = await navigator.ml.createContext();
const builder = new MLGraphBuilder(context);

// Create MLOperandDescriptor object.
const desc = {
  dataType: 'float32',
  shape: TENSOR_SHAPE
};

// constant1 is a constant MLOperand with the value 0.5.
const constantBuffer1 = new Float32Array(TENSOR_SIZE).fill(0.5);
const constant1 = builder.constant(desc, constantBuffer1);

// input1 is one of the input MLOperands. Its value will be set before
// execution.
const input1 = builder.input('input1', desc);

// constant2 is another constant MLOperand with the value 0.5.
const constantBuffer2 = new Float32Array(TENSOR_SIZE).fill(0.5);
const constant2 = builder.constant(desc, constantBuffer2);

// input2 is another input MLOperand. Its value will be set before execution.
const input2 = builder.input('input2', desc);

// intermediateOutput1 is the output of the first Add operation.
const intermediateOutput1 = builder.add(constant1, input1);

// intermediateOutput2 is the output of the second Add operation.
const intermediateOutput2 = builder.add(constant2, input2);

// output is the output MLOperand of the Mul operation.
const output = builder.mul(intermediateOutput1, intermediateOutput2);

11. 演算子エミュレーション

このsectionは非規範的である。

他のneural network inference APIに存在する演算は、多くの場合、WebNNに存在する演算を使用してemulateできる。

11.1. squeeze

squeeze演算は、 size 1である入力の指定されたすべての次元を削除したテンソルを返す。これは、次のようにreshape() 演算を使用して一般的に実装できる:

function squeeze(builder, input, axes) {
  if (!axes)
    axes = [];
  if (!axes.length)
    input.shape.forEach((item, i) => {
      axes.push(i);
    });
  const shape = Array.from(input.shape);
  for (let axis of axes.sort().reverse())
    if (axis < shape.length && shape[axis] == 1)
      shape.splice(axis, 1);
  return builder.reshape(input, shape);
}

11.2. unsqueeze

unsqueeze演算は、指定された位置にsize 1の次元が挿入された新しいテンソルを返す。これは、次のようにreshape() 演算を使用して一般的に実装できる:

function unsqueeze(builder, input, axes) {
  const shape = Array.from(input.shape);
  for (let axis of axes.sort())
    shape.splice(axis, 0, 1);
  return builder.reshape(input, shape);
}

11.3. flatten

flatten演算は、入力を1次元テンソルにreshapeする。これは、次のようにreshape() 演算を使用して一般的に実装できる:

function flatten(builder, input, axis) {
  if (axis > input.shape.length)
    return input;
  const before = axis.slice(0, axis).reduce((a, b) => a * b, 1);
  const after = axis.slice(axis, input.shape.length).reduce((a, b) => a * b, 1);
  return builder.reshape(input, [before, after]);
}

12. 付録

12.1. `MLOperandDataType` と`ArrayBufferView` の互換性

`MLOperandDataType`	`ArrayBufferView`
`float32`	`Float32Array`
`float16`	`Float16Array`
`int64`	`BigInt64Array`
`uint64`	`BigUint64Array`
`int32`	`Int32Array`
`uint32`	`Uint32Array`
`int8`	`Int8Array`
`uint8`	`Uint8Array`

Float16Array はECMA Stage 3にあり、そのdesignが完了していることを示している。 native implementationsに先立ってこの型を有効にしたい実装者は、Uint16Array を介してraw bitsを渡すことにより、この型をemulateできる。 [Issue webnn#373]

13. 謝辞

この仕様はAndroid Neural Networks API C APIの概念に従っている。

use casesについて、Tomoyuki Shimizu、Ningxin Hu、Zhiqiang YuおよびBelem Zhangに感謝する。

API仕様への貢献について、Nikhil Thorat、Daniel Smilkov、Ganesan Ramalingam、Rafael Cintronおよび Benjamin Poulainに感謝する。

web architecture fit、design consistencyおよびdeveloper ergonomicsについてこの仕様をreviewしてくれた Sangwhan MoonおよびW3C Technical Architecture Groupに感謝する。

アルゴリズムを追加し、この仕様のnavigationを快適な体験にしてくれたZoltan Kisに感謝する。仕様をmodern editorial conventionsに合わせてくれたJoshua Bellに感謝する。注意深いreviewおよびcommentsについて、 Ningxin Hu、Lisha Guo、Shiyi Zou、Mingming Xu、Junwei Fu、Bruce DaiおよびBin Miaoに感謝する。

privacy and security reviewおよびfeedbackについて、W3C Privacy Interest Groupに感謝する。

security reviewおよびquestionsについて、Alex GoughおよびChrome Security teamに感謝する。

ONNXからの実践的なguidelinesおよびlearningsを共有してくれたMichal Karzynskiに感謝する。

feedbackおよびprivacy considerationsについて、Kaustubha GovindおよびChrome privacy reviewersに感謝する。

Chromium implementation reviewおよびfeedbackについて、Jiewei Qianに感謝する。

transformer supportの調査およびrecommendationの提供に関する作業について、Dwayne Robinson、Joshua LochnerおよびWanming Linに感謝する。 operator conformanceおよびweb-platform-tests implementationのreviewsを提供してくれたDwayneおよびWanmingにも追加で感謝する。

web-platform-testsを仕様とともに進化させ続ける継続的な貢献について、Feng Daiに感謝する。

reviewsおよびsuggestionsについて、Fuqiao XueおよびW3C Internationalization Activityに感謝する。

14. 変更

このsectionは非規範的である。

このsectionは、Classes of Changesに基づき、前回のmajor publication以降にこの仕様へ行われた変更を記録する。

Candidate Recommendation Snapshot 11 April 2024と22 January 2026の間の詳細な変更

新機能（class 4）

dequantizeLinear、quantizeLinear、およびattention演算を含む新しい演算子でoperator set "wave 3"を拡張 (#805)
WebNNとWebGPUの間のbuffer sharing、および複数のMLGraphsにまたがるreuseのためのinterfaceであるMLTensor APIを追加 (#787)
NaNおよびinfinite valuesをcheckするための新しいelement-wise演算子、isNaNおよびisInfinite演算子を追加 (#858)
banker’s roundingを使用する新しいrounding演算子、roundEven演算子を追加 (#859)
ML acceleratorsを選択するための簡単なmechanismであるaccelerator selection mechanismを追加 (#895)
windowおよびdedicated workerを超えてWebNN availabilityを拡張するため、WebNN APIをshared workersおよびservice workersに公開 (#823)
より診断可能なerror messagesのために、任意のoperator labelsを追加 (#742)
任意の型のnumeric inputsを指定するための統一型、MLNumberを導入 (#647)
support limitsでrank rangesを指定できるようにするため、opSupportLimits()にrankRangeを追加 (#828)
rankRange supportをoutput tensorsへ拡張するため、op output tensorsでrankRangeをサポート (#857)
~~MLDeviceType npu、Neural Processing Unit（NPU）device typeを追加 (#696)~~ - このfeatureは前回のpublication以降に追加され、device selectionを簡素化するため削除された。#809を参照
MLContextおよびMLGraphにdestroy() methodsを追加し、context loss behaviorおよびerror reportingを指定 (#744)
softmax演算に任意のaxis parameterを追加 (#649)
output data typeの指定をサポートするため、argmin/argmaxにoutputDataTypeを追加 (#730)
resample2dを任意のaxesで動作するように一般化するため、resample2dに任意のaxesを許可 (#752)
resampleを8-bit integer typesに対応させるため、Resample data type uint8/int8を追加 (#891)
conv2dおよびpool2d演算のoperand layout supportを簡素化し、pool2dからMLRoundingTypeを削除し、 layout supportを簡素化 (#770)
backend limitsにより適合するようpadding optionsを制限 (#843)

新機能を追加しないその他の変更（class 3）

MLTensor APIを優先し、MLContext.compute() methodを削除 (#795)
device type enumerationを削除してdevice selectionを簡素化するため、MLDeviceTypeを削除 (#809)
MLOperand methodsをreadonly attributesへ変換し、dataType()およびshape()をmethodsからattributesへ変更 (#774)
MLOperandDescriptor.shapeをrequired propertyに変更 (#764)
build methodをsingle invocationに制限するため、MLGraphBuilder.build()を1回だけ呼び出せるようにする (#717)
sequence-based constant creationを削除するため、constant()のfillSequence overloadを削除 (#656)
consistencyのためにparameterを並べ替えるため、scalar constant() operand methodのparametersを入れ替え (#650)
argmin/argmax APIを簡素化するため、argmin/argmaxのselectLastIndex parameterを削除 (#722)
API全体のconsistencyのため、cast/constant parameter typeをdataTypeにrename (#888)
recurrent network activations用のより具体的な型として、MLActivationをMLRecurrentNetworkActivationに置き換え (#718)
より良いUnicode supportのため、DOMStringをUSVStringに変更 (#715)
clarityのためにparameter namingを改善するようwhereのparameter namesをrename (#719)
consistencyのため、MLLstmCellSupportLimitsのmember outputをoutputsにrename (#757)
適切なpromise rejectionのため、destroyされたMLTensor上のin-progress operationsのpromisesをreject (#799)
data type validation rulesを可能にするため、operationsのoperand data type constraintsを指定 (#646)
validation improvementsのため、いくつかのopsにmissing validation stepsを追加 (#820)
array operationsのvalidationを強化するため、pad()、slice()、およびsplit()にmissing validationを追加 (#690)
recurrent network validation改善のため、GRU/LSTMのvalidationを簡素化、修正、および追加 (#659)
GRUおよびLSTM operatorsのhidden sizeをvalidate (#644)
transposed convolutionのより厳密なvalidationのため、convTranspose2dにおけるoutput paddingの制限をvalidate (#631)
gather operation validationを強化 (#642)
一般的なvalidation improvements (#643)
resource validationを改善 (#622)
buffer transfersのerrorsを適切に扱うため、MLNamedArrayBufferViews transfer algorithmのerror handlingを定義 (#723)
dimension valid rangeをsigned integerに更新 (#738)
dimension validityを形式化するため、"valid dimension" conceptを導入 (#641)
object creationで適切なrealm handlingを行うため、object creationがrealmを指定することを確保 (#810)
division operator roundingを明確化 (#909)
pad scalar inconsistencyを修正 (#894)
split opのopSupportLimits errorを修正 (#776)
softmax() axis argumentで適切なrange enforcementのため、EnforceRangeを使用 (#746)
conv2d algorithmsにinputShapeのmissing definitionsを追加 (#680)
unidirectionally broadcast shapes stepsを修正 (#663)
buffer transferringが失敗した場合のcompute() promise rejection behaviorを修正 (#639)
ArrayBufferView compatibility tableを64-bit integer typesで更新 (#698)

文書の解釈に機能的な影響を与えない変更（class 2）

validationへの体系的なapproachのため、operand data typeおよびrank validationをtable-drivenに変更 (#657)
category別のoperatorsの非規範的tableを追加 (#868)
異なるdata types間のcast() op behaviorを明確化 (#726)
resample2dのinterpolation algorithmsを明確化 (#816)
edge casesを明確化するため、empty axesおよびscalar inputを伴うreductionを明確化 (#741)
no-op graphsに関するnoteを追加 (#665)
reduction operation behaviorを明確化するため、reduction opsのkeepDimensionsに関するnoteを追加 (#648)
emulation documentationを再編成 (#598)
reduceLogSum、reduceLogSumExp、およびreduceSumSquareのdecompositionsを追加 (#637)
clamp() minValue == maxValueに関するinterop issuesについてのobsolete noteを削除 (#684)
specification boilerplate metadata informationを更新 (#769)

Horizontal（class 2およびclass 3）

architectural resource contention considerationsを追加 (#765)
Unicodeに関するsecurity considerationsを追加 (#851)
computation control-flow attacksに関するsecurity considerationを追加 (#725)
privacy considerationsを改訂 (#890)
opSupportLimits() fingerprintingに関するprivacy considerationsを追加 (#881)
accessibility considerationsを追加 (#869)
label usageに関するinternationalization noteを追加 (#841)

Editorial（class 2）

さまざまなeditorial improvements (#834)
さまざまなstyleおよびwording tweaks (#797)
grammarおよびspelling corrections (#782)
helpers algorithmsでspecification stepsを簡素化 (#737)
type referencesを改善 (#735)
WebIDL transferable definitionを参照 (#732)
terminology usage改善のため、proseで"sequence"を避ける (#729)
lintingおよびvalidation強化のため、algorithm stepsをvalidatingするlogicを改善 (#727)
cross-references改善のため、method argument definitionsをlink (#721)
organization改善のため、不要なsubsectionsを削除 (#711)
adjectiveが意図される場合に"empty"ではなく"is empty"へlink (#708)
この仕様のauthoringおよびreviewを容易にするutilitiesを追加 (#702)
"transferred" cross-referenceを修正 (#679)
authoring experienceを改善するため、"generically emulated" textをmacro化 (#638)
'backward'および'both' directionsによるLSTMのemulation errorを修正 (#802)
'backward'および'both' directionsによるGRUのemulation errorを修正 (#803)
decompositions内のtypos/JS errorsを修正 (#699)

WebニューラルネットワークAPI

概要

この文書の位置付け

1. はじめに

2. ユース ケース

2.1. アプリケーションユースケース

2.1.1. 人物検出

2.1.2. セマンティックセグメンテーション

2.1.3. 骨格検出

2.1.4. 顔認識

2.1.5. 顔ランドマーク検出

2.1.6. スタイル転送

2.1.7. 超解像

2.1.8. 画像キャプション生成

2.1.9. テキストから画像への生成

2.1.10. 機械翻訳

2.1.11. 感情分析

2.1.12. 動画要約

2.1.13. ノイズ抑制

2.1.14. 音声認識

2.1.15. テキスト生成

2.1.16. フェイク動画の検出

2.2. フレームワークユースケース

2.2.1. カスタムレイヤー

2.2.2. ネットワーク結合

2.2.3. 性能適応

2.2.4. 演算レベル実行

2.2.5. リアルタイム動画処理との統合

3. アクセシビリティの考慮事項

4. セキュリティの考慮事項

4.1. 新しい演算に関するガイドライン

5. プライバシーの考慮事項

5.1. フィンガープリンティング

5.2. 実行時間分析

5.3. WebGPUとの比較

6. 倫理的考慮事項

7. プログラミングモデル

7.1. 概要

7.2. デバイス選択

7.3. 演算子

7.4. タスクソース

7.5. 権限ポリシー統合

8. API

8.1. navigator.mlインターフェイス

8.2. ML インターフェイス

8.2.1. MLContextOptions

8.2.2. createContext()

8.3. MLContext インターフェイス

8.3.1. dispatch()

8.3.1.1. 例

8.3.2. createTensor()

8.3.3. createConstantTensor()

8.3.4. readTensor(tensor)

8.3.5. readTensor(tensor, outputData)

8.3.6. writeTensor()

8.3.7. opSupportLimits()

8.3.7.1. MLOpSupportLimits 辞書

8.3.7.2. MLRankRange 辞書

8.3.7.3. MLTensorLimits 辞書

8.3.7.4. MLBinarySupportLimits 辞書

8.3.7.5. MLSingleInputSupportLimits 辞書

8.3.8. destroy()

8.3.9. エラー

8.4. MLGraph インターフェイス

8.4.1. destroy()

8.5. MLOperandDescriptor 辞書

8.6. MLOperand インターフェイス

8.6.1. MLOperandの作成

8.6.1.1. MLNumber

8.7. MLTensorDescriptor 辞書

8.8. MLTensor インターフェイス

8.8.1. MLTensorの作成

8.8.2. destroy()

8.8.3. 定数MLTensorの作成

8.9. MLGraphBuilder インターフェイス

8.9.1. MLGraphBuilder コンストラクター

8.9.2. 入力オペランド

8.9.3. 定数オペランド

8.9.3.1. constant(descriptor, buffer)

8.9.3.2. constant(tensor)

2. ユースケース

8.2. `ML` インターフェイス

8.2.1. `MLContextOptions`

8.2.2. `createContext()`

8.3. `MLContext` インターフェイス

8.3.1. `dispatch()`

8.3.2. `createTensor()`

8.3.3. `createConstantTensor()`

8.3.4. `readTensor(tensor)`

8.3.5. `readTensor(tensor, outputData)`

8.3.6. `writeTensor()`

8.3.7. `opSupportLimits()`

8.3.7.1. `MLOpSupportLimits` 辞書

8.3.7.2. `MLRankRange` 辞書

8.3.7.3. `MLTensorLimits` 辞書

8.3.7.4. `MLBinarySupportLimits` 辞書

8.3.7.5. `MLSingleInputSupportLimits` 辞書

8.3.8. `destroy()`

8.4. `MLGraph` インターフェイス

8.4.1. `destroy()`

8.5. `MLOperandDescriptor` 辞書

8.6. `MLOperand` インターフェイス

8.6.1. `MLOperand`の作成

8.6.1.1. `MLNumber`

8.7. `MLTensorDescriptor` 辞書

8.8. `MLTensor` インターフェイス

8.8.1. `MLTensor`の作成

8.8.2. `destroy()`

8.8.3. 定数`MLTensor`の作成

8.9. `MLGraphBuilder` インターフェイス

8.9.1. `MLGraphBuilder` コンストラクター

8.9.3.1. `constant(descriptor, buffer)`

8.9.3.2. `constant(tensor)`

8.9.3.3. `constant(dataType, value)`

12.1. `MLOperandDataType` と`ArrayBufferView` の互換性