Эта страница переведена с помощью Cloud Translation API.

'сди' Диалект

Шарди-диалект (SDY)

Диалект Shardy (SDY) определяет представление сегментации тензоров на основе осей, а также дополнительные компоненты API для привязки сегментов к тензорам.

Журнал версий: 0.0.1: Добавлены неуменьшенные оси в TensorShardingAttr.

Операции

`sdy.all_gather` (sdy::AllGatherOp)

Осуществляет всестороннюю связь по осям.

Синтаксис:

operation ::= `sdy.all_gather` $gathering_axes $tensor `out_sharding````=```$out_sharding attr-dict `:` type($result)

Собирает фрагменты тензора вдоль осей, указанных в gathering_axes .

Параметр gathering_axes представляет собой список списков осей. Внешний список охватывает измерения тензора. Каждый внутренний список указывает оси, вдоль которых следует выполнить отдельную операцию gather для соответствующего измерения. Она будет применена к сегментации операнда ( tensor ) для получения сегментации результата ( out_sharding ).

Обратите внимание, что out_sharding не используется для определения сегментации результата. Вместо этого сегментация результата определяется сегментацией операнда и параметра gathering_axes , и out_sharding должен соответствовать этой выведенной сегментации.

Пример:

%1 = stablehlo.tanh(%0) {sdy.sharding = #sdy.sharding_per_value<[<@mesh, [{"a", "b", "c"}, {}, {"d"}\]>]>} : tensor<8x8x8xf32>
%2 = sdy.all_gather [{"b", "c"}, {}, {"d"}\] %1 out_sharding=<@mesh, [{"a"}, {}, {}\]> : tensor<8x8x8xf32>

Ограничения:

Необходимо соблюдать ограничения, указанные в Sdy_CollectiveOpInterface .
Элементы в gathering_axes должны удовлетворять ограничениям, перечисленным в AxisRefListAttr .
Применение gathering_axes к операнду sharding приводит к получению out_sharding .

Характеристики: SameOperandsAndResultType

Интерфейсы: InferTypeOpInterface , Sdy_CollectiveOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	Тип MLIR	Описание
`gathering_axes`	::mlir::sdy::ListOfAxisRefListsAttr	Список ссылок на оси
`out_sharding`	::mlir::sdy::TensorShardingAttr	Разделение тензоров

Операнды:

Операнд	Описание
`tensor`	имеет форму любых значений, не являющихся токенами.

Результаты:

Результат	Описание
`result`	имеет форму любых значений, не являющихся токенами.

`sdy.all_reduce` (sdy::AllReduceOp)

Выполните обмен данными по всем осям путем редукции.

Синтаксис:

operation ::= `sdy.all_reduce` ($reduction_op^)? $reduction_axes $tensor `out_sharding````=```$out_sharding attr-dict `:` type($result)

Уменьшает фрагменты тензора вдоль осей, указанных в reduction_axes . Порядок параметров reduction_axes не важен для результата, но может влиять на порядок соответствующих групп реплик.

Ограничения:

Необходимо соблюдать ограничения, указанные в Sdy_CollectiveOpInterface .
reduction_axes должен удовлетворять ограничениям, перечисленным в AxisRefListAttr .
Сортировка reduction_axes должна производиться относительно сетки.
Оперы sharding и out_sharding должны иметь эквивалентные размерности сегментирования.
reduction_axes не должна перекрываться с осью сегментирования и реплицированными осями операнда (она может перекрываться с неуменьшенными осями).
reduction_axes не должны перекрываться с неуменьшенными осями out_sharding ). Другими словами, out_sharding должен быть продублирован вдоль reduction_axes ) (явно или неявно).

Характеристики: SameOperandsAndResultType

Интерфейсы: CollectiveOpInterface , InferTypeOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	Тип MLIR	Описание
`reduction_axes`	::mlir::sdy::AxisRefListAttr	Список ссылок на оси
`reduction_op`	::mlir::sdy::ReductionOpAttr	сокращение открытого перечисления
`out_sharding`	::mlir::sdy::TensorShardingAttr	Разделение тензоров

Операнды:

Операнд	Описание
`tensor`	имеет форму любых значений, не являющихся токенами.

Результаты:

Результат	Описание
`result`	имеет форму любых значений, не являющихся токенами.

`sdy.all_slice` (sdy::AllSliceOp)

Выполняет операцию динамического разрезания вдоль осей.

Синтаксис:

operation ::= `sdy.all_slice` $slicing_axes $tensor `out_sharding````=```$out_sharding attr-dict `:` type($result)

Функция sdy.all_slice разделяет тензор на фрагменты вдоль осей, указанных в slicing_axes . Между sdy.all_slice и sdy.all_gather существует алгебраическая двойственность.

slicing_axes — это список списков осей. Внешний список охватывает измерения тензора. Каждый внутренний список указывает оси, вдоль которых следует выполнить срез по соответствующему измерению. Он будет применен к сегментированию операнда ( tensor ) для получения сегментирования результата ( out_sharding ).

Обратите внимание, что out_sharding не используется для определения сегментации результата. Вместо этого сегментация результата определяется сегментацией операнда и slicing_axes , и out_sharding должен соответствовать этой выведенной сегментации.

Пример:

%1 = stablehlo.tanh(%0) {sdy.sharding = #sdy.sharding_per_value<[<@mesh, [{"a"}, {}, {}\]>]>} : tensor<8x8x8xf32>
%2 = sdy.all_slice [{"b", "c"}, {}, {"d"}\] %1 out_sharding=<@mesh, [{"a", "b", "c"}, {}, {"d"}\]> : tensor<8x8x8xf32>

Ограничения:

Необходимо соблюдать ограничения, указанные в Sdy_CollectiveOpInterface .
Элементы в slicing_axes должны удовлетворять ограничениям, перечисленным в AxisRefListAttr .
Применение slicing_axes к операнду sharding дает значение out_sharding .

Характеристики: SameOperandsAndResultType

Интерфейсы: CollectiveOpInterface , InferTypeOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	Тип MLIR	Описание
`slicing_axes`	::mlir::sdy::ListOfAxisRefListsAttr	Список ссылок на оси
`out_sharding`	::mlir::sdy::TensorShardingAttr	Разделение тензоров

Операнды:

Операнд	Описание
`tensor`	имеет форму любых значений, не являющихся токенами.

Результаты:

Результат	Описание
`result`	имеет форму любых значений, не являющихся токенами.

`sdy.all_to_all` (sdy::AllToAllOp)

Осуществляет связь «все со всеми» по осям.

Синтаксис:

operation ::= `sdy.all_to_all` $params $tensor `out_sharding````=```$out_sharding attr-dict `:` type($result)

Для каждого кортежа (axes, src_dim, tgt_dim) в списке параметров эта операция разбивает тензор на фрагменты вдоль размерности tgt_dim и осей, указанных в axes , распределяет эти фрагменты вдоль осей и объединяет их вдоль размерности src_dim .

Эта операция по сути представляет собой комбинацию операции сбора данных вдоль src_dim и axes , за которой следует операция разрезания данных вдоль tgt_dim и axes , то есть к размерности сегментирования осей src_dim входного тензора добавляется суффикс размерности сегментирования осей tgt_dim на выходном тензоре.

Метод all-to-all будет применен к сегментации операнда ( tensor ) для получения сегментации результата ( out_sharding ).

Обратите внимание, что out_sharding не используется для определения сегментации результата. Вместо этого сегментация результата определяется сегментацией операнда, src_dim , tgt_dim и axes , и out_sharding должен соответствовать этой предполагаемой сегментации.

Пример:

%1 = stablehlo.tanh(%0) {sdy.sharding = #sdy.sharding_per_value<[<@mesh, [{"a", "b"}, {"c"}, {}, {}\]>]>} : tensor<8x8x4x4x32>
%2 = sdy.all_to_all [{"b"}: 0->2, {"c"}: 1->3] %1 out_sharding=<@mesh, [{"a"}, {}, {"b"}, {"c"}\]> : tensor<8x8x4x4x32>

Ограничения:

Необходимо соблюдать ограничения, указанные в Sdy_CollectiveOpInterface .
Список параметров не должен быть пустым.
Для каждого параметра в params :
- Элементы в axes должны удовлетворять ограничениям, заданным в AxisRefAttr .
- src_dim и tgt_dim должны быть допустимыми размерностями (неотрицательными и меньше ранга тензора).
- Любое src_dim или tgt_dim должно быть уникальным для всех параметров.
- src_dim должно быть отсортировано в порядке возрастания по всем параметрам.
Перемещение axes из src_dim в tgt_dim в операнде sharding приводит к out_sharding .

Характеристики: SameOperandsAndResultType

Интерфейсы: InferTypeOpInterface , Sdy_CollectiveOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	Тип MLIR	Описание
`params`	::mlir::sdy::AllToAllParamListAttr	Список параметров «все ко всем»
`out_sharding`	::mlir::sdy::TensorShardingAttr	Разделение тензоров

Операнды:

Операнд	Описание
`tensor`	имеет форму любых значений, не являющихся токенами.

Результаты:

Результат	Описание
`result`	имеет форму любых значений, не являющихся токенами.

`sdy.collective_permute` (sdy::CollectivePermuteOp)

Выполняет коллективно-перестановочную коммуникацию для замены осей.

Синтаксис:

operation ::= `sdy.collective_permute` $tensor `out_sharding````=```$out_sharding attr-dict `:` type($result)

Отправляет фрагмент входного тензора с каждого устройства на другое для переупорядочивания/замены осей, которые разделяют тензор.

Коллективная перестановка может преобразовать входное сегментирование таким образом, что каждое измерение должно быть сегментировано так же, как и раньше, то есть оно должно быть сегментировано вдоль осей, произведение размеров которых совпадает с произведением размеров осей, которые ранее сегментировали тензор.

Это полезно для изменения порядка осей в одном измерении или в разных измерениях, а также для замены сегментированных осей на реплицированные.

В приведенном ниже примере размер сегментированного тензора равен tensor<1x4x2xf32> , и это сохраняется при коллективной перестановке.

Пример:

sdy.mesh @mesh = <["a"=2, "b"=2, "c"=4, "d"=2, "e"=2, "f"=2]>
%1 = stablehlo.tanh(%0) {sdy.sharding = #sdy.sharding_per_value<[<@mesh, [{"a", "c"}, {"f"}, {"d", "e"}\]>]>} : tensor<8x8x8xf32>
%2 = sdy.collective_permute %1 out_sharding=<@mesh, [{"c":(1)2, "b", "f"}, {"a"}, {"e", "d"}\]> : tensor<8x8x8xf32>

Ограничения:

Необходимо соблюдать ограничения, указанные в Sdy_CollectiveOpInterface .
Если входные и выходные сегменты имеют разные сетки, то эти сетки должны иметь абсолютно одинаковые оси и разный порядок идентификаторов устройств.
Для каждого измерения произведение размеров осей сегментирования в out_sharding должно совпадать с размером сегментирования соответствующего операнда измерения.

Характеристики: SameOperandsAndResultType

Интерфейсы: CollectiveOpInterface , InferTypeOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	Тип MLIR	Описание
`out_sharding`	::mlir::sdy::TensorShardingAttr	Разделение тензоров

Операнды:

Операнд	Описание
`tensor`	имеет форму любых значений, не являющихся токенами.

Результаты:

Результат	Описание
`result`	имеет форму любых значений, не являющихся токенами.

`sdy.constant` (sdy::ConstantOp)

Постоянная работа

Создает output тензор из постоянного value .

См.: https://github.com/openxla/stablehlo/blob/main/docs/spec.md#constant

Пример:

%output = sdy.constant dense<[[0.0, 1.0], [2.0, 3.0]]> : tensor<2x2xf32>

Характеристики: AlwaysSpeculatableImplTrait

Интерфейсы: ConditionallySpeculatable , InferTypeOpInterface , NoMemoryEffect (MemoryEffectOpInterface)

Эффекты: MemoryEffects::Effect{}

Атрибуты:

Атрибут	Тип MLIR	Описание
`value`	::mlir::ElementsAttr	постоянный векторный/тензорный атрибут

Результаты:

Результат	Описание
`output`	тензор статически заданной формы, содержащий любые значения, не являющиеся токенами.

`sdy.data_flow_edge` (sdy::DataFlowEdgeOp)

Операция на границе потока данных.

Синтаксис:

operation ::= `sdy.data_flow_edge` $input (`sharding````=``` $sharding^)? attr-dict `:` type($result)

Ребро потока данных некоторой операции X определяет мост между набором источников (каждый из которых является либо операндом X, либо операндом терминатора блока X) и набором целей (каждая из которых является либо результатом X, либо аргументом блока X), таким образом, что все источники и цели должны быть разделены одинаковым образом.

Операция может иметь несколько ребер потока данных, ортогональных друг другу.

Например:

  y_0, ..., y_n = while (x_0, ..., x_n)
                  ((pred_arg_0,... , pred_arg_n) { ... })
                  ((body_arg_0,..., body_arg_n) {
                    ...
                    return return_value_0, ..., return_value_n
                  })

В то время как операция имеет n ребер потока данных, i-е ребро потока данных находится между источниками x_i , return_value_i и целями y_i , pred_arg_i , body_arg_i .

Функция sdy.data_flow_edge принимает на вход владельца ребра (это может быть любой из целевых объектов, но предпочтительнее результат операции, а не аргумент блока), который не должен иметь других применений. Эта операция не является чистой, поскольку может принимать на вход данные, которые изначально не имели никаких применений.

Объект sdy.data_flow_edge также содержит необязательный сегмент для всех целевых объектов ребра, и этот сегмент должен обновляться вместо сегмента целевых объектов (если его можно прикрепить) во время распространения. Это полезно, когда операция имеет много ребер, поскольку гораздо эффективнее:

распространяться по каждому краю отдельно.
Обновляйте сегментацию каждого ребра отдельно, а не всех целей одновременно (например, операция имеет один неизменяемый TensorShardingPerValueAttr для сегментации результатов).
Добавляйте каждое ребро в список задач отдельно при изменении сегментации источника.

В результате операции шардирование будет происходить между всеми источниками и целями sdy.data_flow_edge так же, как если бы это была обычная операция, где источники являются операндами, а цели — результатами, и используется правило идентичности sdy.op_sharding_rule . Это означает, что прямое распространение происходит от источников к целям, а обратное — от целей к источникам.

Мы не разрешаем определять входные данные для sdy.data_flow_edge операцией SdyDialect , поэтому можем предположить, что они определяются операцией с незарегистрированным атрибутом sdy.sharding .

Характеристики: SameOperandsAndResultType

Интерфейсы: InferTypeOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	Тип MLIR	Описание
`sharding`	::mlir::sdy::TensorShardingAttr	Разделение тензоров

Операнды:

Операнд	Описание
`input`	имеет форму любых значений, не являющихся токенами.

Результаты:

Результат	Описание
`result`	имеет форму любых значений, не являющихся токенами.

`sdy.func_data_flow_edge` (sdy::FuncDataFlowEdgeOp)

Функция обработки входных/выходных данных.

Синтаксис:

operation ::= `sdy.func_data_flow_edge` $operand attr-dict `:` type($result)

Операция, использующая ребро потока данных, но для аргументов функций или результатов вызовов. Когда её операндом является BlockArgument, она служит мостом от аргумента вызывающей функции к пользователям аргумента функции. Для каждого аргумента функции существует одно ребро потока данных. Когда её операндом является OpResult, она служит мостом от возвращаемого значения вызываемой функции к пользователям результата вызова. Для каждого результата вызова существует одно ребро потока данных.

Характеристики: SameOperandsAndResultType

Интерфейсы: InferTypeOpInterface , SymbolUserOpInterface

Операнды:

Операнд	Описание
`operand`	имеет форму любых значений, не являющихся токенами.

Результаты:

Результат	Описание
`result`	имеет форму любых значений, не являющихся токенами.

`sdy.manual_computation` (sdy::ManualComputationOp)

Параллельная работа нескольких устройств с ручным объединением операций.

Синтаксис:

operation ::= `sdy.manual_computation` `(`operands`)`
              `in_shardings````=```custom<StrippedTensorShardingPerValueAttr>($in_shardings)
              `out_shardings````=```custom<StrippedTensorShardingPerValueAttr>($out_shardings)
              `manual_axes````=```$manual_axes
              custom<SingleBlockRegionNoBlockId>($body)
              attr-dict
              `:`
              functional-type(operands, results)

Перейдите в область, описанную с использованием локального кода для каждого устройства с явными коллективными операциями, где логические формы соответствуют локальным физическим формам буферов для каждого устройства, а коллективные операции точно соответствуют физической связи между устройствами.

Тело графика является локальным относительно списка manual_axes. Распространение будет происходить через тело графика по любым свободным осям — тем, которые не входят в список manual_axes.

Обратите внимание, что для любых неранжированных тензоров ожидается сегментирование с рангом 0, то есть полное дублирование.

Ограничения:

Элементы во in_shardings и out_shardings должны удовлетворять ограничениям, перечисленным в TensorShardingAttr .
Количество глобальных и локальных тензорных входов/выходов в области операции должно совпадать.
В каждом сегменте данных оси, заданные вручную, должны располагаться перед любыми свободными осями.
В ручных настройках осей нельзя добавлять отступы. А именно, размер габарита должен быть кратен соответствующему размеру, заданному в ручных настройках.
Глобальная и локальная формы областей действия аргументов/результатов должны совпадать.

Характеристики: IsolatedFromAbove , RecursiveMemoryEffects , SingleBlockImplicitTerminator<ReturnOp> , SingleBlock

Интерфейсы: ShardableDataFlowOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	Тип MLIR	Описание
`in_shardings`	::mlir::sdy::TensorShardingPerValueAttr	Разделение тензора на операнды/результаты операций
`out_shardings`	::mlir::sdy::TensorShardingPerValueAttr	Разделение тензора на операнды/результаты операций
`manual_axes`	::mlir::sdy::ManualAxesAttr	Список осей, по которым операция ManualComputationOp выполняется вручную.

Операнды:

Операнд	Описание
`tensors`	вариативный любого нетокенного типа

Результаты:

Результат	Описание
`results`	вариативный любого нетокенного типа

`sdy.mesh` (sdy::MeshOp)

Именованная сетка

Синтаксис:

operation ::= `sdy.mesh` $sym_name `=` $mesh attr-dict

Определяет новую именованную сетку. Все сетки в модуле должны иметь одинаковое количество устройств (за исключением сеток с одним device_id). Сетка представляет собой операцию Symbol , которая отображается в SymbolTable модуля и может быть указана по своему name .

Характеристики: HasParent<ModuleOp>

Интерфейсы: Symbol

Атрибуты:

Атрибут	Тип MLIR	Описание
`sym_name`	::mlir::StringAttr	строковый атрибут
`mesh`	::mlir::sdy::MeshAttr	Сетка осей и список устройств

`sdy.named_computation` (sdy::NamedComputationOp)

Именованная операция вычисления

Синтаксис:

operation ::= `sdy.named_computation` `<`$name`>` `` `(` $operands `)`
              (`in_shardings````=```custom<StrippedTensorShardingPerValueAttr>($in_shardings)^)?
              (`out_shardings````=```custom<StrippedTensorShardingPerValueAttr>($out_shardings)^)?
              custom<SingleBlockRegionNoBlockId>($body)
              attr-dict
              `:` functional-type($operands, results)

Группирует вычисления, то есть блок операций, и присваивает ему имя. Распространение будет происходить внутри/вне этой области так, как если бы все операции были встроены.

Это можно использовать для передачи инструкций вызова другим функциям. Любые пользователи Shardy должны написать проход импорта/экспорта, который преобразует их операции вызова в операции sdy.named_computation , дублируя/копируя тело вызываемой функции в тело named_computation .

Тип аргументов каждого блока и возвращаемых значений в регионе должен совпадать с типом операндов и типом результатов операции.

Пример:

%1 = sdy.named_computation<"foo">(%0) (%arg1: tensor<16x32xf32>) {
  sdy.return %arg1 : tensor<16x32xf32>
} : (tensor<16x32xf32>) -> tensor<16x32xf32>

Характеристики: IsolatedFromAbove , RecursiveMemoryEffects , RecursivelySpeculatableImplTrait , SingleBlockImplicitTerminator<ReturnOp> , SingleBlock

Интерфейсы: ConditionallySpeculatable , InferTypeOpInterface , ShardableDataFlowOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	Тип MLIR	Описание
`name`	::mlir::StringAttr	строковый атрибут
`in_shardings`	::mlir::sdy::TensorShardingPerValueAttr	Разделение тензора на операнды/результаты операций
`out_shardings`	::mlir::sdy::TensorShardingPerValueAttr	Разделение тензора на операнды/результаты операций

Операнды:

Операнд	Описание
`operands`	вариативный любого нетокенного типа

Результаты:

Результат	Описание
«безымянный»	вариативный любого нетокенного типа

`sdy.propagation_barrier` (sdy::PropagationBarrierOp)

Работа барьера распространения

Синтаксис:

operation ::= `sdy.propagation_barrier` $input `allowed_direction````=```$allowed_direction attr-dict `:` type($input)

Эта операция работает как операция идентичности, выдавая на выходе то же значение, что и на входе. Но с точки зрения распространения, это позволит передавать данные только в определенном направлении.

Это предотвращает распространение фрагментации данных между использованием результата операции барьера и её операнда.

FORWARD означает, что фрагменты данных могут передаваться только от операнда к результату.
BACKWARD означает, что фрагменты могут передаваться только от результата к операнду.
Значение NONE означает, что в рамках этой операции шардинг невозможен.
Невозможно указать BOTH , так как это будет излишним.

Характеристики: AlwaysSpeculatableImplTrait , SameOperandsAndResultType

Интерфейсы: ConditionallySpeculatable , InferTypeOpInterface , NoMemoryEffect (MemoryEffectOpInterface)

Эффекты: MemoryEffects::Effect{}

Атрибуты:

Атрибут	Тип MLIR	Описание
`allowed_direction`	::mlir::sdy::PropagationDirectionAttr	перечисление направлений распространения

Операнды:

Операнд	Описание
`input`	ранжированный тензор любых значений, не являющихся токенами.

Результаты:

Результат	Описание
`result`	ранжированный тензор любых значений, не являющихся токенами.

`sdy.reduce_scatter` (sdy::ReduceScatterOp)

Обеспечивает связь с уменьшенным рассеянием вдоль осей.

Синтаксис:

operation ::= `sdy.reduce_scatter` ($reduction_op^)? $reduce_scatter_axes $tensor `out_sharding````=```$out_sharding attr-dict `:` type($result)

Эта операция уменьшает фрагменты тензора вдоль осей, указанных в reduce_scatter_axes , а затем распределяет результат вдоль тех же осей. По сути, эта операция представляет собой комбинацию sdy.all_reduce с последующим sdy.all_slice вдоль тех же reduce_scatter_axes .

Ограничения:

Необходимо соблюдать ограничения, указанные в Sdy_CollectiveOpInterface .
Элементы в reduce_scatter_axes должны удовлетворять ограничениям, перечисленным в AxisRefListAttr .
Применение функции reduce_scatter_axes к операнду sharding дает значение out_sharding .

Характеристики: SameOperandsAndResultType

Интерфейсы: CollectiveOpInterface , InferTypeOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	Тип MLIR	Описание
`reduce_scatter_axes`	::mlir::sdy::ListOfAxisRefListsAttr	Список ссылок на оси
`reduction_op`	::mlir::sdy::ReductionOpAttr	сокращение открытого перечисления
`out_sharding`	::mlir::sdy::TensorShardingAttr	Разделение тензоров

Операнды:

Операнд	Описание
`tensor`	имеет форму любых значений, не являющихся токенами.

Результаты:

Результат	Описание
`result`	имеет форму любых значений, не являющихся токенами.

`sdy.replicated_to_unreduced` (sdy::ReplicatedToUnreducedOp)

Переместите неявно или явно реплицированные оси на нередуцированные оси.

Синтаксис:

operation ::= `sdy.replicated_to_unreduced` $axes $tensor `out_sharding````=```$out_sharding attr-dict `:` type($result)

axes должны быть неявно или явно продублированы в операнде. Эта операция делает их нередуцированными в результате. Мы имеем следующее соотношение:

all-reduce(replicated-to-unreduced(x, axes), axes) = x

Пример:

%1 = stablehlo.tanh(%0) {sdy.sharding = #sdy.sharding_per_value<[<@mesh, [{"b"}, {}, {}\], replicated={"c", "d"}, unreduced={"e"}>]>} : tensor<8x8x8xf32>
%2 = sdy.replicated_to_unreduced {"a", "c", "f"} %1 out_sharding=<@mesh, [{"b"}, {}, {}\], replicated={"d"}, unreduced={"a", "c", "e", "f"}> : tensor<8x8x8xf32>

Ограничения:

Необходимо соблюдать ограничения, указанные в Sdy_CollectiveOpInterface .
axes должны удовлетворять ограничениям, перечисленным в AxisRefListAttr .
axes должны быть отсортированы относительно сетки.
axes не пусты.
Входные и выходные сегменты должны иметь одинаковые размерности.
axes должны быть неявно или явно продублированы в операнде сегментирования.
inUnreducedAxes + axes = outUnreducedAxes.

Характеристики: SameOperandsAndResultType

Интерфейсы: InferTypeOpInterface , Sdy_CollectiveOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	Тип MLIR	Описание
`axes`	::mlir::sdy::AxisRefListAttr	Список ссылок на оси
`out_sharding`	::mlir::sdy::TensorShardingAttr	Разделение тензоров

Операнды:

Операнд	Описание
`tensor`	имеет форму любых значений, не являющихся токенами.

Результаты:

Результат	Описание
`result`	имеет форму любых значений, не являющихся токенами.

`sdy.reshard` (sdy::ReshardOp)

Перераспределяет тензор на другой сегмент.

Синтаксис:

operation ::= `sdy.reshard` $input $sharding attr-dict `:` type($result)

Изменяет структуру входного тензора, добавляя указанное распределение шардов, отличающееся от существующего распределения шардов во входном тензоре.

И ShardingConstraintOp, и ReshardOp добавляют сегментацию к тензору. Срок их действия:

Перед распространением шардинга пользователи добавляют параметр ShardingConstraintOp.
При распространении шардинга используется операция ShardingConstraintOp. В результатах распространения шардинга операция ShardingConstraintOp отсутствует. Вместо этого при необходимости может быть добавлена операция ReshardOp.
Разделитель преобразует операцию ReshardOp в коллективную операцию (или операцию идентификации). В результатах работы разделителя не должно быть операций ReshardOp.

Характеристики: AlwaysSpeculatableImplTrait , SameOperandsAndResultType

Интерфейсы: ConditionallySpeculatable , InferTypeOpInterface , NoMemoryEffect (MemoryEffectOpInterface) , SymbolUserOpInterface

Эффекты: MemoryEffects::Effect{}

Атрибуты:

Атрибут	Тип MLIR	Описание
`sharding`	::mlir::sdy::TensorShardingAttr	Разделение тензоров

Операнды:

Операнд	Описание
`input`	любой тип, не являющийся токеном

Результаты:

Результат	Описание
`result`	любой тип, не являющийся токеном

`sdy.return` (sdy::ReturnOp)

Операция sdy.return завершает работу областей, связанных с операциями на основе областей sdy и любыми другими операциями на основе областей Shardy. Она является вариативной: в качестве аргументов принимает список значений, типы которых могут быть любыми (но одного и того же типа, например, AnyTensor ), и поэтому может быть повторно использована на различных уровнях стека Shardy IR.

Синтаксис:

operation ::= `sdy.return` attr-dict ($results^ `:` type($results))?

Характеристики: AlwaysSpeculatableImplTrait , ReturnLike , Terminator

Интерфейсы: ConditionallySpeculatable , NoMemoryEffect (MemoryEffectOpInterface) , RegionBranchTerminatorOpInterface

Эффекты: MemoryEffects::Effect{}

Операнды:

Операнд	Описание
`results`	вариативный любого нетокенного типа

`sdy.sharded_to_unreduced` (sdy::ShardedToUnreducedOp)

Переместите некоторые фрагментированные оси операнда на нередуцированные оси результата.

Синтаксис:

operation ::= `sdy.sharded_to_unreduced` $axes $tensor `out_sharding````=```$out_sharding attr-dict `:` type($result)

axes следует использовать для разделения операнда на части. Эта операция делает их нередуцированными в результате. Мы имеем следующее соотношение:

all-gather(x, axes) = all-reduce(sharded-to-unreduced(x, axes), axes), где all-gather, sharded-to-unreduced и all-reduce применяются к одним и тем же осям.

Пример:

%1 = stablehlo.tanh(%0) {sdy.sharding = #sdy.sharding_per_value<[<@mesh, [{"a", "b", "c"}, {}, {"d"}\], unreduced={"e"}>]>} : tensor<8x8x8xf32>
%2 = sdy.sharded_to_unreduced [{"b", "c"}, {}, {"d"}\] %1 out_sharding=<@mesh, [{"a"}, {}, {}\], unreduced={"b", "c", "d", "e"}> : tensor<8x8x8xf32>

Ограничения:

Необходимо соблюдать ограничения, указанные в Sdy_CollectiveOpInterface .
Элементы axes должны удовлетворять ограничениям, перечисленным в AxisRefListAttr .
Применение axes к операнду sharding дает значение out_sharding .

Характеристики: SameOperandsAndResultType

Интерфейсы: InferTypeOpInterface , Sdy_CollectiveOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	Тип MLIR	Описание
`axes`	::mlir::sdy::ListOfAxisRefListsAttr	Список ссылок на оси
`out_sharding`	::mlir::sdy::TensorShardingAttr	Разделение тензоров

Операнды:

Операнд	Описание
`tensor`	имеет форму любых значений, не являющихся токенами.

Результаты:

Результат	Описание
`result`	имеет форму любых значений, не являющихся токенами.

`sdy.sharding_constraint` (sdy::ShardingConstraintOp)

Ограничивает тензор заданным сегментированием.

Синтаксис:

operation ::= `sdy.sharding_constraint` $input $sharding attr-dict `:` type($result)

Добавляет параметр сегментации к промежуточному тензору (например, результату вычисления матриц), чтобы указать, как следует сегментировать этот тензор или подмножество его применений.

Если сегментирование имеет открытые измерения и не ограниченные оси, это означает, что тензор может быть дополнительно сегментирован вдоль открытых измерений.

Эта операция может быть либо:

Не имеют дополнительных функций (висячие ссылки) — это означает, что прикрепленное сегментирование определяет, как следует сегментировать сам входной тензор.
Наличие вариантов использования означает, что прикрепленное сегментирование определяет, как должны быть сегментированы варианты использования, заданные в рамках ограничения сегментирования, в то время как другие варианты использования входного тензора могут иметь другое сегментирование (если входной тензор не имеет других вариантов использования, то поведение такое же, как и в случае отсутствия вариантов использования).

Характеристики: SameOperandsAndResultType

Интерфейсы: InferTypeOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	Тип MLIR	Описание
`sharding`	::mlir::sdy::TensorShardingAttr	Разделение тензоров

Операнды:

Операнд	Описание
`input`	любой тип, не являющийся токеном

Результаты:

Результат	Описание
`result`	любой тип, не являющийся токеном

`sdy.sharding_group` (sdy::ShardingGroupOp)

Накладывает ограничение на то, чтобы тензоры в группе имели одинаковое сегментирование.

Синтаксис:

operation ::= `sdy.sharding_group` $input `group_id````=```$group_id attr-dict `:` type($input)

Эта операция предоставляет интерфейс для назначения тензоров группам сегментирования (группам тензоров, которые будут иметь одинаковое сегментирование). Во время распространения, как только один элемент группы будет сегментирован, все остальные члены будут сегментированы точно так же. Эта операция принимает в качестве аргумента идентификатор группы и не возвращает никакого результата, а вместо этого изменяет внутреннее представление группы сегментирования, добавляя входной тензор к группе с заданным идентификатором.

Интерфейсы: InferTypeOpInterface

Атрибуты:

Атрибут	Тип MLIR	Описание
`group_id`	::mlir::IntegerAttr	64-битный беззнаковый целочисленный атрибут

Операнды:

Операнд	Описание
`input`	ранжированный тензор любых значений, не являющихся токенами.

Атрибуты

AllToAllParamAttr

Параметр «все ко всем»

Синтаксис:

#sdy.all_to_all_param<
  ::llvm::ArrayRef<AxisRefAttr>,   # axes
  int64_t,   # src_dim
  int64_t   # tgt_dim
>

Кортеж, содержащий оси и исходные/целевые измерения, по которым необходимо выполнить операцию "все ко всем".

Параметры:

Параметр	тип C++	Описание
оси	`::llvm::ArrayRef<AxisRefAttr>`	оси для выполнения операции «все против всех»
src_dim	`int64_t`	индекс размерности источника
tgt_dim	`int64_t`	индекс целевого измерения

AllToAllParamListAttr

Список параметров «все ко всем»

Синтаксис:

#sdy.all_to_all_param_list<
  ::llvm::ArrayRef<AllToAllParamAttr>   # value
>

Параметры:

Параметр	тип C++	Описание
ценить	`::llvm::ArrayRef<AllToAllParamAttr>`

AxisRefAttr

Ссылка либо на полную ось, либо на разделенную подось.

Синтаксис:

#sdy.axis_ref<
  ::llvm::StringRef,   # name
  SubAxisInfoAttr   # sub_axis_info
>

Ограничения:

name должно присутствовать в связанном MeshAttr .
Если sub_axis_info присутствует, он должен удовлетворять ограничениям параметра SubAxisInfoAttr .

Параметры:

Параметр	тип C++	Описание
имя	`::llvm::StringRef`	название этой оси
sub_axis_info	`SubAxisInfoAttr`	Дополнительная информация, если это подось.

AxisRefListAttr

Список ссылок на оси

Синтаксис:

#sdy.axis_ref_list<
  ::llvm::ArrayRef<AxisRefAttr>   # value
>

Ограничения:

Элементы в value должны удовлетворять ограничениям AxisRefAttr .
Отсутствуют дублирующиеся ссылки на оси или перекрывающиеся между собой подоси.
Никакие две смежные оси не являются последовательными подосью одной и той же полной оси, то есть они могут быть объединены в одну подось или в полную ось.

Параметры:

Параметр	тип C++	Описание
ценить	`::llvm::ArrayRef<AxisRefAttr>`

AxisToPropagationDetailsAttr

Детали распространения потока по краям для конкретной оси и источника.

Синтаксис:

#sdy.axis_to_propagation_details<
  ::mlir::sdy::AxisRefAttr,   # axis_name
  ::mlir::sdy::EdgeValueRefAttr,   # source
  ::llvm::ArrayRef<EdgeValueRefAttr>   # targets
>

Сопоставляет ссылку на исходное значение со списком ссылок на целевые значения вдоль определенной оси.

Параметры:

Параметр	тип C++	Описание
axis_name	`::mlir::sdy::AxisRefAttr`	Ссылка либо на полную ось, либо на разделенную подось.
источник	`::mlir::sdy::EdgeValueRefAttr`	Ссылка на конкретный индекс ребра значения `type` .
цели	`::llvm::ArrayRef<EdgeValueRefAttr>`	список целевых значений ребер

DimMappingAttr

Список факторных индексов для измерения

Пустой список указывает на то, что это пустое сопоставление (оно анализируется/выводится с помощью * ), то есть измерение не сопоставлено ни с одним фактором.

Ограничения:

Существует как минимум один факторный индекс.
Индексы факторов должны находиться в диапазоне [0, $factor_sizes ).
Если факторов несколько, ни один из них не может иметь размер 1.
Отсутствуют повторяющиеся индексы факторов.

Параметры:

Параметр	тип C++	Описание
факторные_индексы	`::llvm::ArrayRef<int64_t>`	факторы, к которым соотносится это измерение

DimensionShardingAttr

Разделение измерений

Список названий осей для сегментирования измерения тензора от основной к второстепенной, логическое значение, указывающее, можно ли сегментировать измерение дальше, и необязательное целое число, обозначающее приоритет сегментирования этого измерения, который будет учитываться при распространении сегментирования. Приоритеты определяются пользовательскими аннотациями сегментирования, и меньшее значение обозначает более высокий приоритет. При отсутствии приоритета в аннотации предполагается наивысший приоритет.

Ограничения:

Элементы axes должны удовлетворять ограничениям, перечисленным в AxisRefListAttr .
Если сегментирование измерений имеет приоритет:
- Приоритет больше или равен 0.
- Если размерность замкнута, то она имеет как минимум одну ось.

Параметры:

Параметр	тип C++	Описание
оси	`::llvm::ArrayRef<AxisRefAttr>`	ссылки на оси
is_closed	`bool`	нельзя ли разделить это измерение на более мелкие части.
приоритет	`std::optional<int64_t>`	приоритет, используемый при распространении на основе приоритета пользователя

EdgeValueRefAttr

Ссылка на конкретный индекс ребра значения type .

Синтаксис:

#sdy.edge_value_ref<
  `operand` | `result`,   # type
  int64_t   # index
>

Параметры:

Параметр	тип C++	Описание
тип	`::mlir::sdy::EdgeNodeType`	перечисление типа EdgeNodeType
индекс	`int64_t`	Целочисленный индекс (0, 1, 2 и т. д.)

ListOfAxisRefListsAttr

Список ссылок на оси

Синтаксис:

#sdy.list_of_axis_ref_lists<
  ::llvm::ArrayRef<AxisRefListAttr>   # value
>

Параметры:

Параметр	тип C++	Описание
ценить	`::llvm::ArrayRef<AxisRefListAttr>`

ManualAxesAttr

Список осей, по которым операция ManualComputationOp выполняется вручную.

Синтаксис:

#sdy.manual_axes<
  ::llvm::ArrayRef<StringAttr>   # value
>

Параметры:

Параметр	тип C++	Описание
ценить	`::llvm::ArrayRef<StringAttr>`

MeshAttr

Сетка осей и список устройств

Синтаксис:

#sdy.mesh<
  ::llvm::ArrayRef<MeshAxisAttr>,   # axes
  ::llvm::ArrayRef<int64_t>   # device_ids
>

Сетка представляет собой список осей и необязательный список идентификаторов устройств, определяющих порядок расположения устройств.

Если список осей пуст

Если device_ids не указан, это пустая сетка.
Если параметр device_ids указан, он должен представлять собой одно неотрицательное целое число; мы называем такую сеть максимально сегментированной сетью .

Если предоставлен список осей

Если указан список идентификаторов устройств, произведение размеров осей должно соответствовать количеству устройств.
Если список идентификаторов устройств не указан, используется неявный список идентификаторов устройств iota(product(axes)). Для простоты мы также запрещаем указывать список идентификаторов устройств, совпадающий с iota(product(axes)); в этом случае список идентификаторов устройств указывать не следует.
Это не сетка с максимальным сегментированием, даже если общий размер осей равен 1.

Вот несколько примеров сеток:

Пустая сетка представляет собой сетку-заполнитель, которую можно заменить во время распространения: <[]>
Список сетки без осей и один неотрицательный идентификатор устройства, представляющий собой сетку с максимальным сегментированием: <[], device_ids=[3]>
Сетка с двумя осями и неявными идентификаторами устройств iota(6): <["a"=2, "b"=3]>
Сетка с двумя осями и явно указанными идентификаторами устройств, определяющими порядок устройств: <["a"=3, "b"=2], device_ids=[0, 2, 4, 1, 3, 5]>

Ограничения:

Элементы в device_ids должны быть неотрицательными.
Если axes пуста, размер device_ids может быть равен 0 (пустая сетка) или 1 (сетка с максимальным сегментированием).
Если axes не пуста,
- Элементы в axes не должны иметь одинаковых имен.
- Если указан параметр device_ids , исходный device_ids не iota(product(axis_sizes)) , а отсортированный device_ids равен iota(product(axis_sizes)) .

Параметры:

Параметр	тип C++	Описание
оси	`::llvm::ArrayRef<MeshAxisAttr>`	оси сетки
идентификаторы устройств	`::llvm::ArrayRef<int64_t>`	явный порядок устройств или максимальный идентификатор устройства

MeshAxisAttr

Именованные оси в сетке

Синтаксис:

#sdy.mesh_axis<
  ::llvm::StringRef,   # name
  int64_t   # size
>

Параметры:

Параметр	тип C++	Описание
имя	`::llvm::StringRef`	имя
размер	`int64_t`	размер этой оси

OpShardingRuleAttr

Указывает, как можно разделить операцию на этапы.

Синтаксис:

#sdy.op_sharding_rule<
  ::llvm::ArrayRef<int64_t>,   # factor_sizes
  ::llvm::ArrayRef<TensorMappingAttr>,   # operand_mappings
  ::llvm::ArrayRef<TensorMappingAttr>,   # result_mappings
  ::llvm::ArrayRef<int64_t>,   # reduction_factors
  ::llvm::ArrayRef<int64_t>,   # need_replication_factors
  ::llvm::ArrayRef<int64_t>,   # permutation_factors
  ::llvm::ArrayRef<int64_t>,   # blocked_propagation_factors
  bool   # is_custom_rule
>

Правило сегментирования определяет, как операция может быть разделена в соответствии с различными свойствами операции — любыми атрибутами, формой операндов, формой результатов и т. д. Например:

%0 = stablehlo.add %arg0, %arg1 {
    sdy.sharding_rule = #sdy.op_sharding_rule<
        ([i, j],[i, j])->([i, j])
        {i=8, j=8}>
} : tensor<8x8xf32>

%1 = stablehlo.dot_general %arg2, %arg3, contracting_dims = [1] x [0] {
  sdy.sharding_rule = #sdy.op_sharding_rule<
      ([i, k],[k, j])->([i, j])
      {i=8, j=16, k=8}>
}: (tensor<8x8xf32>, tensor<8x16xf32>) -> tensor<8x16xf32>

Обратите внимание, что мы допускаем факторы размером 1, даже если их нельзя разделить на сегменты. Это сделано в основном для полноты картины, поскольку многие операции, такие как поточечные операции, имеют размерность в один элемент, которая соответствует операндам и результатам.

Типы факторов:

reduction_factors содержит индексы факторов, требующих сокращения, например, размерностей, используемых для сжатия в операции скалярного сложения. Эти факторы могут присутствовать в операндах, но не в результатах.
need_replication_factors содержит индексы факторов, требующих полного копирования, например, отсортированного измерения в операции сортировки.
permutation_factors содержит индексы факторов, требующих коллективной перестановки, если они сегментированы, например, размеры заполнения в операции заполнения.
Все остальные факторы считаются сквозными, то есть факторами, которые не требуют никакой коммуникации, если они распределены одинаковым образом по всем тензорам, которые им сопоставлены.

blocked_propagation_factors содержит факторы, по которым распространение шардингов запрещено. Он ортогонален типам факторов. А именно, фактор, по которому распространение заблокировано, может быть любого из типов факторов.

is_custom_rule описывает, является ли это правило заданным пользователем. Пользователи могут определять правила сегментирования для своих пользовательских вызовов или переопределять предопределенные правила сегментирования для стандартных операций. Пользовательское правило всегда сохраняется/никогда не удаляется.

Ограничения:

Количество сопоставлений операндов и результатов должно соответствовать количеству операндов и результатов операции.
Существует как минимум одно соответствие (нельзя иметь правило для операции без операндов/результатов).
Ранг каждого TensorMappingAttr соответствует рангу соответствующего типа тензора.
Для каждой группы факторов ( reduction_factors , need_replication_factors , permutation_factors ):
- Элементы должны находиться в диапазоне [0, $factor_sizes ].
- Внутри каждой группы и между группами отсутствуют дублирующиеся индексы факторов.

Параметры:

Параметр	тип C++	Описание
factor_sizes	`::llvm::ArrayRef<int64_t>`	размеры всех факторов в этом правиле
сопоставления операндов	`::llvm::ArrayRef<TensorMappingAttr>`	сопоставление операндов
result_mappings	`::llvm::ArrayRef<TensorMappingAttr>`	сопоставление результатов
коэффициенты снижения	`::llvm::ArrayRef<int64_t>`	факторы, требующие сокращения
need_replication_factors	`::llvm::ArrayRef<int64_t>`	факторы, требующие полного воспроизведения
перестановочные_факторы	`::llvm::ArrayRef<int64_t>`	факторы, требующие коллективно-пермутных
blocked_propagation_factors	`::llvm::ArrayRef<int64_t>`	факторы, вдоль которых не происходит распространение фрагментации
is_custom_rule	`bool`	относится ли это правило к stablehlo.custom_call

PropagationEdgesAttr

Метаданные границы распространения для всех этапов распространения.

Синтаксис:

#sdy.propagation_edges<
  ::llvm::ArrayRef<PropagationOneStepAttr>   # value
>

Список сведений о распространении значения по каждой оси, сгруппированный по индексу шага.

Параметры:

Параметр	тип C++	Описание
ценить	`::llvm::ArrayRef<PropagationOneStepAttr>`

PropagationOneStepAttr

Метаданные распространения для каждого этапа.

Синтаксис:

#sdy.propagation_one_step<
  int64_t,   # step_index
  ::llvm::ArrayRef<AxisToPropagationDetailsAttr>   # axis_entries
>

Подробные сведения о распространении по всем осям для одного этапа распространения.

Параметры:

Параметр	тип C++	Описание
шаг_индекс	`int64_t`	индекс шагов
axis_entries	`::llvm::ArrayRef<AxisToPropagationDetailsAttr>`	Детали распространения оси для каждого решения о распространении

SubAxisInfoAttr

Информация о том, как эта вспомогательная ось выводится из полной оси.

Синтаксис:

#sdy.sub_axis_info<
  int64_t,   # pre_size
  int64_t   # size
>

При разделении полной оси на n подосей ось преобразуется в [k_1,...,k_n], и i-я подось может быть выражена произведением размеров всех осей слева от нее m=prod(k_1,...,k_(i-1)) (также называемым предварительным размером) и размером k_i. Следовательно, атрибут sub-axis-info содержит эти два числа и обозначается следующим образом: (m)k для предварительного размера m и размера k.

Ограничения:

pre-size составляет не менее 1.
size больше 1.
pre-size должен делить размер полной оси, то есть и pre-size , и size делят размер полной оси, а подось не выходит за пределы полной оси.
Размер вспомогательной оси не равен размеру соответствующей полной оси, в этом случае следует использовать полную ось.

Параметры:

Параметр	тип C++	Описание
pre_size	`int64_t`	произведение размеров подосей слева от этой подоси
размер	`int64_t`	размер этой подоси

TensorMappingAttr

Факторные отображения для каждого измерения тензора.

Синтаксис:

#sdy.tensor_mapping<
  ::llvm::ArrayRef<DimMappingAttr>   # dim_mappings
>

Ограничения:

Элементы в dim_mappings должны удовлетворять ограничениям, указанным в DimMappingAttr .
Индексы факторов не дублируются по различным измерениям.

Параметры:

Параметр	тип C++	Описание
dim_mappings	`::llvm::ArrayRef<DimMappingAttr>`	сопоставление размерностей

TensorShardingAttr

Разделение тензоров

Синтаксис:

#sdy.sharding<
  ::mlir::Attribute,   # mesh_or_ref
  ::llvm::ArrayRef<DimensionShardingAttr>,   # dim_shardings
  ::llvm::ArrayRef<AxisRefAttr>,   # replicated_axes
  ::llvm::ArrayRef<AxisRefAttr>,   # unreduced_axes
  `sum` | `max` | `min`   # reduction_op
>

Разделение тензора на сегменты привязано к определенной сетке и может ссылаться только на имена осей этой сетки. Разделение по измерениям указывает для каждого измерения тензора, вдоль каких осей (или подосей) он разделен от основной к второстепенной. Все остальные оси, которые не разделяют измерение, либо неявно, либо явно (если они присутствуют в списке реплицируемых осей) реплицируются.

Следует отметить, что отсутствие атрибута сегментирования у тензора не эквивалентно полностью открытому сегментированию тензора.

Сетку, к которой привязан этот сегмент, можно указать либо с помощью имени символа, ссылающегося на соответствующий символ MeshOp , либо с помощью встроенного MeshAttr .

При сегментировании могут быть нередуцированные оси (указывающиеся параметром unreduced_axes ), то есть тензор не редуцирован вдоль этих осей. Например, если сжимающая размерность matmul сегментирована вдоль оси x как в левой, так и в правой части, результат не редуцирован вдоль x . Применение операции all-reduce к тензору вдоль нередуцированных осей приведет к дублированию тензора вдоль этих осей. Однако тензор с нередуцированными осями не обязательно должен быть немедленно подвергнут операции all-reduce; он может оставаться нередуцированным при передаче в линейные операции, такие как stablehlo.add (при условии, что как левая, так и правая части не редуцированы), и затем подвергаться операции all-reduce. Мы предполагаем, что тип редукции — sum, в будущем могут быть поддержаны и другие типы редукции.

Ограничения:

Элементы в dim_shardings должны удовлетворять ограничениям, перечисленным в DimensionShardingAttr .
Элементы в replicated_axes должны удовлетворять ограничениям, перечисленным в AxisRefListAttr .
Элементы в unreduced_axes должны удовлетворять ограничениям, перечисленным в AxisRefListAttr .
If the corresponding tensor type isn't a ShapedType , the sharding must have rank 0 and no replicated axes.
If it is a ShapedType , then:
- The tensor should have a rank.
- The number of dimension shardings is equal to the rank of the tensor.
- Dimensions of size 0 aren't sharded.
There are no duplicate axis-refs or sub-axes that overlap with one another across dim_shardings , replicated_axes , and unreduced_axes .
Items in replicated_axes and unreduced_axes are ordered wrt mesh_or_ref (see AxisRefAttr::getMeshComparator ).

Параметры:

Параметр	C++ type	Описание
mesh_or_ref	`::mlir::Attribute`	mesh attr or flat mesh symbol reference attr
dim_shardings	`::llvm::ArrayRef<DimensionShardingAttr>`	dimension shardings
replicated_axes	`::llvm::ArrayRef<AxisRefAttr>`	axis refs
unreduced_axes	`::llvm::ArrayRef<AxisRefAttr>`	axis refs
reduction_op	`::mlir::sdy::ReductionOp`	an enum of type ReductionOp

TensorShardingPerValueAttr

Tensor sharding per operand/result of an op

Синтаксис:

#sdy.sharding_per_value<
  ::llvm::ArrayRef<TensorShardingAttr>   # shardings
>

A list of TensorShardingAttr s, one for each operand/result of an op.

Ограничения:

Elements in shardings must satisfy the constraints of TensorShardingAttr .

Параметры:

Параметр	C++ type	Описание
shardings	`::llvm::ArrayRef<TensorShardingAttr>`	sharding per value

Перечисления

EdgeNodeType

Edge node type enum

Случаи:

Символ	Ценить	Нить
ОПЕРАНД	`0`	операнд
РЕЗУЛЬТАТ	`1`	результат

PropagationDirection

Propagation direction enum

Случаи:

Символ	Ценить	Нить
НИКТО	`0`	НИКТО
ВПЕРЕД	`1`	ВПЕРЕД
НАЗАД	`2`	НАЗАД
ОБА	`3`	ОБА

ReductionOp

Reduction op enum

Случаи:

Символ	Ценить	Нить
СУММА	`0`	сумма
МАКС	`1`	макс
МИН	`2`	мин

The Shardy (SDY) dialect

The Shardy (SDY) dialect defines an axis-based tensor sharding representation and additional API components to attach shardings to tensors.

Version log: 0.0.1: Add unreduced axes to TensorShardingAttr.

Операции

`sdy.all_gather` (sdy::AllGatherOp)

Performs an all-gather communication along axes

Синтаксис:

operation ::= `sdy.all_gather` $gathering_axes $tensor `out_sharding````=```$out_sharding attr-dict `:` type($result)

Gathers chunks of a tensor along axes specified in gathering_axes .

The gathering_axes is a list of lists of axes. The outer list is over the dimensions of the tensor. Each inner list specifies the axes along which a separate gather should be performed on the respective dimension. It will be applied to the sharding of the operand ( tensor ) to obtain the sharding of the result ( out_sharding ).

Note that out_sharding is not used to determine the sharding of the result. Instead, the sharding of the result is determined by the sharding of the operand and the gathering_axes , and out_sharding must match this inferred sharding.

Пример:

%1 = stablehlo.tanh(%0) {sdy.sharding = #sdy.sharding_per_value<[<@mesh, [{"a", "b", "c"}, {}, {"d"}\]>]>} : tensor<8x8x8xf32>
%2 = sdy.all_gather [{"b", "c"}, {}, {"d"}\] %1 out_sharding=<@mesh, [{"a"}, {}, {}\]> : tensor<8x8x8xf32>

Ограничения:

Must satisfy the constraints listed in Sdy_CollectiveOpInterface .
Elements in gathering_axes must satisfy the constraints listed in AxisRefListAttr .
Applying gathering_axes to the operand sharding gets out_sharding .

Traits: SameOperandsAndResultType

Interfaces: InferTypeOpInterface , Sdy_CollectiveOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	MLIR Type	Описание
`gathering_axes`	::mlir::sdy::ListOfAxisRefListsAttr	List of axis ref lists
`out_sharding`	::mlir::sdy::TensorShardingAttr	Tensor sharding

Operands:

Операнд	Описание
`tensor`	shaped of any non-token type values

Результаты:

Результат	Описание
`result`	shaped of any non-token type values

`sdy.all_reduce` (sdy::AllReduceOp)

Perform an all-reduce comunication along axes

Синтаксис:

operation ::= `sdy.all_reduce` ($reduction_op^)? $reduction_axes $tensor `out_sharding````=```$out_sharding attr-dict `:` type($result)

Reduces chunks of a tensor along axes specified in reduction_axes . The order of reduction_axes is not important for the result, but can affect the order of the corresponding replica groups.

Ограничения:

Must satisfy the constraints listed in Sdy_CollectiveOpInterface .
reduction_axes must satisfy the constraints listed in AxisRefListAttr .
reduction_axes must be sorted wrt the mesh.
The operand sharding and out_sharding must have equivalent dimension shardings.
reduction_axes must not overlap with the operand dimension sharding and replicated axes (it can overlap with unreduced axes).
reduction_axes must not overlap with the unreduced axes of out_sharding . In other words, out_sharding must be be replicated along reduction_axes (implicitly or explicitly).

Traits: SameOperandsAndResultType

Interfaces: CollectiveOpInterface , InferTypeOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	MLIR Type	Описание
`reduction_axes`	::mlir::sdy::AxisRefListAttr	List of axis refs
`reduction_op`	::mlir::sdy::ReductionOpAttr	reduction op enum
`out_sharding`	::mlir::sdy::TensorShardingAttr	Tensor sharding

Operands:

Операнд	Описание
`tensor`	shaped of any non-token type values

Результаты:

Результат	Описание
`result`	shaped of any non-token type values

`sdy.all_slice` (sdy::AllSliceOp)

Performs a dynamic-slice operation along axes

Синтаксис:

operation ::= `sdy.all_slice` $slicing_axes $tensor `out_sharding````=```$out_sharding attr-dict `:` type($result)

Slices chunks of a tensor along axes specified in slicing_axes . There is an algebric duality between sdy.all_slice and sdy.all_gather .

The slicing_axes is a list of lists of axes. The outer list is over the dimensions of the tensor. Each inner list specifies the axes along which a slice should be performed on the respective dimension. It will be applied to the sharding of the operand ( tensor ) to obtain the sharding of the result ( out_sharding ).

Note that out_sharding is not used to determine the sharding of the result. Instead, the sharding of the result is determined by the sharding of the operand and the slicing_axes , and out_sharding must match this inferred sharding.

Пример:

%1 = stablehlo.tanh(%0) {sdy.sharding = #sdy.sharding_per_value<[<@mesh, [{"a"}, {}, {}\]>]>} : tensor<8x8x8xf32>
%2 = sdy.all_slice [{"b", "c"}, {}, {"d"}\] %1 out_sharding=<@mesh, [{"a", "b", "c"}, {}, {"d"}\]> : tensor<8x8x8xf32>

Ограничения:

Must satisfy the constraints listed in Sdy_CollectiveOpInterface .
Elements in slicing_axes must satisfy the constraints listed in AxisRefListAttr .
Applying slicing_axes to the operand sharding gets out_sharding .

Traits: SameOperandsAndResultType

Interfaces: CollectiveOpInterface , InferTypeOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	MLIR Type	Описание
`slicing_axes`	::mlir::sdy::ListOfAxisRefListsAttr	List of axis ref lists
`out_sharding`	::mlir::sdy::TensorShardingAttr	Tensor sharding

Operands:

Операнд	Описание
`tensor`	shaped of any non-token type values

Результаты:

Результат	Описание
`result`	shaped of any non-token type values

`sdy.all_to_all` (sdy::AllToAllOp)

Performs an all-to-all communication along axes

Синтаксис:

operation ::= `sdy.all_to_all` $params $tensor `out_sharding````=```$out_sharding attr-dict `:` type($result)

For each (axes, src_dim, tgt_dim) tuple in the parameter list, this operation slices chunks of a tensor along dimension tgt_dim and axes specified in axes , scatteres those chunks along the axes, and concatenates them along dimension src_dim .

This operation is essentially a combination of an all-gather along src_dim and axes , followed by an all-slice along tgt_dim and axes , ie, a suffix of the axes sharding dimension src_dim on the input tensor is appended to the axes sharding dimension tgt_dim on the output tensor.

The all-to-all will be applied to the sharding of the operand ( tensor ) to obtain the sharding of the result ( out_sharding ).

Note that out_sharding is not used to determine the sharding of the result. Instead, the sharding of the result is determined by the sharding of the operand, src_dim , tgt_dim , and axes , and out_sharding must match this inferred sharding.

Пример:

%1 = stablehlo.tanh(%0) {sdy.sharding = #sdy.sharding_per_value<[<@mesh, [{"a", "b"}, {"c"}, {}, {}\]>]>} : tensor<8x8x4x4x32>
%2 = sdy.all_to_all [{"b"}: 0->2, {"c"}: 1->3] %1 out_sharding=<@mesh, [{"a"}, {}, {"b"}, {"c"}\]> : tensor<8x8x4x4x32>

Ограничения:

Must satisfy the constraints listed in Sdy_CollectiveOpInterface .
The parameter list must not be empty.
For each parameter in params :
- Elements in axes must satisfy the constraints of AxisRefAttr .
- src_dim and tgt_dim must be valid dimensions (non-negative and less than rank of tensor).
- Any src_dim or tgt_dim must be unique across all parameters.
- src_dim must be sorted in ascending order across all parameters.
Moving axes from src_dim to tgt_dim in the operand sharding gets out_sharding .

Traits: SameOperandsAndResultType

Interfaces: InferTypeOpInterface , Sdy_CollectiveOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	MLIR Type	Описание
`params`	::mlir::sdy::AllToAllParamListAttr	List of all-to-all parameters
`out_sharding`	::mlir::sdy::TensorShardingAttr	Tensor sharding

Operands:

Операнд	Описание
`tensor`	shaped of any non-token type values

Результаты:

Результат	Описание
`result`	shaped of any non-token type values

`sdy.collective_permute` (sdy::CollectivePermuteOp)

Performs a collective-permute communication to replace axes

Синтаксис:

operation ::= `sdy.collective_permute` $tensor `out_sharding````=```$out_sharding attr-dict `:` type($result)

Sends a chunk of the input tensor from each device to another to reorder/replace the axes that shard the tensor.

A collective permute can transform the input sharding such that each dimension must be as sharded as it was before, ie, it must be sharded along axes whose product of sizes matches that of the axes that previously sharded the tensor.

This is useful for reordering axes in a single dimension or across different dimensions, and swapping sharded axes with replicated ones.

In the below example, the sharded tensor size is tensor<1x4x2xf32> , and that is preserved by the collective permute.

Пример:

sdy.mesh @mesh = <["a"=2, "b"=2, "c"=4, "d"=2, "e"=2, "f"=2]>
%1 = stablehlo.tanh(%0) {sdy.sharding = #sdy.sharding_per_value<[<@mesh, [{"a", "c"}, {"f"}, {"d", "e"}\]>]>} : tensor<8x8x8xf32>
%2 = sdy.collective_permute %1 out_sharding=<@mesh, [{"c":(1)2, "b", "f"}, {"a"}, {"e", "d"}\]> : tensor<8x8x8xf32>

Ограничения:

Must satisfy the constraints listed in Sdy_CollectiveOpInterface .
If input and output sharding have different meshes, then those meshes must have exactly the same axes and different order of device ids.
For each dimension, the product of sharding axis sizes in out_sharding must match that of the corresponding operand dimension sharding.

Traits: SameOperandsAndResultType

Interfaces: CollectiveOpInterface , InferTypeOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	MLIR Type	Описание
`out_sharding`	::mlir::sdy::TensorShardingAttr	Tensor sharding

Operands:

Операнд	Описание
`tensor`	shaped of any non-token type values

Результаты:

Результат	Описание
`result`	shaped of any non-token type values

`sdy.constant` (sdy::ConstantOp)

Constant operation

Produces an output tensor from a constant value .

See: https://github.com/openxla/stablehlo/blob/main/docs/spec.md#constant

Пример:

%output = sdy.constant dense<[[0.0, 1.0], [2.0, 3.0]]> : tensor<2x2xf32>

Traits: AlwaysSpeculatableImplTrait

Interfaces: ConditionallySpeculatable , InferTypeOpInterface , NoMemoryEffect (MemoryEffectOpInterface)

Effects: MemoryEffects::Effect{}

Атрибуты:

Атрибут	MLIR Type	Описание
`value`	::mlir::ElementsAttr	constant vector/tensor attribute

Результаты:

Результат	Описание
`output`	statically shaped tensor of any non-token type values

`sdy.data_flow_edge` (sdy::DataFlowEdgeOp)

Data flow edge op.

Синтаксис:

operation ::= `sdy.data_flow_edge` $input (`sharding````=``` $sharding^)? attr-dict `:` type($result)

A data flow edge of some op X defines a bridge between a set of sources (each is either an operand of X or an operand of X's block terminator) and a set of targets (each is either a result of X or a block argument of X), such that all sources and targets should be sharded in the same way.

An op can have multiple data flow edges that are orthogonal to one another.

Например:

  y_0, ..., y_n = while (x_0, ..., x_n)
                  ((pred_arg_0,... , pred_arg_n) { ... })
                  ((body_arg_0,..., body_arg_n) {
                    ...
                    return return_value_0, ..., return_value_n
                  })

This while op has n data flow edges, the i-th data flow edges is between sources x_i , return_value_i and targets y_i , pred_arg_i , body_arg_i .

An sdy.data_flow_edge takes as input the owner of an edge (can be any of the targets, but preferably an op result rather than a block argument), which shouldn't have any other uses. This op isn't pure because it can take an input that originally didn't have any uses.

The sdy.data_flow_edge also holds an optional sharding for all targets of the edge, and that sharding should be updated instead of the targets' sharding (if can be attached) during propagation. This is useful when an op has many edges, as it's much more efficient to:

propagate through each edge separately.
update the sharding of each edge separately instead of all targets at once (eg an op has a single immutable TensorShardingPerValueAttr for result shardings).
add each edge to the worklist separately when the sharding of a source has changed.

Propagation will propagate shardings between all sources and targets of a sdy.data_flow_edge as if it was a regular op with the sources as operands and targets as results, and an identity sdy.op_sharding_rule . That means that forward propagation is from sources to targets and backwards propagation is from targets to sources.

We don't allow the input of a sdy.data_flow_edge to be defined by an SdyDialect op, so we can assume that it's defined by an op that has unregistered sdy.sharding attribute.

Traits: SameOperandsAndResultType

Interfaces: InferTypeOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	MLIR Type	Описание
`sharding`	::mlir::sdy::TensorShardingAttr	Tensor sharding

Operands:

Операнд	Описание
`input`	shaped of any non-token type values

Результаты:

Результат	Описание
`result`	shaped of any non-token type values

`sdy.func_data_flow_edge` (sdy::FuncDataFlowEdgeOp)

Func input/output data flow edge op.

Синтаксис:

operation ::= `sdy.func_data_flow_edge` $operand attr-dict `:` type($result)

A data flow edge op but for func arguments or call results. When its operand is a BlockArgument; it is a bridge from the caller callOp's argument to the users of the func argument. There is one func data flow edge for each func argument. When its operand is an OpResult; it is a bridge from the called funcOp's return value to the users of the call result. There is one func data flow edge for each call result.

Traits: SameOperandsAndResultType

Interfaces: InferTypeOpInterface , SymbolUserOpInterface

Operands:

Операнд	Описание
`operand`	shaped of any non-token type values

Результаты:

Результат	Описание
`result`	shaped of any non-token type values

`sdy.manual_computation` (sdy::ManualComputationOp)

Multi-device parallelism operation with manual collectives

Синтаксис:

operation ::= `sdy.manual_computation` `(`operands`)`
              `in_shardings````=```custom<StrippedTensorShardingPerValueAttr>($in_shardings)
              `out_shardings````=```custom<StrippedTensorShardingPerValueAttr>($out_shardings)
              `manual_axes````=```$manual_axes
              custom<SingleBlockRegionNoBlockId>($body)
              attr-dict
              `:`
              functional-type(operands, results)

Jump into a region written in terms of per-device local code with explicit collectives, where logical shapes match local per-device physical buffer shapes and collectives correspond exactly to physical cross-device communication.

The body is local wrt the manual_axes. Propagation will occur through the body on any free axes - those not in the manual_axes list.

Note that any unranked tensors are expected to have a sharding with rank 0, ie fully replicated.

Ограничения:

Elements in in_shardings and out_shardings must satisfy the constraints listed in TensorShardingAttr .
The number of global and local tensor inputs/outputs of the op region must match.
The manual axes must come before any free axes in each dim sharding.
The manual axes cannot introduce padding. Namely, the dimension size must be divisible by the corresponding manual axes size.
The global and local shapes of the op regions arguments/results must match.

Traits: IsolatedFromAbove , RecursiveMemoryEffects , SingleBlockImplicitTerminator<ReturnOp> , SingleBlock

Interfaces: ShardableDataFlowOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	MLIR Type	Описание
`in_shardings`	::mlir::sdy::TensorShardingPerValueAttr	Tensor sharding per operand/result of an op
`out_shardings`	::mlir::sdy::TensorShardingPerValueAttr	Tensor sharding per operand/result of an op
`manual_axes`	::mlir::sdy::ManualAxesAttr	A list of axes that a ManualComputationOp is manual on

Operands:

Операнд	Описание
`tensors`	variadic of any non-token type

Результаты:

Результат	Описание
`results`	variadic of any non-token type

`sdy.mesh` (sdy::MeshOp)

Named mesh

Синтаксис:

operation ::= `sdy.mesh` $sym_name `=` $mesh attr-dict

Defines a new named mesh. All meshes in a module must have the same number of devices (except for meshes with a single device_id). The mesh is a Symbol operation that appears in the module's SymbolTable and can be referenced by its name .

Traits: HasParent<ModuleOp>

Interfaces: Symbol

Атрибуты:

Атрибут	MLIR Type	Описание
`sym_name`	::mlir::StringAttr	string attribute
`mesh`	::mlir::sdy::MeshAttr	Mesh of axes and a list of devices

`sdy.named_computation` (sdy::NamedComputationOp)

Named computation operation

Синтаксис:

operation ::= `sdy.named_computation` `<`$name`>` `` `(` $operands `)`
              (`in_shardings````=```custom<StrippedTensorShardingPerValueAttr>($in_shardings)^)?
              (`out_shardings````=```custom<StrippedTensorShardingPerValueAttr>($out_shardings)^)?
              custom<SingleBlockRegionNoBlockId>($body)
              attr-dict
              `:` functional-type($operands, results)

Groups a computation, ie a block of operations, and gives it a name. Propagation will flow in/out of the region as if everything was inlined.

This can be used to handle propagating through call instructions to other functions. Any users of Shardy should write an import/export pass that converts their call ops to sdy.named_computation ops, duplicating/copying the body of the called function into the body of the named_computation .

The type of each block arguments and returned values in the region must be the same as the type of the operands and results type of the op.

Пример:

%1 = sdy.named_computation<"foo">(%0) (%arg1: tensor<16x32xf32>) {
  sdy.return %arg1 : tensor<16x32xf32>
} : (tensor<16x32xf32>) -> tensor<16x32xf32>

Traits: IsolatedFromAbove , RecursiveMemoryEffects , RecursivelySpeculatableImplTrait , SingleBlockImplicitTerminator<ReturnOp> , SingleBlock

Interfaces: ConditionallySpeculatable , InferTypeOpInterface , ShardableDataFlowOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	MLIR Type	Описание
`name`	::mlir::StringAttr	string attribute
`in_shardings`	::mlir::sdy::TensorShardingPerValueAttr	Tensor sharding per operand/result of an op
`out_shardings`	::mlir::sdy::TensorShardingPerValueAttr	Tensor sharding per operand/result of an op

Operands:

Операнд	Описание
`operands`	variadic of any non-token type

Результаты:

Результат	Описание
«unnamed»	variadic of any non-token type

`sdy.propagation_barrier` (sdy::PropagationBarrierOp)

Propagation barrier operation

Синтаксис:

operation ::= `sdy.propagation_barrier` $input `allowed_direction````=```$allowed_direction attr-dict `:` type($input)

This op operates like an identity op, outputting the same value it took as input. But in terms of propagation, this will only allow propagation to flow through it in a certain direction.

This prevents shardings from being propagated between the uses of the result of the barrier op and its operand.

FORWARD means shardings can only flow from the operand to the result.
BACKWARD means shardings can only flow from the result to the operand.
NONE means no sharding can propagate through this op.
Cannot specify BOTH , as this op would be redundant.

Traits: AlwaysSpeculatableImplTrait , SameOperandsAndResultType

Interfaces: ConditionallySpeculatable , InferTypeOpInterface , NoMemoryEffect (MemoryEffectOpInterface)

Effects: MemoryEffects::Effect{}

Атрибуты:

Атрибут	MLIR Type	Описание
`allowed_direction`	::mlir::sdy::PropagationDirectionAttr	propagation direction enum

Operands:

Операнд	Описание
`input`	ranked tensor of any non-token type values

Результаты:

Результат	Описание
`result`	ranked tensor of any non-token type values

`sdy.reduce_scatter` (sdy::ReduceScatterOp)

Performs a reduce-scatter communication along axes

Синтаксис:

operation ::= `sdy.reduce_scatter` ($reduction_op^)? $reduce_scatter_axes $tensor `out_sharding````=```$out_sharding attr-dict `:` type($result)

Reduces chunks of a tensor along axes specified in reduce_scatter_axes , and then scatters the result along the same axes. This operation is essentially a combination of an sdy.all_reduce followed by an sdy.all_slice along the same reduce_scatter_axes .

Ограничения:

Must satisfy the constraints listed in Sdy_CollectiveOpInterface .
Elements in reduce_scatter_axes must satisfy the constraints listed in AxisRefListAttr .
Applying reduce_scatter_axes to the operand sharding gets out_sharding .

Traits: SameOperandsAndResultType

Interfaces: CollectiveOpInterface , InferTypeOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	MLIR Type	Описание
`reduce_scatter_axes`	::mlir::sdy::ListOfAxisRefListsAttr	List of axis ref lists
`reduction_op`	::mlir::sdy::ReductionOpAttr	reduction op enum
`out_sharding`	::mlir::sdy::TensorShardingAttr	Tensor sharding

Operands:

Операнд	Описание
`tensor`	shaped of any non-token type values

Результаты:

Результат	Описание
`result`	shaped of any non-token type values

`sdy.replicated_to_unreduced` (sdy::ReplicatedToUnreducedOp)

Move implicitly or explicitly replicated axes to unreduced axes.

Синтаксис:

operation ::= `sdy.replicated_to_unreduced` $axes $tensor `out_sharding````=```$out_sharding attr-dict `:` type($result)

The axes should be implicitly or explicitly replicated in the operand. This operation makes them unreduced in the result. We have the following relationship:

all-reduce(replicated-to-unreduced(x, axes), axes) = x

Пример:

%1 = stablehlo.tanh(%0) {sdy.sharding = #sdy.sharding_per_value<[<@mesh, [{"b"}, {}, {}\], replicated={"c", "d"}, unreduced={"e"}>]>} : tensor<8x8x8xf32>
%2 = sdy.replicated_to_unreduced {"a", "c", "f"} %1 out_sharding=<@mesh, [{"b"}, {}, {}\], replicated={"d"}, unreduced={"a", "c", "e", "f"}> : tensor<8x8x8xf32>

Ограничения:

Must satisfy the constraints listed in Sdy_CollectiveOpInterface .
axes must satisfy the constraints listed in AxisRefListAttr .
axes must be sorted wrt the mesh.
axes are not empty.
The input and output sharding must have the same dimension shardings.
axes must be implicitly or explicitly replicated in the operand sharding.
inUnreducedAxes + axes = outUnreducedAxes.

Traits: SameOperandsAndResultType

Interfaces: InferTypeOpInterface , Sdy_CollectiveOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	MLIR Type	Описание
`axes`	::mlir::sdy::AxisRefListAttr	List of axis refs
`out_sharding`	::mlir::sdy::TensorShardingAttr	Tensor sharding

Operands:

Операнд	Описание
`tensor`	shaped of any non-token type values

Результаты:

Результат	Описание
`result`	shaped of any non-token type values

`sdy.reshard` (sdy::ReshardOp)

Reshards a tensor to a different sharding

Синтаксис:

operation ::= `sdy.reshard` $input $sharding attr-dict `:` type($result)

Reshards the input tensor with the specified sharding, which is different from the input tensor's existing sharding.

Both ShardingConstraintOp and ReshardOp attach a sharding to a tensor. Their lifespan is:

Before sharding propagation, ShardingConstraintOp is added by users.
Sharding propagation consumes ShardingConstraintOp. There is no ShardingConstraintOp in the results of sharding propagation. Instead, ReshardOp may be added if needed.
A partitioner converts a ReshardOp into a collective op (or an identity op). There should be no ReshardOp in the results of the partitioner.

Traits: AlwaysSpeculatableImplTrait , SameOperandsAndResultType

Interfaces: ConditionallySpeculatable , InferTypeOpInterface , NoMemoryEffect (MemoryEffectOpInterface) , SymbolUserOpInterface

Effects: MemoryEffects::Effect{}

Атрибуты:

Атрибут	MLIR Type	Описание
`sharding`	::mlir::sdy::TensorShardingAttr	Tensor sharding

Operands:

Операнд	Описание
`input`	any non-token type

Результаты:

Результат	Описание
`result`	any non-token type

`sdy.return` (sdy::ReturnOp)

The sdy.return operation terminates the regions attached to sdy region-based ops and any other Shardy region-based ops. It is variadic: it takes as arguments a list of values whose types can be any (but of the same kind, eg AnyTensor ) and therefore can be reused at various levels of the Shardy IR stack.

Синтаксис:

operation ::= `sdy.return` attr-dict ($results^ `:` type($results))?

Traits: AlwaysSpeculatableImplTrait , ReturnLike , Terminator

Interfaces: ConditionallySpeculatable , NoMemoryEffect (MemoryEffectOpInterface) , RegionBranchTerminatorOpInterface

Effects: MemoryEffects::Effect{}

Operands:

Операнд	Описание
`results`	variadic of any non-token type

`sdy.sharded_to_unreduced` (sdy::ShardedToUnreducedOp)

Move some sharded axes of the operand to unreduced axes of the result.

Синтаксис:

operation ::= `sdy.sharded_to_unreduced` $axes $tensor `out_sharding````=```$out_sharding attr-dict `:` type($result)

The axes should be used to shard the operand. This operation makes them unreduced in the result. We have the following relationship:

all-gather(x, axes) = all-reduce(sharded-to-unreduced(x, axes), axes), where all-gather, sharded-to-unreduced, all-reduce are applied on the same axes.

Пример:

%1 = stablehlo.tanh(%0) {sdy.sharding = #sdy.sharding_per_value<[<@mesh, [{"a", "b", "c"}, {}, {"d"}\], unreduced={"e"}>]>} : tensor<8x8x8xf32>
%2 = sdy.sharded_to_unreduced [{"b", "c"}, {}, {"d"}\] %1 out_sharding=<@mesh, [{"a"}, {}, {}\], unreduced={"b", "c", "d", "e"}> : tensor<8x8x8xf32>

Ограничения:

Must satisfy the constraints listed in Sdy_CollectiveOpInterface .
Elements in axes must satisfy the constraints listed in AxisRefListAttr .
Applying axes to the operand sharding gets out_sharding .

Traits: SameOperandsAndResultType

Interfaces: InferTypeOpInterface , Sdy_CollectiveOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	MLIR Type	Описание
`axes`	::mlir::sdy::ListOfAxisRefListsAttr	List of axis ref lists
`out_sharding`	::mlir::sdy::TensorShardingAttr	Tensor sharding

Operands:

Операнд	Описание
`tensor`	shaped of any non-token type values

Результаты:

Результат	Описание
`result`	shaped of any non-token type values

`sdy.sharding_constraint` (sdy::ShardingConstraintOp)

Constrains a tensor to the specified sharding

Синтаксис:

operation ::= `sdy.sharding_constraint` $input $sharding attr-dict `:` type($result)

Attaches a sharding to an intermediate tensor (eg the result of a matmul) to indicate that this is how that tensor, or a subset of its uses, should be sharded.

If the sharding has open dimensions and unconstraint axes, it means the tensor can be further sharded along the open dimensions.

This op can either:

Have no uses (dangling) - which means the attached sharding is how the input tensor itself should be sharded.
Have uses - which means the attached sharding is how the uses of the sharding constraint op should be sharded, while other uses of the input tensor might have a different sharding (if the input tensor has no other uses then the behavior is the same as the no uses case).

Traits: SameOperandsAndResultType

Interfaces: InferTypeOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	MLIR Type	Описание
`sharding`	::mlir::sdy::TensorShardingAttr	Tensor sharding

Operands:

Операнд	Описание
`input`	any non-token type

Результаты:

Результат	Описание
`result`	any non-token type

`sdy.sharding_group` (sdy::ShardingGroupOp)

Constrains tensors in the group to have the same sharding.

Синтаксис:

operation ::= `sdy.sharding_group` $input `group_id````=```$group_id attr-dict `:` type($input)

This op provides an interface to assign tensors to sharding groups ( groups of tensors that will be enforced to have identical shardings). During propagation, as soon as one group element is sharded, all other members will be sharded in exactly the same way. This operation takes the argument group ID and returns no result, but instead modifies the internal sharding group representation to add the input tensor to the group with the given ID.

Interfaces: InferTypeOpInterface

Атрибуты:

Атрибут	MLIR Type	Описание
`group_id`	::mlir::IntegerAttr	64-bit signless integer attribute

Operands:

Операнд	Описание
`input`	ranked tensor of any non-token type values

Атрибуты

AllToAllParamAttr

All-to-all parameter

Синтаксис:

#sdy.all_to_all_param<
  ::llvm::ArrayRef<AxisRefAttr>,   # axes
  int64_t,   # src_dim
  int64_t   # tgt_dim
>

A tuple containing the axes and source/target dimensions to perform all-to-all on.

Параметры:

Параметр	C++ type	Описание
оси	`::llvm::ArrayRef<AxisRefAttr>`	the axes to perform all-to-all on
src_dim	`int64_t`	the source dimension index
tgt_dim	`int64_t`	the target dimension index

AllToAllParamListAttr

List of all-to-all parameters

Синтаксис:

#sdy.all_to_all_param_list<
  ::llvm::ArrayRef<AllToAllParamAttr>   # value
>

Параметры:

Параметр	C++ type	Описание
ценить	`::llvm::ArrayRef<AllToAllParamAttr>`

AxisRefAttr

Reference to either a full axis or a split sub-axis

Синтаксис:

#sdy.axis_ref<
  ::llvm::StringRef,   # name
  SubAxisInfoAttr   # sub_axis_info
>

Ограничения:

name must be present in the bound MeshAttr .
If sub_axis_info is present, it must satisfy the constraints of SubAxisInfoAttr .

Параметры:

Параметр	C++ type	Описание
имя	`::llvm::StringRef`	name of this axis
sub_axis_info	`SubAxisInfoAttr`	additional info if this is a sub axis

AxisRefListAttr

List of axis refs

Синтаксис:

#sdy.axis_ref_list<
  ::llvm::ArrayRef<AxisRefAttr>   # value
>

Ограничения:

Elements in value must satisfy the constraints of AxisRefAttr .
There are no duplicate axis-refs or sub-axes that overlap with one another.
No two adjacent axis-refs are consecutive sub-axes of that same full axis, ie, they can be merged into one sub-axis or the full axis.

Параметры:

Параметр	C++ type	Описание
ценить	`::llvm::ArrayRef<AxisRefAttr>`

AxisToPropagationDetailsAttr

Propagation edge flow details for a specific axis and source.

Синтаксис:

#sdy.axis_to_propagation_details<
  ::mlir::sdy::AxisRefAttr,   # axis_name
  ::mlir::sdy::EdgeValueRefAttr,   # source
  ::llvm::ArrayRef<EdgeValueRefAttr>   # targets
>

Maps a source value reference to a list of target value references along a particular axis.

Параметры:

Параметр	C++ type	Описание
axis_name	`::mlir::sdy::AxisRefAttr`	Reference to either a full axis or a split sub-axis
источник	`::mlir::sdy::EdgeValueRefAttr`	Reference to a particular index of a value edge of type `type` .
цели	`::llvm::ArrayRef<EdgeValueRefAttr>`	list of edge target values

DimMappingAttr

List of factor indices for a dimension

An empty list indicates that this is a null mapping (this is parsed/printed with * ), ie the dimension isn't mapped to any factors.

Ограничения:

There is at least one factor index.
Factor indices must be in range [0, $factor_sizes ).
If there are multiple factors, none of them can have size 1.
No duplicate factor indices.

Параметры:

Параметр	C++ type	Описание
factor_indices	`::llvm::ArrayRef<int64_t>`	factors this dimension is mapped to

DimensionShardingAttr

Dimension sharding

List of axis names to shard a tensor dimension on from major to minor, a boolean indicating whether the dimension can be further sharded, and an optional integer denoting the priority of this dimension sharding, which will respected during sharding propagation. Priorities originate from user sharding annotations and a lower value denotes a higher priority. The highest priority is assumed when the priority is missing in the annotation.

Ограничения:

Elements in axes must satisfy the constraints listed in AxisRefListAttr .
If a dimension sharding has a priority:
- The priority is greater than or equal to 0.
- The dimension has at least one axis if it is closed.

Параметры:

Параметр	C++ type	Описание
оси	`::llvm::ArrayRef<AxisRefAttr>`	axis refs
is_closed	`bool`	whether this dimension can't be further sharded
приоритет	`std::optional<int64_t>`	the priority used during user priority based propagation

EdgeValueRefAttr

Reference to a particular index of a value edge of type type .

Синтаксис:

#sdy.edge_value_ref<
  `operand` | `result`,   # type
  int64_t   # index
>

Параметры:

Параметр	C++ type	Описание
тип	`::mlir::sdy::EdgeNodeType`	an enum of type EdgeNodeType
индекс	`int64_t`	The integer index (0, 1, 2, etc.)

ListOfAxisRefListsAttr

List of axis ref lists

Синтаксис:

#sdy.list_of_axis_ref_lists<
  ::llvm::ArrayRef<AxisRefListAttr>   # value
>

Параметры:

Параметр	C++ type	Описание
ценить	`::llvm::ArrayRef<AxisRefListAttr>`

ManualAxesAttr

A list of axes that a ManualComputationOp is manual on

Синтаксис:

#sdy.manual_axes<
  ::llvm::ArrayRef<StringAttr>   # value
>

Параметры:

Параметр	C++ type	Описание
ценить	`::llvm::ArrayRef<StringAttr>`

MeshAttr

Mesh of axes and a list of devices

Синтаксис:

#sdy.mesh<
  ::llvm::ArrayRef<MeshAxisAttr>,   # axes
  ::llvm::ArrayRef<int64_t>   # device_ids
>

A mesh is a list of axes and an optional list of device IDs specifying the device ordering.

If the list of axes is empty

If the device_ids is not provided, it is an empty mesh.
If the device_ids is provided, it must be a single non-negative integer, we call it a maximal-sharding mesh .

If the list of axes is provided

If a device ID list is specified, the product of the axis sizes should match the number of devices.
If a device ID list is not specified, the implicit device ID list is iota(product(axes)). For simplicity, we also disallow specifying a device ID list that is the same as iota(product(axes)); in this case, a device ID list shouldn't be specified.
It is not a maximal-sharding mesh even if the total size of axes is 1.

Here are some examples of meshes:

An empty mesh represents a placeholder mesh that can be replaced during propagation: <[]>
A mesh without axes list and a single non-negative device ID, which is a maximal-sharding mesh: <[], device_ids=[3]>
A mesh with two axes and implicit device IDs iota(6): <["a"=2, "b"=3]>
A mesh with two axes and explicit device IDs specifying the device ordering: <["a"=3, "b"=2], device_ids=[0, 2, 4, 1, 3, 5]>

Ограничения:

Elements in device_ids should be non-negative.
If axes is empty, the size of device_ids can be 0 (empty mesh) or 1 (maximal-sharding mesh).
If axes is not empty,
- Elements in axes must not have duplicate names.
- If device_ids is specified, the original device_ids is not iota(product(axis_sizes)) and the sorted device_ids is iota(product(axis_sizes)) .

Параметры:

Параметр	C++ type	Описание
оси	`::llvm::ArrayRef<MeshAxisAttr>`	mesh axes
device_ids	`::llvm::ArrayRef<int64_t>`	explicit device ordering or maximal device id

MeshAxisAttr

Named axis in a mesh

Синтаксис:

#sdy.mesh_axis<
  ::llvm::StringRef,   # name
  int64_t   # size
>

Параметры:

Параметр	C++ type	Описание
имя	`::llvm::StringRef`	имя
размер	`int64_t`	size of this axis

OpShardingRuleAttr

Specifies how an operation can be partitioned.

Синтаксис:

#sdy.op_sharding_rule<
  ::llvm::ArrayRef<int64_t>,   # factor_sizes
  ::llvm::ArrayRef<TensorMappingAttr>,   # operand_mappings
  ::llvm::ArrayRef<TensorMappingAttr>,   # result_mappings
  ::llvm::ArrayRef<int64_t>,   # reduction_factors
  ::llvm::ArrayRef<int64_t>,   # need_replication_factors
  ::llvm::ArrayRef<int64_t>,   # permutation_factors
  ::llvm::ArrayRef<int64_t>,   # blocked_propagation_factors
  bool   # is_custom_rule
>

A sharding rule specifies how an operation can be partitioned according to various properties on the op - any attributes, the shape of operands, the shape of the results, etc. For example:

%0 = stablehlo.add %arg0, %arg1 {
    sdy.sharding_rule = #sdy.op_sharding_rule<
        ([i, j],[i, j])->([i, j])
        {i=8, j=8}>
} : tensor<8x8xf32>

%1 = stablehlo.dot_general %arg2, %arg3, contracting_dims = [1] x [0] {
  sdy.sharding_rule = #sdy.op_sharding_rule<
      ([i, k],[k, j])->([i, j])
      {i=8, j=16, k=8}>
}: (tensor<8x8xf32>, tensor<8x16xf32>) -> tensor<8x16xf32>

Note that we allow factors with size 1 even though they cannot be sharded, this is mainly for completeness as many ops such as pointwise ops have size one dimensions that correspond across operands and results.

Factor types:

reduction_factors contains the indices of factors requiring reduction, such as the contracting dimensions in a dot operation. These factors can be in operands but not in results.
need_replication_factors contains the indices of factors requiring full replication, such as the sorted dimension in a sort operation.
permutation_factors contains the indices of factors requiring collective-permute if they are sharded, such as the padding dimensions in a pad operation.
All other factors are considered as pass-through factors, ie, factors that don't require any communication if sharded in the same way across all tensors that are mapped to them.

blocked_propagation_factors contains the factors along which shardings are not allowed to be propagated. It is orthogonal to the factor types. Namely, a blocked-propagation factor can be any of the factor types.

is_custom_rule describes whether this is a rule defined by a user. Users can define sharding rules for their custom calls or overwrite the pre-defined sharding rules for the standard operations. A custom rule is always preserved/never removed.

Ограничения:

Number of operand/result mappings must match the number of operands/results of the op.
There is at least one mapping (can't have a rule for an op with no operands/results).
Rank of each TensorMappingAttr matches the rank of the corresponding tensor type.
For each group of factors ( reduction_factors , need_replication_factors , permutation_factors ):
- Elements must be in range [0, $factor_sizes ].
- No duplicate factor indices within each group and across groups.

Параметры:

Параметр	C++ type	Описание
factor_sizes	`::llvm::ArrayRef<int64_t>`	sizes of all factors in this rule
operand_mappings	`::llvm::ArrayRef<TensorMappingAttr>`	operand mappings
result_mappings	`::llvm::ArrayRef<TensorMappingAttr>`	result mappings
reduction_factors	`::llvm::ArrayRef<int64_t>`	factors requiring reduction
need_replication_factors	`::llvm::ArrayRef<int64_t>`	factors requiring full replication
permutation_factors	`::llvm::ArrayRef<int64_t>`	factors requiring collective-permute
blocked_propagation_factors	`::llvm::ArrayRef<int64_t>`	factors along which shardings are not propagated
is_custom_rule	`bool`	whether the rule is for a stablehlo.custom_call

PropagationEdgesAttr

Propagation edge metadata for all propagation steps.

Синтаксис:

#sdy.propagation_edges<
  ::llvm::ArrayRef<PropagationOneStepAttr>   # value
>

A list of per-axis propagation details for a value, grouped by step index.

Параметры:

Параметр	C++ type	Описание
ценить	`::llvm::ArrayRef<PropagationOneStepAttr>`

PropagationOneStepAttr

Per-step propagation metadata.

Синтаксис:

#sdy.propagation_one_step<
  int64_t,   # step_index
  ::llvm::ArrayRef<AxisToPropagationDetailsAttr>   # axis_entries
>

Propagation details for all axes for a single propagation step.

Параметры:

Параметр	C++ type	Описание
step_index	`int64_t`	step index
axis_entries	`::llvm::ArrayRef<AxisToPropagationDetailsAttr>`	Axis propagation details per propagation decision

SubAxisInfoAttr

Info about how this sub-axis is derived from the full axis

Синтаксис:

#sdy.sub_axis_info<
  int64_t,   # pre_size
  int64_t   # size
>

When splitting a full axis into n sub-axes, the axis is reshaped into [k_1,...,k_n], and the ith sub-axis can be expressed by the product of all axis sizes to its left m=prod(k_1,...,k_(i-1)) (aka pre-size) and size k_i. Therefore, the sub-axis-info attribute holds those two numbers and is denoted as follows: (m)k for pre-size m and size k.

Ограничения:

pre-size is at least 1.
size is greater than 1.
pre-size must divide the size of the full axis, ie, both pre-size and size divide the size of the full axis, and the sub-axis doesn't go beyond the full axis.
The size of the sub-axis isn't equal to the size of the corresponding full axis, in which case the full axis should be used instead.

Параметры:

Параметр	C++ type	Описание
pre_size	`int64_t`	product of sub-axis sizes to the left of this sub-axis
размер	`int64_t`	size of this sub-axis

TensorMappingAttr

Factor mappings for each dimension of a tensor.

Синтаксис:

#sdy.tensor_mapping<
  ::llvm::ArrayRef<DimMappingAttr>   # dim_mappings
>

Ограничения:

Elements in dim_mappings must satisfy the constraints in DimMappingAttr .
No duplicate factors indices across dimensions.

Параметры:

Параметр	C++ type	Описание
dim_mappings	`::llvm::ArrayRef<DimMappingAttr>`	dimension mappings

TensorShardingAttr

Tensor sharding

Синтаксис:

#sdy.sharding<
  ::mlir::Attribute,   # mesh_or_ref
  ::llvm::ArrayRef<DimensionShardingAttr>,   # dim_shardings
  ::llvm::ArrayRef<AxisRefAttr>,   # replicated_axes
  ::llvm::ArrayRef<AxisRefAttr>,   # unreduced_axes
  `sum` | `max` | `min`   # reduction_op
>

A tensor sharding is bound to a specific mesh, and can only reference axis names from that mesh. The dimension shardings tell us for each dimension of the tensor, along which axes (or sub-axes) it is sharded from major to minor. All other axes that don't shard a dimension are either implicitly or explicitly (if they appear in the list of replicated axes) replicated.

Note that no sharding attribute on a tensor is equivalent to a fully open tensor sharding.

The mesh this sharding is bound to can either be specified by a symbol name, referencing a corresponding MeshOp symbol, or an inlined MeshAttr .

A sharding can have unreduced axes (specified by unreduced_axes ), meaning the tensor is unreduced along these axes. For example, if the contracting dimension of a matmul is sharded along axis x in both the lhs and rhs, the result is unreduced along x . Applying an all-reduce on the tensor along the unreduced axes will make the tensor replicated along those axes. However, a tensor with unreduced axes doesn't have to be all-reduced immediately, it can remain unreduced when passed to linear operations like stablehlo.add (as long as both lhs and rhs are unreduced) and all-reduced afterwards. We assume the reduction type is sum, other reductions may be supported in the future.

Ограничения:

Elements in dim_shardings must satisfy the constraints listed in DimensionShardingAttr .
Elements in replicated_axes must satisfy the constraints listed in AxisRefListAttr .
Elements in unreduced_axes must satisfy the constraints listed in AxisRefListAttr .
If the corresponding tensor type isn't a ShapedType , the sharding must have rank 0 and no replicated axes.
If it is a ShapedType , then:
- The tensor should have a rank.
- The number of dimension shardings is equal to the rank of the tensor.
- Dimensions of size 0 aren't sharded.
There are no duplicate axis-refs or sub-axes that overlap with one another across dim_shardings , replicated_axes , and unreduced_axes .
Items in replicated_axes and unreduced_axes are ordered wrt mesh_or_ref (see AxisRefAttr::getMeshComparator ).

Параметры:

Параметр	C++ type	Описание
mesh_or_ref	`::mlir::Attribute`	mesh attr or flat mesh symbol reference attr
dim_shardings	`::llvm::ArrayRef<DimensionShardingAttr>`	dimension shardings
replicated_axes	`::llvm::ArrayRef<AxisRefAttr>`	axis refs
unreduced_axes	`::llvm::ArrayRef<AxisRefAttr>`	axis refs
reduction_op	`::mlir::sdy::ReductionOp`	an enum of type ReductionOp

TensorShardingPerValueAttr

Tensor sharding per operand/result of an op

Синтаксис:

#sdy.sharding_per_value<
  ::llvm::ArrayRef<TensorShardingAttr>   # shardings
>

A list of TensorShardingAttr s, one for each operand/result of an op.

Ограничения:

Elements in shardings must satisfy the constraints of TensorShardingAttr .

Параметры:

Параметр	C++ type	Описание
shardings	`::llvm::ArrayRef<TensorShardingAttr>`	sharding per value

Перечисления

EdgeNodeType

Edge node type enum

Случаи:

Символ	Ценить	Нить
ОПЕРАНД	`0`	операнд
РЕЗУЛЬТАТ	`1`	результат

PropagationDirection

Propagation direction enum

Случаи:

Символ	Ценить	Нить
НИКТО	`0`	НИКТО
ВПЕРЕД	`1`	ВПЕРЕД
НАЗАД	`2`	НАЗАД
ОБА	`3`	ОБА

ReductionOp

Reduction op enum

Случаи:

Символ	Ценить	Нить
СУММА	`0`	сумма
МАКС	`1`	макс
МИН	`2`	мин

The Shardy (SDY) dialect

The Shardy (SDY) dialect defines an axis-based tensor sharding representation and additional API components to attach shardings to tensors.

Version log: 0.0.1: Add unreduced axes to TensorShardingAttr.

Операции

`sdy.all_gather` (sdy::AllGatherOp)

Performs an all-gather communication along axes

Синтаксис:

operation ::= `sdy.all_gather` $gathering_axes $tensor `out_sharding````=```$out_sharding attr-dict `:` type($result)

Gathers chunks of a tensor along axes specified in gathering_axes .

Пример:

%1 = stablehlo.tanh(%0) {sdy.sharding = #sdy.sharding_per_value<[<@mesh, [{"a", "b", "c"}, {}, {"d"}\]>]>} : tensor<8x8x8xf32>
%2 = sdy.all_gather [{"b", "c"}, {}, {"d"}\] %1 out_sharding=<@mesh, [{"a"}, {}, {}\]> : tensor<8x8x8xf32>

Ограничения:

Must satisfy the constraints listed in Sdy_CollectiveOpInterface .
Elements in gathering_axes must satisfy the constraints listed in AxisRefListAttr .
Applying gathering_axes to the operand sharding gets out_sharding .

Traits: SameOperandsAndResultType

Interfaces: InferTypeOpInterface , Sdy_CollectiveOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	MLIR Type	Описание
`gathering_axes`	::mlir::sdy::ListOfAxisRefListsAttr	List of axis ref lists
`out_sharding`	::mlir::sdy::TensorShardingAttr	Tensor sharding

Operands:

Операнд	Описание
`tensor`	shaped of any non-token type values

Результаты:

Результат	Описание
`result`	shaped of any non-token type values

`sdy.all_reduce` (sdy::AllReduceOp)

Perform an all-reduce comunication along axes

Синтаксис:

operation ::= `sdy.all_reduce` ($reduction_op^)? $reduction_axes $tensor `out_sharding````=```$out_sharding attr-dict `:` type($result)

Reduces chunks of a tensor along axes specified in reduction_axes . The order of reduction_axes is not important for the result, but can affect the order of the corresponding replica groups.

Ограничения:

Must satisfy the constraints listed in Sdy_CollectiveOpInterface .
reduction_axes must satisfy the constraints listed in AxisRefListAttr .
reduction_axes must be sorted wrt the mesh.
The operand sharding and out_sharding must have equivalent dimension shardings.
reduction_axes must not overlap with the operand dimension sharding and replicated axes (it can overlap with unreduced axes).
reduction_axes must not overlap with the unreduced axes of out_sharding . In other words, out_sharding must be be replicated along reduction_axes (implicitly or explicitly).

Traits: SameOperandsAndResultType

Interfaces: CollectiveOpInterface , InferTypeOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	MLIR Type	Описание
`reduction_axes`	::mlir::sdy::AxisRefListAttr	List of axis refs
`reduction_op`	::mlir::sdy::ReductionOpAttr	reduction op enum
`out_sharding`	::mlir::sdy::TensorShardingAttr	Tensor sharding

Operands:

Операнд	Описание
`tensor`	shaped of any non-token type values

Результаты:

Результат	Описание
`result`	shaped of any non-token type values

`sdy.all_slice` (sdy::AllSliceOp)

Performs a dynamic-slice operation along axes

Синтаксис:

operation ::= `sdy.all_slice` $slicing_axes $tensor `out_sharding````=```$out_sharding attr-dict `:` type($result)

Slices chunks of a tensor along axes specified in slicing_axes . There is an algebric duality between sdy.all_slice and sdy.all_gather .

Пример:

%1 = stablehlo.tanh(%0) {sdy.sharding = #sdy.sharding_per_value<[<@mesh, [{"a"}, {}, {}\]>]>} : tensor<8x8x8xf32>
%2 = sdy.all_slice [{"b", "c"}, {}, {"d"}\] %1 out_sharding=<@mesh, [{"a", "b", "c"}, {}, {"d"}\]> : tensor<8x8x8xf32>

Ограничения:

Must satisfy the constraints listed in Sdy_CollectiveOpInterface .
Elements in slicing_axes must satisfy the constraints listed in AxisRefListAttr .
Applying slicing_axes to the operand sharding gets out_sharding .

Traits: SameOperandsAndResultType

Interfaces: CollectiveOpInterface , InferTypeOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	MLIR Type	Описание
`slicing_axes`	::mlir::sdy::ListOfAxisRefListsAttr	List of axis ref lists
`out_sharding`	::mlir::sdy::TensorShardingAttr	Tensor sharding

Operands:

Операнд	Описание
`tensor`	shaped of any non-token type values

Результаты:

Результат	Описание
`result`	shaped of any non-token type values

`sdy.all_to_all` (sdy::AllToAllOp)

Performs an all-to-all communication along axes

Синтаксис:

operation ::= `sdy.all_to_all` $params $tensor `out_sharding````=```$out_sharding attr-dict `:` type($result)

The all-to-all will be applied to the sharding of the operand ( tensor ) to obtain the sharding of the result ( out_sharding ).

Пример:

%1 = stablehlo.tanh(%0) {sdy.sharding = #sdy.sharding_per_value<[<@mesh, [{"a", "b"}, {"c"}, {}, {}\]>]>} : tensor<8x8x4x4x32>
%2 = sdy.all_to_all [{"b"}: 0->2, {"c"}: 1->3] %1 out_sharding=<@mesh, [{"a"}, {}, {"b"}, {"c"}\]> : tensor<8x8x4x4x32>

Ограничения:

Must satisfy the constraints listed in Sdy_CollectiveOpInterface .
The parameter list must not be empty.
For each parameter in params :
- Elements in axes must satisfy the constraints of AxisRefAttr .
- src_dim and tgt_dim must be valid dimensions (non-negative and less than rank of tensor).
- Any src_dim or tgt_dim must be unique across all parameters.
- src_dim must be sorted in ascending order across all parameters.
Moving axes from src_dim to tgt_dim in the operand sharding gets out_sharding .

Traits: SameOperandsAndResultType

Interfaces: InferTypeOpInterface , Sdy_CollectiveOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	MLIR Type	Описание
`params`	::mlir::sdy::AllToAllParamListAttr	List of all-to-all parameters
`out_sharding`	::mlir::sdy::TensorShardingAttr	Tensor sharding

Operands:

Операнд	Описание
`tensor`	shaped of any non-token type values

Результаты:

Результат	Описание
`result`	shaped of any non-token type values

`sdy.collective_permute` (sdy::CollectivePermuteOp)

Performs a collective-permute communication to replace axes

Синтаксис:

operation ::= `sdy.collective_permute` $tensor `out_sharding````=```$out_sharding attr-dict `:` type($result)

Sends a chunk of the input tensor from each device to another to reorder/replace the axes that shard the tensor.

This is useful for reordering axes in a single dimension or across different dimensions, and swapping sharded axes with replicated ones.

In the below example, the sharded tensor size is tensor<1x4x2xf32> , and that is preserved by the collective permute.

Пример:

sdy.mesh @mesh = <["a"=2, "b"=2, "c"=4, "d"=2, "e"=2, "f"=2]>
%1 = stablehlo.tanh(%0) {sdy.sharding = #sdy.sharding_per_value<[<@mesh, [{"a", "c"}, {"f"}, {"d", "e"}\]>]>} : tensor<8x8x8xf32>
%2 = sdy.collective_permute %1 out_sharding=<@mesh, [{"c":(1)2, "b", "f"}, {"a"}, {"e", "d"}\]> : tensor<8x8x8xf32>

Ограничения:

Must satisfy the constraints listed in Sdy_CollectiveOpInterface .
If input and output sharding have different meshes, then those meshes must have exactly the same axes and different order of device ids.
For each dimension, the product of sharding axis sizes in out_sharding must match that of the corresponding operand dimension sharding.

Traits: SameOperandsAndResultType

Interfaces: CollectiveOpInterface , InferTypeOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	MLIR Type	Описание
`out_sharding`	::mlir::sdy::TensorShardingAttr	Tensor sharding

Operands:

Операнд	Описание
`tensor`	shaped of any non-token type values

Результаты:

Результат	Описание
`result`	shaped of any non-token type values

`sdy.constant` (sdy::ConstantOp)

Constant operation

Produces an output tensor from a constant value .

See: https://github.com/openxla/stablehlo/blob/main/docs/spec.md#constant

Пример:

%output = sdy.constant dense<[[0.0, 1.0], [2.0, 3.0]]> : tensor<2x2xf32>

Traits: AlwaysSpeculatableImplTrait

Interfaces: ConditionallySpeculatable , InferTypeOpInterface , NoMemoryEffect (MemoryEffectOpInterface)

Effects: MemoryEffects::Effect{}

Атрибуты:

Атрибут	MLIR Type	Описание
`value`	::mlir::ElementsAttr	constant vector/tensor attribute

Результаты:

Результат	Описание
`output`	statically shaped tensor of any non-token type values

`sdy.data_flow_edge` (sdy::DataFlowEdgeOp)

Data flow edge op.

Синтаксис:

operation ::= `sdy.data_flow_edge` $input (`sharding````=``` $sharding^)? attr-dict `:` type($result)

An op can have multiple data flow edges that are orthogonal to one another.

Например:

  y_0, ..., y_n = while (x_0, ..., x_n)
                  ((pred_arg_0,... , pred_arg_n) { ... })
                  ((body_arg_0,..., body_arg_n) {
                    ...
                    return return_value_0, ..., return_value_n
                  })

This while op has n data flow edges, the i-th data flow edges is between sources x_i , return_value_i and targets y_i , pred_arg_i , body_arg_i .

propagate through each edge separately.
update the sharding of each edge separately instead of all targets at once (eg an op has a single immutable TensorShardingPerValueAttr for result shardings).
add each edge to the worklist separately when the sharding of a source has changed.

We don't allow the input of a sdy.data_flow_edge to be defined by an SdyDialect op, so we can assume that it's defined by an op that has unregistered sdy.sharding attribute.

Traits: SameOperandsAndResultType

Interfaces: InferTypeOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	MLIR Type	Описание
`sharding`	::mlir::sdy::TensorShardingAttr	Tensor sharding

Operands:

Операнд	Описание
`input`	shaped of any non-token type values

Результаты:

Результат	Описание
`result`	shaped of any non-token type values

`sdy.func_data_flow_edge` (sdy::FuncDataFlowEdgeOp)

Func input/output data flow edge op.

Синтаксис:

operation ::= `sdy.func_data_flow_edge` $operand attr-dict `:` type($result)

Traits: SameOperandsAndResultType

Interfaces: InferTypeOpInterface , SymbolUserOpInterface

Operands:

Операнд	Описание
`operand`	shaped of any non-token type values

Результаты:

Результат	Описание
`result`	shaped of any non-token type values

`sdy.manual_computation` (sdy::ManualComputationOp)

Multi-device parallelism operation with manual collectives

Синтаксис:

operation ::= `sdy.manual_computation` `(`operands`)`
              `in_shardings````=```custom<StrippedTensorShardingPerValueAttr>($in_shardings)
              `out_shardings````=```custom<StrippedTensorShardingPerValueAttr>($out_shardings)
              `manual_axes````=```$manual_axes
              custom<SingleBlockRegionNoBlockId>($body)
              attr-dict
              `:`
              functional-type(operands, results)

The body is local wrt the manual_axes. Propagation will occur through the body on any free axes - those not in the manual_axes list.

Note that any unranked tensors are expected to have a sharding with rank 0, ie fully replicated.

Ограничения:

Elements in in_shardings and out_shardings must satisfy the constraints listed in TensorShardingAttr .
The number of global and local tensor inputs/outputs of the op region must match.
The manual axes must come before any free axes in each dim sharding.
The manual axes cannot introduce padding. Namely, the dimension size must be divisible by the corresponding manual axes size.
The global and local shapes of the op regions arguments/results must match.

Traits: IsolatedFromAbove , RecursiveMemoryEffects , SingleBlockImplicitTerminator<ReturnOp> , SingleBlock

Interfaces: ShardableDataFlowOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	MLIR Type	Описание
`in_shardings`	::mlir::sdy::TensorShardingPerValueAttr	Tensor sharding per operand/result of an op
`out_shardings`	::mlir::sdy::TensorShardingPerValueAttr	Tensor sharding per operand/result of an op
`manual_axes`	::mlir::sdy::ManualAxesAttr	A list of axes that a ManualComputationOp is manual on

Operands:

Операнд	Описание
`tensors`	variadic of any non-token type

Результаты:

Результат	Описание
`results`	variadic of any non-token type

`sdy.mesh` (sdy::MeshOp)

Named mesh

Синтаксис:

operation ::= `sdy.mesh` $sym_name `=` $mesh attr-dict

Traits: HasParent<ModuleOp>

Interfaces: Symbol

Атрибуты:

Атрибут	MLIR Type	Описание
`sym_name`	::mlir::StringAttr	string attribute
`mesh`	::mlir::sdy::MeshAttr	Mesh of axes and a list of devices

`sdy.named_computation` (sdy::NamedComputationOp)

Named computation operation

Синтаксис:

operation ::= `sdy.named_computation` `<`$name`>` `` `(` $operands `)`
              (`in_shardings````=```custom<StrippedTensorShardingPerValueAttr>($in_shardings)^)?
              (`out_shardings````=```custom<StrippedTensorShardingPerValueAttr>($out_shardings)^)?
              custom<SingleBlockRegionNoBlockId>($body)
              attr-dict
              `:` functional-type($operands, results)

Groups a computation, ie a block of operations, and gives it a name. Propagation will flow in/out of the region as if everything was inlined.

The type of each block arguments and returned values in the region must be the same as the type of the operands and results type of the op.

Пример:

%1 = sdy.named_computation<"foo">(%0) (%arg1: tensor<16x32xf32>) {
  sdy.return %arg1 : tensor<16x32xf32>
} : (tensor<16x32xf32>) -> tensor<16x32xf32>

Traits: IsolatedFromAbove , RecursiveMemoryEffects , RecursivelySpeculatableImplTrait , SingleBlockImplicitTerminator<ReturnOp> , SingleBlock

Interfaces: ConditionallySpeculatable , InferTypeOpInterface , ShardableDataFlowOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	MLIR Type	Описание
`name`	::mlir::StringAttr	string attribute
`in_shardings`	::mlir::sdy::TensorShardingPerValueAttr	Tensor sharding per operand/result of an op
`out_shardings`	::mlir::sdy::TensorShardingPerValueAttr	Tensor sharding per operand/result of an op

Operands:

Операнд	Описание
`operands`	variadic of any non-token type

Результаты:

Результат	Описание
«unnamed»	variadic of any non-token type

`sdy.propagation_barrier` (sdy::PropagationBarrierOp)

Propagation barrier operation

Синтаксис:

operation ::= `sdy.propagation_barrier` $input `allowed_direction````=```$allowed_direction attr-dict `:` type($input)

This op operates like an identity op, outputting the same value it took as input. But in terms of propagation, this will only allow propagation to flow through it in a certain direction.

This prevents shardings from being propagated between the uses of the result of the barrier op and its operand.

FORWARD means shardings can only flow from the operand to the result.
BACKWARD means shardings can only flow from the result to the operand.
NONE means no sharding can propagate through this op.
Cannot specify BOTH , as this op would be redundant.

Traits: AlwaysSpeculatableImplTrait , SameOperandsAndResultType

Interfaces: ConditionallySpeculatable , InferTypeOpInterface , NoMemoryEffect (MemoryEffectOpInterface)

Effects: MemoryEffects::Effect{}

Атрибуты:

Атрибут	MLIR Type	Описание
`allowed_direction`	::mlir::sdy::PropagationDirectionAttr	propagation direction enum

Operands:

Операнд	Описание
`input`	ranked tensor of any non-token type values

Результаты:

Результат	Описание
`result`	ranked tensor of any non-token type values

`sdy.reduce_scatter` (sdy::ReduceScatterOp)

Performs a reduce-scatter communication along axes

Синтаксис:

operation ::= `sdy.reduce_scatter` ($reduction_op^)? $reduce_scatter_axes $tensor `out_sharding````=```$out_sharding attr-dict `:` type($result)

Ограничения:

Must satisfy the constraints listed in Sdy_CollectiveOpInterface .
Elements in reduce_scatter_axes must satisfy the constraints listed in AxisRefListAttr .
Applying reduce_scatter_axes to the operand sharding gets out_sharding .

Traits: SameOperandsAndResultType

Interfaces: CollectiveOpInterface , InferTypeOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	MLIR Type	Описание
`reduce_scatter_axes`	::mlir::sdy::ListOfAxisRefListsAttr	List of axis ref lists
`reduction_op`	::mlir::sdy::ReductionOpAttr	reduction op enum
`out_sharding`	::mlir::sdy::TensorShardingAttr	Tensor sharding

Operands:

Операнд	Описание
`tensor`	shaped of any non-token type values

Результаты:

Результат	Описание
`result`	shaped of any non-token type values

`sdy.replicated_to_unreduced` (sdy::ReplicatedToUnreducedOp)

Move implicitly or explicitly replicated axes to unreduced axes.

Синтаксис:

operation ::= `sdy.replicated_to_unreduced` $axes $tensor `out_sharding````=```$out_sharding attr-dict `:` type($result)

The axes should be implicitly or explicitly replicated in the operand. This operation makes them unreduced in the result. We have the following relationship:

all-reduce(replicated-to-unreduced(x, axes), axes) = x

Пример:

%1 = stablehlo.tanh(%0) {sdy.sharding = #sdy.sharding_per_value<[<@mesh, [{"b"}, {}, {}\], replicated={"c", "d"}, unreduced={"e"}>]>} : tensor<8x8x8xf32>
%2 = sdy.replicated_to_unreduced {"a", "c", "f"} %1 out_sharding=<@mesh, [{"b"}, {}, {}\], replicated={"d"}, unreduced={"a", "c", "e", "f"}> : tensor<8x8x8xf32>

Ограничения:

Must satisfy the constraints listed in Sdy_CollectiveOpInterface .
axes must satisfy the constraints listed in AxisRefListAttr .
axes must be sorted wrt the mesh.
axes are not empty.
The input and output sharding must have the same dimension shardings.
axes must be implicitly or explicitly replicated in the operand sharding.
inUnreducedAxes + axes = outUnreducedAxes.

Traits: SameOperandsAndResultType

Interfaces: InferTypeOpInterface , Sdy_CollectiveOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	MLIR Type	Описание
`axes`	::mlir::sdy::AxisRefListAttr	List of axis refs
`out_sharding`	::mlir::sdy::TensorShardingAttr	Tensor sharding

Operands:

Операнд	Описание
`tensor`	shaped of any non-token type values

Результаты:

Результат	Описание
`result`	shaped of any non-token type values

`sdy.reshard` (sdy::ReshardOp)

Reshards a tensor to a different sharding

Синтаксис:

operation ::= `sdy.reshard` $input $sharding attr-dict `:` type($result)

Reshards the input tensor with the specified sharding, which is different from the input tensor's existing sharding.

Both ShardingConstraintOp and ReshardOp attach a sharding to a tensor. Their lifespan is:

Before sharding propagation, ShardingConstraintOp is added by users.
Sharding propagation consumes ShardingConstraintOp. There is no ShardingConstraintOp in the results of sharding propagation. Instead, ReshardOp may be added if needed.
A partitioner converts a ReshardOp into a collective op (or an identity op). There should be no ReshardOp in the results of the partitioner.

Traits: AlwaysSpeculatableImplTrait , SameOperandsAndResultType

Interfaces: ConditionallySpeculatable , InferTypeOpInterface , NoMemoryEffect (MemoryEffectOpInterface) , SymbolUserOpInterface

Effects: MemoryEffects::Effect{}

Атрибуты:

Атрибут	MLIR Type	Описание
`sharding`	::mlir::sdy::TensorShardingAttr	Tensor sharding

Operands:

Операнд	Описание
`input`	any non-token type

Результаты:

Результат	Описание
`result`	any non-token type

`sdy.return` (sdy::ReturnOp)

Синтаксис:

operation ::= `sdy.return` attr-dict ($results^ `:` type($results))?

Traits: AlwaysSpeculatableImplTrait , ReturnLike , Terminator

Interfaces: ConditionallySpeculatable , NoMemoryEffect (MemoryEffectOpInterface) , RegionBranchTerminatorOpInterface

Effects: MemoryEffects::Effect{}

Operands:

Операнд	Описание
`results`	variadic of any non-token type

`sdy.sharded_to_unreduced` (sdy::ShardedToUnreducedOp)

Move some sharded axes of the operand to unreduced axes of the result.

Синтаксис:

operation ::= `sdy.sharded_to_unreduced` $axes $tensor `out_sharding````=```$out_sharding attr-dict `:` type($result)

The axes should be used to shard the operand. This operation makes them unreduced in the result. We have the following relationship:

all-gather(x, axes) = all-reduce(sharded-to-unreduced(x, axes), axes), where all-gather, sharded-to-unreduced, all-reduce are applied on the same axes.

Пример:

%1 = stablehlo.tanh(%0) {sdy.sharding = #sdy.sharding_per_value<[<@mesh, [{"a", "b", "c"}, {}, {"d"}\], unreduced={"e"}>]>} : tensor<8x8x8xf32>
%2 = sdy.sharded_to_unreduced [{"b", "c"}, {}, {"d"}\] %1 out_sharding=<@mesh, [{"a"}, {}, {}\], unreduced={"b", "c", "d", "e"}> : tensor<8x8x8xf32>

Ограничения:

Must satisfy the constraints listed in Sdy_CollectiveOpInterface .
Elements in axes must satisfy the constraints listed in AxisRefListAttr .
Applying axes to the operand sharding gets out_sharding .

Traits: SameOperandsAndResultType

Interfaces: InferTypeOpInterface , Sdy_CollectiveOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	MLIR Type	Описание
`axes`	::mlir::sdy::ListOfAxisRefListsAttr	List of axis ref lists
`out_sharding`	::mlir::sdy::TensorShardingAttr	Tensor sharding

Operands:

Операнд	Описание
`tensor`	shaped of any non-token type values

Результаты:

Результат	Описание
`result`	shaped of any non-token type values

`sdy.sharding_constraint` (sdy::ShardingConstraintOp)

Constrains a tensor to the specified sharding

Синтаксис:

operation ::= `sdy.sharding_constraint` $input $sharding attr-dict `:` type($result)

Attaches a sharding to an intermediate tensor (eg the result of a matmul) to indicate that this is how that tensor, or a subset of its uses, should be sharded.

If the sharding has open dimensions and unconstraint axes, it means the tensor can be further sharded along the open dimensions.

This op can either:

Have no uses (dangling) - which means the attached sharding is how the input tensor itself should be sharded.
Have uses - which means the attached sharding is how the uses of the sharding constraint op should be sharded, while other uses of the input tensor might have a different sharding (if the input tensor has no other uses then the behavior is the same as the no uses case).

Traits: SameOperandsAndResultType

Interfaces: InferTypeOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	MLIR Type	Описание
`sharding`	::mlir::sdy::TensorShardingAttr	Tensor sharding

Operands:

Операнд	Описание
`input`	any non-token type

Результаты:

Результат	Описание
`result`	any non-token type

`sdy.sharding_group` (sdy::ShardingGroupOp)

Constrains tensors in the group to have the same sharding.

Синтаксис:

operation ::= `sdy.sharding_group` $input `group_id````=```$group_id attr-dict `:` type($input)

Interfaces: InferTypeOpInterface

Атрибуты:

Атрибут	MLIR Type	Описание
`group_id`	::mlir::IntegerAttr	64-bit signless integer attribute

Operands:

Операнд	Описание
`input`	ranked tensor of any non-token type values

Атрибуты

AllToAllParamAttr

All-to-all parameter

Синтаксис:

#sdy.all_to_all_param<
  ::llvm::ArrayRef<AxisRefAttr>,   # axes
  int64_t,   # src_dim
  int64_t   # tgt_dim
>

A tuple containing the axes and source/target dimensions to perform all-to-all on.

Параметры:

Параметр	C++ type	Описание
оси	`::llvm::ArrayRef<AxisRefAttr>`	the axes to perform all-to-all on
src_dim	`int64_t`	the source dimension index
tgt_dim	`int64_t`	the target dimension index

AllToAllParamListAttr

List of all-to-all parameters

Синтаксис:

#sdy.all_to_all_param_list<
  ::llvm::ArrayRef<AllToAllParamAttr>   # value
>

Параметры:

Параметр	C++ type	Описание
ценить	`::llvm::ArrayRef<AllToAllParamAttr>`

AxisRefAttr

Reference to either a full axis or a split sub-axis

Синтаксис:

#sdy.axis_ref<
  ::llvm::StringRef,   # name
  SubAxisInfoAttr   # sub_axis_info
>

Ограничения:

name must be present in the bound MeshAttr .
If sub_axis_info is present, it must satisfy the constraints of SubAxisInfoAttr .

Параметры:

Параметр	C++ type	Описание
имя	`::llvm::StringRef`	name of this axis
sub_axis_info	`SubAxisInfoAttr`	additional info if this is a sub axis

AxisRefListAttr

List of axis refs

Синтаксис:

#sdy.axis_ref_list<
  ::llvm::ArrayRef<AxisRefAttr>   # value
>

Ограничения:

Elements in value must satisfy the constraints of AxisRefAttr .
There are no duplicate axis-refs or sub-axes that overlap with one another.
No two adjacent axis-refs are consecutive sub-axes of that same full axis, ie, they can be merged into one sub-axis or the full axis.

Параметры:

Параметр	C++ type	Описание
ценить	`::llvm::ArrayRef<AxisRefAttr>`

AxisToPropagationDetailsAttr

Propagation edge flow details for a specific axis and source.

Синтаксис:

#sdy.axis_to_propagation_details<
  ::mlir::sdy::AxisRefAttr,   # axis_name
  ::mlir::sdy::EdgeValueRefAttr,   # source
  ::llvm::ArrayRef<EdgeValueRefAttr>   # targets
>

Maps a source value reference to a list of target value references along a particular axis.

Параметры:

Параметр	C++ type	Описание
axis_name	`::mlir::sdy::AxisRefAttr`	Reference to either a full axis or a split sub-axis
источник	`::mlir::sdy::EdgeValueRefAttr`	Reference to a particular index of a value edge of type `type` .
цели	`::llvm::ArrayRef<EdgeValueRefAttr>`	list of edge target values

DimMappingAttr

List of factor indices for a dimension

An empty list indicates that this is a null mapping (this is parsed/printed with * ), ie the dimension isn't mapped to any factors.

Ограничения:

There is at least one factor index.
Factor indices must be in range [0, $factor_sizes ).
If there are multiple factors, none of them can have size 1.
No duplicate factor indices.

Параметры:

Параметр	C++ type	Описание
factor_indices	`::llvm::ArrayRef<int64_t>`	factors this dimension is mapped to

DimensionShardingAttr

Dimension sharding

Ограничения:

Elements in axes must satisfy the constraints listed in AxisRefListAttr .
If a dimension sharding has a priority:
- The priority is greater than or equal to 0.
- The dimension has at least one axis if it is closed.

Параметры:

Параметр	C++ type	Описание
оси	`::llvm::ArrayRef<AxisRefAttr>`	axis refs
is_closed	`bool`	whether this dimension can't be further sharded
приоритет	`std::optional<int64_t>`	the priority used during user priority based propagation

EdgeValueRefAttr

Reference to a particular index of a value edge of type type .

Синтаксис:

#sdy.edge_value_ref<
  `operand` | `result`,   # type
  int64_t   # index
>

Параметры:

Параметр	C++ type	Описание
тип	`::mlir::sdy::EdgeNodeType`	an enum of type EdgeNodeType
индекс	`int64_t`	The integer index (0, 1, 2, etc.)

ListOfAxisRefListsAttr

List of axis ref lists

Синтаксис:

#sdy.list_of_axis_ref_lists<
  ::llvm::ArrayRef<AxisRefListAttr>   # value
>

Параметры:

Параметр	C++ type	Описание
ценить	`::llvm::ArrayRef<AxisRefListAttr>`

ManualAxesAttr

A list of axes that a ManualComputationOp is manual on

Синтаксис:

#sdy.manual_axes<
  ::llvm::ArrayRef<StringAttr>   # value
>

Параметры:

Параметр	C++ type	Описание
ценить	`::llvm::ArrayRef<StringAttr>`

MeshAttr

Mesh of axes and a list of devices

Синтаксис:

#sdy.mesh<
  ::llvm::ArrayRef<MeshAxisAttr>,   # axes
  ::llvm::ArrayRef<int64_t>   # device_ids
>

A mesh is a list of axes and an optional list of device IDs specifying the device ordering.

If the list of axes is empty

If the device_ids is not provided, it is an empty mesh.
If the device_ids is provided, it must be a single non-negative integer, we call it a maximal-sharding mesh .

If the list of axes is provided

If a device ID list is specified, the product of the axis sizes should match the number of devices.
If a device ID list is not specified, the implicit device ID list is iota(product(axes)). For simplicity, we also disallow specifying a device ID list that is the same as iota(product(axes)); in this case, a device ID list shouldn't be specified.
It is not a maximal-sharding mesh even if the total size of axes is 1.

Here are some examples of meshes:

An empty mesh represents a placeholder mesh that can be replaced during propagation: <[]>
A mesh without axes list and a single non-negative device ID, which is a maximal-sharding mesh: <[], device_ids=[3]>
A mesh with two axes and implicit device IDs iota(6): <["a"=2, "b"=3]>
A mesh with two axes and explicit device IDs specifying the device ordering: <["a"=3, "b"=2], device_ids=[0, 2, 4, 1, 3, 5]>

Ограничения:

Elements in device_ids should be non-negative.
If axes is empty, the size of device_ids can be 0 (empty mesh) or 1 (maximal-sharding mesh).
If axes is not empty,
- Elements in axes must not have duplicate names.
- If device_ids is specified, the original device_ids is not iota(product(axis_sizes)) and the sorted device_ids is iota(product(axis_sizes)) .

Параметры:

Параметр	C++ type	Описание
оси	`::llvm::ArrayRef<MeshAxisAttr>`	mesh axes
device_ids	`::llvm::ArrayRef<int64_t>`	explicit device ordering or maximal device id

MeshAxisAttr

Named axis in a mesh

Синтаксис:

#sdy.mesh_axis<
  ::llvm::StringRef,   # name
  int64_t   # size
>

Параметры:

Параметр	C++ type	Описание
имя	`::llvm::StringRef`	имя
размер	`int64_t`	size of this axis

OpShardingRuleAttr

Specifies how an operation can be partitioned.

Синтаксис:

#sdy.op_sharding_rule<
  ::llvm::ArrayRef<int64_t>,   # factor_sizes
  ::llvm::ArrayRef<TensorMappingAttr>,   # operand_mappings
  ::llvm::ArrayRef<TensorMappingAttr>,   # result_mappings
  ::llvm::ArrayRef<int64_t>,   # reduction_factors
  ::llvm::ArrayRef<int64_t>,   # need_replication_factors
  ::llvm::ArrayRef<int64_t>,   # permutation_factors
  ::llvm::ArrayRef<int64_t>,   # blocked_propagation_factors
  bool   # is_custom_rule
>

A sharding rule specifies how an operation can be partitioned according to various properties on the op - any attributes, the shape of operands, the shape of the results, etc. For example:

%0 = stablehlo.add %arg0, %arg1 {
    sdy.sharding_rule = #sdy.op_sharding_rule<
        ([i, j],[i, j])->([i, j])
        {i=8, j=8}>
} : tensor<8x8xf32>

%1 = stablehlo.dot_general %arg2, %arg3, contracting_dims = [1] x [0] {
  sdy.sharding_rule = #sdy.op_sharding_rule<
      ([i, k],[k, j])->([i, j])
      {i=8, j=16, k=8}>
}: (tensor<8x8xf32>, tensor<8x16xf32>) -> tensor<8x16xf32>

Factor types:

reduction_factors contains the indices of factors requiring reduction, such as the contracting dimensions in a dot operation. These factors can be in operands but not in results.
need_replication_factors contains the indices of factors requiring full replication, such as the sorted dimension in a sort operation.
permutation_factors contains the indices of factors requiring collective-permute if they are sharded, such as the padding dimensions in a pad operation.
All other factors are considered as pass-through factors, ie, factors that don't require any communication if sharded in the same way across all tensors that are mapped to them.

Ограничения:

Number of operand/result mappings must match the number of operands/results of the op.
There is at least one mapping (can't have a rule for an op with no operands/results).
Rank of each TensorMappingAttr matches the rank of the corresponding tensor type.
For each group of factors ( reduction_factors , need_replication_factors , permutation_factors ):
- Elements must be in range [0, $factor_sizes ].
- No duplicate factor indices within each group and across groups.

Параметры:

Параметр	C++ type	Описание
factor_sizes	`::llvm::ArrayRef<int64_t>`	sizes of all factors in this rule
operand_mappings	`::llvm::ArrayRef<TensorMappingAttr>`	operand mappings
result_mappings	`::llvm::ArrayRef<TensorMappingAttr>`	result mappings
reduction_factors	`::llvm::ArrayRef<int64_t>`	factors requiring reduction
need_replication_factors	`::llvm::ArrayRef<int64_t>`	factors requiring full replication
permutation_factors	`::llvm::ArrayRef<int64_t>`	factors requiring collective-permute
blocked_propagation_factors	`::llvm::ArrayRef<int64_t>`	factors along which shardings are not propagated
is_custom_rule	`bool`	whether the rule is for a stablehlo.custom_call

PropagationEdgesAttr

Propagation edge metadata for all propagation steps.

Синтаксис:

#sdy.propagation_edges<
  ::llvm::ArrayRef<PropagationOneStepAttr>   # value
>

A list of per-axis propagation details for a value, grouped by step index.

Параметры:

Параметр	C++ type	Описание
ценить	`::llvm::ArrayRef<PropagationOneStepAttr>`

PropagationOneStepAttr

Per-step propagation metadata.

Синтаксис:

#sdy.propagation_one_step<
  int64_t,   # step_index
  ::llvm::ArrayRef<AxisToPropagationDetailsAttr>   # axis_entries
>

Propagation details for all axes for a single propagation step.

Параметры:

Параметр	C++ type	Описание
step_index	`int64_t`	step index
axis_entries	`::llvm::ArrayRef<AxisToPropagationDetailsAttr>`	Axis propagation details per propagation decision

SubAxisInfoAttr

Info about how this sub-axis is derived from the full axis

Синтаксис:

#sdy.sub_axis_info<
  int64_t,   # pre_size
  int64_t   # size
>

Ограничения:

pre-size is at least 1.
size is greater than 1.
pre-size must divide the size of the full axis, ie, both pre-size and size divide the size of the full axis, and the sub-axis doesn't go beyond the full axis.
The size of the sub-axis isn't equal to the size of the corresponding full axis, in which case the full axis should be used instead.

Параметры:

Параметр	C++ type	Описание
pre_size	`int64_t`	product of sub-axis sizes to the left of this sub-axis
размер	`int64_t`	size of this sub-axis

TensorMappingAttr

Factor mappings for each dimension of a tensor.

Синтаксис:

#sdy.tensor_mapping<
  ::llvm::ArrayRef<DimMappingAttr>   # dim_mappings
>

Ограничения:

Elements in dim_mappings must satisfy the constraints in DimMappingAttr .
No duplicate factors indices across dimensions.

Параметры:

Параметр	C++ type	Описание
dim_mappings	`::llvm::ArrayRef<DimMappingAttr>`	dimension mappings

TensorShardingAttr

Tensor sharding

Синтаксис:

#sdy.sharding<
  ::mlir::Attribute,   # mesh_or_ref
  ::llvm::ArrayRef<DimensionShardingAttr>,   # dim_shardings
  ::llvm::ArrayRef<AxisRefAttr>,   # replicated_axes
  ::llvm::ArrayRef<AxisRefAttr>,   # unreduced_axes
  `sum` | `max` | `min`   # reduction_op
>

Note that no sharding attribute on a tensor is equivalent to a fully open tensor sharding.

The mesh this sharding is bound to can either be specified by a symbol name, referencing a corresponding MeshOp symbol, or an inlined MeshAttr .

Ограничения:

Elements in dim_shardings must satisfy the constraints listed in DimensionShardingAttr .
Elements in replicated_axes must satisfy the constraints listed in AxisRefListAttr .
Elements in unreduced_axes must satisfy the constraints listed in AxisRefListAttr .
If the corresponding tensor type isn't a ShapedType , the sharding must have rank 0 and no replicated axes.
If it is a ShapedType , then:
- The tensor should have a rank.
- The number of dimension shardings is equal to the rank of the tensor.
- Dimensions of size 0 aren't sharded.
There are no duplicate axis-refs or sub-axes that overlap with one another across dim_shardings , replicated_axes , and unreduced_axes .
Items in replicated_axes and unreduced_axes are ordered wrt mesh_or_ref (see AxisRefAttr::getMeshComparator ).

Параметры:

Параметр	C++ type	Описание
mesh_or_ref	`::mlir::Attribute`	mesh attr or flat mesh symbol reference attr
dim_shardings	`::llvm::ArrayRef<DimensionShardingAttr>`	dimension shardings
replicated_axes	`::llvm::ArrayRef<AxisRefAttr>`	axis refs
unreduced_axes	`::llvm::ArrayRef<AxisRefAttr>`	axis refs
reduction_op	`::mlir::sdy::ReductionOp`	an enum of type ReductionOp

TensorShardingPerValueAttr

Tensor sharding per operand/result of an op

Синтаксис:

#sdy.sharding_per_value<
  ::llvm::ArrayRef<TensorShardingAttr>   # shardings
>

A list of TensorShardingAttr s, one for each operand/result of an op.

Ограничения:

Elements in shardings must satisfy the constraints of TensorShardingAttr .

Параметры:

Параметр	C++ type	Описание
shardings	`::llvm::ArrayRef<TensorShardingAttr>`	sharding per value

Перечисления

EdgeNodeType

Edge node type enum

Случаи:

Символ	Ценить	Нить
ОПЕРАНД	`0`	операнд
РЕЗУЛЬТАТ	`1`	результат

PropagationDirection

Propagation direction enum

Случаи:

Символ	Ценить	Нить
НИКТО	`0`	НИКТО
ВПЕРЕД	`1`	ВПЕРЕД
НАЗАД	`2`	НАЗАД
ОБА	`3`	ОБА

ReductionOp

Reduction op enum

Случаи:

Символ	Ценить	Нить
СУММА	`0`	сумма
МАКС	`1`	макс
МИН	`2`	мин

The Shardy (SDY) dialect

The Shardy (SDY) dialect defines an axis-based tensor sharding representation and additional API components to attach shardings to tensors.

Version log: 0.0.1: Add unreduced axes to TensorShardingAttr.

Операции

`sdy.all_gather` (sdy::AllGatherOp)

Performs an all-gather communication along axes

Синтаксис:

operation ::= `sdy.all_gather` $gathering_axes $tensor `out_sharding````=```$out_sharding attr-dict `:` type($result)

Gathers chunks of a tensor along axes specified in gathering_axes .

Пример:

%1 = stablehlo.tanh(%0) {sdy.sharding = #sdy.sharding_per_value<[<@mesh, [{"a", "b", "c"}, {}, {"d"}\]>]>} : tensor<8x8x8xf32>
%2 = sdy.all_gather [{"b", "c"}, {}, {"d"}\] %1 out_sharding=<@mesh, [{"a"}, {}, {}\]> : tensor<8x8x8xf32>

Ограничения:

Must satisfy the constraints listed in Sdy_CollectiveOpInterface .
Elements in gathering_axes must satisfy the constraints listed in AxisRefListAttr .
Applying gathering_axes to the operand sharding gets out_sharding .

Traits: SameOperandsAndResultType

Interfaces: InferTypeOpInterface , Sdy_CollectiveOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	MLIR Type	Описание
`gathering_axes`	::mlir::sdy::ListOfAxisRefListsAttr	List of axis ref lists
`out_sharding`	::mlir::sdy::TensorShardingAttr	Tensor sharding

Operands:

Операнд	Описание
`tensor`	shaped of any non-token type values

Результаты:

Результат	Описание
`result`	shaped of any non-token type values

`sdy.all_reduce` (sdy::AllReduceOp)

Perform an all-reduce comunication along axes

Синтаксис:

operation ::= `sdy.all_reduce` ($reduction_op^)? $reduction_axes $tensor `out_sharding````=```$out_sharding attr-dict `:` type($result)

Reduces chunks of a tensor along axes specified in reduction_axes . The order of reduction_axes is not important for the result, but can affect the order of the corresponding replica groups.

Ограничения:

Must satisfy the constraints listed in Sdy_CollectiveOpInterface .
reduction_axes must satisfy the constraints listed in AxisRefListAttr .
reduction_axes must be sorted wrt the mesh.
The operand sharding and out_sharding must have equivalent dimension shardings.
reduction_axes must not overlap with the operand dimension sharding and replicated axes (it can overlap with unreduced axes).
reduction_axes must not overlap with the unreduced axes of out_sharding . In other words, out_sharding must be be replicated along reduction_axes (implicitly or explicitly).

Traits: SameOperandsAndResultType

Interfaces: CollectiveOpInterface , InferTypeOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	MLIR Type	Описание
`reduction_axes`	::mlir::sdy::AxisRefListAttr	List of axis refs
`reduction_op`	::mlir::sdy::ReductionOpAttr	reduction op enum
`out_sharding`	::mlir::sdy::TensorShardingAttr	Tensor sharding

Operands:

Операнд	Описание
`tensor`	shaped of any non-token type values

Результаты:

Результат	Описание
`result`	shaped of any non-token type values

`sdy.all_slice` (sdy::AllSliceOp)

Performs a dynamic-slice operation along axes

Синтаксис:

operation ::= `sdy.all_slice` $slicing_axes $tensor `out_sharding````=```$out_sharding attr-dict `:` type($result)

Slices chunks of a tensor along axes specified in slicing_axes . There is an algebric duality between sdy.all_slice and sdy.all_gather .

Пример:

%1 = stablehlo.tanh(%0) {sdy.sharding = #sdy.sharding_per_value<[<@mesh, [{"a"}, {}, {}\]>]>} : tensor<8x8x8xf32>
%2 = sdy.all_slice [{"b", "c"}, {}, {"d"}\] %1 out_sharding=<@mesh, [{"a", "b", "c"}, {}, {"d"}\]> : tensor<8x8x8xf32>

Ограничения:

Must satisfy the constraints listed in Sdy_CollectiveOpInterface .
Elements in slicing_axes must satisfy the constraints listed in AxisRefListAttr .
Applying slicing_axes to the operand sharding gets out_sharding .

Traits: SameOperandsAndResultType

Interfaces: CollectiveOpInterface , InferTypeOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	MLIR Type	Описание
`slicing_axes`	::mlir::sdy::ListOfAxisRefListsAttr	List of axis ref lists
`out_sharding`	::mlir::sdy::TensorShardingAttr	Tensor sharding

Operands:

Операнд	Описание
`tensor`	shaped of any non-token type values

Результаты:

Результат	Описание
`result`	shaped of any non-token type values

`sdy.all_to_all` (sdy::AllToAllOp)

Performs an all-to-all communication along axes

Синтаксис:

operation ::= `sdy.all_to_all` $params $tensor `out_sharding````=```$out_sharding attr-dict `:` type($result)

The all-to-all will be applied to the sharding of the operand ( tensor ) to obtain the sharding of the result ( out_sharding ).

Пример:

%1 = stablehlo.tanh(%0) {sdy.sharding = #sdy.sharding_per_value<[<@mesh, [{"a", "b"}, {"c"}, {}, {}\]>]>} : tensor<8x8x4x4x32>
%2 = sdy.all_to_all [{"b"}: 0->2, {"c"}: 1->3] %1 out_sharding=<@mesh, [{"a"}, {}, {"b"}, {"c"}\]> : tensor<8x8x4x4x32>

Ограничения:

Must satisfy the constraints listed in Sdy_CollectiveOpInterface .
The parameter list must not be empty.
For each parameter in params :
- Elements in axes must satisfy the constraints of AxisRefAttr .
- src_dim and tgt_dim must be valid dimensions (non-negative and less than rank of tensor).
- Any src_dim or tgt_dim must be unique across all parameters.
- src_dim must be sorted in ascending order across all parameters.
Moving axes from src_dim to tgt_dim in the operand sharding gets out_sharding .

Traits: SameOperandsAndResultType

Interfaces: InferTypeOpInterface , Sdy_CollectiveOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	MLIR Type	Описание
`params`	::mlir::sdy::AllToAllParamListAttr	List of all-to-all parameters
`out_sharding`	::mlir::sdy::TensorShardingAttr	Tensor sharding

Operands:

Операнд	Описание
`tensor`	shaped of any non-token type values

Результаты:

Результат	Описание
`result`	shaped of any non-token type values

`sdy.collective_permute` (sdy::CollectivePermuteOp)

Performs a collective-permute communication to replace axes

Синтаксис:

operation ::= `sdy.collective_permute` $tensor `out_sharding````=```$out_sharding attr-dict `:` type($result)

Sends a chunk of the input tensor from each device to another to reorder/replace the axes that shard the tensor.

This is useful for reordering axes in a single dimension or across different dimensions, and swapping sharded axes with replicated ones.

In the below example, the sharded tensor size is tensor<1x4x2xf32> , and that is preserved by the collective permute.

Пример:

sdy.mesh @mesh = <["a"=2, "b"=2, "c"=4, "d"=2, "e"=2, "f"=2]>
%1 = stablehlo.tanh(%0) {sdy.sharding = #sdy.sharding_per_value<[<@mesh, [{"a", "c"}, {"f"}, {"d", "e"}\]>]>} : tensor<8x8x8xf32>
%2 = sdy.collective_permute %1 out_sharding=<@mesh, [{"c":(1)2, "b", "f"}, {"a"}, {"e", "d"}\]> : tensor<8x8x8xf32>

Ограничения:

Must satisfy the constraints listed in Sdy_CollectiveOpInterface .
If input and output sharding have different meshes, then those meshes must have exactly the same axes and different order of device ids.
For each dimension, the product of sharding axis sizes in out_sharding must match that of the corresponding operand dimension sharding.

Traits: SameOperandsAndResultType

Interfaces: CollectiveOpInterface , InferTypeOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	MLIR Type	Описание
`out_sharding`	::mlir::sdy::TensorShardingAttr	Tensor sharding

Operands:

Операнд	Описание
`tensor`	shaped of any non-token type values

Результаты:

Результат	Описание
`result`	shaped of any non-token type values

`sdy.constant` (sdy::ConstantOp)

Constant operation

Produces an output tensor from a constant value .

See: https://github.com/openxla/stablehlo/blob/main/docs/spec.md#constant

Пример:

%output = sdy.constant dense<[[0.0, 1.0], [2.0, 3.0]]> : tensor<2x2xf32>

Traits: AlwaysSpeculatableImplTrait

Interfaces: ConditionallySpeculatable , InferTypeOpInterface , NoMemoryEffect (MemoryEffectOpInterface)

Effects: MemoryEffects::Effect{}

Атрибуты:

Атрибут	MLIR Type	Описание
`value`	::mlir::ElementsAttr	constant vector/tensor attribute

Результаты:

Результат	Описание
`output`	statically shaped tensor of any non-token type values

`sdy.data_flow_edge` (sdy::DataFlowEdgeOp)

Data flow edge op.

Синтаксис:

operation ::= `sdy.data_flow_edge` $input (`sharding````=``` $sharding^)? attr-dict `:` type($result)

An op can have multiple data flow edges that are orthogonal to one another.

Например:

  y_0, ..., y_n = while (x_0, ..., x_n)
                  ((pred_arg_0,... , pred_arg_n) { ... })
                  ((body_arg_0,..., body_arg_n) {
                    ...
                    return return_value_0, ..., return_value_n
                  })

This while op has n data flow edges, the i-th data flow edges is between sources x_i , return_value_i and targets y_i , pred_arg_i , body_arg_i .

propagate through each edge separately.
update the sharding of each edge separately instead of all targets at once (eg an op has a single immutable TensorShardingPerValueAttr for result shardings).
add each edge to the worklist separately when the sharding of a source has changed.

We don't allow the input of a sdy.data_flow_edge to be defined by an SdyDialect op, so we can assume that it's defined by an op that has unregistered sdy.sharding attribute.

Traits: SameOperandsAndResultType

Interfaces: InferTypeOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	MLIR Type	Описание
`sharding`	::mlir::sdy::TensorShardingAttr	Tensor sharding

Operands:

Операнд	Описание
`input`	shaped of any non-token type values

Результаты:

Результат	Описание
`result`	shaped of any non-token type values

`sdy.func_data_flow_edge` (sdy::FuncDataFlowEdgeOp)

Func input/output data flow edge op.

Синтаксис:

operation ::= `sdy.func_data_flow_edge` $operand attr-dict `:` type($result)

Traits: SameOperandsAndResultType

Interfaces: InferTypeOpInterface , SymbolUserOpInterface

Operands:

Операнд	Описание
`operand`	shaped of any non-token type values

Результаты:

Результат	Описание
`result`	shaped of any non-token type values

`sdy.manual_computation` (sdy::ManualComputationOp)

Multi-device parallelism operation with manual collectives

Синтаксис:

operation ::= `sdy.manual_computation` `(`operands`)`
              `in_shardings````=```custom<StrippedTensorShardingPerValueAttr>($in_shardings)
              `out_shardings````=```custom<StrippedTensorShardingPerValueAttr>($out_shardings)
              `manual_axes````=```$manual_axes
              custom<SingleBlockRegionNoBlockId>($body)
              attr-dict
              `:`
              functional-type(operands, results)

The body is local wrt the manual_axes. Propagation will occur through the body on any free axes - those not in the manual_axes list.

Note that any unranked tensors are expected to have a sharding with rank 0, ie fully replicated.

Ограничения:

Elements in in_shardings and out_shardings must satisfy the constraints listed in TensorShardingAttr .
The number of global and local tensor inputs/outputs of the op region must match.
The manual axes must come before any free axes in each dim sharding.
The manual axes cannot introduce padding. Namely, the dimension size must be divisible by the corresponding manual axes size.
The global and local shapes of the op regions arguments/results must match.

Traits: IsolatedFromAbove , RecursiveMemoryEffects , SingleBlockImplicitTerminator<ReturnOp> , SingleBlock

Interfaces: ShardableDataFlowOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	MLIR Type	Описание
`in_shardings`	::mlir::sdy::TensorShardingPerValueAttr	Tensor sharding per operand/result of an op
`out_shardings`	::mlir::sdy::TensorShardingPerValueAttr	Tensor sharding per operand/result of an op
`manual_axes`	::mlir::sdy::ManualAxesAttr	A list of axes that a ManualComputationOp is manual on

Operands:

Операнд	Описание
`tensors`	variadic of any non-token type

Результаты:

Результат	Описание
`results`	variadic of any non-token type

`sdy.mesh` (sdy::MeshOp)

Named mesh

Синтаксис:

operation ::= `sdy.mesh` $sym_name `=` $mesh attr-dict

Traits: HasParent<ModuleOp>

Interfaces: Symbol

Атрибуты:

Атрибут	MLIR Type	Описание
`sym_name`	::mlir::StringAttr	string attribute
`mesh`	::mlir::sdy::MeshAttr	Mesh of axes and a list of devices

`sdy.named_computation` (sdy::NamedComputationOp)

Named computation operation

Синтаксис:

operation ::= `sdy.named_computation` `<`$name`>` `` `(` $operands `)`
              (`in_shardings````=```custom<StrippedTensorShardingPerValueAttr>($in_shardings)^)?
              (`out_shardings````=```custom<StrippedTensorShardingPerValueAttr>($out_shardings)^)?
              custom<SingleBlockRegionNoBlockId>($body)
              attr-dict
              `:` functional-type($operands, results)

Groups a computation, ie a block of operations, and gives it a name. Propagation will flow in/out of the region as if everything was inlined.

The type of each block arguments and returned values in the region must be the same as the type of the operands and results type of the op.

Пример:

%1 = sdy.named_computation<"foo">(%0) (%arg1: tensor<16x32xf32>) {
  sdy.return %arg1 : tensor<16x32xf32>
} : (tensor<16x32xf32>) -> tensor<16x32xf32>

Traits: IsolatedFromAbove , RecursiveMemoryEffects , RecursivelySpeculatableImplTrait , SingleBlockImplicitTerminator<ReturnOp> , SingleBlock

Interfaces: ConditionallySpeculatable , InferTypeOpInterface , ShardableDataFlowOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	MLIR Type	Описание
`name`	::mlir::StringAttr	string attribute
`in_shardings`	::mlir::sdy::TensorShardingPerValueAttr	Tensor sharding per operand/result of an op
`out_shardings`	::mlir::sdy::TensorShardingPerValueAttr	Tensor sharding per operand/result of an op

Operands:

Операнд	Описание
`operands`	variadic of any non-token type

Результаты:

Результат	Описание
«unnamed»	variadic of any non-token type

`sdy.propagation_barrier` (sdy::PropagationBarrierOp)

Propagation barrier operation

Синтаксис:

operation ::= `sdy.propagation_barrier` $input `allowed_direction````=```$allowed_direction attr-dict `:` type($input)

This op operates like an identity op, outputting the same value it took as input. But in terms of propagation, this will only allow propagation to flow through it in a certain direction.

This prevents shardings from being propagated between the uses of the result of the barrier op and its operand.

FORWARD means shardings can only flow from the operand to the result.
BACKWARD means shardings can only flow from the result to the operand.
NONE means no sharding can propagate through this op.
Cannot specify BOTH , as this op would be redundant.

Traits: AlwaysSpeculatableImplTrait , SameOperandsAndResultType

Interfaces: ConditionallySpeculatable , InferTypeOpInterface , NoMemoryEffect (MemoryEffectOpInterface)

Effects: MemoryEffects::Effect{}

Атрибуты:

Атрибут	MLIR Type	Описание
`allowed_direction`	::mlir::sdy::PropagationDirectionAttr	propagation direction enum

Operands:

Операнд	Описание
`input`	ranked tensor of any non-token type values

Результаты:

Результат	Описание
`result`	ranked tensor of any non-token type values

`sdy.reduce_scatter` (sdy::ReduceScatterOp)

Performs a reduce-scatter communication along axes

Синтаксис:

operation ::= `sdy.reduce_scatter` ($reduction_op^)? $reduce_scatter_axes $tensor `out_sharding````=```$out_sharding attr-dict `:` type($result)

Ограничения:

Must satisfy the constraints listed in Sdy_CollectiveOpInterface .
Elements in reduce_scatter_axes must satisfy the constraints listed in AxisRefListAttr .
Applying reduce_scatter_axes to the operand sharding gets out_sharding .

Traits: SameOperandsAndResultType

Interfaces: CollectiveOpInterface , InferTypeOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	MLIR Type	Описание
`reduce_scatter_axes`	::mlir::sdy::ListOfAxisRefListsAttr	List of axis ref lists
`reduction_op`	::mlir::sdy::ReductionOpAttr	reduction op enum
`out_sharding`	::mlir::sdy::TensorShardingAttr	Tensor sharding

Operands:

Операнд	Описание
`tensor`	shaped of any non-token type values

Результаты:

Результат	Описание
`result`	shaped of any non-token type values

`sdy.replicated_to_unreduced` (sdy::ReplicatedToUnreducedOp)

Move implicitly or explicitly replicated axes to unreduced axes.

Синтаксис:

operation ::= `sdy.replicated_to_unreduced` $axes $tensor `out_sharding````=```$out_sharding attr-dict `:` type($result)

The axes should be implicitly or explicitly replicated in the operand. This operation makes them unreduced in the result. We have the following relationship:

all-reduce(replicated-to-unreduced(x, axes), axes) = x

Пример:

%1 = stablehlo.tanh(%0) {sdy.sharding = #sdy.sharding_per_value<[<@mesh, [{"b"}, {}, {}\], replicated={"c", "d"}, unreduced={"e"}>]>} : tensor<8x8x8xf32>
%2 = sdy.replicated_to_unreduced {"a", "c", "f"} %1 out_sharding=<@mesh, [{"b"}, {}, {}\], replicated={"d"}, unreduced={"a", "c", "e", "f"}> : tensor<8x8x8xf32>

Ограничения:

Must satisfy the constraints listed in Sdy_CollectiveOpInterface .
axes must satisfy the constraints listed in AxisRefListAttr .
axes must be sorted wrt the mesh.
axes are not empty.
The input and output sharding must have the same dimension shardings.
axes must be implicitly or explicitly replicated in the operand sharding.
inUnreducedAxes + axes = outUnreducedAxes.

Traits: SameOperandsAndResultType

Interfaces: InferTypeOpInterface , Sdy_CollectiveOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	MLIR Type	Описание
`axes`	::mlir::sdy::AxisRefListAttr	List of axis refs
`out_sharding`	::mlir::sdy::TensorShardingAttr	Tensor sharding

Operands:

Операнд	Описание
`tensor`	shaped of any non-token type values

Результаты:

Результат	Описание
`result`	shaped of any non-token type values

`sdy.reshard` (sdy::ReshardOp)

Reshards a tensor to a different sharding

Синтаксис:

operation ::= `sdy.reshard` $input $sharding attr-dict `:` type($result)

Reshards the input tensor with the specified sharding, which is different from the input tensor's existing sharding.

Both ShardingConstraintOp and ReshardOp attach a sharding to a tensor. Their lifespan is:

Before sharding propagation, ShardingConstraintOp is added by users.
Sharding propagation consumes ShardingConstraintOp. There is no ShardingConstraintOp in the results of sharding propagation. Instead, ReshardOp may be added if needed.
A partitioner converts a ReshardOp into a collective op (or an identity op). There should be no ReshardOp in the results of the partitioner.

Traits: AlwaysSpeculatableImplTrait , SameOperandsAndResultType

Interfaces: ConditionallySpeculatable , InferTypeOpInterface , NoMemoryEffect (MemoryEffectOpInterface) , SymbolUserOpInterface

Effects: MemoryEffects::Effect{}

Атрибуты:

Атрибут	MLIR Type	Описание
`sharding`	::mlir::sdy::TensorShardingAttr	Tensor sharding

Operands:

Операнд	Описание
`input`	any non-token type

Результаты:

Результат	Описание
`result`	any non-token type

`sdy.return` (sdy::ReturnOp)

Синтаксис:

operation ::= `sdy.return` attr-dict ($results^ `:` type($results))?

Traits: AlwaysSpeculatableImplTrait , ReturnLike , Terminator

Interfaces: ConditionallySpeculatable , NoMemoryEffect (MemoryEffectOpInterface) , RegionBranchTerminatorOpInterface

Effects: MemoryEffects::Effect{}

Operands:

Операнд	Описание
`results`	variadic of any non-token type

`sdy.sharded_to_unreduced` (sdy::ShardedToUnreducedOp)

Move some sharded axes of the operand to unreduced axes of the result.

Синтаксис:

operation ::= `sdy.sharded_to_unreduced` $axes $tensor `out_sharding````=```$out_sharding attr-dict `:` type($result)

The axes should be used to shard the operand. This operation makes them unreduced in the result. We have the following relationship:

all-gather(x, axes) = all-reduce(sharded-to-unreduced(x, axes), axes), where all-gather, sharded-to-unreduced, all-reduce are applied on the same axes.

Пример:

%1 = stablehlo.tanh(%0) {sdy.sharding = #sdy.sharding_per_value<[<@mesh, [{"a", "b", "c"}, {}, {"d"}\], unreduced={"e"}>]>} : tensor<8x8x8xf32>
%2 = sdy.sharded_to_unreduced [{"b", "c"}, {}, {"d"}\] %1 out_sharding=<@mesh, [{"a"}, {}, {}\], unreduced={"b", "c", "d", "e"}> : tensor<8x8x8xf32>

Ограничения:

Must satisfy the constraints listed in Sdy_CollectiveOpInterface .
Elements in axes must satisfy the constraints listed in AxisRefListAttr .
Applying axes to the operand sharding gets out_sharding .

Traits: SameOperandsAndResultType

Interfaces: InferTypeOpInterface , Sdy_CollectiveOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	MLIR Type	Описание
`axes`	::mlir::sdy::ListOfAxisRefListsAttr	List of axis ref lists
`out_sharding`	::mlir::sdy::TensorShardingAttr	Tensor sharding

Operands:

Операнд	Описание
`tensor`	shaped of any non-token type values

Результаты:

Результат	Описание
`result`	shaped of any non-token type values

`sdy.sharding_constraint` (sdy::ShardingConstraintOp)

Constrains a tensor to the specified sharding

Синтаксис:

operation ::= `sdy.sharding_constraint` $input $sharding attr-dict `:` type($result)

Attaches a sharding to an intermediate tensor (eg the result of a matmul) to indicate that this is how that tensor, or a subset of its uses, should be sharded.

If the sharding has open dimensions and unconstraint axes, it means the tensor can be further sharded along the open dimensions.

This op can either:

Have no uses (dangling) - which means the attached sharding is how the input tensor itself should be sharded.
Have uses - which means the attached sharding is how the uses of the sharding constraint op should be sharded, while other uses of the input tensor might have a different sharding (if the input tensor has no other uses then the behavior is the same as the no uses case).

Traits: SameOperandsAndResultType

Interfaces: InferTypeOpInterface , SymbolUserOpInterface

Атрибуты:

Атрибут	MLIR Type	Описание
`sharding`	::mlir::sdy::TensorShardingAttr	Tensor sharding

Operands:

Операнд	Описание
`input`	any non-token type

Результаты:

Результат	Описание
`result`	any non-token type

`sdy.sharding_group` (sdy::ShardingGroupOp)

Constrains tensors in the group to have the same sharding.

Синтаксис:

operation ::= `sdy.sharding_group` $input `group_id````=```$group_id attr-dict `:` type($input)

Interfaces: InferTypeOpInterface

Атрибуты:

Атрибут	MLIR Type	Описание
`group_id`	::mlir::IntegerAttr	64-bit signless integer attribute

Operands:

Операнд	Описание
`input`	ranked tensor of any non-token type values

Атрибуты

AllToAllParamAttr

All-to-all parameter

Синтаксис:

#sdy.all_to_all_param<
  ::llvm::ArrayRef<AxisRefAttr>,   # axes
  int64_t,   # src_dim
  int64_t   # tgt_dim
>

A tuple containing the axes and source/target dimensions to perform all-to-all on.

Параметры:

Параметр	C++ type	Описание
оси	`::llvm::ArrayRef<AxisRefAttr>`	the axes to perform all-to-all on
src_dim	`int64_t`	the source dimension index
tgt_dim	`int64_t`	the target dimension index

AllToAllParamListAttr

List of all-to-all parameters

Синтаксис:

#sdy.all_to_all_param_list<
  ::llvm::ArrayRef<AllToAllParamAttr>   # value
>

Параметры:

Параметр	C++ type	Описание
ценить	`::llvm::ArrayRef<AllToAllParamAttr>`

AxisRefAttr

Reference to either a full axis or a split sub-axis

Синтаксис:

#sdy.axis_ref<
  ::llvm::StringRef,   # name
  SubAxisInfoAttr   # sub_axis_info
>

Ограничения:

name must be present in the bound MeshAttr .
If sub_axis_info is present, it must satisfy the constraints of SubAxisInfoAttr .

Параметры:

Параметр	C++ type	Описание
имя	`::llvm::StringRef`	name of this axis
sub_axis_info	`SubAxisInfoAttr`	additional info if this is a sub axis

AxisRefListAttr

List of axis refs

Синтаксис:

#sdy.axis_ref_list<
  ::llvm::ArrayRef<AxisRefAttr>   # value
>

Ограничения:

Elements in value must satisfy the constraints of AxisRefAttr .
There are no duplicate axis-refs or sub-axes that overlap with one another.
No two adjacent axis-refs are consecutive sub-axes of that same full axis, ie, they can be merged into one sub-axis or the full axis.

Параметры:

Параметр	C++ type	Описание
ценить	`::llvm::ArrayRef<AxisRefAttr>`

AxisToPropagationDetailsAttr

Propagation edge flow details for a specific axis and source.

Синтаксис:

#sdy.axis_to_propagation_details<
  ::mlir::sdy::AxisRefAttr,   # axis_name
  ::mlir::sdy::EdgeValueRefAttr,   # source
  ::llvm::ArrayRef<EdgeValueRefAttr>   # targets
>

Maps a source value reference to a list of target value references along a particular axis.

Параметры:

Параметр	C++ type	Описание
axis_name	`::mlir::sdy::AxisRefAttr`	Reference to either a full axis or a split sub-axis
источник	`::mlir::sdy::EdgeValueRefAttr`	Reference to a particular index of a value edge of type `type` .
цели	`::llvm::ArrayRef<EdgeValueRefAttr>`	list of edge target values

DimMappingAttr

List of factor indices for a dimension

An empty list indicates that this is a null mapping (this is parsed/printed with * ), ie the dimension isn't mapped to any factors.

Ограничения:

There is at least one factor index.
Factor indices must be in range [0, $factor_sizes ).
If there are multiple factors, none of them can have size 1.
No duplicate factor indices.

Параметры:

Параметр	C++ type	Описание
factor_indices	`::llvm::ArrayRef<int64_t>`	factors this dimension is mapped to

DimensionShardingAttr

Dimension sharding

Ограничения:

Elements in axes must satisfy the constraints listed in AxisRefListAttr .
If a dimension sharding has a priority:
- The priority is greater than or equal to 0.
- The dimension has at least one axis if it is closed.

Параметры:

Параметр	C++ type	Описание
оси	`::llvm::ArrayRef<AxisRefAttr>`	axis refs
is_closed	`bool`	whether this dimension can't be further sharded
приоритет	`std::optional<int64_t>`	the priority used during user priority based propagation

EdgeValueRefAttr

Reference to a particular index of a value edge of type type .

Синтаксис:

#sdy.edge_value_ref<
  `operand` | `result`,   # type
  int64_t   # index
>

Параметры:

Параметр	C++ type	Описание
тип	`::mlir::sdy::EdgeNodeType`	an enum of type EdgeNodeType
индекс	`int64_t`	The integer index (0, 1, 2, etc.)

ListOfAxisRefListsAttr

List of axis ref lists

Синтаксис:

#sdy.list_of_axis_ref_lists<
  ::llvm::ArrayRef<AxisRefListAttr>   # value
>

Параметры:

Параметр	C++ type	Описание
ценить	`::llvm::ArrayRef<AxisRefListAttr>`

ManualAxesAttr

A list of axes that a ManualComputationOp is manual on

Синтаксис:

#sdy.manual_axes<
  ::llvm::ArrayRef<StringAttr>   # value
>

Параметры:

Параметр	C++ type	Описание
ценить	`::llvm::ArrayRef<StringAttr>`

MeshAttr

Mesh of axes and a list of devices

Синтаксис:

#sdy.mesh<
  ::llvm::ArrayRef<MeshAxisAttr>,   # axes
  ::llvm::ArrayRef<int64_t>   # device_ids
>

A mesh is a list of axes and an optional list of device IDs specifying the device ordering.

If the list of axes is empty

If the device_ids is not provided, it is an empty mesh.
If the device_ids is provided, it must be a single non-negative integer, we call it a maximal-sharding mesh .

If the list of axes is provided

If a device ID list is specified, the product of the axis sizes should match the number of devices.
If a device ID list is not specified, the implicit device ID list is iota(product(axes)). For simplicity, we also disallow specifying a device ID list that is the same as iota(product(axes)); in this case, a device ID list shouldn't be specified.
It is not a maximal-sharding mesh even if the total size of axes is 1.

Here are some examples of meshes:

An empty mesh represents a placeholder mesh that can be replaced during propagation: <[]>
A mesh without axes list and a single non-negative device ID, which is a maximal-sharding mesh: <[], device_ids=[3]>
A mesh with two axes and implicit device IDs iota(6): <["a"=2, "b"=3]>
A mesh with two axes and explicit device IDs specifying the device ordering: <["a"=3, "b"=2], device_ids=[0, 2, 4, 1, 3, 5]>

Ограничения:

Elements in device_ids should be non-negative.
If axes is empty, the size of device_ids can be 0 (empty mesh) or 1 (maximal-sharding mesh).
If axes is not empty,
- Elements in axes must not have duplicate names.
- If device_ids is specified, the original device_ids is not iota(product(axis_sizes)) and the sorted device_ids is iota(product(axis_sizes)) .

Параметры:

Параметр	C++ type	Описание
оси	`::llvm::ArrayRef<MeshAxisAttr>`	mesh axes
device_ids	`::llvm::ArrayRef<int64_t>`	explicit device ordering or maximal device id

MeshAxisAttr

Named axis in a mesh

Синтаксис:

#sdy.mesh_axis<
  ::llvm::StringRef,   # name
  int64_t   # size
>

Параметры:

Параметр	C++ type	Описание
имя	`::llvm::StringRef`	имя
размер	`int64_t`	size of this axis

OpShardingRuleAttr

Specifies how an operation can be partitioned.

Синтаксис:

#sdy.op_sharding_rule<
  ::llvm::ArrayRef<int64_t>,   # factor_sizes
  ::llvm::ArrayRef<TensorMappingAttr>,   # operand_mappings
  ::llvm::ArrayRef<TensorMappingAttr>,   # result_mappings
  ::llvm::ArrayRef<int64_t>,   # reduction_factors
  ::llvm::ArrayRef<int64_t>,   # need_replication_factors
  ::llvm::ArrayRef<int64_t>,   # permutation_factors
  ::llvm::ArrayRef<int64_t>,   # blocked_propagation_factors
  bool   # is_custom_rule
>

A sharding rule specifies how an operation can be partitioned according to various properties on the op - any attributes, the shape of operands, the shape of the results, etc. For example:

%0 = stablehlo.add %arg0, %arg1 {
    sdy.sharding_rule = #sdy.op_sharding_rule<
        ([i, j],[i, j])->([i, j])
        {i=8, j=8}>
} : tensor<8x8xf32>

%1 = stablehlo.dot_general %arg2, %arg3, contracting_dims = [1] x [0] {
  sdy.sharding_rule = #sdy.op_sharding_rule<
      ([i, k],[k, j])->([i, j])
      {i=8, j=16, k=8}>
}: (tensor<8x8xf32>, tensor<8x16xf32>) -> tensor<8x16xf32>

Factor types:

reduction_factors contains the indices of factors requiring reduction, such as the contracting dimensions in a dot operation. These factors can be in operands but not in results.
need_replication_factors contains the indices of factors requiring full replication, such as the sorted dimension in a sort operation.
permutation_factors contains the indices of factors requiring collective-permute if they are sharded, such as the padding dimensions in a pad operation.
All other factors are considered as pass-through factors, ie, factors that don't require any communication if sharded in the same way across all tensors that are mapped to them.

Ограничения:

Number of operand/result mappings must match the number of operands/results of the op.
There is at least one mapping (can't have a rule for an op with no operands/results).
Rank of each TensorMappingAttr matches the rank of the corresponding tensor type.
For each group of factors ( reduction_factors , need_replication_factors , permutation_factors ):
- Elements must be in range [0, $factor_sizes ].
- No duplicate factor indices within each group and across groups.

Параметры:

Параметр	C++ type	Описание
factor_sizes	`::llvm::ArrayRef<int64_t>`	sizes of all factors in this rule
operand_mappings	`::llvm::ArrayRef<TensorMappingAttr>`	operand mappings
result_mappings	`::llvm::ArrayRef<TensorMappingAttr>`	result mappings
reduction_factors	`::llvm::ArrayRef<int64_t>`	factors requiring reduction
need_replication_factors	`::llvm::ArrayRef<int64_t>`	factors requiring full replication
permutation_factors	`::llvm::ArrayRef<int64_t>`	factors requiring collective-permute
blocked_propagation_factors	`::llvm::ArrayRef<int64_t>`	factors along which shardings are not propagated
is_custom_rule	`bool`	whether the rule is for a stablehlo.custom_call

PropagationEdgesAttr

Propagation edge metadata for all propagation steps.

Синтаксис:

#sdy.propagation_edges<
  ::llvm::ArrayRef<PropagationOneStepAttr>   # value
>

A list of per-axis propagation details for a value, grouped by step index.

Параметры:

Параметр	C++ type	Описание
ценить	`::llvm::ArrayRef<PropagationOneStepAttr>`

PropagationOneStepAttr

Per-step propagation metadata.

Синтаксис:

#sdy.propagation_one_step<
  int64_t,   # step_index
  ::llvm::ArrayRef<AxisToPropagationDetailsAttr>   # axis_entries
>

Propagation details for all axes for a single propagation step.

Параметры:

Параметр	C++ type	Описание
step_index	`int64_t`	step index
axis_entries	`::llvm::ArrayRef<AxisToPropagationDetailsAttr>`	Axis propagation details per propagation decision

SubAxisInfoAttr

Info about how this sub-axis is derived from the full axis

Синтаксис:

#sdy.sub_axis_info<
  int64_t,   # pre_size
  int64_t   # size
>

Ограничения:

pre-size is at least 1.
size is greater than 1.
pre-size must divide the size of the full axis, ie, both pre-size and size divide the size of the full axis, and the sub-axis doesn't go beyond the full axis.
The size of the sub-axis isn't equal to the size of the corresponding full axis, in which case the full axis should be used instead.

Параметры:

Параметр	C++ type	Описание
pre_size	`int64_t`	product of sub-axis sizes to the left of this sub-axis
размер	`int64_t`	size of this sub-axis

TensorMappingAttr

Factor mappings for each dimension of a tensor.

Синтаксис:

#sdy.tensor_mapping<
  ::llvm::ArrayRef<DimMappingAttr>   # dim_mappings
>

Ограничения:

Elements in dim_mappings must satisfy the constraints in DimMappingAttr .
No duplicate factors indices across dimensions.

Параметры:

Параметр	C++ type	Описание
dim_mappings	`::llvm::ArrayRef<DimMappingAttr>`	dimension mappings

TensorShardingAttr

Tensor sharding

Синтаксис:

#sdy.sharding<
  ::mlir::Attribute,   # mesh_or_ref
  ::llvm::ArrayRef<DimensionShardingAttr>,   # dim_shardings
  ::llvm::ArrayRef<AxisRefAttr>,   # replicated_axes
  ::llvm::ArrayRef<AxisRefAttr>,   # unreduced_axes
  `sum` | `max` | `min`   # reduction_op
>

Note that no sharding attribute on a tensor is equivalent to a fully open tensor sharding.

The mesh this sharding is bound to can either be specified by a symbol name, referencing a corresponding MeshOp symbol, or an inlined MeshAttr .

Ограничения:

Elements in dim_shardings must satisfy the constraints listed in DimensionShardingAttr .
Elements in replicated_axes must satisfy the constraints listed in AxisRefListAttr .
Elements in unreduced_axes must satisfy the constraints listed in AxisRefListAttr .
If the corresponding tensor type isn't a ShapedType , the sharding must have rank 0 and no replicated axes.
If it is a ShapedType , then:
- The tensor should have a rank.
- The number of dimension shardings is equal to the rank of the tensor.
- Dimensions of size 0 aren't sharded.
There are no duplicate axis-refs or sub-axes that overlap with one another across dim_shardings , replicated_axes , and unreduced_axes .
Items in replicated_axes and unreduced_axes are ordered wrt mesh_or_ref (see AxisRefAttr::getMeshComparator ).

Параметры:

Параметр	C++ type	Описание
mesh_or_ref	`::mlir::Attribute`	mesh attr or flat mesh symbol reference attr
dim_shardings	`::llvm::ArrayRef<DimensionShardingAttr>`	dimension shardings
replicated_axes	`::llvm::ArrayRef<AxisRefAttr>`	axis refs
unreduced_axes	`::llvm::ArrayRef<AxisRefAttr>`	axis refs
reduction_op	`::mlir::sdy::ReductionOp`	an enum of type ReductionOp

TensorShardingPerValueAttr

Tensor sharding per operand/result of an op

Синтаксис:

#sdy.sharding_per_value<
  ::llvm::ArrayRef<TensorShardingAttr>   # shardings
>

A list of TensorShardingAttr s, one for each operand/result of an op.

Ограничения:

Elements in shardings must satisfy the constraints of TensorShardingAttr .

Параметры:

Параметр	C++ type	Описание
shardings	`::llvm::ArrayRef<TensorShardingAttr>`	sharding per value

Перечисления

EdgeNodeType

Edge node type enum

Случаи:

Символ	Ценить	Нить
ОПЕРАНД	`0`	операнд
РЕЗУЛЬТАТ	`1`	результат

PropagationDirection

Propagation direction enum

Случаи:

Символ	Ценить	Нить
НИКТО	`0`	НИКТО
ВПЕРЕД	`1`	ВПЕРЕД
НАЗАД	`2`	НАЗАД
ОБА	`3`	ОБА

ReductionOp

Reduction op enum

Случаи:

Символ	Ценить	Нить
СУММА	`0`	сумма
МАКС	`1`	макс
МИН	`2`	мин

'сди' Диалект Оптимизируйте свои подборки Сохраняйте и классифицируйте контент в соответствии со своими настройками.

Операции

sdy.all_gather (sdy::AllGatherOp)

Атрибуты:

Операнды:

Результаты:

sdy.all_reduce (sdy::AllReduceOp)

Атрибуты:

Операнды:

Результаты:

sdy.all_slice (sdy::AllSliceOp)

Атрибуты:

Операнды:

Результаты:

sdy.all_to_all (sdy::AllToAllOp)

Атрибуты:

Операнды:

Результаты:

sdy.collective_permute (sdy::CollectivePermuteOp)

Атрибуты:

Операнды:

Результаты:

sdy.constant (sdy::ConstantOp)

Атрибуты:

Результаты:

sdy.data_flow_edge (sdy::DataFlowEdgeOp)

Атрибуты:

Операнды:

Результаты:

sdy.func_data_flow_edge (sdy::FuncDataFlowEdgeOp)

Операнды:

Результаты:

sdy.manual_computation (sdy::ManualComputationOp)

Атрибуты:

Операнды:

Результаты:

sdy.mesh (sdy::MeshOp)

Атрибуты:

sdy.named_computation (sdy::NamedComputationOp)

Атрибуты:

Операнды:

Результаты:

sdy.propagation_barrier (sdy::PropagationBarrierOp)

Атрибуты:

Операнды:

Результаты:

sdy.reduce_scatter (sdy::ReduceScatterOp)

Атрибуты:

Операнды:

Результаты:

sdy.replicated_to_unreduced (sdy::ReplicatedToUnreducedOp)

Атрибуты:

Операнды:

Результаты:

sdy.reshard (sdy::ReshardOp)

Атрибуты:

Операнды:

Результаты:

sdy.return (sdy::ReturnOp)

Операнды:

sdy.sharded_to_unreduced (sdy::ShardedToUnreducedOp)

Атрибуты:

Операнды:

Результаты:

sdy.sharding_constraint (sdy::ShardingConstraintOp)

Атрибуты:

Операнды:

Результаты:

sdy.sharding_group (sdy::ShardingGroupOp)

Атрибуты:

Операнды:

Атрибуты

AllToAllParamAttr

Параметры:

AllToAllParamListAttr

Параметры:

AxisRefAttr

Параметры:

AxisRefListAttr

Параметры:

'сди' Диалект

`sdy.all_gather` (sdy::AllGatherOp)

`sdy.all_reduce` (sdy::AllReduceOp)

`sdy.all_slice` (sdy::AllSliceOp)

`sdy.all_to_all` (sdy::AllToAllOp)

`sdy.collective_permute` (sdy::CollectivePermuteOp)

`sdy.constant` (sdy::ConstantOp)

`sdy.data_flow_edge` (sdy::DataFlowEdgeOp)

`sdy.func_data_flow_edge` (sdy::FuncDataFlowEdgeOp)

`sdy.manual_computation` (sdy::ManualComputationOp)

`sdy.mesh` (sdy::MeshOp)

`sdy.named_computation` (sdy::NamedComputationOp)

`sdy.propagation_barrier` (sdy::PropagationBarrierOp)

`sdy.reduce_scatter` (sdy::ReduceScatterOp)

`sdy.replicated_to_unreduced` (sdy::ReplicatedToUnreducedOp)

`sdy.reshard` (sdy::ReshardOp)

`sdy.return` (sdy::ReturnOp)

`sdy.sharded_to_unreduced` (sdy::ShardedToUnreducedOp)

`sdy.sharding_constraint` (sdy::ShardingConstraintOp)

`sdy.sharding_group` (sdy::ShardingGroupOp)

`sdy.all_gather` (sdy::AllGatherOp)

`sdy.all_reduce` (sdy::AllReduceOp)

`sdy.all_slice` (sdy::AllSliceOp)

`sdy.all_to_all` (sdy::AllToAllOp)

`sdy.collective_permute` (sdy::CollectivePermuteOp)

`sdy.constant` (sdy::ConstantOp)

`sdy.data_flow_edge` (sdy::DataFlowEdgeOp)

`sdy.func_data_flow_edge` (sdy::FuncDataFlowEdgeOp)

`sdy.manual_computation` (sdy::ManualComputationOp)