Basic Usage
In the following example, we demonstrate the einsum notation for basic tensor operations.
Einsum notation
To specify the operation, the user can either use the @ein_str
-string literal or the EinCode
object. For example, both the following code snippets define the matrix multiplication operation:
julia> using OMEinsum
julia> code1 = ein"ij,jk -> ik" # the string literal
ij, jk -> ik
julia> ixs = [[1, 2], [2, 3]] # the input indices
2-element Vector{Vector{Int64}}: [1, 2] [2, 3]
julia> iy = [1, 3] # the output indices
2-element Vector{Int64}: 1 3
julia> code2 = EinCode(ixs, iy) # the EinCode object (equivalent to the string literal)
1∘2, 2∘3 -> 1∘3
The @ein_str
macro can be used to define the einsum notation directly in the function call.
julia> A, B = randn(2, 3), randn(3, 4);
julia> code1(A, B) # matrix multiplication
2×4 Matrix{Float64}: -0.113748 0.475082 0.162445 -0.613051 0.897045 -2.70397 0.259365 -3.94346
julia> size_dict = OMEinsum.get_size_dict(getixsv(code1), (A, B)) # get the size of the labels
Dict{Char, Int64} with 3 entries: 'j' => 3 'i' => 2 'k' => 4
julia> einsum(code1, (A, B), size_dict) # lower-level function
2×4 Matrix{Float64}: -0.113748 0.475082 0.162445 -0.613051 0.897045 -2.70397 0.259365 -3.94346
julia> einsum!(code1, (A, B), zeros(2, 4), true, false, size_dict) # the in-place operation
2×4 Matrix{Float64}: -0.113748 0.475082 0.162445 -0.613051 0.897045 -2.70397 0.259365 -3.94346
julia> @ein C[i,k] := A[i,j] * B[j,k] # all-in-one macro
2×4 Matrix{Float64}: -0.113748 0.475082 0.162445 -0.613051 0.897045 -2.70397 0.259365 -3.94346
Here, we show that the @ein
macro combines the einsum notation defintion and the operation in a single line, which is more convenient for simple operations. Separating the einsum notation and the operation (the first approach) can be useful for reusing the einsum notation for multiple input tensors. Lower level functions, einsum
and einsum!
, can be used for more control over the operation.
For more than two input tensors, the @ein_str
macro does not optimize the contraction order. In such cases, the user can use the @optein_str
string literal to optimize the contraction order or specify the contraction order manually.
julia> tensors = [randn(100, 100) for _ in 1:4];
julia> optein"ij,jk,kl,lm->im"(tensors...) # optimized contraction (without knowing the size)
100×100 Matrix{Float64}: 1515.58 -933.392 -23.2939 … -322.86 -195.654 -625.719 1267.64 -787.69 1766.53 1072.41 788.287 757.697 1318.22 -1467.46 -454.072 221.148 -2325.38 -1526.44 1763.06 870.86 -1152.97 -699.157 241.433 681.715 368.99 1121.16 841.942 -658.904 -865.063 1396.92 486.982 -464.514 -462.872 … 372.413 -1406.71 -214.992 -1441.52 33.659 -2322.0 -1806.89 -799.964 -723.715 917.642 529.481 764.808 36.9285 782.646 -49.5901 -1217.36 920.767 -679.782 434.433 328.293 686.897 -1349.4 -1009.01 2153.61 1138.54 1181.72 37.9518 ⋮ ⋱ -478.231 27.9846 -676.124 -2099.75 -89.6706 -560.617 110.353 2186.97 -1359.95 638.35 -912.002 -210.819 -9.69087 438.441 702.887 1133.1 146.279 -532.485 -596.816 -738.136 -475.25 -923.82 -596.468 -1613.61 1525.27 533.03 -372.88 … 61.5346 -819.652 -586.914 636.534 673.761 -733.101 728.116 5.42234 -78.5712 -866.727 186.031 1624.71 -511.186 2977.92 1450.75 -475.972 -373.477 -507.227 -625.753 1334.22 -695.644 -719.134 347.944 -625.585 396.207 -1195.56 381.909
julia> ein"(ij,jk),(kl,lm)->im"(tensors...) # manually specified contraction
100×100 Matrix{Float64}: 1515.58 -933.392 -23.2939 … -322.86 -195.654 -625.719 1267.64 -787.69 1766.53 1072.41 788.287 757.697 1318.22 -1467.46 -454.072 221.148 -2325.38 -1526.44 1763.06 870.86 -1152.97 -699.157 241.433 681.715 368.99 1121.16 841.942 -658.904 -865.063 1396.92 486.982 -464.514 -462.872 … 372.413 -1406.71 -214.992 -1441.52 33.659 -2322.0 -1806.89 -799.964 -723.715 917.642 529.481 764.808 36.9285 782.646 -49.5901 -1217.36 920.767 -679.782 434.433 328.293 686.897 -1349.4 -1009.01 2153.61 1138.54 1181.72 37.9518 ⋮ ⋱ -478.231 27.9846 -676.124 -2099.75 -89.6706 -560.617 110.353 2186.97 -1359.95 638.35 -912.002 -210.819 -9.69087 438.441 702.887 1133.1 146.279 -532.485 -596.816 -738.136 -475.25 -923.82 -596.468 -1613.61 1525.27 533.03 -372.88 … 61.5346 -819.652 -586.914 636.534 673.761 -733.101 728.116 5.42234 -78.5712 -866.727 186.031 1624.71 -511.186 2977.92 1450.75 -475.972 -373.477 -507.227 -625.753 1334.22 -695.644 -719.134 347.944 -625.585 396.207 -1195.56 381.909
Sometimes, manually optimizing the contraction order can be beneficial. Please check Contraction order optimization for more details.
Einsum examples
We first define the tensors and then demonstrate the einsum notation for various tensor operations.
julia> using OMEinsum
julia> s = fill(1) # scalar
0-dimensional Array{Int64, 0}: 1
julia> w, v = [1, 2], [4, 5]; # vectors
julia> A, B = [1 2; 3 4], [5 6; 7 8]; # matrices
julia> T1, T2 = reshape(1:8, 2, 2, 2), reshape(9:16, 2, 2, 2); # 3D tensor
Unary examples
julia> ein"i->"(w) # sum of the elements of a vector.
0-dimensional Array{Int64, 0}: 3
julia> ein"ij->i"(A) # sum of the rows of a matrix.
2-element Vector{Int64}: 3 7
julia> ein"ii->"(A) # sum of the diagonal elements of a matrix, i.e., the trace.
0-dimensional Array{Int64, 0}: 5
julia> ein"ij->"(A) # sum of the elements of a matrix.
0-dimensional Array{Int64, 0}: 10
julia> ein"i->ii"(w) # create a diagonal matrix.
2×2 Matrix{Int64}: 1 0 0 2
julia> ein"i->ij"(w; size_info=Dict('j'=>2)) # repeat a vector to form a matrix.
2×2 Matrix{Int64}: 1 1 2 2
julia> ein"ijk->ikj"(T1) # permute the dimensions of a tensor.
2×2×2 Array{Int64, 3}: [:, :, 1] = 1 5 2 6 [:, :, 2] = 3 7 4 8
Binary examples
julia> ein"ij, jk -> ik"(A, B) # matrix multiplication.
2×2 Matrix{Int64}: 19 22 43 50
julia> ein"ijb,jkb->ikb"(T1, T2) # batch matrix multiplication.
2×2×2 Array{Int64, 3}: [:, :, 1] = 39 47 58 70 [:, :, 2] = 163 187 190 218
julia> ein"ij,ij->ij"(A, B) # element-wise multiplication.
2×2 Matrix{Int64}: 5 12 21 32
julia> ein"ij,ij->"(A, B) # sum of the element-wise multiplication.
0-dimensional Array{Int64, 0}: 70
julia> ein"ij,->ij"(A, s) # element-wise multiplication by a scalar.
2×2 Matrix{Int64}: 1 2 3 4
Nary examples
julia> optein"ai,aj,ak->ijk"(A, A, B) # star contraction.
2×2×2 Array{Int64, 3}: [:, :, 1] = 68 94 94 132 [:, :, 2] = 78 108 108 152
julia> optein"ia,ajb,bkc,cld,dm->ijklm"(A, T1, T2, T1, A) # tensor train contraction.
2×2×2×2×2 Array{Int64, 5}: [:, :, 1, 1, 1] = 9500 14564 21604 33420 [:, :, 2, 1, 1] = 11084 17012 25204 39036 [:, :, 1, 2, 1] = 13644 20916 31028 47996 [:, :, 2, 2, 1] = 15932 24452 36228 56108 [:, :, 1, 1, 2] = 13214 20258 30050 46486 [:, :, 2, 1, 2] = 15414 23658 35050 54286 [:, :, 1, 2, 2] = 19430 29786 44186 68350 [:, :, 2, 2, 2] = 22686 34818 51586 79894