CAS, CASA, CASAL, CASL
Compare and swap word or doubleword in memory
This instruction
reads a 32-bit word or 64-bit
doubleword from memory, and compares it against the value held in a
first register. If the comparison is equal, the value in a second register
is written to memory.
If the write is performed, the read and write occur atomically such
that no other modification of the memory location can take place
between the read and write.
CASA and CASAL load from memory with acquire semantics.
CASL and CASAL store to memory with release semantics.
CAS has neither acquire nor release semantics.
The architecture permits that the data read clears any exclusive
monitors associated with that location, even if the compare
subsequently fails.
If the instruction generates a synchronous Data Abort, the register
which is compared and loaded, that is <Ws>, or
<Xs>, is restored to the value held in the register
before the instruction was executed.
For a CAS or CASA instruction, when <Ws>
or <Xs> specifies the same register as <Wt> or <Xt>,
this signals to the memory system that an additional subsequent CAS,
CASA, CASAL, or CASL
access to the specified location is likely to occur in the near future. The memory system can respond by
taking actions that are expected to enable the subsequent CAS,
CASA, CASAL, or CASL access to succeed when it does occur.
A code sequence starting with a CAS or CASA instruction for which
<Ws> or <Xs> specifies the same register as <Wt>
or <Xt>, and ending with a subsequent CAS, CASA,
CASAL, or CASL to the same location, exhibits the following
properties for best performance when the location may be accessed concurrently, on one or more other PEs:
The sequence does not contain any direct system register writes, address translation instructions, cache or TLB
maintenance operations, exception producing instructions, exception returns, or ISB barriers.
The execution of the sequence includes 32 or fewer instructions.
The value provided in <Ws> or <Xs> of the first CAS or CASA
is a value likely to result in the comparison failing.
A failing comparison result may lead to better performance due to the hardware not performing a write to memory.
For a CAS or CASA instruction, when <Ws> or
<Xs> specifies the same register as <Wt> or <Xt>, the
value in memory is not modified, because the CAS or CASA either fails its compare or writes the same value back to memory.
For more information about memory ordering semantics, see
Load-Acquire, Store-Release.
For information about addressing modes, see
Load/Store addressing modes.
1
x
0
0
1
0
0
0
1
1
1
1
1
1
1
0
0
0
CAS <Ws>, <Wt>, [<Xn|SP>{, #0}]
0
1
0
CASA <Ws>, <Wt>, [<Xn|SP>{, #0}]
0
1
1
CASAL <Ws>, <Wt>, [<Xn|SP>{, #0}]
0
0
1
CASL <Ws>, <Wt>, [<Xn|SP>{, #0}]
1
0
0
CAS <Xs>, <Xt>, [<Xn|SP>{, #0}]
1
1
0
CASA <Xs>, <Xt>, [<Xn|SP>{, #0}]
1
1
1
CASAL <Xs>, <Xt>, [<Xn|SP>{, #0}]
1
0
1
CASL <Xs>, <Xt>, [<Xn|SP>{, #0}]
if !IsFeatureImplemented(FEAT_LSE) then UNDEFINED;
constant integer s = UInt(Rs);
constant integer t = UInt(Rt);
constant integer n = UInt(Rn);
constant integer datasize = 8 << UInt(size);
constant integer regsize = if datasize == 64 then 64 else 32;
constant boolean acquire = L == '1';
constant boolean release = o0 == '1';
constant boolean tagchecked = n != 31;
<Ws>
Is the 32-bit name of the general-purpose register to be compared and loaded, encoded in the "Rs" field.
<Wt>
Is the 32-bit name of the general-purpose register to be conditionally stored, encoded in the "Rt" field.
<Xn|SP>
Is the 64-bit name of the general-purpose base register or stack pointer, encoded in the "Rn" field.
<Xs>
Is the 64-bit name of the general-purpose register to be compared and loaded, encoded in the "Rs" field.
<Xt>
Is the 64-bit name of the general-purpose register to be conditionally stored, encoded in the "Rt" field.
bits(64) address;
bits(datasize) comparevalue;
bits(datasize) newvalue;
constant boolean privileged = PSTATE.EL != EL0;
constant AccessDescriptor accdesc = CreateAccDescAtomicOp(MemAtomicOp_CAS, acquire, release,
tagchecked, privileged);
comparevalue = X[s, datasize];
newvalue = X[t, datasize];
if n == 31 then
CheckSPAlignment();
address = SP[];
else
address = X[n, 64];
constant bits(datasize) data = MemAtomic(address, comparevalue, newvalue, accdesc);
X[s, regsize] = ZeroExtend(data, regsize);