thr3ads.net - llvm dev - [LLVMdev] Reviving the new LLVM concurrency model [Aug 2011]

If this information is useful, please help other people find it:
Share via:

Eli Friedman

2011-Aug-22 19:02 UTC

[LLVMdev] Reviving the new LLVM concurrency model

On Mon, Aug 22, 2011 at 11:17 AM, Jianzhou Zhao <jianzhou at
seas.upenn.edu> wrote:> On Mon, Aug 22, 2011 at 1:02 PM, Eli Friedman <eli.friedman at
gmail.com> wrote:
>> On Mon, Aug 22, 2011 at 9:55 AM, Jianzhou Zhao <jianzhou at
seas.upenn.edu> wrote:
>>> In the definition of 'monotonic' ordering,
>>> ... "If an address is written monotonically by one thread, and
other
>>> threads monotonically read that address repeatedly, the other
threads
>>> must eventually see the write"...
>>
>> It's supposed to mean that if you have a something like looks like
a
>> spinloop with monotonic reads, it shouldn't spin forever if the
value
>> changes.  I'll take another look at rewording that.
>>
>>> Does this mean if a thread does multi-writes monotonically,
monotonic
>>> reads from other threads should see all of them? But intuitively,
it
>>> seems to be fine for a read to ``miss'' some of the writes
as long as
>>> the writes seen are monotonic in the sense that later reads should
see
>>> the same write of earlier reads, or any write monotonically after
the
>>> writes seen.
>>>
>>> In the case there is only one monotonic write, what does
'eventually'
>>> mean? Can we know a write must be seen when some condition holds,
for
>>> example, a number of instructions executed, the thread that did the
>>> write executes a fence, ...?
>>>
>>> C++ memory model does not have ``unordered'', and
"monotonic", but
>>> have "modification ordering" (is it same to the relaxed
atomic
>>> variables the LLVM IR mentions?). If I am compiling C++ to LLVM,
can
>>> all modification atomic be compiled to monotonic? And when should
we
>>> use "unordered"?
>>
>> http://llvm.org/docs/Atomics.html is an attempt to make things much
>> more straightforward than the stuff in LangRef.
>
> This is cool.
>
> At the end of the "optimization outside atomic" section there are
> discussions about "returning undef". Is it the following correct?
> * a store/store data race in LLVM leads to undefined behaviors,
What exactly is a store-store "race"?  That sounds wrong.
> * a store/load data race does not result in undefined behavior, but
> the load returns undef
> * if two memory accesses are of data races, then at least one of them
> is NonAtomic.
The model isn't really defined in terms of races, but these two sound
roughly correct.
> My question is suppose a load L and a store S have a data race, and L
> runs earlier than S in an execution, L is well-ordered with earlier
> writes by happens-before, then at the point when L runs, but S has not
> run yet, should the L also return undef or what ever write it can see
> w/o races so far?
>
> Although non-synchronized writes from other threads may propagate to
> another thread in different orders, but the writes that a read from a
> different thread can see should have already executed before the read
> (in a global time). So in the above case, it seems fine to allow the
> load to return a 'defined' value. Is there any case that makes
'undef'
> possible?
If there is a load and a store to the same address with no
happens-before relationship, the load returns undef.  "L runs earlier
than S in an execution" doesn't make sense; if the load and store
aren't atomic, the compiler is allowed to, for example, rematerialize
the load, so that it happens both before and after the store.


-Eli

Jianzhou Zhao

2011-Aug-22 20:09 UTC

head link

[LLVMdev] Reviving the new LLVM concurrency model

On Mon, Aug 22, 2011 at 3:02 PM, Eli Friedman <eli.friedman at gmail.com>
wrote:> On Mon, Aug 22, 2011 at 11:17 AM, Jianzhou Zhao <jianzhou at
seas.upenn.edu> wrote:
>> On Mon, Aug 22, 2011 at 1:02 PM, Eli Friedman <eli.friedman at
gmail.com> wrote:
>>> On Mon, Aug 22, 2011 at 9:55 AM, Jianzhou Zhao <jianzhou at
seas.upenn.edu> wrote:
>>>> In the definition of 'monotonic' ordering,
>>>> ... "If an address is written monotonically by one thread,
and other
>>>> threads monotonically read that address repeatedly, the other
threads
>>>> must eventually see the write"...
>>>
>>> It's supposed to mean that if you have a something like looks
like a
>>> spinloop with monotonic reads, it shouldn't spin forever if the
value
>>> changes.  I'll take another look at rewording that.
>>>
>>>> Does this mean if a thread does multi-writes monotonically,
monotonic
>>>> reads from other threads should see all of them? But
intuitively, it
>>>> seems to be fine for a read to ``miss'' some of the
writes as long as
>>>> the writes seen are monotonic in the sense that later reads
should see
>>>> the same write of earlier reads, or any write monotonically
after the
>>>> writes seen.
>>>>
>>>> In the case there is only one monotonic write, what does
'eventually'
>>>> mean? Can we know a write must be seen when some condition
holds, for
>>>> example, a number of instructions executed, the thread that did
the
>>>> write executes a fence, ...?
>>>>
>>>> C++ memory model does not have ``unordered'', and
"monotonic", but
>>>> have "modification ordering" (is it same to the
relaxed atomic
>>>> variables the LLVM IR mentions?). If I am compiling C++ to
LLVM, can
>>>> all modification atomic be compiled to monotonic? And when
should we
>>>> use "unordered"?
>>>
>>> http://llvm.org/docs/Atomics.html is an attempt to make things much
>>> more straightforward than the stuff in LangRef.
>>
>> This is cool.
>>
>> At the end of the "optimization outside atomic" section there
are
>> discussions about "returning undef". Is it the following
correct?
>> * a store/store data race in LLVM leads to undefined behaviors,
>
> What exactly is a store-store "race"?  That sounds wrong.
>
>> * a store/load data race does not result in undefined behavior, but
>> the load returns undef
>> * if two memory accesses are of data races, then at least one of them
>> is NonAtomic.
>
> The model isn't really defined in terms of races, but these two sound
> roughly correct.
>
>> My question is suppose a load L and a store S have a data race, and L
>> runs earlier than S in an execution, L is well-ordered with earlier
>> writes by happens-before, then at the point when L runs, but S has not
>> run yet, should the L also return undef or what ever write it can see
>> w/o races so far?
>>
>> Although non-synchronized writes from other threads may propagate to
>> another thread in different orders, but the writes that a read from a
>> different thread can see should have already executed before the read
>> (in a global time). So in the above case, it seems fine to allow the
>> load to return a 'defined' value. Is there any case that makes
'undef'
>> possible?
>
> If there is a load and a store to the same address with no
> happens-before relationship, the load returns undef.  "L runs earlier
> than S in an execution" doesn't make sense; if the load and store
> aren't atomic, the compiler is allowed to, for example, rematerialize
> the load, so that it happens both before and after the store.
I agree that if we consider all possible executions of the program,
"L runs earlier
than S in an execution" doesn't make sense, since L and S can run in
any orders.

My confusion was more about a concrete execution of the program. If
the execution schedules S before L, then L is reasonable to return any
values (at least the write from S, or the recent write happens before
L), so undef is fine with me.

If the execution runs L earlier than S, and L can only see one write
till the point, can the L still return any value? L can return any
value in term of the C++ DRF memory model because eventually the L has
a data race with the future S, which makes the program behavior
undefined--- so reading any value is a good behavior.

At this case, LLVM only allows some particular behaviors--namely, the
load can return any value, but returning a value that none earlier
writes have does not seem to be consistent for the execution. Did I
misunderstand any concept here?
>
>
> -Eli
>


-- 
Jianzhou

Eli Friedman

2011-Aug-22 20:22 UTC

head link

[LLVMdev] Reviving the new LLVM concurrency model

On Mon, Aug 22, 2011 at 1:09 PM, Jianzhou Zhao <jianzhou at
seas.upenn.edu> wrote:> On Mon, Aug 22, 2011 at 3:02 PM, Eli Friedman <eli.friedman at
gmail.com> wrote:
>> On Mon, Aug 22, 2011 at 11:17 AM, Jianzhou Zhao <jianzhou at
seas.upenn.edu> wrote:
>>> On Mon, Aug 22, 2011 at 1:02 PM, Eli Friedman <eli.friedman at
gmail.com> wrote:
>>>> On Mon, Aug 22, 2011 at 9:55 AM, Jianzhou Zhao <jianzhou at
seas.upenn.edu> wrote:
>>>>> In the definition of 'monotonic' ordering,
>>>>> ... "If an address is written monotonically by one
thread, and other
>>>>> threads monotonically read that address repeatedly, the
other threads
>>>>> must eventually see the write"...
>>>>
>>>> It's supposed to mean that if you have a something like
looks like a
>>>> spinloop with monotonic reads, it shouldn't spin forever if
the value
>>>> changes.  I'll take another look at rewording that.
>>>>
>>>>> Does this mean if a thread does multi-writes monotonically,
monotonic
>>>>> reads from other threads should see all of them? But
intuitively, it
>>>>> seems to be fine for a read to ``miss'' some of the
writes as long as
>>>>> the writes seen are monotonic in the sense that later reads
should see
>>>>> the same write of earlier reads, or any write monotonically
after the
>>>>> writes seen.
>>>>>
>>>>> In the case there is only one monotonic write, what does
'eventually'
>>>>> mean? Can we know a write must be seen when some condition
holds, for
>>>>> example, a number of instructions executed, the thread that
did the
>>>>> write executes a fence, ...?
>>>>>
>>>>> C++ memory model does not have ``unordered'', and
"monotonic", but
>>>>> have "modification ordering" (is it same to the
relaxed atomic
>>>>> variables the LLVM IR mentions?). If I am compiling C++ to
LLVM, can
>>>>> all modification atomic be compiled to monotonic? And when
should we
>>>>> use "unordered"?
>>>>
>>>> http://llvm.org/docs/Atomics.html is an attempt to make things
much
>>>> more straightforward than the stuff in LangRef.
>>>
>>> This is cool.
>>>
>>> At the end of the "optimization outside atomic" section
there are
>>> discussions about "returning undef". Is it the following
correct?
>>> * a store/store data race in LLVM leads to undefined behaviors,
>>
>> What exactly is a store-store "race"?  That sounds wrong.
>>
>>> * a store/load data race does not result in undefined behavior, but
>>> the load returns undef
>>> * if two memory accesses are of data races, then at least one of
them
>>> is NonAtomic.
>>
>> The model isn't really defined in terms of races, but these two
sound
>> roughly correct.
>>
>>> My question is suppose a load L and a store S have a data race, and
L
>>> runs earlier than S in an execution, L is well-ordered with earlier
>>> writes by happens-before, then at the point when L runs, but S has
not
>>> run yet, should the L also return undef or what ever write it can
see
>>> w/o races so far?
>>>
>>> Although non-synchronized writes from other threads may propagate
to
>>> another thread in different orders, but the writes that a read from
a
>>> different thread can see should have already executed before the
read
>>> (in a global time). So in the above case, it seems fine to allow
the
>>> load to return a 'defined' value. Is there any case that
makes 'undef'
>>> possible?
>>
>> If there is a load and a store to the same address with no
>> happens-before relationship, the load returns undef.  "L runs
earlier
>> than S in an execution" doesn't make sense; if the load and
store
>> aren't atomic, the compiler is allowed to, for example,
rematerialize
>> the load, so that it happens both before and after the store.
>
> I agree that if we consider all possible executions of the program,
> "L runs earlier
> than S in an execution" doesn't make sense, since L and S can run
in any orders.
>
> My confusion was more about a concrete execution of the program. If
> the execution schedules S before L, then L is reasonable to return any
> values (at least the write from S, or the recent write happens before
> L), so undef is fine with me.
>
> If the execution runs L earlier than S, and L can only see one write
> till the point, can the L still return any value? L can return any
> value in term of the C++ DRF memory model because eventually the L has
> a data race with the future S, which makes the program behavior
> undefined--- so reading any value is a good behavior.
>
> At this case, LLVM only allows some particular behaviors--namely, the
> load can return any value, but returning a value that none earlier
> writes have does not seem to be consistent for the execution. Did I
> misunderstand any concept here?
For non-atomic loads, we don't guarantee that a load corresponds to a
single hardware memory access.  Fo example, a 64-bit load on x86-32
will generally be split. See also
http://www.open-std.org/jtc1/sc22/wg21/docs/papers/2007/n2176.html#undefined
.  Also,

-Eli

Jianzhou Zhao

2011-Aug-22 20:24 UTC

head link

[LLVMdev] Reviving the new LLVM concurrency model

On Mon, Aug 22, 2011 at 3:02 PM, Eli Friedman <eli.friedman at gmail.com>
wrote:> On Mon, Aug 22, 2011 at 11:17 AM, Jianzhou Zhao <jianzhou at
seas.upenn.edu> wrote:
>> On Mon, Aug 22, 2011 at 1:02 PM, Eli Friedman <eli.friedman at
gmail.com> wrote:
>>> On Mon, Aug 22, 2011 at 9:55 AM, Jianzhou Zhao <jianzhou at
seas.upenn.edu> wrote:
>>>> In the definition of 'monotonic' ordering,
>>>> ... "If an address is written monotonically by one thread,
and other
>>>> threads monotonically read that address repeatedly, the other
threads
>>>> must eventually see the write"...
>>>
>>> It's supposed to mean that if you have a something like looks
like a
>>> spinloop with monotonic reads, it shouldn't spin forever if the
value
>>> changes.  I'll take another look at rewording that.
>>>
>>>> Does this mean if a thread does multi-writes monotonically,
monotonic
>>>> reads from other threads should see all of them? But
intuitively, it
>>>> seems to be fine for a read to ``miss'' some of the
writes as long as
>>>> the writes seen are monotonic in the sense that later reads
should see
>>>> the same write of earlier reads, or any write monotonically
after the
>>>> writes seen.
>>>>
>>>> In the case there is only one monotonic write, what does
'eventually'
>>>> mean? Can we know a write must be seen when some condition
holds, for
>>>> example, a number of instructions executed, the thread that did
the
>>>> write executes a fence, ...?
>>>>
>>>> C++ memory model does not have ``unordered'', and
"monotonic", but
>>>> have "modification ordering" (is it same to the
relaxed atomic
>>>> variables the LLVM IR mentions?). If I am compiling C++ to
LLVM, can
>>>> all modification atomic be compiled to monotonic? And when
should we
>>>> use "unordered"?
>>>
>>> http://llvm.org/docs/Atomics.html is an attempt to make things much
>>> more straightforward than the stuff in LangRef.
>>
>> This is cool.
>>
>> At the end of the "optimization outside atomic" section there
are
>> discussions about "returning undef". Is it the following
correct?
>> * a store/store data race in LLVM leads to undefined behaviors,
>
> What exactly is a store-store "race"?  That sounds wrong.
If a program has two stores that are not ordered by happens-before
relations, does the program have undefined behaviors?

At the end of the "optimization outside atomic" section

...
However, LLVM is not allowed to transform the former to the latter: it
could introduce undefined behavior if another thread can access x at
the same time.
...

Is the ``undefined behavior'' introduced by the addition write "x
xtemp"? For example, if a[i] is always false in the original program,
there is no the write.
But I don't think "int xtemp = x;" can introduce undefined
behavior,
if LLVM does not take load/store race as undefined. Am I correct?
>
>> * a store/load data race does not result in undefined behavior, but
>> the load returns undef
>> * if two memory accesses are of data races, then at least one of them
>> is NonAtomic.
>
> The model isn't really defined in terms of races, but these two sound
> roughly correct.
>
>> My question is suppose a load L and a store S have a data race, and L
>> runs earlier than S in an execution, L is well-ordered with earlier
>> writes by happens-before, then at the point when L runs, but S has not
>> run yet, should the L also return undef or what ever write it can see
>> w/o races so far?
>>
>> Although non-synchronized writes from other threads may propagate to
>> another thread in different orders, but the writes that a read from a
>> different thread can see should have already executed before the read
>> (in a global time). So in the above case, it seems fine to allow the
>> load to return a 'defined' value. Is there any case that makes
'undef'
>> possible?
>
> If there is a load and a store to the same address with no
> happens-before relationship, the load returns undef.  "L runs earlier
> than S in an execution" doesn't make sense; if the load and store
> aren't atomic, the compiler is allowed to, for example, rematerialize
> the load, so that it happens both before and after the store.
>
>
> -Eli
>


-- 
Jianzhou

Eli Friedman

2011-Aug-22 20:37 UTC

head link

[LLVMdev] Reviving the new LLVM concurrency model

On Mon, Aug 22, 2011 at 1:24 PM, Jianzhou Zhao <jianzhou at
seas.upenn.edu> wrote:> On Mon, Aug 22, 2011 at 3:02 PM, Eli Friedman <eli.friedman at
gmail.com> wrote:
>> On Mon, Aug 22, 2011 at 11:17 AM, Jianzhou Zhao <jianzhou at
seas.upenn.edu> wrote:
>>> On Mon, Aug 22, 2011 at 1:02 PM, Eli Friedman <eli.friedman at
gmail.com> wrote:
>>>> On Mon, Aug 22, 2011 at 9:55 AM, Jianzhou Zhao <jianzhou at
seas.upenn.edu> wrote:
>>>>> In the definition of 'monotonic' ordering,
>>>>> ... "If an address is written monotonically by one
thread, and other
>>>>> threads monotonically read that address repeatedly, the
other threads
>>>>> must eventually see the write"...
>>>>
>>>> It's supposed to mean that if you have a something like
looks like a
>>>> spinloop with monotonic reads, it shouldn't spin forever if
the value
>>>> changes.  I'll take another look at rewording that.
>>>>
>>>>> Does this mean if a thread does multi-writes monotonically,
monotonic
>>>>> reads from other threads should see all of them? But
intuitively, it
>>>>> seems to be fine for a read to ``miss'' some of the
writes as long as
>>>>> the writes seen are monotonic in the sense that later reads
should see
>>>>> the same write of earlier reads, or any write monotonically
after the
>>>>> writes seen.
>>>>>
>>>>> In the case there is only one monotonic write, what does
'eventually'
>>>>> mean? Can we know a write must be seen when some condition
holds, for
>>>>> example, a number of instructions executed, the thread that
did the
>>>>> write executes a fence, ...?
>>>>>
>>>>> C++ memory model does not have ``unordered'', and
"monotonic", but
>>>>> have "modification ordering" (is it same to the
relaxed atomic
>>>>> variables the LLVM IR mentions?). If I am compiling C++ to
LLVM, can
>>>>> all modification atomic be compiled to monotonic? And when
should we
>>>>> use "unordered"?
>>>>
>>>> http://llvm.org/docs/Atomics.html is an attempt to make things
much
>>>> more straightforward than the stuff in LangRef.
>>>
>>> This is cool.
>>>
>>> At the end of the "optimization outside atomic" section
there are
>>> discussions about "returning undef". Is it the following
correct?
>>> * a store/store data race in LLVM leads to undefined behaviors,
>>
>> What exactly is a store-store "race"?  That sounds wrong.
>
> If a program has two stores that are not ordered by happens-before
> relations, does the program have undefined behaviors?
In the LLVM memory model, you cannot get undefined behavior by any
combination of non-atomic loads and stores to valid memory locations;
the undefined behavior only comes into play when you try to use an
'undef'. (although a load which subsequently examines the result would
be 'undef').
> At the end of the "optimization outside atomic" section
>
> ...
> However, LLVM is not allowed to transform the former to the latter: it
> could introduce undefined behavior if another thread can access x at
> the same time.
> ...
>
> Is the ``undefined behavior'' introduced by the addition write
"x > xtemp"? For example, if a[i] is always false in the original
program,
> there is no the write.
Yes (indirectly).
> But I don't think "int xtemp = x;" can introduce undefined
behavior,
> if LLVM does not take load/store race as undefined. Am I correct?
Yes, the speculative load is allowed.

-Eli

Apparently Analagous Threads

Search for more possibly parallel threads

llvm dev - Aug 2011 - [LLVMdev] Reviving the new LLVM concurrency model

[LLVMdev] Reviving the new LLVM concurrency model

[LLVMdev] Reviving the new LLVM concurrency model

[LLVMdev] Reviving the new LLVM concurrency model

[LLVMdev] Reviving the new LLVM concurrency model

[LLVMdev] Reviving the new LLVM concurrency model

Apparently Analagous Threads